Advanced

The Database of Cross-Linguistic Colexifications, reproducible analysis of cross-linguistic polysemies

Rzymski, Christoph ; Tresoldi, Tiago ; Greenhill, Simon J. ; Wu, Mei Shin ; Schweikhard, Nathanael E. ; Koptjevskaja-Tamm, Maria ; Gast, Volker ; Bodt, Timotheus A. ; Hantgan, Abbie and Kaiping, Gereon A. , et al. (2020) In Scientific Data 7(1).
Abstract

Advances in computer-assisted linguistic research have been greatly influential in reshaping linguistic research. With the increasing availability of interconnected datasets created and curated by researchers, more and more interwoven questions can now be investigated. Such advances, however, are bringing high requirements in terms of rigorousness for preparing and curating datasets. Here we present CLICS, a Database of Cross-Linguistic Colexifications (CLICS). CLICS tackles interconnected interdisciplinary research questions about the colexification of words across semantic categories in the world's languages, and show-cases best practices for preparing data for cross-linguistic research. This is done by addressing shortcomings of an... (More)

Advances in computer-assisted linguistic research have been greatly influential in reshaping linguistic research. With the increasing availability of interconnected datasets created and curated by researchers, more and more interwoven questions can now be investigated. Such advances, however, are bringing high requirements in terms of rigorousness for preparing and curating datasets. Here we present CLICS, a Database of Cross-Linguistic Colexifications (CLICS). CLICS tackles interconnected interdisciplinary research questions about the colexification of words across semantic categories in the world's languages, and show-cases best practices for preparing data for cross-linguistic research. This is done by addressing shortcomings of an earlier version of the database, CLICS2, and by supplying an updated version with CLICS3, which massively increases the size and scope of the project. We provide tools and guidelines for this purpose and discuss insights resulting from organizing student tasks for database updates.

(Less)
Please use this url to cite or link to this publication:
author
, et al. (More)
(Less)
organization
publishing date
type
Contribution to journal
publication status
published
subject
in
Scientific Data
volume
7
issue
1
article number
13
publisher
Nature Publishing Group
external identifiers
  • pmid:31932593
  • scopus:85077786232
ISSN
2052-4463
DOI
10.1038/s41597-019-0341-x
language
English
LU publication?
yes
id
711d8455-0154-41ce-842c-390cb4a6c8d5
date added to LUP
2020-01-27 08:57:19
date last changed
2020-01-30 03:59:13
@article{711d8455-0154-41ce-842c-390cb4a6c8d5,
  abstract     = {<p>Advances in computer-assisted linguistic research have been greatly influential in reshaping linguistic research. With the increasing availability of interconnected datasets created and curated by researchers, more and more interwoven questions can now be investigated. Such advances, however, are bringing high requirements in terms of rigorousness for preparing and curating datasets. Here we present CLICS, a Database of Cross-Linguistic Colexifications (CLICS). CLICS tackles interconnected interdisciplinary research questions about the colexification of words across semantic categories in the world's languages, and show-cases best practices for preparing data for cross-linguistic research. This is done by addressing shortcomings of an earlier version of the database, CLICS2, and by supplying an updated version with CLICS3, which massively increases the size and scope of the project. We provide tools and guidelines for this purpose and discuss insights resulting from organizing student tasks for database updates.</p>},
  author       = {Rzymski, Christoph and Tresoldi, Tiago and Greenhill, Simon J. and Wu, Mei Shin and Schweikhard, Nathanael E. and Koptjevskaja-Tamm, Maria and Gast, Volker and Bodt, Timotheus A. and Hantgan, Abbie and Kaiping, Gereon A. and Chang, Sophie and Lai, Yunfan and Morozova, Natalia and Arjava, Heini and Hübler, Nataliia and Koile, Ezequiel and Pepper, Steve and Proos, Mariann and Van Epps, Briana and Blanco, Ingrid and Hundt, Carolin and Monakhov, Sergei and Pianykh, Kristina and Ramesh, Sallona and Gray, Russell D. and Forkel, Robert and List, Johann Mattis},
  issn         = {2052-4463},
  language     = {eng},
  month        = {01},
  number       = {1},
  publisher    = {Nature Publishing Group},
  series       = {Scientific Data},
  title        = {The Database of Cross-Linguistic Colexifications, reproducible analysis of cross-linguistic polysemies},
  url          = {http://dx.doi.org/10.1038/s41597-019-0341-x},
  doi          = {10.1038/s41597-019-0341-x},
  volume       = {7},
  year         = {2020},
}