Skip to content

Dataset Index

The NCCR Evolving Language offers the following datasets as open source downloads:

Hovsepyan, S., Olasagasti, I., & Giraud, A.-L. (2022). Rhythmic modulation of prediction errors: a possible role for the beta-range in speech processing. bioRxiv. https://doi.org/10.1101/2022.03.28.486037 Download
Su, Y., Olasagasti, I., & Giraud, A.-L. (2022). A deep hierarchy of predictions enables assignment of semantic roles in real-time speech comprehension. bioRxiv. https://doi.org/10.1101/2022.04.01.486694 Download
Proix, T., Delgado Saa, J., Christen, A., Martin, S., Pasley, B. N., Knight, R. T., … Giraud, A.-L. (2022). Imagined speech can be decoded from low- and cross-frequency intracranial EEG features. Nature Communications, 13(1), 48. https://doi.org/10.1038/s41467-021-27725-3
Pasqualotto, A., Altarelli, I., De Angeli, A., Menestrina, Z., Bavelier, D., & Venuti, P. (2022). Enhancing reading skills through a video game mixing action mechanics and cognitive training. Nature Human Behaviour, 6(4), 545–554. https://doi.org/10.1038/s41562-021-01254-x
Egurtzegi, A., Blasi, D. E., Bornkessel-Schlesewsky, I., Laka, I., Meyer, M., Bickel, B., & Sauppe, S. (2022). Cross-linguistic differences in case marking shape neural power dynamics and gaze behavior during sentence planning. Brain and Language, 230, 105127. https://doi.org/10.1016/j.bandl.2022.105127
Mansfield, J., Saldana, C., Hurst, P., Nordlinger, R., Stoll, S., Bickel, B., & Perfors, A. (2022). Category Clustering and Morphological Learning. Retrieved from https://osf.io/mwj54/
de Vevey, M., Bouchard, A., Soldati, A., & Zuberbühler, K. (2022). Thermal imaging reveals audience-dependent effects during cooperation and competition in wild chimpanzees. https://doi.org/10.6084/m9.figshare.17136965.v1
Bouchard, A., & Zuberbühler, K. (2022). An intentional cohesion call in male chimpanzees of Budongo Forest. Retrieved from https://figshare.com/projects/An_intentional_cohesion_call_in_male_chimpanzees_of_Budongo_Forest/97508
Sauppe, S., Choudhary, K. K., Giroud, N., Blasi, D. E., Norcliffe, E., Bhattamishra, S., … Bickel, B. (2021). Neural signatures of syntactic variation in speech planning. PLOS Biology, 19(1), e3001038. https://doi.org/10.1371/journal.pbio.3001038 Download
Neureiter, N., Ranacher, P., van Gijn, R., Bickel, B., & Weibel, R. (2021). Can Bayesian phylogeography reconstruct migrations and expansions in linguistic evolution? Royal Society Open Science, 8(1), 201079. https://doi.org/10.1098/rsos.201079
Aguirre-Fernández, G., Barbieri, C., Graff, A., Pérez de Arce, J., Moreno, H., & Sánchez-Villagra, M. R. (2021). Cultural macroevolution of musical instruments in South America. Humanities and Social Sciences Communications, 8(1), 1–12. https://doi.org/10.1057/s41599-021-00881-z Download
Matsumae, H., Ranacher, P., Savage, P. E., Blasi, D. E., Currie, T. E., Koganebuchi, K., … Bickel, B. (2021). Exploring correlations in genetic and cultural variation across language families in northeast Asia. Science Advances, 7(34), eabd9223. https://doi.org/10.1126/sciadv.abd9223
Ranacher, P., Neureiter, N., van Gijn, R., Sonnenhauser, B., Escher, A., Weibel, R., … Bickel, B. (2021). Contact-tracing in cultural evolution: a Bayesian mixture model to detect geographic areas of language contact. Journal of The Royal Society Interface, 18(181), 20201031. https://doi.org/10.1098/rsif.2020.1031
You, G., Daum, M., & Stoll, S. (2021). Processing Causatives in First Language Acquisition: A Computational Approach. Retrieved from https://github.com/G-You/cds_adaptation
You, G., Bickel, B., Daum, M. M., & Stoll, S. (2021). Child-directed speech is optimized for syntax-free semantic inference. Retrieved from https://osf.io/hcj7y/?view_only=87bf26f2343c4a9ebc93d69aaaf6eddb
Wirsich, J., Jorge, J., Iannotti, G. R., Shamshiri, E. A., Grouiller, F., Abreu, R., … Vulliémoz, S. (2021). The relationship between EEG and fMRI connectomes is reproducible across simultaneous EEG-fMRI studies from 1.5T to 7T. https://doi.org/10.5281/zenodo.3905103
Miani, A., Hills, T., & Bangerter, A. (2021). LOCO: The 88-million-word language of conspiracy corpus. Retrieved from https://osf.io/snpcg/
Sato, T., Adachi, N., Kimura, R., Hosomichi, K., Yoneda, M., Oota, H., … Ishida, H. (2021). Whole Genome Sequencing of a 900-year-old Human Skeleton Supports Two Past Migration Events from the Russian Far East to Northern Japan. Retrieved from https://ddbj.nig.ac.jp/resource/sra-submission/DRA010743
Mohammadshahi, A., & Henderson, J. (2021). Recursive Non-Autoregressive Graph-to-Graph Transformer for Dependency Parsing with Iterative Refinement. Retrieved from https://github.com/idiap/g2g-transformer
Jing, Y., Blasi, D., & Bickel, B. (2021). Dependency length minimization and its limits: a possible role for a probabilistic version of the Final-Over-Final Condition (to appear in Language). https://doi.org/10.17605/OSF.IO/A2JVW
Heesen, R., Bangerter, A., Zuberbühler, K., Iglesias, K., Neumann, C., Pajot, A., … Genty, E. (2021). Assessing joint commitment as a process in great apes. https://doi.org/10.6084/m9.figshare.14723493
Fröhlich, M., Bartolotta, N., Fryns, C., Wagner, C., Momon, L., Jaffrezic, M., … Schaik, C. P. van. (2021). Wild-captive contrasts in non-vocal communicative repertoires and functional specificity in orang-utans. Retrieved from https://github.com/MarlenF/repertoire-orang
Le Floch, A., Bouchard, A., Gallot, Q., & Zuberbühler, K. (2021). Lesser spot-nosed monkeys coordinate alarm call production with associated Campbell’s monkeys. Retrieved from https://figshare.com/projects/Lesser_spot-nosed_monkeys_coordinate_alarm_call_production_with_associated_Campbell_s_monkeys/98576
Hollenstein, N., Pirovano, F., Zhang, C., Jäger, L., & Beinborn, L. (2021). Multilingual Language Models Predict Human Reading Behavior. Retrieved from https://github.com/DS3Lab/multilingual-gaze
Heesen, R., Zuberbühler, K., Bangerter, A., Iglesias, K., Rossano, F., Pajot, A., … Genty, E. (2021). Evidence of joint commitment in great apes’ natural joint actions. https://doi.org/10.6084/m9.figshare.14891865
Gönül, G., & Paulus, M. (2021). Children’s reasoning about the efficiency of others’ actions: The development of rational action prediction. https://doi.org/10.17605/OSF.IO/EJ43P
Fröhlich, M., Bartolotta, N., Fryns, C., Wagner, C., Momon, L., Jaffrezic, M., … van Schaik, C. P. (2021). Multicomponent and multisensory communicative acts in orang-utans may serve different functions. https://doi.org/10.5281/zenodo.4882719
Brügger, R. K., Willems, E. P., & Burkart, J. M. (2021). Do marmosets understand others’ conversations? A thermography approach. https://doi.org/10.17605/OSF.IO/BR95D
Bonvin, A., Brugger, L., & Berthele, R. (2021). Lexical measures as a proxy for bilingual language dominance? Retrieved from https://osf.io/vqe46/?view_only=61f5a453e1ee4629acf0fa52411af0b2
Watson, S. K., Burkart, J. M., Schapiro, S. J., Lambeth, S. P., Mueller, J. L., & Townsend, S. W. (2020). Nonadjacent dependency processing in monkeys, apes, and humans. Science Advances, 6(43), eabb0725. https://doi.org/10.1126/sciadv.abb0725
Marchesotti, S., Nicolle, J., Merlet, I., Arnal, L. H., Donoghue, J. P., & Giraud, A.-L. (2020). Selective enhancement of low-gamma activity by tACS improves phonemic processing and reading accuracy in dyslexia. PLoS Biology, 18(9), e3000833. https://doi.org/10.1371/journal.pbio.3000833
Widmer, M., Jenny, M., Behr, W., & Bickel, B. (2020). Morphological structure can escape reduction effects from mass admixture of second language speakers. Retrieved from https://osf.io/mt98r
Shorland, G., Genty, E., Guéry, J.-P., & Zuberbühler, K. (2020). Investigating self-recognition in bonobos: mirror exposure reduces looking time to self but not unfamiliar conspecifics. https://doi.org/10.6084/m9.figshare.12608786.v1
Shimizu, K. K., Copetti, D., Okada, M., Wicker, T., Tameshige, T., Hatakeyama, M., … Handa, H. (2020). De Novo Genome Assembly of the Japanese Wheat Cultivar Norin 61 Highlights Functional Variation in Flowering Time and Fusarium Resistance Genes in East Asian Genotypes. Retrieved from IPK, Germany, https://wheat.ipk-gatersleben.de/ (last accessed on Dec. 15, 2020), and BLAST server at the National Institute of Genetics, Japan, https://shigen.nig.ac.jp/wheat/komugi/about/norin61GenomeSequence.jsp (last accessed on Dec. 15, 2020). The gene annotation of the Fhb1 locus can be downloaded from https://de.cyverse.org/dl/d/6A85909D-942B-4C95-AEB8-7B5516680878/Fhb1_N61_340kbregion.tar.gz.
Watson, S. K., Heesen, R., Hedwig, D., Robbins, M. M., & Townsend, S. W. (2020). An exploration of Menzerath’s law in wild mountain gorilla vocal sequences. Retrieved from https://osf.io/5u3jf/
Sokoliuk, R., Degano, G., Banellis, L., Melloni, L., Hayton, T., Sturman, S., … Cruse, D. (2020). Covert Speech Comprehension Predicts Recovery From Acute Unresponsive States. Retrieved from https://osf.io/wu2vy/
Kurthen, I., Meyer, M., Schlesewsky, M., & Bornkessel-Schlesewsky, I. (2020). Individual differences in peripheral hearing and cognition reveal sentence processing differences in healthy older adults. Retrieved from https://osf.io/9qx8h/
Hovsepyan, S., Olasagasti, I., & Giraud, A.-L. (2020). Combining predictive coding and neural oscillations enables online syllable recognition in natural speech. Retrieved from https://github.com/sevadah/precoss/tree/master/data_construction
Collier, K., Radford, A. N., Stoll, S., Watson, S. K., Manser, M. B., Bickel, B., & Townsend, S. W. (2020). Dwarf mongoose alarm calls: investigating a complex non-human animal call. https://doi.org/10.6084/m9.figshare.c.5124519
Bangerter, A., Mayor, E., & Knutsen, D. (2020). Lexical entrainment without conceptual pacts? Revisiting the matching task. Retrieved from https://osf.io/a4m7k/
Andrieu, J., Penny, S. G., Bouchet, H., Malaivijitnond, S., Reichard, U. H., & Zuberbühler, K. (2020). White-handed gibbons discriminate context-specific song compositions. https://doi.org/10.6084/m9.figshare.12363050.v1