Skip to content

Dataset Index

The NCCR Evolving Language offers the following datasets as open source downloads:

Su, Y., Olasagasti, I., & Giraud, A.-L. (2022, April 5). A deep hierarchy of predictions enables assignment of semantic roles in real-time speech comprehension. bioRxiv. https://doi.org/10.1101/2022.04.01.486694 Download
Hovsepyan, S., Olasagasti, I., & Giraud, A.-L. (2022). Rhythmic modulation of prediction errors: a possible role for the beta-range in speech processing. bioRxiv. https://doi.org/10.1101/2022.03.28.486037 Download
Pasqualotto, A., Altarelli, I., De Angeli, A., Menestrina, Z., Bavelier, D., & Venuti, P. (2022). Enhancing reading skills through a video game mixing action mechanics and cognitive training. https://doi.org/10.17605/OSF.IO/4RZGC
Proix, T., Delgado Saa, J., Christen, A., Martin, S., Pasley, B. N., Knight, R. T., … Giraud, A.-L. (2022). Imagined speech can be decoded from low- and cross-frequency intracranial EEG features. https://doi.org/10.5281/zenodo.5702872
Mansfield, J., Saldana, C., Hurst, P., Nordlinger, R., Stoll, S., Bickel, B., & Perfors, A. (2022). Category Clustering and Morphological Learning. Retrieved from https://osf.io/mwj54/
Egurtzegi, A., Blasi, D. E., Bornkessel-Schlesewsky, I., Laka, I., Meyer, M., Bickel, B., & Sauppe, S. (2022). Cross-linguistic differences in case marking shape neural power dynamics and gaze behavior during sentence planning. Retrieved from https://osf.io/s8tq5/
de Vevey, M., Bouchard, A., Soldati, A., & Zuberbühler, K. (2022). Thermal imaging reveals audience-dependent effects during cooperation and competition in wild chimpanzees. https://doi.org/10.6084/m9.figshare.17136965.v1
Bouchard, A., & Zuberbühler, K. (2022). An intentional cohesion call in male chimpanzees of Budongo Forest. Retrieved from https://figshare.com/projects/An_intentional_cohesion_call_in_male_chimpanzees_of_Budongo_Forest/97508
You, G., Daum, M., & Stoll, S. (2021). Processing Causatives in First Language Acquisition: A Computational Approach. Retrieved from https://github.com/G-You/cds_adaptation
You, G., Bickel, B., Daum, M. M., & Stoll, S. (2021). Child-directed speech is optimized for syntax-free semantic inference. Retrieved from https://osf.io/hcj7y/?view_only=87bf26f2343c4a9ebc93d69aaaf6eddb
Wirsich, J., Jorge, J., Iannotti, G. R., Shamshiri, E. A., Grouiller, F., Abreu, R., … Vulliémoz, S. (2021). The relationship between EEG and fMRI connectomes is reproducible across simultaneous EEG-fMRI studies from 1.5T to 7T. https://doi.org/10.5281/zenodo.3905103
Ranacher, P., Neureiter, N., van Gijn, R., Sonnenhauser, B., Escher, A., Weibel, R., … Bickel, B. (2021). Contact-tracing in cultural evolution: a Bayesian mixture model to detect geographic areas of language contact. Retrieved from https://github.com/derpetermann/sbayes
Miani, A., Hills, T., & Bangerter, A. (2021). LOCO: The 88-million-word language of conspiracy corpus. Retrieved from https://osf.io/snpcg/
Sauppe, S., Choudhary, K. K., Giroud, N., Blasi, D. E., Norcliffe, E., Bhattamishra, S., … Bickel, B. (2021). Neural signatures of syntactic variation in speech planning. Retrieved from https://osf.io/uhtcn/
Sato, T., Adachi, N., Kimura, R., Hosomichi, K., Yoneda, M., Oota, H., … Ishida, H. (2021). Whole Genome Sequencing of a 900-year-old Human Skeleton Supports Two Past Migration Events from the Russian Far East to Northern Japan. Retrieved from https://ddbj.nig.ac.jp/resource/sra-submission/DRA010743
Neureiter, N., Ranacher, P., van Gijn, R., Bickel, B., & Weibel, R. (2021). Can Bayesian phylogeography reconstruct migrations and expansions in linguistic evolution? https://doi.org/10.5281/zenodo.4279082
Mohammadshahi, A., & Henderson, J. (2021). Recursive Non-Autoregressive Graph-to-Graph Transformer for Dependency Parsing with Iterative Refinement. Retrieved from https://github.com/idiap/g2g-transformer
Matsumae, H., Ranacher, P., Savage, P. E., Blasi, D. E., Currie, T. E., Koganebuchi, K., … Bickel, B. (2021). Exploring correlations in genetic and cultural variation across language families in northeast Asia. Retrieved from https://github.com/derpetermann/music_languages_genes
Jing, Y., Blasi, D., & Bickel, B. (2021). Dependency length minimization and its limits: a possible role for a probabilistic version of the Final-Over-Final Condition (to appear in Language). https://doi.org/10.17605/OSF.IO/A2JVW
Heesen, R., Bangerter, A., Zuberbühler, K., Iglesias, K., Neumann, C., Pajot, A., … Genty, E. (2021). Assessing joint commitment as a process in great apes. https://doi.org/10.6084/m9.figshare.14723493
Fröhlich, M., Bartolotta, N., Fryns, C., Wagner, C., Momon, L., Jaffrezic, M., … Schaik, C. P. van. (2021). Wild-captive contrasts in non-vocal communicative repertoires and functional specificity in orang-utans. Retrieved from https://github.com/MarlenF/repertoire-orang
Le Floch, A., Bouchard, A., Gallot, Q., & Zuberbühler, K. (2021). Lesser spot-nosed monkeys coordinate alarm call production with associated Campbell’s monkeys. Retrieved from https://figshare.com/projects/Lesser_spot-nosed_monkeys_coordinate_alarm_call_production_with_associated_Campbell_s_monkeys/98576
Hollenstein, N., Pirovano, F., Zhang, C., Jäger, L., & Beinborn, L. (2021). Multilingual Language Models Predict Human Reading Behavior. Retrieved from https://github.com/DS3Lab/multilingual-gaze
Heesen, R., Zuberbühler, K., Bangerter, A., Iglesias, K., Rossano, F., Pajot, A., … Genty, E. (2021). Evidence of joint commitment in great apes’ natural joint actions. https://doi.org/10.6084/m9.figshare.14891865
Gönül, G., & Paulus, M. (2021). Children’s reasoning about the efficiency of others’ actions: The development of rational action prediction. https://doi.org/10.17605/OSF.IO/EJ43P
Fröhlich, M., Bartolotta, N., Fryns, C., Wagner, C., Momon, L., Jaffrezic, M., … van Schaik, C. P. (2021). Multicomponent and multisensory communicative acts in orang-utans may serve different functions. https://doi.org/10.5281/zenodo.4882719
Brügger, R. K., Willems, E. P., & Burkart, J. M. (2021). Do marmosets understand others’ conversations? A thermography approach. https://doi.org/10.17605/OSF.IO/BR95D
Bonvin, A., Brugger, L., & Berthele, R. (2021). Lexical measures as a proxy for bilingual language dominance? Retrieved from https://osf.io/vqe46/?view_only=61f5a453e1ee4629acf0fa52411af0b2
Aguirre-Fernández, G., Barbieri, C., Graff, A., Pérez de Arce, J., Moreno, H., & Sánchez-Villagra, M. R. (2021). Cultural macroevolution of musical instruments in South America. Retrieved from https://github.com/chiarabarbieri/SouthAmerica_MusicInstruments
Widmer, M., Jenny, M., Behr, W., & Bickel, B. (2020). Morphological structure can escape reduction effects from mass admixture of second language speakers. Retrieved from https://osf.io/mt98r
Watson, S. K., Burkart, J. M., Schapiro, S. J., Lambeth, S. P., Mueller, J. L., & Townsend, S. W. (2020). Nonadjacent dependency processing in monkeys, apes, and humans. Retrieved from https://osf.io/4m3gv/
Shorland, G., Genty, E., Guéry, J.-P., & Zuberbühler, K. (2020). Investigating self-recognition in bonobos: mirror exposure reduces looking time to self but not unfamiliar conspecifics. https://doi.org/10.6084/m9.figshare.12608786.v1
Shimizu, K. K., Copetti, D., Okada, M., Wicker, T., Tameshige, T., Hatakeyama, M., … Handa, H. (2020). De Novo Genome Assembly of the Japanese Wheat Cultivar Norin 61 Highlights Functional Variation in Flowering Time and Fusarium Resistance Genes in East Asian Genotypes. Retrieved from IPK, Germany, https://wheat.ipk-gatersleben.de/ (last accessed on Dec. 15, 2020), and BLAST server at the National Institute of Genetics, Japan, https://shigen.nig.ac.jp/wheat/komugi/about/norin61GenomeSequence.jsp (last accessed on Dec. 15, 2020). The gene annotation of the Fhb1 locus can be downloaded from https://de.cyverse.org/dl/d/6A85909D-942B-4C95-AEB8-7B5516680878/Fhb1_N61_340kbregion.tar.gz.
Watson, S. K., Heesen, R., Hedwig, D., Robbins, M. M., & Townsend, S. W. (2020). An exploration of Menzerath’s law in wild mountain gorilla vocal sequences. Retrieved from https://osf.io/5u3jf/
Sokoliuk, R., Degano, G., Banellis, L., Melloni, L., Hayton, T., Sturman, S., … Cruse, D. (2020). Covert Speech Comprehension Predicts Recovery From Acute Unresponsive States. Retrieved from https://osf.io/wu2vy/
Marchesotti, S., Nicolle, J., Merlet, I., Arnal, L. H., Donoghue, J. P., & Giraud, A.-L. (2020). Selective enhancement of low-gamma activity by tACS improves phonemic processing and reading accuracy in dyslexia. Retrieved from https://osf.io/6j49q/
Kurthen, I., Meyer, M., Schlesewsky, M., & Bornkessel-Schlesewsky, I. (2020). Individual differences in peripheral hearing and cognition reveal sentence processing differences in healthy older adults. Retrieved from https://osf.io/9qx8h/
Hovsepyan, S., Olasagasti, I., & Giraud, A.-L. (2020). Combining predictive coding and neural oscillations enables online syllable recognition in natural speech. Retrieved from https://github.com/sevadah/precoss/tree/master/data_construction
Collier, K., Radford, A. N., Stoll, S., Watson, S. K., Manser, M. B., Bickel, B., & Townsend, S. W. (2020). Dwarf mongoose alarm calls: investigating a complex non-human animal call. https://doi.org/10.6084/m9.figshare.c.5124519
Bangerter, A., Mayor, E., & Knutsen, D. (2020). Lexical entrainment without conceptual pacts? Revisiting the matching task. Retrieved from https://osf.io/a4m7k/
Andrieu, J., Penny, S. G., Bouchet, H., Malaivijitnond, S., Reichard, U. H., & Zuberbühler, K. (2020). White-handed gibbons discriminate context-specific song compositions. https://doi.org/10.6084/m9.figshare.12363050.v1