ACQDIV Database
The ACQDIV Database brings together 17 corpora of first language acquisition, representing 15 maximally diverse languages, in a formally and semantically standardized format. It contains video and audio recordings, transcribed speech, and linguistic annotations from these corpora.
Learn more about the ACQDIV Database at https://evolvinglanguage.ch/databases-acqdiv/.
ATLAS Database
The Areal Typology of Languages of the Americas (ATLAs) is a database of typological features targeting areal linguistic structures in North and South America, collected from language descriptions by a team of 21 authors.
Visit the web service to view the database at https://atlas.evolvinglanguage.ch/.
CallMark
CallMark is a custom web application for annotating vocalizations. CallMark combines a unique set of features that we consider important to optimize the precision of vocal annotations and to reduce their existing biases between species and research fields. CallMark facilitates data standardization in VoCallBase.
View our latest development at https://annotation.evolvinglanguage.ch/ and learn more about CallMark at https://evolvinglanguage.ch/vocallbase-callmark.
iSpeechCont
iSpeechCont is database of week-long synchronized intracranial EEG and speech datasets from patients with epilepsy recorded at the Geneva University Hospital. The database contains the recording during naturalistic periods (i.e. in the absence of any experimental task) when the patients are talking to the medical staff, to family members or other people, watching TV or listening to the radio, or just resting. Currently, it contains 16 datasets, each corresponding to a different patient. Recordings include neural data from 100-150 recording contacts per patient, as well as transcribed and aligned speech with different language (French, German, and Portuguese, depending on the patient).
Learn more about iSpeechCont at https://evolvinglanguage.ch/databases-ispeechcont.
Library of Tasks
The Library of Tasks is a centralized repository developed by the TTF-DDG of the NCCR. This platform enables researchers across different work packages to share experimental tasks and paradigms, reuse validated tasks across studies, customize existing tasks for specific research needs, and discover tasks by domain, language, platform, or format.
Learn more about Library of Tasks at https://library.nccr-ttf-ddg.ch/.
Maximum Diversity Sampler
The Maximum Diversity Sampler address the challenge of disentangling particular variables from universal variables. The sampler systematically samples for maximum diversity, and the web service demonstrates its application to a few newly developed databases of linguistics and human ecology.
View our latest development at https://max-div-sampler.evolvinglanguage.ch/.
NEBULA101
NEBULA101 (Neuro-behavioural Understanding of Language Aptitude) is a multimodal dataset, which comprises behavioural and brain imaging data from 101 healthy adults to examine individual differences in language and cognition. The NEBULA101 dataset offers brain structural, diffusion-weighted, task-based and resting-state MRI data, alongside extensive linguistic and non-linguistic behavioural measures to explore the complex interaction of language and cognition in a highly multilingual sample.
Learn more about iSpeechCont at https://evolvinglanguage.ch/databases-nebula101.
VoCallBase
VoCallBase is a web-based platform designed to streamline research on animal communication. By standardizing data organization with a focus on Voice Activity (VA) Detection, VoCallBase empowers scientists to unlock new insights into the evolution of communication systems, signal structures, and social behaviors across thousands of species.
Learn more about VoCallBase at https://evolvinglanguage.ch/vocallbase-main.
