EXMARaLDA
   HOME

TheInfoList



OR:

EXMARaLDA (Extensible Markup Language for Discourse Annotation) is a set of free software tools for creating, managing and analyzing spoken language corpora. It consists of a transcription tool (comparable to tools like
Praat Praat ( , ; ) is a free, open-source computer software package widely used for speech analysis and synthesis in phonetics and other fields of linguistics. It was designed and continues to be developed by Paul Boersma and David Weenink at the ...
or
Transcriber Transcriber is an open-source software tool for the transcription and annotation of speech signals for linguistic research. It supports multiple hierarchical layers of segmentation, named entity annotation, speaker lists, topic lists, and over ...
), a tool for administering corpus meta data and a tool for doing queries ( KWIC searches) on spoken language corpora. EXMARaLDA is used for doing
conversation Conversation is interactive communication between two or more people. The development of conversational skills and etiquette is an important part of socialization. The development of conversational skills in a new language is a frequent focus ...
and
discourse analysis Discourse analysis (DA), or discourse studies, is an approach to the analysis of written, spoken, or sign language, including any significant semiotic event. The objects of discourse analysis (discourse, writing, conversation, communicative sy ...
,
dialectology Dialectology (from Ancient Greek, Greek , ''dialektos'', "talk, dialect"; and , ''-logy, -logia'') is the scientific study of dialects: subsets of languages. Though in the 19th century a branch of historical linguistics, dialectology is often now c ...
,
phonology Phonology (formerly also phonemics or phonematics: "phonemics ''n.'' 'obsolescent''1. Any procedure for identifying the phonemes of a language from a corpus of data. 2. (formerly also phonematics) A former synonym for phonology, often pre ...
and research into first and second
language acquisition Language acquisition is the process by which humans acquire the capacity to perceive and comprehend language. In other words, it is how human beings gain the ability to be aware of language, to understand it, and to produce and use words and s ...
in children and adults. EXMARaLDA is based on the open standards
XML Extensible Markup Language (XML) is a markup language and file format for storing, transmitting, and reconstructing data. It defines a set of rules for encoding electronic document, documents in a format that is both human-readable and Machine-r ...
and
Unicode Unicode or ''The Unicode Standard'' or TUS is a character encoding standard maintained by the Unicode Consortium designed to support the use of text in all of the world's writing systems that can be digitized. Version 16.0 defines 154,998 Char ...
and programmed in
Java Java is one of the Greater Sunda Islands in Indonesia. It is bordered by the Indian Ocean to the south and the Java Sea (a part of Pacific Ocean) to the north. With a population of 156.9 million people (including Madura) in mid 2024, proje ...
.


References

* Schmidt, Thomas and Wörner, Kai (2014). "EXMARaLDA" In: ''Handbook on Corpus Phonology''. Oxford University Press, 402-419. * Schmidt, Thomas and Wörner, Kai (2009). "EXMARaLDA – Creating, analysing and sharing spoken language corpora for pragmatic research." In: ''Pragmatics 19''. * Schmidt, Thomas and Bennöhr, Jasmine (2008). "Rescuing Legacy Data." In: ''Language Documentation and Conservation 2'', 109–129.


External links


exmaralda.org
- Official project website
std.metu.edu.tr
- Website of the METU Spoken Turkish Corpus, a corpus constructed with EXMARaLDA {{DEFAULTSORT:Exmaralda Free science software Phonetics Phonology Free audio software Linguistic research software