Abstract
In this paper we present Kratylos, at www.kratylos.org/, a web application that creates searchable multimedia corpora from data collections in diverse formats, including collections of interlinearized glossed text (IGT) and dictionaries. There exists a crucial lacuna in the electronic ecology that supports language documentation and linguistic research. Vast amounts of IGT are produced in stand-alone programs without an easy way to share them publicly as dynamic databases. Solving this problem will not only unlock an enormous amount of linguistic information that can be shared easily across the web, it will also improve accountability by allowing us to verify analyses across collections of primary data. We argue for a two-pronged approach to sharing language documentation, which involves a popular interface and a specialist interface. Finally, we briefly introduce the potential of regular expression queries for syntactic research.
Original language | English |
---|---|
Pages (from-to) | 124-146 |
Number of pages | 23 |
Journal | Language Documentation and Conservation |
Volume | 12 |
State | Published - 2018 |
Bibliographical note
Publisher Copyright:© 2018 University of Hawaii Press.
ASJC Scopus subject areas
- Linguistics and Language
- Computer Science Applications
- Library and Information Sciences