Kratylos: A tool for sharing interlinearized and lexical data in diverse formats

Daniel Kaufman, Raphael Finkel

Research output: Contribution to journalArticlepeer-review

4 Scopus citations

Abstract

In this paper we present Kratylos, at www.kratylos.org/, a web application that creates searchable multimedia corpora from data collections in diverse formats, including collections of interlinearized glossed text (IGT) and dictionaries. There exists a crucial lacuna in the electronic ecology that supports language documentation and linguistic research. Vast amounts of IGT are produced in stand-alone programs without an easy way to share them publicly as dynamic databases. Solving this problem will not only unlock an enormous amount of linguistic information that can be shared easily across the web, it will also improve accountability by allowing us to verify analyses across collections of primary data. We argue for a two-pronged approach to sharing language documentation, which involves a popular interface and a specialist interface. Finally, we briefly introduce the potential of regular expression queries for syntactic research.

Original languageEnglish
Pages (from-to)124-146
Number of pages23
JournalLanguage Documentation and Conservation
Volume12
StatePublished - 2018

Bibliographical note

Publisher Copyright:
© 2018 University of Hawaii Press.

ASJC Scopus subject areas

  • Linguistics and Language
  • Computer Science Applications
  • Library and Information Sciences

Fingerprint

Dive into the research topics of 'Kratylos: A tool for sharing interlinearized and lexical data in diverse formats'. Together they form a unique fingerprint.

Cite this