Ir directamente a la navegación principal Ir directamente a la búsqueda Ir directamente al contenido principal

A deep learning pipeline for the palaeographical dating of ancient Greek papyrus fragments

  • Graham West
  • , Matthew I. Swindall
  • , James H. Brusuelas
  • , Francesca Maltomini
  • , Marius Gerhardt
  • , Marzia D’Angelo
  • , John F. Wallin

Producción científica: Conference contributionrevisión exhaustiva

Resumen

In this paper we present a deep learning pipeline for automatically dating ancient Greek papyrus fragments based solely on fragment images. The overall pipeline consists of several stages, including handwritten text recognition (HTR) to detect and classify characters, filtering and grouping of detected characters, 24 character-level date prediction models, and a fragment-level date prediction model that utilizes the per-character predictions. A new dataset (containing approximately 7,000 fragment images and 778,000 character images) was created by scraping papyrus databases, extracting fragment images with known dates, and running them through our HTR models to obtain labeled character images. Transfer learning was then used to fine-tune separate ResNets to predict dates for individual characters which are then used, in aggregate, to train the fragment-level date prediction model. Experiments show that even though the average accuracies of character-level dating models is low, between 35%-45%, the fragment-level model can achieve up to 79% accuracy in predicting a broad, two-century date range for fragments with many characters. We then discuss the limitations of this approach and outline future work to improve temporal resolution and further testing on additional papyri. This image-based deep learning approach has great potential to assist scholars in the palaeographical analysis and dating of ancient Greek manuscripts.

Idioma originalEnglish
Título de la publicación alojadaML4AL 2024 - 1st Workshop on Machine Learning for Ancient Languages, Proceedings of the Workshop
EditoresJohn Pavlopoulos, Thea Sommerschield, Yannis Assael, Shai Gordin, Kyunghyun Cho, Marco Passarotti, Rachele Sprugnoli, Yudong Liu, Bin Li, Adam Anderson
Páginas177-185
Número de páginas9
ISBN (versión digital)9798891761445
EstadoPublished - 2024
Evento1st Workshop on Machine Learning for Ancient Languages, ML4AL 2024 - Hybrid, Bangkok, Thailand
Duración: ago 15 2024 → …

Serie de la publicación

NombreML4AL 2024 - 1st Workshop on Machine Learning for Ancient Languages, Proceedings of the Workshop

Conference

Conference1st Workshop on Machine Learning for Ancient Languages, ML4AL 2024
País/TerritorioThailand
CiudadHybrid, Bangkok
Período8/15/24 → …

Nota bibliográfica

Publisher Copyright:
© 2024 Association for Computational Linguistics.

ASJC Scopus subject areas

  • Sociology and Political Science
  • Human-Computer Interaction

Huella

Profundice en los temas de investigación de 'A deep learning pipeline for the palaeographical dating of ancient Greek papyrus fragments'. En conjunto forman una huella única.

Citar esto