Pythia is an ancient text restoration model that recovers missing characters from a damaged text input using deep neural networks. It was created by
Yannis Assael,
Thea Sommerschield, and
Jonathan Prag, researchers from
Google DeepMind and the
University of Oxford
The University of Oxford is a collegiate university, collegiate research university in Oxford, England. There is evidence of teaching as early as 1096, making it the oldest university in the English-speaking world and the List of oldest un ...
.
To study the society and the history of ancient civilisations,
ancient history
Ancient history is a time period from the History of writing, beginning of writing and recorded human history through late antiquity. The span of recorded history is roughly 5,000 years, beginning with the development of Sumerian language, ...
relies on disciplines such as
epigraphy
Epigraphy () is the study of inscriptions, or epigraphs, as writing; it is the science of identifying graphemes, clarifying their meanings, classifying their uses according to dates and cultural contexts, and drawing conclusions about the wr ...
, the study of ancient inscribed texts. Hundreds of thousands of these texts, known as
inscriptions, have survived to our day, but are often damaged over the centuries. Illegible parts of the text must then be restored by specialists, called
epigraphists, in order to extract meaningful information from the text and use it to expand our knowledge of the context in which the text was written. Pythia takes as input the damaged text, and is trained to return hypothesised restorations of ancient Greek inscriptions, working as an assistive aid for ancient historians. Its
neural network
A neural network is a group of interconnected units called neurons that send signals to one another. Neurons can be either biological cells or signal pathways. While individual neurons are simple, many of them together in a network can perfor ...
architecture works at both the character- and word-level, thereby effectively handling long-term context information, and dealing efficiently with incomplete word representations. Pythia is applicable to any discipline dealing with ancient texts (
philology
Philology () is the study of language in Oral tradition, oral and writing, written historical sources. It is the intersection of textual criticism, literary criticism, history, and linguistics with strong ties to etymology. Philology is also de ...
,
papyrology,
codicology
Codicology (; from French ''codicologie;'' from Latin , genitive , "notebook, book" and Greek , ''-logia'') is the study of codices or manuscript books. It is often referred to as "the archaeology of the book," a term coined by François Masai. ...
) and can work in any language (ancient or modern).
References
{{reflist
Machine learning
Digital humanities projects
Digital humanities
Epigraphy
Ancient history