Text Encoding Initiative
   HOME



picture info

Text Encoding Initiative
The Text Encoding Initiative (TEI) is a text-centric community of practice in the academic field of digital humanities, operating continuously since the 1980s. The community currently runs a mailing list, meetings and conference series, and maintains the TEI technical standard, a journal, a wiki, a GitHub repository and a toolchain. TEI guidelines The ''TEI Guidelines'' collectively define a type of XML format, and are the defining output of the community of practice. The format differs from other well-known open formats for text (such as HTML and OpenDocument) in that it is primarily semantic rather than presentational: the semantics and interpretation of every tag and attribute are specified. There are some 500 different textual components and concepts: , , , , , etc. Each is grounded in one or more academic disciplines and examples are given. Technical details The standard is split into two parts, a discursive textual description with extended examples and discussion ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  




TEI Logo 2024
Tei may refer to: * Sandister Tei, Ghanaian journalist and Wikimedian * Tei (singer) (born 1983), Korean pop singer * Tei, Bucharest, a neighborhood in Bucharest, Romania * Tellurium monoiodide, a chemical compound with the formula TeI * Towa Tei, an artist, record producer and DJ (b. 1965) TEI may refer to: * Tarlac Electric * Text Encoding Initiative, a consortium that develops standards for digital texts * Technological Educational Institute, a type of tertiary education institute in Greece * Terminal Endpoint Identifier (TEI), together with Service Access Point Identifier (SAPI) the unique identification of terminal equipment in the ISDN DSS1 standard * Tertiary Education Institution in New Zealand * Tezu Airport in India (IATA code) * Total Economic Impact, a financial metric (see Application Portfolio Management) * Trans Earth Injection, the procedure that a spacecraft performs to get back into the Earth's gravitational influence * Triethylindium, a chemical compound * Tus ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


Schematron
Schematron is a rule-based validation language for making assertions about the presence or absence of patterns in XML trees. It is a structural schema language expressed in XML using a small number of elements and XPath languages. In many implementations, Schematron XML is processed into XSLT code for deployment anywhere that XSLT can be used. Schematron is capable of expressing constraints in ways that other XML schema languages like XML Schema and DTD cannot. For example, it can require that the content of an element be controlled by one of its siblings. Or it can request or require that the root element, regardless of what element that is, must have specific attributes. Schematron can also specify required relationships between multiple XML files. Constraints and content rules may be associated with "plain-English" (or any language) validation error messages, allowing translation of numeric Schematron error codes into meaningful user error messages. Users of Schematron define ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


picture info

Greek Language
Greek (, ; , ) is an Indo-European languages, Indo-European language, constituting an independent Hellenic languages, Hellenic branch within the Indo-European language family. It is native to Greece, Cyprus, Italy (in Calabria and Salento), southern Albania, and other regions of the Balkans, Caucasus, the Black Sea coast, Asia Minor, and the Eastern Mediterranean. It has the list of languages by first written accounts, longest documented history of any Indo-European language, spanning at least 3,400 years of written records. Its writing system is the Greek alphabet, which has been used for approximately 2,800 years; previously, Greek was recorded in writing systems such as Linear B and the Cypriot syllabary. The Greek language holds a very important place in the history of the Western world. Beginning with the epics of Homer, ancient Greek literature includes many works of lasting importance in the European canon. Greek is also the language in which many of the foundational texts ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


Perseus Project
The Perseus Digital Library, formerly known as the Perseus Project, is a free-access digital library founded by Gregory Crane in 1987 and hosted by the Department of Classical Studies of Tufts University. One of the pioneers of digital libraries, its self-proclaimed mission is to make the full record of humanity available to everyone. While originally focused on the ancient Greco -Roman world, it has since diversified and offers materials in Arabic, Germanic, English Renaissance literature, 19th century American documents and Italian poetry in Latin, and has sprouted several child projects and international cooperation. The current version, Perseus 4.0, is also known as the Perseus Hopper, and is mirrored by the University of Chicago. Purpose The Perseus Digital Library was created to provide access to materials of the history of humanity to everyone, with Gregory Crane, the editor-in-chief of the library, stating that "access to the cultural heritage of humanity is a righ ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


picture info

Language
Language is a structured system of communication that consists of grammar and vocabulary. It is the primary means by which humans convey meaning, both in spoken and signed language, signed forms, and may also be conveyed through writing system, writing. Human language is characterized by its cultural and historical diversity, with significant variations observed between cultures and across time. Human languages possess the properties of Productivity (linguistics), productivity and Displacement (linguistics), displacement, which enable the creation of an infinite number of sentences, and the ability to refer to objects, events, and ideas that are not immediately present in the discourse. The use of human language relies on social convention and is acquired through learning. Estimates of the number of human languages in the world vary between and . Precise estimates depend on an arbitrary distinction (dichotomy) established between languages and dialects. Natural languages are ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


Oxford Text Archive
Oxford Text Archive (OTA) is an archive of electronic texts and other literary and language resources at the University of Oxford, England which have been created, collected and distributed for the purpose of research into literary and linguistic topics. The OTA collection consists of deposits of primary-source academic electronic editions and linguistic corpora created by the academic community. The OTA was one of founding centres in the European research infrastructure ( CLARIN, the Common Language Resources and Technology Infrastructure). The OTA collection contains many scholarly documents marked up according to the guidelines of the Text Encoding Initiative, including copies of all of the Eighteenth Century Collections Online (ECCO) and Early English Books Online (EEBO) texts which are now in the public domain, linked data. The OTA also manages the distribution of the British National Corpus (BNC). History The OTA was founded by Lou Burnard and Susan Hockey of Oxford Un ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


picture info

British National Corpus
The British National Corpus (BNC) is a 100-million-word text corpus of samples of written and spoken English from a wide range of sources. The corpus covers British English of the late 20th century from a wide variety of genres, with the intention that it be a representative sample of spoken and written British English of that time. It is used in corpus linguistics for analysis of corpora. History The project to create the BNC involved the collaboration of three publishers (with the Oxford University Press as the lead collaborator, Longman and W. & R. Chambers), two universities (the University of Oxford and Lancaster University), and the British Library. The creation of the BNC started in 1991 under the management of the BNC consortium, and the project was finished by 1994. There have been no additions of new samples after 1994, but the BNC underwent slight revisions before the release of the second edition BNC World (2001) and the third edition BNC XML Edition (2007).
[...More Info...]       [...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


picture info

EpiDoc
EpiDoc is an international community that produces guidelines and tools for encoding in TEI XML scholarly and educational editions of ancient documents, especially inscriptions and papyri. The EpiDoc Guidelines were originally proposed as a recommendation for Greek and Latin epigraphy in 2000 by scholars at the University of North Carolina at Chapel Hill: Tom Elliott, the former director of the Ancient World Mapping Center, with Hugh Cayless and Amy Hawkins. The guidelines have since matured considerably through extensive discussion on the community mailing list (Markup) and other discussion fora, at several conferences, and through the experience of various pilot projects. The first major epigraphic projects to adopt and pilot the EpiDoc recommendations were the Inscriptions of Aphrodisias and Vindolanda Tablets Online in 2002–4, and the guidelines reached a degree of stability for the first time in that period. EpiDoc has since been adopted as the native format for the Gree ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  




Internationalization Tag Set
The Internationalization Tag Set (ITS) is a set of attributes and elements designed to provide internationalization and localization support in XML documents. The ITS specification identifies concepts (called "ITS data categories") which are important for internationalization and localization. It also defines implementation of these concepts through a set of elements and attributes grouped in the ITS namespace. XML developers can use this namespace to integrate internationalization features directly into their own XML schemas and documents. Overview ITS v1.0 includes seven data categories: * Translate: Defines what parts of a document are translatable or not. * Localization Note: Provides alerts, hints, instructions, or other information to help the localizers or the translators. * Terminology: Indicates which parts of the documents are terms and optionally points to information about these terms. * Directionality: Indicates what type of display directionality should be applied t ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


picture info

EPub
EPUB is an e-book file format that uses the ".epub" file extension. The term is short for ''electronic publication'' and is sometimes stylized as ''ePUB''. EPUB is supported by many e-readers, and compatible software is available for most smartphones, tablets, and computers. EPUB is a technical standard published by the International Digital Publishing Forum (IDPF). It became an official standard of the IDPF in September 2007, superseding the older Open eBook (OEB) standard. The Book Industry Study Group endorses EPUB 3 as the format of choice for packaging content and has stated that the global book publishing industry should rally around a single standard. Technically, a file in the EPUB format is a ZIP (file format), ZIP archive file consisting of XHTML files carrying the content, along with images and other supporting files. EPUB is the most widely supported vendor-independent XML-based e-book format; it is supported by almost all hardware readers and many software readers a ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


picture info

Internationalization And Localization
In computing, internationalization and localization (American English, American) or internationalisation and localisation (British English, British), often abbreviated i18n and l10n respectively, are means of adapting to different languages, regional peculiarities and technical requirements of a target locale (computer software), locale. Internationalization is the process of designing a software application so that it can be adapted to various languages and regions without engineering changes. Localization is the process of adapting internationalized software for a specific region or language by translating text and adding locale-specific components. Localization (which is potentially performed multiple times, for different locales) uses the infrastructure or flexibility provided by internationalization (which is ideally performed only once before localization, or as an integral part of ongoing development). Naming The terms are frequently abbreviated to the numeronyms ''i18n ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


XML Schema
An XML schema is a description of a type of XML document, typically expressed in terms of constraints on the structure and content of documents of that type, above and beyond the basic syntactical constraints imposed by XML itself. These constraints are generally expressed using some combination of grammatical rules governing the order of elements, Boolean predicates that the content must satisfy, data types governing the content of elements and attributes, and more specialized rules such as uniqueness and referential integrity constraints. There are languages developed specifically to express XML schemas. The document type definition (DTD) language, which is native to the XML specification, is a schema language that is of relatively limited capability, but that also has other uses in XML aside from the expression of schemas. Two more expressive XML schema languages in widespread use are XML Schema (with a capital ''S'') and RELAX NG. The mechanism for associating an XML do ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]