HOME





IETF Language Tag
An IETF BCP 47 language tag is a standardized code that is used to identify human languages on the Internet. The tag structure has been standardized by the Internet Engineering Task Force (IETF) in ''Best Current Practice (BCP) 47''; the subtags are maintained by the ''IANA Language Subtag Registry''. To distinguish language variants for countries, Administrative division, regions, or writing systems (scripts), IETF language tags combine subtags from other standards such as ISO 639, ISO 15924, ISO 3166-1 and UN M.49. For example, the tag stands for English language, English; for Spanish language in the Americas, Latin American Spanish; for Romansh Sursilvan; for Serbian language, Serbian written in Cyrillic script; for Southern Min, Min Nan Chinese using Traditional Chinese characters, traditional Han characters, as spoken in Taiwanese Hokkien, Taiwan; for Cantonese language, Cantonese using Traditional Chinese characters, traditional Han characters, as spoken in Hong Kong; ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


picture info

Human Language
Language is a structured system of communication that consists of grammar and vocabulary. It is the primary means by which humans convey meaning, both in spoken and signed language, signed forms, and may also be conveyed through writing system, writing. Human language is characterized by its cultural and historical diversity, with significant variations observed between cultures and across time. Human languages possess the properties of Productivity (linguistics), productivity and Displacement (linguistics), displacement, which enable the creation of an infinite number of sentences, and the ability to refer to objects, events, and ideas that are not immediately present in the discourse. The use of human language relies on social convention and is acquired through learning. Estimates of the number of human languages in the world vary between and . Precise estimates depend on an arbitrary distinction (dichotomy) established between languages and dialects. Natural languages are ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


picture info

Cantonese Language
Cantonese is the traditional prestige variety of Yue Chinese, a Sinitic languages, Sinitic language belonging to the Sino-Tibetan language family. It originated in the city of Guangzhou (formerly known as Canton) and its surrounding Pearl River Delta. While the term ''Cantonese'' specifically refers to the prestige variety, in linguistics it has often been used to refer to the entire Yue subgroup of Chinese, including related but partially mutually intelligible varieties like Taishanese. Cantonese is viewed as a vital and inseparable part of the cultural identity for its Cantonese people, native speakers across large swaths of South China, southeastern China, Hong Kong and Macau, as well as in Overseas Chinese, overseas communities. In mainland China, it is the ''lingua franca'' of the province of Guangdong (being the majority language of the Pearl River Delta) and neighbouring areas such as Guangxi. It is also the dominant and co-official language of Hong Kong and Macau. Further ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


picture info

Letter Case
Letter case is the distinction between the letters that are in larger uppercase or capitals (more formally ''majuscule'') and smaller lowercase (more formally '' minuscule'') in the written representation of certain languages. The writing systems that distinguish between the upper- and lowercase have two parallel sets of letters: each in the majuscule set has a counterpart in the minuscule set. Some counterpart letters have the same shape, and differ only in size (e.g. ), but for others the shapes are different (e.g., ). The two case variants are alternative representations of the same letter: they have the same name and pronunciation and are typically treated identically when sorting in alphabetical order. Letter case is generally applied in a mixed-case fashion, with both upper and lowercase letters appearing in a given piece of text for legibility. The choice of case is often denoted by the grammar of a language or by the conventions of a particular discipline. In ortho ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


ISO 639-1
ISO 639-1:2002, ''Codes for the representation of names of languages—Part 1: Alpha-2 code'', is the first part of the ISO 639 series of international standards for language codes. Part 1 covers the registration of "set 1" two-letter codes. There are 183 two-letter codes registered as of June 2021. The registered codes cover the world's major languages. Some languages do not have the ISO 639-1 codes because the standard was initially designed to represent major and primary national languages with well-established terminologies and lexicography. The ISO 639-1 is more restrictive than other ISO 639 standards, such as ISO 639-2 as well as ISO 639-3, which cover a wider range of languages and variations. These codes are a useful international and formal shorthand for indicating languages. Many multilingual websites use these codes to prefix URLs of specific language versions of their websites, for example, "ua." before the website name is the Ukrainian version of that website. ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


ISO 639-5
ISO 639-5:2008 "Codes for the representation of names of languages—Part 5: Alpha-3 code for language families and groups" is an international standard published by the International Organization for Standardization (ISO). It was developed by ISO Technical Committee 37, Subcommittee 2, and first published on May 15, 2008. It is part of the ISO 639 series of standards. Collective codes This is a list of ISO 639-5 codes, including the code hierarchy as given in the ISO 639-5 registry. The code und (''undetermined'') from ISO 639-2 can be seen as top of the hierarchy (for example, und:aav, und:euq:eu). The hierarchy is not a complete genetic hierarchy; some of the collection codes are based on geography (like nai) or category (like crp) instead. ISO 639-5 defines ''alpha-3'' (3-letter) codes, called "collective codes", that identify language families and groups. As of the February 11, 2013, update to ISO 639-5, the standard defines 115 collective codes. The United States Library of ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


ISO 639-3
ISO 639-3:2007, ''Codes for the representation of names of languages – Part 3: Alpha-3 code for comprehensive coverage of languages'', is an international standard for language codes in the ISO 639 series. It defines three-letter codes for identifying languages. The standard was published by International Organization for Standardization (ISO) on 1 February 2007. As of 2023, this edition of the standard has been officially withdrawn and replaced by ISO 639:2023. ISO 639-3 extends the ISO 639-2 alpha-3 codes with an aim to cover all known natural languages. The extended language coverage was based primarily on the language codes used in the ''Ethnologue'' (volumes 10–14) published by SIL International, which is now the registration authority for ISO 639-3. It provides an enumeration of languages as complete as possible, including living and extinct, ancient and constructed, major and minor, written and unwritten. However, it does not include reconstructed languages su ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


Mark Davis (Unicode)
Mark Edward Davis (born September 13, 1952) is an American specialist in the internationalization and localization of software and the co-founder and chief technical officer of the Unicode Consortium, previously serving as its president until 2022. He is one of the key technical contributors to the Unicode specifications, being the primary author or co-author of bidirectional text algorithms (used worldwide to display Arabic language and Hebrew language text), collation (used by sorting algorithms and search algorithms), Unicode normalization, Unicode scripts, text segmentation, identifiers, regular expressions, data compression, character encoding and security. Education Davis was educated at Stanford University where he was awarded a PhD in Philosophy in 1979. Career and research Davis has specialized in Internationalization and localization of software for many years. After his PhD, he worked in Zurich, Switzerland for several years, then returned to the US to ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


ISO 639-2
ISO 639-2:1998, ''Codes for the representation of names of languages — Part 2: Alpha-3 code'', is the second part of the ISO 639 International standard, standard, which lists Language code, codes for the representation of the names of languages. The three-letter codes given for each language in this part of the standard are referred to as "Alpha-3" codes. There are 487 entries in the list of ISO 639-2 codes. The US Library of Congress is the registration authority for ISO 639-2 (referred to as ISO 639-2/RA). As registration authority, the LOC receives and reviews proposed changes; they also have representation on the ISO 639-RA Joint Advisory Committee responsible for maintaining the ISO 639 code tables. History and relationship to other ISO 639 standards Work was begun on the ISO 639-2 standard in 1989, because the ISO 639-1 standard, which uses only two-letter codes for languages, is not able to accommodate a sufficient number of languages. The ISO 639-2 standard was first ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


Harald Tveit Alvestrand
Harald Tveit Alvestrand (born 29 June 1959) is a Norwegian computer scientist. He was chair of the Internet Engineering Task Force (IETF) from 2001 until 2005, succeeding Fred Baker. Within the IETF, Alvestrand was earlier the chair of the Areas for Applications from 1995 until 1997, and of Operations and Management in 1998. Biography Alvestrand was born in Namsos, Norway, received his education from Bergen Cathedral School and the Norwegian Institute of Technology, and has worked for Norsk Data, UNINETT, EDB Maxware, Cisco Systems, and Google. He is an author of several important Request for Comments (RFCs), many in the general area of Internationalization and localization, most notable the documents required for interoperability between SMTP and X.400. Since the start of the use of OIDs he has run a front end to the hierarchy of assignments according to X.208. At the end of 2007 Alvestrand was selected for the ICANN Board, where he remained until December 2010. In 2001 ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


picture info

Portable Network Graphics
Portable Network Graphics (PNG, officially pronounced , colloquially pronounced ) is a raster graphics, raster-graphics file graphics file format, format that supports lossless data compression. PNG was developed as an improved, non-patented replacement for Graphics Interchange Format (GIF). PNG supports palette-based images (with palettes of 24-bit RGB color model, RGB or 32-bit RGBA color space, RGBA colors), grayscale images (with or without an Alpha compositing, alpha channel for transparency), and full-color non-palette-based RGB or RGBA images. The PNG working group designed the format for transferring images on the Internet, not for professional-quality print graphics; therefore, non-RGB color spaces such as CMYK color model, CMYK are not supported. A PNG file contains a single image in an extensible structure of ''chunks'', encoding the basic pixels and other information such as textual comments and Integrity checker, integrity checks documented in Request for Comments ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]  


picture info

HTML
Hypertext Markup Language (HTML) is the standard markup language for documents designed to be displayed in a web browser. It defines the content and structure of web content. It is often assisted by technologies such as Cascading Style Sheets (CSS) and scripting languages such as JavaScript, a programming language. Web browsers receive HTML documents from a web server or from local storage and browser engine, render the documents into multimedia web pages. HTML describes the structure of a web page Semantic Web, semantically and originally included cues for its appearance. HTML elements are the building blocks of HTML pages. With HTML constructs, HTML element#Images and objects, images and other objects such as Fieldset, interactive forms may be embedded into the rendered page. HTML provides a means to create structured documents by denoting structural semantics for text such as headings, paragraphs, lists, Hyperlink, links, quotes, and other items. HTML elements are delineated ...
[...More Info...]      
[...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]