Common Voice
Common Voice is a crowdsourcing project started by Mozilla to create a free and open speech corpus. The project is supported by volunteers who record sample sentences with a microphone and review recordings of other users. The transcribed sentences are collected in a voice database available under the public domain license CC0. This license ensures that developers can use the database for voice-to-text and text-to-voice applications without restrictions or costs. Aims Common Voice aims to provide diverse voice samples. According to Mozilla's Katharina Borchert, many existing projects took datasets from public radio or otherwise had datasets that underrepresented both women and people with pronounced accents. Voice database The first dataset was released in November 2017. More than 20,000 users worldwide had recorded 500 hours of English sentences. In February 2019, the first batch of languages was released for use. This included 18 languages: English, French, German and Ma ... [...More Info...]       [...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]   |
|
Mozilla Foundation
The Mozilla Foundation is an American non-profit organization that exists to support and collectively lead the Open-source software, open source Mozilla project. Founded in July 2003, the organization sets the policies that govern development, operates critical infrastructure, and controls Mozilla trademarks and copyrights. It owns two taxable subsidiaries: the Mozilla Corporation, which employs many Mozilla developers and coordinates releases of the Mozilla Firefox web browser, and MZLA Technologies Corporation, which employs developers to work on the Mozilla Thunderbird email client and coordinate its releases. The Mozilla Foundation was founded by the Netscape-affiliated Mozilla Organization. The organization is currently based in the Silicon Valley city of Mountain View, California, United States. The Mozilla Foundation describes itself as "a non-profit organization that promotes openness, innovation and participation on the Internet." The Mozilla Foundation is guided by the ... [...More Info...]       [...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]   |
|
VentureBeat
''VentureBeat'' is an American technology website headquartered in San Francisco, California. ''VentureBeat'' is a tech news source that publishes news, analysis, long-form features, interviews, and videos. The ''VentureBeat'' company was founded in 2006 by Matt Marshall, an ex-correspondent for ''The Mercury News ''The Mercury News'' (formerly ''San Jose Mercury News'', often locally known as ''The Merc'') is a morning daily newspaper published in San Jose, California, in the San Francisco Bay Area. It is published by the Bay Area News Group, a subsidia ...''. History In March 2009, ''VentureBeat'' signed a partnership agreement with IDG to produce DEMO Conference, a conference for startups to announce their launches and raise funding from venture capitalists and angel investors. The partnership with IDG ended in 2012. In September 2009, Matt Marshall took on the role of executive producer for the DEMO conference. Over the years, a variety of companies have launched ... [...More Info...]       [...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]   |
|
Belarusian Language
Belarusian (, ) is an East Slavic languages, East Slavic language. It is one of the two Languages of Belarus, official languages in Belarus, the other being Russian language, Russian. It is also spoken in parts of Russia, Lithuania, Latvia, Poland, Ukraine, and the United States by the Belarusian diaspora. Before Belarus Dissolution of the Soviet Union, gained independence in 1991, the language was known in English language, English as ''Byelorussian'' or ''Belorussian'', or alternatively as ''White Russian''. Following independence, it became known as ''Belarusian'', or alternatively as ''Belarusan''. As one of the East Slavic languages, Belarusian shares many grammatical and lexical features with other members of the group. To some extent, Russian, Ukrainian language, Ukrainian, and Belarusian retain a degree of mutual intelligibility. Belarusian descends from a language generally referred to as Ruthenian language, Ruthenian (13th to 18th centuries), which had, in turn, descend ... [...More Info...]       [...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]   |
|
Euskara
Basque ( ; ) is a language spoken by Basques and other residents of the Basque Country, a region that straddles the westernmost Pyrenees in adjacent parts of northern Spain and southwestern France. Basque is classified as a language isolate (unrelated to any other known languages), the only one in Europe. The Basques are indigenous to and primarily inhabit the Basque Country. The Basque language is spoken by 806,000 Basques in all territories. Of them, 93.7% (756,000) are in the Spanish area of the Basque Country and the remaining 6.3% (50,000) are in the French portion. Native speakers live in a contiguous area that includes parts of four Spanish provinces and the three "ancient provinces" in France. Gipuzkoa, most of Biscay, a few municipalities on the northern border of Álava and the northern area of Navarre formed the core of the remaining Basque-speaking area before measures were introduced in the 1980s to strengthen Basque fluency. By contrast, most of Álava, th ... [...More Info...]       [...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]   |
|
Basaa
Basaa (also spelled ''Bassa, Basa, Bissa''), or Mbene, is a Bantu language spoken in Cameroon by the Basaa people. It is spoken by about 300,000 people in the Centre and Littoral regions. Maho (2009) lists North and South Kogo as dialects. Background and Origin Basaa is spoken by 230,000 speakers. They live in Nyong-et-Kelle (Central Region) and Sanaga Maritime (with the exception of the Edéa commune, which has a Bakoko majority) and most of Nkam commune ( Littoral Region). In the western and northern parts of this department, the peripheral Basaa dialects are spoken: Yabasi in the commune of Yabassi, Diɓuum in the commune of Nkondjok (Diboum Canton), north of Ndemli and Dimbamban. Similarly, Basaa Baduala is spoken in Wouri Department (Littoral Region), traditional Basaa territory that is being transformed by the growth of Douala. Basaa is also found in Océan Department (commune of Bipindi, Southern Region). Hijuk is spoken only in the quarter of Niki in Bata ... [...More Info...]       [...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]   |
|
Bashkir Language
Bashkir ( , ) or Bashkort (, ) is a Turkic languages, Turkic language belonging to the Kipchak languages, Kipchak branch. It is official language#Political alternatives, co-official with Russian language, Russian in Bashkortostan. Bashkir has approximately 750,000 native speakers. It has two dialect groups: Southern and Eastern. Bashkir has native speakers in Russia, as well as in Ukraine, Belarus, Kazakhstan, Uzbekistan, Estonia and other neighboring post-Soviet states, and among the Bashkirs, Bashkir diaspora. Speakers Speakers of Bashkir mostly live in the republic of Bashkortostan (a republic within the Russian Federation). Many speakers also live in Tatarstan, Chelyabinsk Oblast, Chelyabinsk, Orenburg Oblast, Orenburg, Tyumen Oblast, Tyumen, Sverdlovsk Oblast, Sverdlovsk and Kurgan Oblasts and other regions of Russia. Minor Bashkir groups also live in Kazakhstan and the United States. In a recent local media report in Bashkortostan, it was reported that some officials of t ... [...More Info...]       [...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]   |
|
Asturian Language
Asturian (; )Art. 1 de lLey 1/1998, de 23 de marzo, de uso y promoción del bable/asturiano [Law 1/93, of March 23, on the Use and Promotion of the Asturian Language/nowiki>] is a West Iberian languages, West Iberian Romance languages, Romance language spoken in the Principality of Asturias, Spain. Asturian is part of a wider linguistic group, the Asturleonese languages. The number of speakers is estimated at 100,000 (native) and 450,000 (second language). The dialects of the Astur-Leonese language family are traditionally classified in three groups: Western, Central, and Eastern. For historical and demographic reasons, the Standard language, standard is based on #Dialects, Central Asturian. Asturian has a distinct grammar, dictionary, and orthography. It is regulated by the Academy of the Asturian Language. Although it is not an official language of Spain, it is protected under the Statute of Autonomy of Asturias and is an elective language in schools. For much of its history ... [...More Info...]       [...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]   |
|
Assamese Language
Assamese () or Asamiya ( ) is an Indo-Aryan language spoken mainly in the north-eastern Indian state of Assam, where it is an official language. It has long served as a ''lingua franca'' in parts of Northeast India."Axomiya is the major language spoken in Assam, and serves almost as a lingua franca among the different speech communities in the whole area." It has over 15 million native speakers and 8.3 million second language, second language speakers according to ''Ethnologue''. Nefamese, an Assamese-based pidgin in Arunachal Pradesh, was used as a lingua franca till it was replaced by Hindi language, Hindi; and Nagamese Creole, Nagamese, an Assamese-based Creole language, continues to be widely used in Nagaland. The Kamtapuri language of Rangpur division of Bangladesh and the Cooch Behar district, Cooch Behar and Jalpaiguri district, Jalpaiguri districts of India is linguistically closer to Assamese, though the speakers identify with the Bengali culture and the literary lan ... [...More Info...]       [...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]   |
|
Armenian Language
Armenian (endonym: , , ) is an Indo-European languages, Indo-European language and the sole member of the independent branch of the Armenian language family. It is the native language of the Armenians, Armenian people and the official language of Armenia. Historically spoken in the Armenian highlands, today Armenian is also widely spoken throughout the Armenian diaspora. Armenian is written in its own writing system, the Armenian alphabet, introduced in 405 AD by Saint Mesrop Mashtots. The estimated number of Armenian speakers worldwide is between five and seven million. History Classification and origins Armenian is an independent branch of the Indo-European languages. It is of interest to linguists for its distinctive phonological changes within that family. Armenian exhibits Centum and satem languages, more satemization than centumization, although it is not classified as belonging to either of these subgroups. Some linguists tentatively conclude that Armenian, Greek ... [...More Info...]       [...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]   |
|
Arabic Language
Arabic (, , or , ) is a Central Semitic languages, Central Semitic language of the Afroasiatic languages, Afroasiatic language family spoken primarily in the Arab world. The International Organization for Standardization (ISO) assigns language codes to 32 varieties of Arabic, including its standard form of Literary Arabic, known as Modern Standard Arabic, which is derived from Classical Arabic. This distinction exists primarily among Western linguists; Arabic speakers themselves generally do not distinguish between Modern Standard Arabic and Classical Arabic, but rather refer to both as ( "the eloquent Arabic") or simply ' (). Arabic is the List of languages by the number of countries in which they are recognized as an official language, third most widespread official language after English and French, one of six official languages of the United Nations, and the Sacred language, liturgical language of Islam. Arabic is widely taught in schools and universities around the wo ... [...More Info...]       [...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]   |
|
Abkhaz Language
Abkhaz, also known as Abkhazian, is a Northwest Caucasian languages, Northwest Caucasian language most closely related to Abaza language, Abaza. It is spoken mostly by the Abkhazians, Abkhaz people. It is one of the official languages of Abkhazia, where around 190,000 people speak it. Furthermore, it is spoken by thousands of members of the Abkhazian diaspora in Turkey, Georgia (country), Georgia's autonomous republic of Adjara, Syria, Jordan, and several Western countries. 27 October is the day of the Abkhazian language in Georgia (country), Georgia. Classification Abkhaz is a Northwest Caucasian languages, Northwest Caucasian language and is thus related to Adyghe language, Adyghe. The language of Abkhaz is especially close to Abaza language, Abaza, and they are sometimes considered dialects of the same language,''B. G. Hewitt Abkhaz 1979;'' page 1. Abazgi, of which the literary dialects of Abkhaz and Abaza are simply two ends of a dialect continuum. Grammatically, the two ar ... [...More Info...]       [...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]   |
|
Twi Language
Twi (; ) is the common name of the Akan literary language of Asante and Akuapem. Effectively, it is a synonym for 'Akan' that is not used by the Fante people. It is not a linguistic grouping, but more of a common name used by inland Akans as Akuapem Twi is more closely related to the Fante dialects than it is to Asante Twi. Aside from the Fante, other Akan groups such as the Nzema, Ahanta, Chakossi, Sefwi, and Baoulé, all classified under the Akan Bia languages, do not use Twi as the name of their languages. Twi generally subsumes the following Akan tongues: Ahafo, Akuapem, Akyem, Asante, Assin, Bono, Denkyira and Kwawu, which have about 4.4 million speakers in southern and central Ghana Ghana, officially the Republic of Ghana, is a country in West Africa. It is situated along the Gulf of Guinea and the Atlantic Ocean to the south, and shares borders with Côte d’Ivoire to the west, Burkina Faso to the north, and Togo to t .... References Exter ... [...More Info...]       [...Related Items...]     OR:     [Wikipedia]   [Google]   [Baidu]   |