Predictive text
   HOME

TheInfoList



OR:

Predictive text is an input technology used where one key or button represents many letters, such as on the numeric keypads of
mobile phone A mobile phone, cellular phone, cell phone, cellphone, handphone, hand phone or pocket phone, sometimes shortened to simply mobile, cell, or just phone, is a portable telephone that can make and receive calls over a radio frequency link whi ...
s and in
accessibility Accessibility is the design of products, devices, services, vehicles, or environments so as to be usable by people with disabilities. The concept of accessible design and practice of accessible development ensures both "direct access" (i. ...
technologies. Each key press results in a ''prediction'' rather than repeatedly sequencing through the same group of "letters" it represents, in the same, invariable order. Predictive text could allow for an entire ''word'' to be input by single keypress. Predictive text makes efficient use of fewer device keys to input
writing Writing is a medium of human communication which involves the representation of a language through a system of physically inscribed, mechanically transferred, or digitally represented symbols. Writing systems do not themselves constitute h ...
into a
text message Text messaging, or texting, is the act of composing and sending electronic messages, typically consisting of alphabetic and numeric characters, between two or more users of mobile devices, desktops/laptops, or another type of compatible comput ...
, an
e-mail Electronic mail (email or e-mail) is a method of exchanging messages ("mail") between people using electronic devices. Email was thus conceived as the electronic (digital) version of, or counterpart to, mail, at a time when "mail" meant ...
, an
address book An address book or a name and address book is a book, or a database used for storing entries called contacts. Each contact entry usually consists of a few standard fields (for example: first name, last name, company name, address, telephone n ...
, a
calendar A calendar is a system of organizing days. This is done by giving names to periods of time, typically days, weeks, months and years. A date is the designation of a single and specific day within such a system. A calendar is also a phy ...
, and the like. The most widely used, general, predictive text systems are T9, iTap, eZiText, and LetterWise/WordWise. There are many ways to build a device that predicts text, but all predictive text systems have initial linguistic settings that offer predictions that are re-prioritized to adapt to each user. This ''
learning Learning is the process of acquiring new understanding, knowledge, behaviors, skills, values, attitudes, and preferences. The ability to learn is possessed by humans, animals, and some machines; there is also evidence for some kind of lea ...
'' adapts, by way of the device memory, to a user's ''disambiguating'' feedback that results in corrective key presses, such as pressing a "next" key to get to the intention. Most predictive text systems have a user database to facilitate this process. Theoretically the number of keystrokes required per desired character in the finished writing is, on average, comparable to using a
keyboard Keyboard may refer to: Text input * Keyboard, part of a typewriter * Computer keyboard ** Keyboard layout, the software control of computer keyboards and their mapping ** Keyboard technology, computer keyboard hardware and firmware Music * Mu ...
. This is approximately true providing that all words used are in its database, punctuation is ignored, and no input mistakes are made typing or spelling. The theoretical keystrokes per character, KSPC, of a keyboard is KSPC=1.00, and of multi-tap is KSPC=2.03. Eatoni's LetterWise is a predictive multi-tap hybrid, which when operating on a standard telephone keypad achieves KSPC=1.15 for English. The choice of which predictive text system is the best to use involves matching the user's preferred interface style, the user's level of learned ability to operate predictive text software, and the user's efficiency goal. There are various levels of risk in predictive text systems, versus
multi-tap Multi-tap (multi-press) refers to a text entry system for mobile phones. The alphabet is printed under each key (beginning on "2") in a three-letter sequence as follows; ABC under 2 key, DEF under 3 key, etc. Exceptions are the "7" key, which ...
systems, because the predicted text that is automatically written that provide the speed and mechanical efficiency benefit, could, if the user is not careful to review, result in transmitting misinformation. Predictive text systems take time to learn to use well, and so generally, a device's system has user options to set up the choice of multi-tap or of any one of several schools of predictive text methods.


Background

Short message service Short Message/Messaging Service, commonly abbreviated as SMS, is a text messaging service component of most telephone, Internet and mobile device systems. It uses standardized communication protocols that let mobile devices exchange short tex ...
(SMS) permits a
mobile phone A mobile phone, cellular phone, cell phone, cellphone, handphone, hand phone or pocket phone, sometimes shortened to simply mobile, cell, or just phone, is a portable telephone that can make and receive calls over a radio frequency link whi ...
user to send
text messages Text messaging, or texting, is the act of composing and sending electronic messages, typically consisting of alphabetic and numeric characters, between two or more users of mobile devices, desktops/ laptops, or another type of compatible compu ...
(also called messages, SMSes, texts, and txts) as a short message. The most common system of SMS text input is referred to as "
multi-tap Multi-tap (multi-press) refers to a text entry system for mobile phones. The alphabet is printed under each key (beginning on "2") in a three-letter sequence as follows; ABC under 2 key, DEF under 3 key, etc. Exceptions are the "7" key, which ...
". Using multi-tap, a key is pressed multiple times to access the list of letters on that key. For instance, pressing the "2" key once displays an "a", twice displays a "b" and three times displays a "c". To enter two successive letters that are on the same key, the user must either pause or hit a "next" button. A user can type by pressing an alphanumeric keypad without looking at the electronic equipment display. Thus, multi-tap is easy to understand, and can be used without any visual feedback. However, multi-tap is not very efficient, requiring potentially many keystrokes to enter a single letter. In ideal predictive text entry, all words used are in the dictionary, punctuation is ignored, no spelling mistakes are made, and no typing mistakes are made. The ideal dictionary would include all slang,
proper nouns A proper noun is a noun that identifies a single entity and is used to refer to that entity (''Africa'', ''Jupiter'', ''Sarah'', ''Microsoft)'' as distinguished from a common noun, which is a noun that refers to a class of entities (''continent, ...
,
abbreviations An abbreviation (from Latin ''brevis'', meaning ''short'') is a shortened form of a word or phrase, by any method. It may consist of a group of letters or words taken from the full version of the word or phrase; for example, the word ''abbrevia ...
,
URLs A Uniform Resource Locator (URL), colloquially termed as a web address, is a reference to a web resource that specifies its location on a computer network and a mechanism for retrieving it. A URL is a specific type of Uniform Resource Identi ...
, foreign-language words and other user-unique words. This ideal circumstance gives predictive text software the reduction in the number of key strokes a user is required to enter a word. The user presses the number corresponding to each letter and, as long as the word exists in the predictive text dictionary, or is correctly disambiguated by non-dictionary systems, it will appear. For instance, pressing "4663" will typically be interpreted as the word ''good'', provided that a linguistic database in English is currently in use, though alternatives such as ''home'', ''hood'' and ''hoof'' are also valid interpretations of the sequence of key strokes. The most widely used systems of predictive text are Tegic's T9, Motorola's iTap, and the Eatoni Ergonomics' LetterWise and WordWise. T9 and iTap use dictionaries, but Eatoni Ergonomics' products uses a disambiguation process, a set of statistical rules to recreate words from keystroke sequences. All predictive text systems require a linguistic database for every supported input language.


Dictionary vs. non-dictionary systems

Traditional disambiguation works by referencing a
dictionary A dictionary is a listing of lexemes from the lexicon of one or more specific languages, often arranged alphabetically (or by radical and stroke for ideographic languages), which may include information on definitions, usage, etymologie ...
of commonly used words, though Eatoni offers a dictionaryless disambiguation system. In dictionary-based systems, as the user presses the number buttons, an algorithm searches the dictionary for a list of possible words that match the keypress combination, and offers up the most probable choice. The user can then confirm the selection and move on, or use a key to cycle through the possible combinations. A non-dictionary system constructs words and other sequences of letters from the statistics of word parts. To attempt predictions of the intended result of keystrokes not yet entered, disambiguation may be combined with a word completion facility. Either system (disambiguation or predictive) may include a user database, which can be further classified as a "learning" system when words or phrases are entered into the user database without direct user intervention. The user database is for storing words or phrases which are not well disambiguated by the pre-supplied database. Some disambiguation systems further attempt to correct spelling, format text or perform other automatic rewrites, with the risky effect of either enhancing or frustrating user efforts to enter text.


History

The actuating keys of the Chinese typewriter created by Lin Yutang in the 1940s included suggestions for the characters following the one selected. In 1951, the Chinese typesetter Zhang Jiying arranged Chinese characters in associative clusters, a precursor of modern predictive text entry, and broke speed records by doing so. Predictive entry of text from a telephone keypad has been known at least since the 1970s (Smith and Goodwin, 1971). Predictive text was mainly used to look up names in directories over the phone, until mobile phone text messaging came into widespread use.


Example

On a typical phone keypad, if users wished to type ''the'' in a "multi-tap" keypad entry system, they would need to: * Press 8 (tuv) once to select ''t''. * Press 4 (ghi) twice to select ''h''. * Press 3 (def) twice to select ''e''. Meanwhile, in a phone with predictive text, they need only: * Press 8 once to select the (tuv) group for the first character. * Press 4 once to select the (ghi) group for the second character. * Press 3 once to select the (def) group for the third character. The system updates the display as each keypress is entered, to show the most probable entry. In this example, prediction reduced the number of button presses from five to three. The effect is even greater with longer words and those composed of letters later in each key's sequence. A dictionary-based predictive system is based on hope that the desired word is in the dictionary. That hope may be misplaced if the word differs in any way from common usage—in particular, if the word is not spelled or typed correctly, is slang, or is a
proper noun A proper noun is a noun that identifies a single entity and is used to refer to that entity (''Africa'', ''Jupiter'', '' Sarah'', ''Microsoft)'' as distinguished from a common noun, which is a noun that refers to a class of entities (''continent, ...
. In these cases, some other mechanism must be used to enter the word. Furthermore, the simple dictionary approach fails with
agglutinative language An agglutinative language is a type of synthetic language with morphology that primarily uses agglutination. Words may contain different morphemes to determine their meanings, but all of these morphemes (including stems and affixes) tend to rem ...
s, where a single word does not necessarily represent a single semantic entity.


Companies and products

Predictive text is developed and marketed in a variety of competing products, such as
Nuance Communications Nuance Communications, Inc. is an American multinational computer software technology corporation, headquartered in Burlington, Massachusetts, that markets speech recognition and artificial intelligence software. Nuance merged with its compe ...
's T9. Other products include
Motorola Motorola, Inc. () was an American multinational telecommunications company based in Schaumburg, Illinois, United States. After having lost $4.3 billion from 2007 to 2009, the company split into two independent public companies, Motorola ...
's iTap, Eatoni Ergonomic's LetterWise (character, rather than word-based prediction), WordWise (word-based prediction without a dictionary), EQ3 (a
QWERTY QWERTY () is a keyboard layout for Latin-script alphabets. The name comes from the order of the first six keys on the top left letter row of the keyboard ( ). The QWERTY design is based on a layout created for the Sholes and Glidden ty ...
-like layout compatible with regular telephone keypads); Prevalent Devices's Phraze-It; Xrgomics' TenGO (a six-key reduced QWERTY keyboard system); Adaptxt (considers language, context, grammar and semantics)
Lightkey
(a predictive typing software for Windows); Clevertexting (statistical nature of the language, dictionaryless, dynamic key allocation); and Oizea Type (temporal ambiguity); Intelab's Tauto; WordLogic's Intelligent Input Platform™ (patented, layer-based advanced text prediction, includes multi-language dictionary, spell-check, built-in Web search).


Textonyms

Words produced by the same combination of keypresses have been called "textonyms"; also "txtonyms"; or "T9onyms" (pronounced "tynonyms" ), though they are not specific to T9. Selecting the wrong textonym can occur with no misspelling or typo, if the wrong textonym is selected by default or user error. As mentioned above, the key sequence 4663 on a telephone keypad, provided with a linguistic database in English, will generally be disambiguated as the word ''good''. However, the same key sequence also corresponds to other words, such as ''home'', ''gone'', ''hoof'', ''hood'' and so on. For example, "Are you home?" could be rendered as "Are you good?" if the user neglects to alter the default 4663 word. This can lead to misunderstandings; for example sequence 735328 might correspond to either ''select'' or its
antonym In lexical semantics, opposites are words lying in an inherently incompatible binary relationship. For example, something that is ''long'' entails that it is not ''short''. It is referred to as a 'binary' relationship because there are two members ...
''reject''. A 2010 brawl that led to
manslaughter Manslaughter is a common law legal term for homicide considered by law as less culpable than murder. The distinction between murder and manslaughter is sometimes said to have first been made by the ancient Athenian lawmaker Draco in the 7th ce ...
was sparked by a textonym error. Predictive text choosing a default different from that which the user expects has similarities with the
Cupertino effect The Cupertino effect occurs when a spell checker erroneously replaces correctly spelled words that are not in its dictionary. Origin This term refers to the unhyphenated English word "cooperation" often being changed to "Cupertino" by older spe ...
, by which spell-check software changes a spelling to that of an unintended word. Textonyms have been used as
Millennial Millennials, also known as Generation Y or Gen Y, are the Western world, Western demography, demographic Cohort (statistics), cohort following Generation X and preceding Generation Z. Researchers and popular media use the early 1980s as start ...
slang; for example, the use of the word ''book'' to mean ''cool'', since ''book'' is the default in those predictive text systems that assume it is more frequent than ''cool''. This is related to
cacography Cacography is bad spelling or bad handwriting. The term in the sense of "poor spelling, accentuation, and punctuation" is a semantic antonym to orthography, and in the sense of "poor handwriting" it is an etymological antonym to the word calligra ...
.


Disambiguation failure and misspelling

Textonyms are not the only issue limiting the effectiveness of predictive text implementations. Another significant problem are words for which the disambiguation produces a single, incorrect response. The system may, for example, respond with ''Blairf'' upon input of 252473, when the intended word was ''Blaise'' or ''Claire'', both of which correspond to the keystroke sequence, but are not, in this example, found by the predictive text system. When typos or misspellings occur, they are very unlikely to be recognized correctly by a disambiguation system, though error correction mechanisms may mitigate that effect.


See also


Concepts

*
Assistive technology Assistive technology (AT) is a term for assistive, adaptive, and rehabilitative devices for Disability, people with disabilities and the elderly. Disabled people often have difficulty performing activities of daily living (ADLs) independently, ...
*
Autocomplete Autocomplete, or word completion, is a feature in which an application predicts the rest of a word a user is typing. In Android and iOS smartphones, this is called predictive text. In graphical user interfaces, users can typically press the ta ...
* Text entry interface *
Input method editor An input method (or input method editor, commonly abbreviated IME) is an operating system component or program that enables users to generate characters not natively available on their input devices by using sequences of characters (or mouse o ...
*
Text messaging Text messaging, or texting, is the act of composing and sending electronic messages, typically consisting of alphabetic and numeric characters, between two or more users of mobile devices, desktops/laptops, or another type of compatible comput ...
*
SMS language Short Message Service (SMS) language, textism, or textese is the abbreviated language and slang commonly used in the late 1990s and early 2000s with mobile phone text messaging, and occasionally through Internet-based communication such as ema ...
*
Speech-to-text reporter A speech-to-text reporter (STTR), also known as a captioner, is a person who listens to what is being said and inputs it, word for word (verbatim), using an electronic shorthand keyboard, speech recognition software, or a CAT (Computer Aided Tra ...


References


Further reading

*


External links


New Scientist article on textonymsTechnical notes on iTap (including lists of textonyms)
{{Natural Language Processing Input/output Mobile technology Natural language processing