Optical mark recognition (OMR) collects data from people by identifying markings on a
paper
Paper is a thin sheet material produced by mechanically or chemically processing cellulose fibres derived from wood, Textile, rags, poaceae, grasses, Feces#Other uses, herbivore dung, or other vegetable sources in water. Once the water is dra ...
.
OMR enables the hourly processing of hundreds or even thousands of documents. A common application of this technology is used in exams, where students mark cells as their answers. This allows for very fast automated grading of exam sheets.
Background

Many
OMR devices have a
scanner that shines a light onto a form. The device then looks at the contrasting
reflectivity
The reflectance of the surface of a material is its effectiveness in Reflection (physics), reflecting radiant energy. It is the fraction of incident electromagnetic power that is reflected at the boundary. Reflectance is a component of the respon ...
of the light at certain positions on the form. It will detect the black marks because they reflect less light than the blank areas on the form.
Some OMR devices use forms that are printed on transoptic paper. The device can then measure the amount of light that passes through the paper. It will pick up any black marks on either side of the paper because they reduce the amount of light passing through.
In contrast to the dedicated OMR device, desktop OMR
software
Software consists of computer programs that instruct the Execution (computing), execution of a computer. Software also includes design documents and specifications.
The history of software is closely tied to the development of digital comput ...
allows a user to create their own forms in a word processor or computer and
print them on a
laser printer
Laser printing is an electrostatic digital printing process. It produces high-quality text and graphics (and moderate-quality photographs) by repeatedly passing a laser beam back and forth over a Electric charge, negatively charged cylinder call ...
. The OMR software then works with a common desktop image scanner with a
document
A document is a writing, written, drawing, drawn, presented, or memorialized representation of thought, often the manifestation of nonfiction, non-fictional, as well as fictional, content. The word originates from the Latin ', which denotes ...
feeder to process the forms once filled out.
OMR is generally distinguished from
optical character recognition
Optical character recognition or optical character reader (OCR) is the electronics, electronic or machine, mechanical conversion of images of typed, handwritten or printed text into machine-encoded text, whether from a scanned document, a photo ...
(OCR) by the fact that a complicated
pattern recognition
Pattern recognition is the task of assigning a class to an observation based on patterns extracted from data. While similar, pattern recognition (PR) is not to be confused with pattern machines (PM) which may possess PR capabilities but their p ...
engine is not required. That is, the marks are constructed in such a way that there is little chance that the OMR device will not read them correctly. This does require the image to have high contrast and an easily recognizable or irrelevant shape. A related field to OMR and OCR is the recognition of
barcodes
A barcode or bar code is a method of representing data in a visual, Machine-readable data, machine-readable form. Initially, barcodes represented data by varying the widths, spacings and sizes of parallel lines. These barcodes, now commonly ref ...
, such as the
UPC bar code found on product packaging.
One of the most familiar applications of OMR is the use of
#2 pencil (HB in Europe) bubble
optical answer sheets in multiple choice question
examinations. Students mark their answers, or other personal information, by darkening circles on a forms. The sheet is then graded by a scanning machine.
* In the
United States
The United States of America (USA), also known as the United States (U.S.) or America, is a country primarily located in North America. It is a federal republic of 50 U.S. state, states and a federal capital district, Washington, D.C. The 48 ...
and most
European countries, a horizontal or vertical "tick" in a rectangular "lozenge" is the most commonly used type of OMR form;
* The most familiar form in the United Kingdom is the UK National lottery form.
Lozenge marks represent a later technology that is easier to mark and easier to erase. The large "bubble" marks are legacy technology from very early
OMR machines that were so insensitive a large mark was required for reliability. In most Asian countries, a special marker is used to fill in an
optical answer sheet. Students, likewise, mark answers or other information by darkening circles marked on a pre-printed sheet. Then the sheet is automatically graded by a scanning machine.
Many of today's OMR applications involve people filling in specialized forms. These forms are optimized for computer scanning, with careful registration in the printing, and careful design so that
ambiguity
Ambiguity is the type of meaning (linguistics), meaning in which a phrase, statement, or resolution is not explicitly defined, making for several interpretations; others describe it as a concept or statement that has no real reference. A com ...
is reduced to the minimum possible. Due to its extremely low error rate, low cost and ease-of-use, OMR is a
popular method of tallying votes.
[Haag, S., Cummings, M., McCubbrey, D., Pinsonnault, A., Donovan, R. (2006). Management Information Systems for the Information Age (3rd ed.). Canada: McGraw-Hill Ryerson.]
OMR marks are also added to items of printed mail so folder inserter equipment can be used. The marks are added to each (normally facing/odd) page of a mail document and consist of a sequence of black dashes that folder inserter equipment scans in order to determine when the mail should be folded then inserted in an
envelope
An envelope is a common packaging item, usually made of thin, flat material. It is designed to contain a flat object, such as a letter (message), letter or Greeting card, card.
Traditional envelopes are made from sheets of paper cut to one o ...
.
Optical answer sheet

An optical answer sheet or bubble sheet is a special type of
form
Form is the shape, visual appearance, or configuration of an object. In a wider sense, the form is the way something happens.
Form may also refer to:
*Form (document), a document (printed or electronic) with spaces in which to write or enter dat ...
used in multiple choice question examinations. OMR is used to detect answers. The
Scantron Corporation creates many optical answer sheets, although certain uses require their own customized system.
Optical answer sheets usually have a set of blank ovals or boxes that correspond to each question, often on separate sheets of paper.
Bar code
A barcode or bar code is a method of representing data in a visual, Machine-readable data, machine-readable form. Initially, barcodes represented data by varying the widths, spacings and sizes of parallel lines. These barcodes, now commonly ref ...
s may mark the sheet for automatic processing, and each series of ovals filled will return a certain value when read. In this way students' answers can be digitally recorded, or identity given.
Reading
The first optical answer sheets were read by shining a light through the sheet and measuring how much of the light was blocked using
phototubes
A phototube or photoelectric cell is a type of gas filled tube, gas-filled or vacuum tube that is sensitive to light. Such a tube is more correctly called a 'photoemissive cell' to distinguish it from photovoltaic or photoconductive cells. Photo ...
on the opposite side.
As some phototubes are mostly sensitive to the blue end of the
visible spectrum
The visible spectrum is the spectral band, band of the electromagnetic spectrum that is visual perception, visible to the human eye. Electromagnetic radiation in this range of wavelengths is called ''visible light'' (or simply light).
The optica ...
, blue pens could not be used, as blue inks reflect and
transmit blue light. Because of this, number two pencils had to be used to fill in the bubbles—
graphite
Graphite () is a Crystallinity, crystalline allotrope (form) of the element carbon. It consists of many stacked Layered materials, layers of graphene, typically in excess of hundreds of layers. Graphite occurs naturally and is the most stable ...
is a very
opaque substance which absorbs or reflects most of the light which hits it.
Modern optical answer sheets are read based on reflected light, measuring lightness and darkness. They do not need to be filled in with a number two pencil, though these are recommended over other types (this is due to the lighter marks made by higher-number pencils and the smudges from number 1 pencils). Black ink will be read, though many systems will ignore marks that are the same color the form is printed in.
This also allows optical answer sheets to be double-sided because marks made on the opposite side will not interfere with reflectance readings as much as with opacity readings.
Most systems accommodate for
human error
Human error is an action that has been done but that was "not intended by the actor; not desired by a set of rules or an external observer; or that led the task or system outside its acceptable limits".Senders, J.W. and Moray, N.P. (1991) Human Er ...
in filling in ovals imprecisely—as long as they do not stray into the other ovals and the oval is almost filled, the scanner will detect it as filled in.
Designing and printing
There are specific dimensions of designing OMR sheets with 0.05 mm precision on scale. If the dimensions are not up to the precision scale, the accuracy of the OMR sheet may vary, so the sheet should be designed, printed and cut perfectly.
* Single Part – Sheets are printed on 105 gsm to 120 gsm paper on A4/Legal sheets.
* Double Part (Carbonless) – Two sheets are printed; one on 105 gsm paper and one on 60-70 gsm paper on A4 sheets. The bottom of the first sheet and the top of the second sheet are chemically treated so that the impression of the first sheet comes on the second sheet.
* Three Part (Carbonless) – Three sheets are printed on one 105 gsm paper and the other two are printed on 60-70 gsm paper on A4 sheets. The bottom of the first sheet, the top and bottom of the second sheet, and the top of the third sheet are chemically treated so that the impression of the first sheet comes on the second and third sheets.
Errors
It is possible for optical answer sheets to be printed incorrectly, such that all ovals will be read as filled. This occurs if the outline of the ovals is too thick, or is irregular. During the
2008 U.S. presidential election
Presidential elections were held in the United States on November 4, 2008. The Democratic ticket of Barack Obama, the junior senator from Illinois, and Joe Biden, the senior senator from Delaware, defeated the Republican ticket of John Mc ...
, this occurred with over 19,000
absentee ballot
Absentee or The Absentee may refer to:
* Absentee (band), a British band
* The Absentee, a novel by Maria Edgeworth, published in 1812 in ''Tales of Fashionable Life''
* ''The Absentee'' (1915 film), a 1915 American silent film directed by Christy ...
s in the
Georgia
Georgia most commonly refers to:
* Georgia (country), a country in the South Caucasus
* Georgia (U.S. state), a state in the southeastern United States
Georgia may also refer to:
People and fictional characters
* Georgia (name), a list of pe ...
county of
Gwinnett, and was discovered after around 10,000 had already been returned. The slight difference was not apparent to the
naked eye
Naked eye, also called bare eye or unaided eye, is the practice of engaging in visual perception unaided by a magnification, magnifying, Optical telescope#Light-gathering power, light-collecting optical instrument, such as a telescope or microsc ...
, and was not detected until a test run was made in late October. This required all
ballot
A ballot is a device used to cast votes in an election and may be found as a piece of paper or a small ball used in voting. It was originally a small ball (see blackballing) used to record decisions made by voters in Italy around the 16th cent ...
s to be transferred to correctly printed ones, by
sequestered workers of the
board of elections, under close
observation
Observation in the natural sciences is an act or instance of noticing or perceiving and the acquisition of information from a primary source. In living beings, observation employs the senses. In science, observation can also involve the percep ...
by members of the
Democratic and
Republican (but not
other) political parties, and county
sheriff
A sheriff is a government official, with varying duties, existing in some countries with historical ties to England where the office originated. There is an analogous, although independently developed, office in Iceland, the , which is common ...
deputies
A legislator, or lawmaker, is a person who writes and passes laws, especially someone who is a member of a legislature. Legislators are often elected by the people, but they can be appointed, or hereditary. Legislatures may be supra-nati ...
. The transfer, by law, could not occur until
election day (November 4).
OMR software

OMR software is a computer software application that makes OMR possible on a desktop computer by using an
Image scanner
An image scanner (often abbreviated to just scanner) is a device that optically scans images, printed text, handwriting, or an object and converts it to a digital image. The most common type of scanner used in the home and the office is the flatbe ...
to process surveys, tests, attendance sheets, checklists, and other plain-paper forms printed on a laser printer.
OMR software is used to capture data from OMR sheets. While data capturing scanning devices focus on many factors like thickness of paper dimensions of OMR sheet and the designing pattern.
Commercial OMR software
One of the first OMR software packages that used images from common image scanners was Remark Office OMR, made by Gravic, Inc. (originally named Principia Products, Inc.). Remark Office OMR 1.0 was released in 1991.
The need for OMR software originated because early optical mark recognition systems used dedicated scanners and special pre-printed forms with drop-out colors and registration marks. Such forms typically cost US$0.10 to $0.19 a page. In contrast, OMR software users design their own mark-sense forms with a word processor or built-in form editor, print them locally on a printer, and can save thousands of dollars on large numbers of forms.
Identifying optical marks within a form, such as for processing
census
A census (from Latin ''censere'', 'to assess') is the procedure of systematically acquiring, recording, and calculating population information about the members of a given Statistical population, population, usually displayed in the form of stati ...
forms, has been offered by many forms-processing (Batch Transaction Capture) companies since the late 1980s. Mostly this is based on a
bitonal image and pixel count with minimum and maximum
pixel
In digital imaging, a pixel (abbreviated px), pel, or picture element is the smallest addressable element in a Raster graphics, raster image, or the smallest addressable element in a dot matrix display device. In most digital display devices, p ...
counts to eliminate extraneous marks, such as those erased with a dirty eraser that when converted into a black-and-white image (bitonal) can look like a legitimate mark. So this method can cause problems when a user changes their mind, and so some products started to use grayscale to better identify the intent of the marker—internally scantron and NCS scanners used
grayscale
In digital photography, computer-generated imagery, and colorimetry, a greyscale (more common in Commonwealth English) or grayscale (more common in American English) image is one in which the value of each pixel is a single sample (signal), s ...
.
OMR software is also used for adding OMR marks to mail documents so they can be scanned by folder inserter equipment. An example of OMR software is Mail Markup from UK developer Funasset Limited. This software allows the user to configure and select an OMR sequence then apply the OMR marks to mail documents prior to printing.
History
Optical mark recognition (OMR) is the scanning of paper to detect the presence or absence of a mark in a predetermined position.
Optical mark recognition has evolved from several other technologies. In the early 19th century and 20th century patents were given for machines that would aid the blind.
OMR is now used as an input device for data entry. Two early forms of OMR are
paper tape
Five- and eight-hole wide punched paper tape
Paper tape reader on the Harwell computer with a small piece of five-hole tape connected in a circle – creating a physical program loop
Punched tape or perforated paper tape is a form of data st ...
and
punch card
A punched card (also punch card or punched-card) is a stiff paper-based medium used to store digital information via the presence or absence of holes in predefined positions. Developed over the 18th to 20th centuries, punched cards were wide ...
s which use actual holes punched into the medium instead of pencil filled circles on the medium. Paper tape was used as early as 1857 as an input device for telegraph.
Punch cards were created in 1890 and were used as input devices for computers. The use of punch cards declined greatly in the early 1970s with the introduction of personal computers.
[Palmer, Roger C. (1989, Sept) The Basics of Automatic Identification lectronic version Canadian Datasystems, 21 (9), 30-33] With modern OMR, where the presence of a pencil filled in bubble is recognized, the recognition is done via an optical scanner.
The first
mark sense scanner was the
IBM 805 Test Scoring Machine; this read marks by sensing the electrical conductivity of graphite pencil lead using pairs of wire brushes that scanned the page. In the 1930s, Richard Warren at
IBM
International Business Machines Corporation (using the trademark IBM), nicknamed Big Blue, is an American Multinational corporation, multinational technology company headquartered in Armonk, New York, and present in over 175 countries. It is ...
experimented with optical mark sense systems for test scoring, as documented in US Patents 2,150,256 (filed in 1932, granted in 1939) and 2,010,653 (filed in 1933, granted in 1935). The first successful optical mark-sense scanner was developed by
Everett Franklin Lindquist
Everett Franklin Lindquist (June 4, 1901 – May 13, 1978) was a professor of education at the University of Iowa College of Education. He is best known as the creator of the ACT (test), ACT and other standardized tests. His contributions to the fi ...
as documented in US Patent 3,050,248 (filed in 1955, granted in 1962). Lindquist had developed numerous standardized educational tests, and needed a better test scoring machine than the then-standard IBM 805. The rights to Lindquist's patents were held by the Measurement Research Center until 1968, when the
University of Iowa
The University of Iowa (U of I, UIowa, or Iowa) is a public university, public research university in Iowa City, Iowa, United States. Founded in 1847, it is the oldest and largest university in the state. The University of Iowa is organized int ...
sold the operation to
Westinghouse Corporation.
During the same period,
IBM
International Business Machines Corporation (using the trademark IBM), nicknamed Big Blue, is an American Multinational corporation, multinational technology company headquartered in Armonk, New York, and present in over 175 countries. It is ...
also developed a successful optical mark-sense test-scoring machine, as documented in US Patent 2,944,734 (filed in 1957, granted in 1960). IBM commercialized this as the IBM 1230 Optical mark scoring reader in 1962. This and a variety of related machines allowed IBM to migrate a wide variety of applications developed for its
mark sense machines to the new optical technology. These applications included a variety of inventory management and trouble reporting forms, most of which had the dimensions of a standard
punched card
A punched card (also punch card or punched-card) is a stiff paper-based medium used to store digital information via the presence or absence of holes in predefined positions. Developed over the 18th to 20th centuries, punched cards were widel ...
.
While the other players in the educational testing arena focused on selling scanning services,
Scantron Corporation, founded in 1972, had a different model; it would distribute inexpensive scanners to schools and make profits from selling the test forms. As a result, many people came to think of all mark-sense forms (whether optically sensed or not) as ''scantron'' forms.
In 1983, Westinghouse Learning Corporation was acquired by National Computer Systems (NCS). In 2000, NCS was acquired by
Pearson Education
Pearson Education, known since 2011 as simply Pearson, is the educational publishing and services subsidiary of the international corporation Pearson plc. The subsidiary was formed in 1998, when Pearson plc acquired Simon & Schuster's educatio ...
, where the OMR technology formed the core of Pearson's Data Management group. In February 2008, M&F Worldwide purchased the Data Management group from Pearson; the group is now part of the Scantron brand.
OMR has been used in many situations as mentioned below. The use of OMR in inventory systems was a transition between punch cards and bar codes and is not used as much for this purpose.
OMR is still used extensively for surveys and testing though.
Usage
The use of OMR is not limited to schools or
data collection
Data collection or data gathering is the process of gathering and measuring information on targeted variables in an established system, which then enables one to answer relevant questions and evaluate outcomes. Data collection is a research com ...
agencies; many businesses and health care agencies use OMR to streamline their data input processes and reduce input error. OMR, OCR, and ICR technologies all provide a means of data collection from paper forms. OMR may also be done using an OMR (discrete read head) scanner or an imaging scanner.
Applications

There are many other applications for OMR, for examples:
* In the process of
institutional research
* Community surveys
* Consumer surveys
*
Tests and assessments
* Evaluations and feedback
* Data compilation
* Product evaluation
*
Time sheets and inventory counts
* Membership subscription forms
*
Lotteries
A lottery (or lotto) is a form of gambling that involves the drawing of numbers at random for a prize. Some governments outlaw lotteries, while others endorse it to the extent of organizing a national or state lottery. It is common to find som ...
and voting
* Geocoding (e.g.
postal code
A postal code (also known locally in various English-speaking countries throughout the world as a postcode, post code, PIN or ZIP Code) is a series of letters or numerical digit, digits or both, sometimes including spaces or punctuation, inclu ...
s)
*
Mortgage loan
A mortgage loan or simply mortgage (), in civil law (legal system), civil law jurisdictions known also as a hypothec loan, is a loan used either by purchasers of real property to raise funds to buy real estate, or by existing property owners t ...
, banking, and insurance applications
Field types
OMR has different fields to provide the format the questioner desires. These fields include:
*Multiple, where there are several options but only one is chosen. For example, the form might ask for one of the options ABCDE; 12345; completely disagree, disagree, indifferent, agree, completely agree; or similar.
*Grid: the bubbles or lines are set up in a grid format for the user to fill in a phone number, name, ID number and so on.
*Add, total the answers to a single value
*
Boolean, answering yes or no to all that apply
*Binary, answering yes or no to only one
*Dotted lines fields, developed by Smartshoot OMR, allow border dropping like traditional color dropping.
Capabilities/requirements
In the past and presently, some OMR systems require special paper, special ink and a special input reader (Bergeron, 1998). This restricts the types of questions that can be asked and does not allow for much variability when the form is being input. Progress in OMR now allows users to create and print their own forms and use a scanner (preferably with a document feeder) to read the information.
[Bergeron, ] The user is able to arrange questions in a format that suits their needs while still being able to easily input the data. OMR systems approach one hundred percent
accuracy
Accuracy and precision are two measures of ''observational error''.
''Accuracy'' is how close a given set of measurements (observations or readings) are to their ''true value''.
''Precision'' is how close the measurements are to each other.
The ...
and only take 5
millisecond
A millisecond (from '' milli-'' and second; symbol: ms) is a unit of time in the International System of Units equal to one thousandth (0.001 or 10−3 or 1/1000) of a second or 1000 microseconds.
A millisecond is to one second, as one second i ...
s on average to recognize marks.
Users can use squares, circles, ellipses and hexagons for the mark zone. The software can then be set to recognize filled in bubbles, crosses or check marks.
OMR can also be used for personal use. There are all-in-one printers in the market that will print the photos the user selects by filling in the bubbles for size and paper selection on an index sheet that has been printed. Once the sheet has been filled in, the individual places the sheet on the scanner to be scanned and the printer will print the photos according to the marks that were indicated.
Disadvantages
There are also some disadvantages and limitations to OMR. If the user wants to gather large amounts of text, then OMR complicates the data collection.
[Green, 2000 ] There is also the possibility of missing data in the scanning process, and incorrectly or unnumbered pages can lead to their being scanned in the wrong order. Also, unless safeguards are in place, a page could be rescanned, providing duplicate data and skewing the data.
As a result of the widespread adoption and ease of use of OMR,
standardized examinations can consist primarily of multiple-choice questions, changing the nature of what is being tested.
See also
*
AI effect
The AI effect is the discounting of the behavior of an artificial intelligence program as not "real" intelligence.
The author Pamela McCorduck writes: "It's part of the history of the field of artificial intelligence that every time somebody fi ...
*
Applications of artificial intelligence
Artificial intelligence (AI) has been used in applications throughout industry and academia. In a manner analogous to electricity or computers, AI serves as a general-purpose technology. AI programs are designed to simulate human perception and u ...
*
Clock mark
*
Electronic data capture
*
Mark sense
*
Object recognition
Object recognition – technology in the field of computer vision for finding and identifying objects in an image or video sequence. Humans recognize a multitude of objects in images with little effort, despite the fact that the image of the ...
*
Optical character recognition
Optical character recognition or optical character reader (OCR) is the electronics, electronic or machine, mechanical conversion of images of typed, handwritten or printed text into machine-encoded text, whether from a scanned document, a photo ...
*
Pattern recognition
Pattern recognition is the task of assigning a class to an observation based on patterns extracted from data. While similar, pattern recognition (PR) is not to be confused with pattern machines (PM) which may possess PR capabilities but their p ...
*
Benjamin D. Wood
; Lists
*
List of emerging technologies
This is a list of emerging technologies, which are emerging technologies, in-development technical innovations that have significant potential in their applications. The criteria for this list is that the technology must:
# Exist in some way; ...
*
Outline of artificial intelligence
The following outline is provided as an overview of and topical guide to artificial intelligence:
Artificial intelligence (AI) is intelligence exhibited by machines or software. It is also the name of the scientific field which studies how to ...
References
{{DEFAULTSORT:Optical Mark Recognition
Optical character recognition
Paper data storage
ja:マークシート