OCR Systems, Inc., was an American computer hardware manufacturer and software publisher dedicated to
optical character recognition
Optical character recognition or optical character reader (OCR) is the electronics, electronic or machine, mechanical conversion of images of typed, handwritten or printed text into machine-encoded text, whether from a scanned document, a photo ...
technologies. The company's first product, the System 1000 in 1970, was used by numerous large corporations for bill processing and mail sorting. Following a series of pitfalls in the 1970s and early 1980s, founder Theodor Herzl Levine put the company in the hands of Gregory Boleslavsky and Vadim Brikman, the company's vice presidents and recent immigrants from the
Soviet Ukraine
The Ukrainian Soviet Socialist Republic, abbreviated as the Ukrainian SSR, UkrSSR, and also known as Soviet Ukraine or just Ukraine, was one of the constituent republics of the Soviet Union from 1922 until 1991. Under the Soviet one-party m ...
, who were able to turn OCR System's fortunes around and expand its employee base. The company released the software-based OCR application ReadRight for
DOS
DOS (, ) is a family of disk-based operating systems for IBM PC compatible computers. The DOS family primarily consists of IBM PC DOS and a rebranded version, Microsoft's MS-DOS, both of which were introduced in 1981. Later compatible syste ...
, later ported to
Windows
Windows is a Product lining, product line of Proprietary software, proprietary graphical user interface, graphical operating systems developed and marketed by Microsoft. It is grouped into families and subfamilies that cater to particular sec ...
, in the late 1980s.
Adobe Inc.
Adobe Inc. ( ), formerly Adobe Systems Incorporated, is an American computer software company based in San Jose, California. It offers a wide range of programs from web design tools, photo manipulation and vector creation, through to video/audi ...
bought the company in 1992.
History
OCR Systems was co-founded by Theodor Herzl Levine ( 1923 – May 30, 2005). Levine served in the
U.S. Army Signal Corps during
World War II
World War II or the Second World War (1 September 1939 – 2 September 1945) was a World war, global conflict between two coalitions: the Allies of World War II, Allies and the Axis powers. World War II by country, Nearly all of the wo ...
in the
Solomon Islands
Solomon Islands, also known simply as the Solomons,John Prados, ''Islands of Destiny'', Dutton Caliber, 2012, p,20 and passim is an island country consisting of six major islands and over 1000 smaller islands in Melanesia, part of Oceania, t ...
, where he helped develop a
sonar
Sonar (sound navigation and ranging or sonic navigation and ranging) is a technique that uses sound propagation (usually underwater, as in submarine navigation) to navigate, measure distances ( ranging), communicate with or detect objects o ...
to find
ejected pilots in the ocean. After the war, Levine spent 22 years at the
University of Pennsylvania
The University of Pennsylvania (Penn or UPenn) is a Private university, private Ivy League research university in Philadelphia, Pennsylvania, United States. One of nine colonial colleges, it was chartered in 1755 through the efforts of f ...
, earning his
bachelor's degree
A bachelor's degree (from Medieval Latin ''baccalaureus'') or baccalaureate (from Modern Latin ''baccalaureatus'') is an undergraduate degree awarded by colleges and universities upon completion of a course of study lasting three to six years ...
in 1951, his
master's degree
A master's degree (from Latin ) is a postgraduate academic degree awarded by universities or colleges upon completion of a course of study demonstrating mastery or a high-order overview of a specific field of study or area of professional prac ...
in
electrical engineering
Electrical engineering is an engineering discipline concerned with the study, design, and application of equipment, devices, and systems that use electricity, electronics, and electromagnetism. It emerged as an identifiable occupation in the l ...
in 1957, and his
doctorate
A doctorate (from Latin ''doctor'', meaning "teacher") or doctoral degree is a postgraduate academic degree awarded by universities and some other educational institutions, derived from the ancient formalism '' licentia docendi'' ("licence to teach ...
in 1968. Alongside his studies, Levine taught statistics and calculus at
Temple University
Temple University (Temple or TU) is a public university, public Commonwealth System of Higher Education, state-related research university in Philadelphia, Philadelphia, Pennsylvania, United States. It was founded in 1884 by the Baptist ministe ...
,
Rutgers University
Rutgers University ( ), officially Rutgers, The State University of New Jersey, is a Public university, public land-grant research university consisting of three campuses in New Jersey. Chartered in 1766, Rutgers was originally called Queen's C ...
,
La Salle University
La Salle University () is a private university, private, Catholic university in Philadelphia, Pennsylvania, United States. The university was founded in 1863 by the Institute of the Brothers of the Christian Schools and named for St. Jean-Bapt ...
and
Penn State Abington
Penn State Abington is a commonwealth campus of the Pennsylvania State University located in Abington, Pennsylvania. The campus is set on of wooded land. The roughly 4000 undergraduate students (full-time and part-time students combined) ar ...
. Sometime in the 1960s, Levine was hired at
Philco
Philco (an acronym for Philadelphia Battery Company) is an American electronics industry, electronics manufacturer headquartered in Philadelphia. Philco was a pioneer in battery, radio, and television production. In 1961, the company was purchase ...
. He and two of his co-workers decided to form their own company dedicated to
optical character recognition
Optical character recognition or optical character reader (OCR) is the electronics, electronic or machine, mechanical conversion of images of typed, handwritten or printed text into machine-encoded text, whether from a scanned document, a photo ...
, founding OCR Systems in 1969 in
Bensalem, Pennsylvania.
OCR Systems's first product, the System 1000, was announced in 1970. OCR Systems entered a partnership with
3M to resell the System 1000 throughout the United States in March 1973. This was 3M's entry into the
data entry
Data entry is the process of digitizing data by entering it into a computer system for organization and management purposes. It is a person-based process and is "one of the important basic" tasks needed when no machine-readable version of the in ...
field, managed by the company's Microfilm Products Division and accompanying 3M's suite of
data retrieval
Data retrieval means obtaining data from a database management system (DBMS), like for example an object-oriented database (ODBMS). In this case, it is considered that data is represented in a structured way, and there is no ambiguity in data.
I ...
systems. It soon found use among
Texas Instruments
Texas Instruments Incorporated (TI) is an American multinational semiconductor company headquartered in Dallas, Texas. It is one of the top 10 semiconductor companies worldwide based on sales volume. The company's focus is on developing analog ...
,
AT&T
AT&T Inc., an abbreviation for its predecessor's former name, the American Telephone and Telegraph Company, is an American multinational telecommunications holding company headquartered at Whitacre Tower in Downtown Dallas, Texas. It is the w ...
,
Ricoh
is a Japanese multinational imaging and electronics company. It was founded by the now-defunct commercial division of the Institute of Physical and Chemical Research (Riken) known as the ''Riken Concern'', on 6 February 1936 as . Ricoh's hea ...
,
Panasonic
is a Japanese multinational electronics manufacturer, headquartered in Kadoma, Osaka, Kadoma, Japan. It was founded in 1918 as in Fukushima-ku, Osaka, Fukushima by Kōnosuke Matsushita. The company was incorporated in 1935 and renamed and c ...
and
Canon
Canon or Canons may refer to:
Arts and entertainment
* Canon (fiction), the material accepted as officially written by an author or an ascribed author
* Literary canon, an accepted body of works considered as high culture
** Western canon, th ...
for bill processing and mail sorting. Later in the mid-1970s an unspecified
Fortune 500
The ''Fortune'' 500 is an annual list compiled and published by ''Fortune (magazine), Fortune'' magazine that ranks 500 of the largest United States Joint-stock company#Closely held corporations and publicly traded corporations, corporations by ...
company reneged on a contract to distribute the System 1000; later still a Canadian company distributing the System 1000 in Canada went defunct. Both incidents led OCR Systems to go nearly bankrupt, although it eventually recovered.
By the early 1980s, however, the company was almost insolvent. In 1983 Levine had only $8,000 in his savings and became bedridden with an illness. He left the company in the hands of Gregory Boleslavsky and Vadim Brikman, two
Soviet Ukraine
The Ukrainian Soviet Socialist Republic, abbreviated as the Ukrainian SSR, UkrSSR, and also known as Soviet Ukraine or just Ukraine, was one of the constituent republics of the Soviet Union from 1922 until 1991. Under the Soviet one-party m ...
expats whom Levine had hired earlier in the 1980s. Boleslavsky was hired as a
wire wrap
Close-up of a wire-wrap connection
Typical wire wrap construction of crossbar_switch.html" ;"title="Bell System telephone crossbar switch">Bell System telephone crossbar switch. Some types of connection were soldered.
Wire wrap is an electron ...
per for the System 1000 and as a programmer and beta tester for ReadRight—a software package developed by Levine implementing patents from Nonlinear Technology, another OCR-centric company from
Greenbelt, Maryland
Greenbelt is a city in Prince George's County, Maryland, United States, and a suburb of Washington, D.C. At the 2020 United States census, 2020 census, the population was 24,921.
Greenbelt is the first and the largest of the three experimental ...
. Boleslavsky in turn recommended Brikman to Levine. The two soon became vice presidents of the company while Levine was bedridden; in Boleslavsky's case, he worked 14-hour work days for over half a year in pursuit of the title. The two presented OCR Systems' products to the
National Computer Conference The Joint Computer Conferences were a series of computer conferences in the United States held under various names between 1951 and 1987. The conferences were the venue for presentations and papers representing "cumulative work in the omputerfield ...
in
Chicago
Chicago is the List of municipalities in Illinois, most populous city in the U.S. state of Illinois and in the Midwestern United States. With a population of 2,746,388, as of the 2020 United States census, 2020 census, it is the List of Unite ...
, where they were massively popular. The company soon gained such clients as
Allegheny Energy in Pennsylvania and the
postal service of Belgium and received an influx of employees—mostly expats from Russia but also
Poland
Poland, officially the Republic of Poland, is a country in Central Europe. It extends from the Baltic Sea in the north to the Sudetes and Carpathian Mountains in the south, bordered by Lithuania and Russia to the northeast, Belarus and Ukrai ...
and
South Korea
South Korea, officially the Republic of Korea (ROK), is a country in East Asia. It constitutes the southern half of the Korea, Korean Peninsula and borders North Korea along the Korean Demilitarized Zone, with the Yellow Sea to the west and t ...
, as well as American-born workers. To accommodate the company's employee base, which had grown to over 30 in 1988, Levine moved OCR System's headquarters from Bensalem to the Masons Mill Business Park in
Bryn Athyn.
Chinon Industries
was a Japanese camera manufacturer. Kodak took a majority stake in the company in 1997, and made it a fully owned subsidiary of Kodak Japan, , in 2004. As a subsidiary, it continues to develop digital camera models.
They manufactured several ...
of Japan signed an agreement with OCR Systems in 1987 to distribute OCR's ReadRight 1.0 software with Chinon's scanners, starting with their N-205
overhead scanner. In 1988, OCR opened their agreement to distribute ReadRight to other scanner manufacturers, including Canon,
Hewlett-Packard
The Hewlett-Packard Company, commonly shortened to Hewlett-Packard ( ) or HP, was an American multinational information technology company. It was founded by Bill Hewlett and David Packard in 1939 in a one-car garage in Palo Alto, California ...
,
Skyworld,
Taxan
TAXAN was a brand of Kaga Create, Kaga Electronics Co. Ltd, Tokyo, Japan. It was founded in July 1981. In the late 1980s and early 1990s, the US division published several video games on the Nintendo Entertainment System, NES and Game Boy. The com ...
,
Diamond Flower and
Abaton. That year, the company posted a revenue of $3 million. OCR Systems extended their agreement with Chinon in 1989 and introduced version 2.0 of ReadRight.
OCR Systems faced stiff competition in the software OCR market in the turn of the 1990s. The
Toronto
Toronto ( , locally pronounced or ) is the List of the largest municipalities in Canada by population, most populous city in Canada. It is the capital city of the Provinces and territories of Canada, Canadian province of Ontario. With a p ...
-based software firm
Delrina
Delrina Corporation was a Canadian software company active from 1988 to 1995. The company was best known for WinFax, a software package which enabled computers equipped with fax modems to transmit copies of documents to standalone fax machines or ...
signed a letter of intent to purchase the company in November 1991, expecting the deal to close in December and have OCR software available by Christmas. OCR was to receive $3 million worth of Delrina shares in a
stock swap
In corporate finance, a stock swap is the exchange of one equity-based asset for another, where, during the merger or acquisition, the swap provides an opportunity to pay with stock rather than with cash; see .
Overview
The acquiring company ...
, but the deal collapsed in January 1992. Delrine later marketed its own Extended Character Recognition, or XCR, software package to compete with ReadRight. In July 1992, OCR Systems was purchased by
Adobe Inc.
Adobe Inc. ( ), formerly Adobe Systems Incorporated, is an American computer software company based in San Jose, California. It offers a wide range of programs from web design tools, photo manipulation and vector creation, through to video/audi ...
for an undisclosed sum.
Products
System 1000
The System 1000 was based on the 16-bit
Varian Data 620/i
minicomputer
A minicomputer, or colloquially mini, is a type of general-purpose computer mostly developed from the mid-1960s, built significantly smaller and sold at a much lower price than mainframe computers . By 21st century-standards however, a mini is ...
with 4 KB of
core memory
Core or cores may refer to:
Science and technology
* Core (anatomy), everything except the appendages
* Core (laboratory), a highly specialized shared research resource
* Core (manufacturing), used in casting and molding
* Core (optical fiber), ...
. The system used the 620/i for controlling the paper feed, interpreting the format of the documents, the optical character recognition process itself,
error detection
In information theory and coding theory with applications in computer science and telecommunications, error detection and correction (EDAC) or error control are techniques that enable reliable delivery of digital data over unreliable communi ...
, sequencing and output. The System was initially programmed to recognize 1428 OCR (used by
Selectrics);
IBM 407
The IBM 407 Accounting Machine, introduced in 1949, was one of a long line of IBM tabulating machines dating back to the days of Herman Hollerith. It had a card reader and printer; a summary punch could be attached. Processing was directed by ...
print; and the full character sets of
OCR-A
OCR-A is a font issued in 1966 and first implemented in 1968. A special font was needed in the early days of computer optical character recognition, when there was a need for a font that could be recognized not only by the computers of that day, b ...
,
OCR-B and
Farrington 7B; as well as
optical marks and handwritten numbers. OCR Systems promised added compatibility with more fonts available down the line—per request—in 1970. The number of fonts supported was limited by the amount of core memory, which was expandable in 4 KB increments up to 32 KB. The System 1000 later supported generalized typewriter and
photocopier
A photocopier (also called copier or copy machine, and formerly Xerox machine, the generic trademark) is a machine that makes copies of documents and other visual images onto paper or plastic film quickly and cheaply. Most modern photocopiers ...
fonts.
The rest of the System 1000 comprised the document transport, one or more scanner elements, a CRT display and a
Teletype Model 33
The Teletype Model 33 is an electromechanical teleprinter designed for light-duty office use. It is less rugged and cost less than earlier Teletype models. The Teletype Corporation introduced the Model 33 as a commercial product in 1963, after ...
or 35. Pages are fed via friction with a rubber belt. Up to three lines could be scanned per document, while the rest of the scanned document could be laid out in any manner granted there was enough space around the fields to be read. The reader initially supported pages as small as 3.25 in by 3.5 in dimension (later supporting 2.6 in by 3.5 in utility cash stubs) all the way to the standard
ANSI letter size (8.5 in by 11 in; later 8.5 in by 12 in as used in
stock certificate
In company (law), corporate law, a stock certificate (also known as certificate of stock or share certificate) is a legal document that certifies the legal interest (a bundle of several legal rights) of ownership of a specific number of share ...
s). The initial System 1000 had a maximum
throughput
Network throughput (or just throughput, when in context) refers to the rate of message delivery over a communication channel in a communication network, such as Ethernet or packet radio. The data that these messages contain may be delivered ov ...
of 420 documents per minute per transport (later 500 documents per minute), contingent on document size and content.
A feature unique to the System 1000 over other optical character recognition systems of the time was its ability to alert the operator when a field was unreadable or otherwise invalid. This feature, called Document Referral, placed the document in front of the operator and displayed a blank field on the screen of the included
CRT
CRT or Crt most commonly refers to:
* Cathode-ray tube, a display
* Critical race theory, an academic framework of analysis
CRT may also refer to:
Law
* Charitable remainder trust, United States
* Civil Resolution Tribunal, Canada
* Columbia ...
monitor for manual re-entry via keyboard. Once input, data could be output to
7- or
9-track tape
9-track tape is a format for magnetic-tape data storage, introduced with the IBM System/360 in 1964. The wide magnetic tape media and reels have the same size as the earlier IBM 7-track format it replaced, but the new format has eight data ...
,
paper tape
Five- and eight-hole wide punched paper tape
Paper tape reader on the Harwell computer with a small piece of five-hole tape connected in a circle – creating a physical program loop
Punched tape or perforated paper tape is a form of data st ...
,
punched card
A punched card (also punch card or punched-card) is a stiff paper-based medium used to store digital information via the presence or absence of holes in predefined positions. Developed over the 18th to 20th centuries, punched cards were widel ...
s and other
mass storage
In computing, mass storage refers to the storage of large amounts of data in a persisting and machine-readable fashion. In general, the term ''mass'' in ''mass storage'' is used to mean ''large'' in relation to contemporaneous hard disk drive ...
media or to
System/360
The IBM System/360 (S/360) is a family of mainframe computer systems announced by IBM on April 7, 1964, and delivered between 1965 and 1978. System/360 was the first family of computers designed to cover both commercial and scientific applicati ...
mainframes for further processing.
The complete System 1000 could be purchased for US$69,000. Options for renting were $1,800 per month on a three-year lease or $1,600 per month for five years. ''
Computerworld
''Computerworld'' (abbreviated as CW) is a computer magazine published since 1967 aimed at information technology (IT) and Business computing, business technology professionals. Original a print magazine, ''Computerworld'' published its final pr ...
'' wrote that it was less than half the cost of its competitors while more capable and user-friendly. Competing systems included the Recognition Equipment Retina, the
Scan-Optics IC/20 and the Scan-Data 250/350.
ReadRight
ReadRight processes individual letters topographically: it breaks down the scanned letter into parts—strokes, curves, angles,
ascenders and
descender
In typography and handwriting, a descender is the portion of a grapheme that extends below the Baseline (typography), baseline of a typeface, font.
For example, in the letter ''y'', the descender is the "tail", or that portion of the diagonal li ...
s—and follows a
tree structure
A tree structure, tree diagram, or tree model is a way of representing the hierarchical nature of a structure in a graphical form. It is named a "tree structure" because the classic representation resembles a tree, although the chart is gen ...
of letters broken down into these parts to determine the corresponding
character code
Character encoding is the process of assigning numbers to graphical character (computing), characters, especially the written characters of human language, allowing them to be stored, transmitted, and transformed using computers. The numerical v ...
. ReadRight was entirely software-based, requiring no
expansion card
In computing, an expansion card (also called an expansion board, adapter card, peripheral card or accessory card) is a printed circuit board that can be inserted into an electrical connector, or expansion slot (also referred to as a bus sl ...
to work. Version 2.01, the last version released for
DOS
DOS (, ) is a family of disk-based operating systems for IBM PC compatible computers. The DOS family primarily consists of IBM PC DOS and a rebranded version, Microsoft's MS-DOS, both of which were introduced in 1981. Later compatible syste ...
, runs in
real mode
Real mode, also called real address mode, is an operating mode of all x86-compatible CPUs. The mode gets its name from the fact that addresses in real mode always correspond to real locations in memory. Real mode is characterized by a 20- bit s ...
in under 640 KB of RAM. OCR Systems released the
Windows
Windows is a Product lining, product line of Proprietary software, proprietary graphical user interface, graphical operating systems developed and marketed by Microsoft. It is grouped into families and subfamilies that cater to particular sec ...
-only version 3.0 in 1991 while offering version 2.01 alongside it. The company unveiled a sister product, ReadRight Personal, dedicated to
handheld scanner
An image scanner (often abbreviated to just scanner) is a device that optically scans images, printed text, handwriting, or an object and converts it to a digital image. The most common type of scanner used in the home and the office is the flatbe ...
s and for Windows only in October 1991. This version adds real-time scanning—each word is updated to the screen while lines are being scanned. ReadRight proper was later made a Windows-only product with version 3.1 in 1992.
The inclusion of ReadRight 2.0 with Canon's IX-12F flatbed scanner led ''
PC Magazine
''PC Magazine'' (shortened as ''PCMag'') is an American computer magazine published by Ziff Davis. A print edition was published from 1982 to January 2009. Publication of online editions started in late 1994 and continues .
Overview
''PC Mag ...
'' to award it an Editor's Choice rating in 1989. Despite this, reviewer Robert Kendall found qualification with ReadRight's ability to parse
proportional typefaces such as
Helvetica
Helvetica, also known by its original name Neue Haas Grotesk, is a widely-used sans-serif typeface developed in 1957 by Swiss typeface designer Max Miedinger and Eduard Hoffmann.
Helvetica is a neo-grotesque design, one influenced by the f ...
and
Times New Roman
Times New Roman is a serif typeface commissioned for use by the British newspaper ''The Times'' in 1931. It has become one of the most popular typefaces of all time and is installed on most personal computers. The typeface was conceived by Stanl ...
. Mitt Jones of the same publication found version 2.01 to have improved its ability to read such typefaces and praised its ease of use and low resource intensiveness. Jones disliked the inability to handle uneven page paragraph column widths and graphics, noting that the manual recommended the user block out graphics with a
Post-it Note
A Post-it note (or sticky note) is a small piece of paper with a re-adherable strip of glue on its back, made for temporarily attaching notes to documents and other surfaces. A low-adhesion, tack pressure-sensitive adhesive allows the notes to ...
.
Version 3.1 for Windows received mixed reviews. Mike Heck of ''
InfoWorld
''InfoWorld'' (''IW'') is an American information technology media business. Founded in 1978, it began as a monthly magazine. In 2007, it transitioned to a Web-only publication. Its parent company is International Data Group, and its sister pu ...
'' wrote that its "low cost and rich collection of features are hard to ignore" but rated its speed and accuracy average. Barry Simon of ''PC Magazine'' called it economical but inaccurate, unable to correct errors it did not detect, and found its spellchecker flawed and its speed lacking compared to Calera's WordScan Plus. Gary Berline of the same publication wrote that "ReadRight produced serviceable accuracy on clean files with simple layouts, but at a less than sprightly pace", finding it unable to process small type and multicolumn text with small margins between columns. The software also regularly interpreted graphical illustrations as text in his experience. OCR Systems announced a follow-up release promising to correcting these issues in July 1992, which never came to fruition on account of Adobe buying the company.
Citations
References
*
*
*
*
*
*
*
*
*
*
*
*
*
*
*
*
*
*
*
*
*
*
*
*
*
*
*
{{Adobe Inc.
Adobe Inc.
American companies established in 1969
American companies disestablished in 1992
Computer companies established in 1969
Computer companies disestablished in 1992
Defunct computer companies of the United States
Defunct computer hardware companies
Defunct software companies of the United States
Optical character recognition