.doc (an abbreviation of "
document
A document is a writing, written, drawing, drawn, presented, or memorialized representation of thought, often the manifestation of nonfiction, non-fictional, as well as fictional, content. The word originates from the Latin ', which denotes ...
") is a
filename extension
A filename extension, file name extension or file extension is a suffix to the name of a computer file (for example, .txt, .mp3, .exe) that indicates a characteristic of the file contents or its intended use. A filename extension is typically d ...
used for
word processing A word processor (WP) is a device or computer program that provides for input, editing, formatting, and output of text, often with some additional features.
Word processor (electronic device), Early word processors were stand-alone devices dedicate ...
documents stored on
Microsoft
Microsoft Corporation is an American multinational corporation and technology company, technology conglomerate headquartered in Redmond, Washington. Founded in 1975, the company became influential in the History of personal computers#The ear ...
's
proprietary Microsoft Word Binary File Format; it was the primary format for Microsoft Word until the
2007
2007 was designated as the International Heliophysical Year and the International Polar Year.
Events
January
* January 1
**Bulgaria and Romania 2007 enlargement of the European Union, join the European Union, while Slovenia joins the Eur ...
version replaced it with
Office Open XML
Office Open XML (also informally known as OOXML) is a zipped, XML-based file format developed by Microsoft for representing spreadsheets, charts, presentations and word processing documents. Ecma International standardized the initial version ...
.docx
files.
[ Microsoft has used the extension since 1983.
]
Overview
Binary DOC files often contain more text formatting
Typesetting is the composition of text for publication, display, or distribution by means of arranging physical ''type'' (or ''sort'') in mechanical systems or '' glyphs'' in digital systems representing '' characters'' (letters and other ...
information (as well as script
Script may refer to:
Writing systems
* Script, a distinctive writing system, based on a repertoire of specific elements or symbols, or that repertoire
* Script (styles of handwriting)
** Script typeface, a typeface with characteristics of handw ...
s and undo information) than some other document file format
A document file format is a Text file, text or binary file format for storing documents on a computer storage, storage media, especially for use by computers.
There currently exists a multitude of incompatible document file formats.
Examples of ...
s like Rich Text Format
)
As an example, the following RTF code
would be rendered as follows:
This is some bold text.
Character encoding
A standard RTF file can only consist of 7-bit ASCII characters, but can use escape sequences to encode other characters. ...
and Hypertext Markup Language
Hypertext Markup Language (HTML) is the standard markup language for documents designed to be displayed in a web browser. It defines the content and structure of web content. It is often assisted by technologies such as Cascading Style Sheet ...
, but are usually less widely compatible.
The DOC files created with Microsoft Word
Microsoft Word is a word processor program, word processing program developed by Microsoft. It was first released on October 25, 1983, under the name Multi-Tool Word for Xenix systems. Subsequent versions were later written for several other platf ...
versions differ: Word for Windows 1.0 and 2.0 uses a format that was changed in the following Word 6.0 and 95 ("7.0") releases. DOC was changed once again into an OLE
OLE, Ole or Olé may refer to:
* Olé, a cheering expression used in Spain
* Ole (name), a male given name, includes a list of people named Ole
* Overhead lines equipment, used to transmit electrical energy to trams, trolleybuses or trains
Co ...
and CFBF-based format used from Word 97 ("8.0") to 2003 ("11.0"). Word for MS-DOS
MS-DOS ( ; acronym for Microsoft Disk Operating System, also known as Microsoft DOS) is an operating system for x86-based personal computers mostly developed by Microsoft. Collectively, MS-DOS, its rebranding as IBM PC DOS, and a few op ...
used its own specific DOC format.
In order to allow users of Word 6.0 and Word 95 to be able to open and read documents created in the newer (97–2003) format, Microsoft released the downloadable Word 97 Import Converter.
In Microsoft Word 2007 and later, the binary file format was replaced as the default format by the Office Open XML
Office Open XML (also informally known as OOXML) is a zipped, XML-based file format developed by Microsoft for representing spreadsheets, charts, presentations and word processing documents. Ecma International standardized the initial version ...
(.docx
) format, though Microsoft Word can still produce and open DOC files. Microsoft released the Office Compatibility Pack allowing users of Word 2003, 2002, 2000 and 97 to open the new format as well as edit and save them. Modern versions of Word can also open documents in the pre-97 DOC formats, but should be manually enabled in the File Block Settings.[
]
Application support
The DOC format is native to Microsoft Word. Other word processor A word processor (WP) is a device or computer program that provides for input, editing, formatting, and output of text, often with some additional features.
Early word processors were stand-alone devices dedicated to the function, but current word ...
s, such as OpenOffice.org Writer, IBM Lotus Symphony
IBM Lotus Symphony is a discontinued software suite, suite of applications for creating, editing, and sharing text, spreadsheet, presentations, and other documents and Web browser, browsing the World Wide Web. It was first distributed as commer ...
, Apple Pages and AbiWord, can also create and read DOC files, although with some limitations. Command line
A command-line interface (CLI) is a means of interacting with software via command (computing), commands each formatted as a line of text. Command-line interfaces emerged in the mid-1960s, on computer terminals, as an interactive and more user ...
programs for Unix-like
A Unix-like (sometimes referred to as UN*X, *nix or *NIX) operating system is one that behaves in a manner similar to a Unix system, although not necessarily conforming to or being certified to any version of the Single UNIX Specification. A Uni ...
operating systems that can convert files from the DOC format to plain text or other standard formats include the wv library, which itself is used directly by AbiWord.
Specification
Because the DOC file format was a closed specification for many years, inconsistent handling of the format persists and may cause some loss of formatting information when handling the same file with multiple word processing programs. Some specifications for Microsoft Office 97 binary file formats were published in 1997 under a restrictive license, but these specifications were removed from online download in 1999. Specifications of later versions of Microsoft Office binary file formats were not publicly available. The DOC format specification was available from Microsoft on request since 2006 under restrictive RAND-Z terms until February 2008. Sun Microsystems
Sun Microsystems, Inc., often known as Sun for short, was an American technology company that existed from 1982 to 2010 which developed and sold computers, computer components, software, and information technology services. Sun contributed sig ...
and OpenOffice.org
OpenOffice.org (OOo), commonly known as OpenOffice, is a discontinued open-source office suite. Active successor projects include LibreOffice (the most actively developed) and Collabora Online, with Apache OpenOffice being considered mostly d ...
reverse engineered the file format. On February 15, 2008, Microsoft released a .DOC format specification under the Microsoft Open Specification Promise. However, this specification does not describe all of the features used by DOC format and reverse engineered work remains necessary. Since 2008 the specification has been updated several times; the latest change was made in May 2022.
The format used in earlier, pre-97 ("1.0" 1989 through "7.0" 1995) versions of Word are less known, but both OpenOffice and LibreOffice contain open-source code for reading these formats. The format is probably related to the "Stream" format found in similar Excel versions. Word 95 also seems to have an OLE-wrapped form.
Other .doc file formats
Some historical documentations may use the DOC filename extension for plain-text files, indicating documentation
Documentation is any communicable material that is used to describe, explain or instruct regarding some attributes of an object, system or procedure, such as its parts, assembly, installation, maintenance, and use. As a form of knowledge managem ...
for software or hardware. The DOC filename extension was also used during the 1980s by WordPerfect
WordPerfect (WP) is a word processing application, now owned by Alludo, with a long history on multiple personal computer platforms. At the height of its popularity in the 1980s and early 1990s, it was the market leader of word processors, disp ...
for its proprietary format.
''DOC'' is sometimes used by users of Palm OS
Palm OS (also known as Garnet OS) is a discontinued mobile operating system initially developed by Palm, Inc., for personal digital assistants (PDAs) in 1996. Palm OS was designed for ease of use with a touchscreen-based graphical user interface. ...
as shorthand for PalmDoc, an unrelated format (commonly using PDB filename extension) used to encode text files such as ebook
An ebook (short for electronic book), also spelled as e-book or eBook, is a book publication made available in electronic form, consisting of text, images, or both, readable on the flat-panel display of computers or other electronic devices. A ...
s.
See also
* docx, the file format used by modern versions of Word
* ''De facto'' standard
* Dominant design
References
External links
DOC
XLS, and PPT specifications
Microsoft Compound Document Format
- OpenOffice.org
OpenOffice.org (OOo), commonly known as OpenOffice, is a discontinued open-source office suite. Active successor projects include LibreOffice (the most actively developed) and Collabora Online, with Apache OpenOffice being considered mostly d ...
{{Office document file formats
Computer file formats
Microsoft Office