HOME

TheInfoList



OR:

.doc (an abbreviation of "
document A document is a written, drawn, presented, or memorialized representation of thought, often the manifestation of non-fictional, as well as fictional, content. The word originates from the Latin ''Documentum'', which denotes a "teaching" o ...
") is a
filename extension A filename extension, file name extension or file extension is a suffix to the name of a computer file (e.g., .txt, .docx, .md). The extension indicates a characteristic of the file contents or its intended use. A filename extension is typically ...
used for
word processing A word is a basic element of language that carries an objective or practical meaning, can be used on its own, and is uninterruptible. Despite the fact that language speakers often have an intuitive grasp of what a word is, there is no consen ...
documents stored on
Microsoft Microsoft Corporation is an American multinational corporation, multinational technology company, technology corporation producing Software, computer software, consumer electronics, personal computers, and related services headquartered at th ...
's proprietary Microsoft Word Binary File Format. Microsoft has used the extension since 1983.


Microsoft Word Binary File Format

Binary DOC files often contain more
text formatting Typesetting is the composition of text by means of arranging physical ''type'' (or ''sort'') in mechanical systems or '' glyphs'' in digital systems representing '' characters'' (letters and other symbols).Dictionary.com Unabridged. Random ...
information (as well as
script Script may refer to: Writing systems * Script, a distinctive writing system, based on a repertoire of specific elements or symbols, or that repertoire * Script (styles of handwriting) ** Script typeface, a typeface with characteristics of ha ...
s and undo information) than some other
document file format A document file format is a text or binary file format for storing documents on a storage media, especially for use by computers. There currently exist a multitude of incompatible document file formats. Examples of XML-based open standards ar ...
s like
Rich Text Format ) As an example, the following RTF code would be rendered as follows: This is some bold text. Character encoding A standard RTF file can only consist of 7-bit ASCII characters, but can use escape sequences to encode other characters. T ...
and
Hypertext Markup Language The HyperText Markup Language or HTML is the standard markup language for documents designed to be displayed in a web browser. It can be assisted by technologies such as Cascading Style Sheets (CSS) and scripting languages such as JavaScr ...
, but are usually less widely compatible. The DOC files created with Microsoft Word versions differ. Microsoft Word versions before Word 97 ("8.0") used a different format from the OLE and CFBF-based Microsoft Word 97 – 2003. In
Microsoft Word 2007 Microsoft Office 2007 (codenamed Office 12) is an office suite for Windows, developed and published by Microsoft. It was officially revealed on March 9, 2006 and was the 12th version of Microsoft Office. It was released to manufacturing on Nove ...
and later, the binary file format was replaced as the default format by the
Office Open XML Office Open XML (also informally known as OOXML) is a zipped, XML-based file format developed by Microsoft for representing spreadsheets, charts, presentations and word processing documents. Ecma International standardized the initial versi ...
format, though Microsoft Word can still produce DOC files.


Application support

The DOC format is native to Microsoft Word. Other
word processor A word processor (WP) is a device or computer program that provides for input, editing, formatting, and output of text, often with some additional features. Word processor (electronic device), Early word processors were stand-alone devices ded ...
s, such as OpenOffice.org Writer, IBM Lotus Symphony, Apple Pages and AbiWord, can also create and read DOC files, although with some limitations.
Command line A command-line interpreter or command-line processor uses a command-line interface (CLI) to receive commands from a user in the form of lines of text. This provides a means of setting parameters for the environment, invoking executables and pro ...
programs for
Unix-like A Unix-like (sometimes referred to as UN*X or *nix) operating system is one that behaves in a manner similar to a Unix system, although not necessarily conforming to or being certified to any version of the Single UNIX Specification. A Unix-li ...
operating systems that can convert files from the DOC format to plain text or other standard formats include the wv library, which itself is used directly by AbiWord.


Specification

Because the DOC file format was a closed specification for many years, inconsistent handling of the format persists and may cause some loss of formatting information when handling the same file with multiple word processing programs. Some specifications for Microsoft Office 97 binary file formats were published in 1997 under a restrictive license, but these specifications were removed from online download in 1999. Specifications of later versions of Microsoft Office binary file formats were not publicly available. The DOC format specification was available from Microsoft on request since 2006 under restrictive
RAND-Z Reasonable and non-discriminatory (RAND) terms, also known as fair, reasonable, and non-discriminatory (FRAND) terms, denote a voluntary licensing commitment that standards organizations often request from the owner of an intellectual property ...
terms until February 2008.
Sun Microsystems Sun Microsystems, Inc. (Sun for short) was an American technology company that sold computers, computer components, software, and information technology services and created the Java programming language, the Solaris operating system, ZFS, ...
and OpenOffice.org reverse engineered the file format. On February 15, 2008, Microsoft released a .DOC format specification under the Microsoft Open Specification Promise. However, this specification does not describe all of the features used by DOC format and reverse engineered work remains necessary. Since 2008 the specification has been updated several times; the latest change was made in May 2022. The format used in earlier, pre-97 ("1.0" 1989 through "7.0" 1995) versions of Word are less known, but both OpenOffice and LibreOffice contain open-source code for reading these formats. The format is probably related to the "Stream" format found in similar Excel versions. Word 95 also seems to have an OLE-wrapped form


Other file formats

Some historical documentations may use the DOC filename extension for plain-text files, indicating
documentation Documentation is any communicable material that is used to describe, explain or instruct regarding some attributes of an object, system or procedure, such as its parts, assembly, installation, maintenance and use. As a form of knowledge manageme ...
for software or hardware. The DOC filename extension was also used during the 1980s by WordPerfect for its proprietary format. ''DOC'' is sometimes used by users of
Palm OS Palm OS (also known as Garnet OS) was a mobile operating system initially developed by Palm, Inc., for personal digital assistants (PDAs) in 1996. Palm OS was designed for ease of use with a touchscreen-based graphical user interface. It is provi ...
as shorthand for PalmDoc, an unrelated format (commonly using PDB filename extension) used to encode text files such as
ebook An ebook (short for electronic book), also known as an e-book or eBook, is a book publication made available in digital form, consisting of text, images, or both, readable on the flat-panel display of computers or other electronic devices. Al ...
s.


See also

* docx, the file format used by modern versions of Word * ''De facto'' standard *
Dominant design Dominant design is a technology management concept introduced by James M. Utterback and William J. Abernathy in 1975, identifying key technological features that become a de facto standard. A dominant design is the one that wins the allegiance of ...


References


External links


DOC
XLS, and PPT specifications
Microsoft Compound Document Format
- OpenOffice.org {{Office document file formats Computer file formats Microsoft Office