HTML Tidy is a
console application for correcting invalid
HyperText Markup Language
The HyperText Markup Language or HTML is the standard markup language for documents designed to be displayed in a web browser. It can be assisted by technologies such as Cascading Style Sheets (CSS) and scripting languages such as JavaScr ...
(HTML), detecting potential
web accessibility errors, and for improving the layout and
indent style
In computer programming, an indentation style is a convention governing the indentation of blocks of code to convey program structure. This article largely addresses the free-form languages, such as C and its descendants, but can be (and oft ...
of the resulting markup. It is also a cross-platform
library
A library is a collection of materials, books or media that are accessible for use and not just for display purposes. A library provides physical (hard copies) or digital access (soft copies) materials, and may be a physical location or a vi ...
for
computer applications that provides HTML Tidy's features.
History
HTML Tidy was developed by
Dave Raggett
Dave Raggett is an English computer specialist who has played a major role in implementing the World Wide Web since 1992.
He has been a W3C Fellow at the World Wide Web Consortium since 1995 and worked on many of the key web protocols, including ...
of the
World Wide Web Consortium (W3C). Later it was released as a
SourceForge
SourceForge is a web service that offers software consumers a centralized online location to control and manage open-source software projects and research business software. It provides source code repository hosting, bug tracking, mirrori ...
project in 2003 and managed by various maintainers.
In 2012, the project was moved to
GitHub
GitHub, Inc. () is an Internet hosting service for software development and version control using Git. It provides the distributed version control of Git plus access control, bug tracking, software feature requests, task management, co ...
, and maintained by Michael Smith, also of
W3C, where
HTML5 support was added.
In 2015, the
HTML Tidy Advocacy Community Group (HTACG) was formed for management and development of HTML Tidy as a W3C Community Group.
HTML Tidy source code is written in
ANSI C for portability. Compiled binary files are available for a variety of platforms. It is available under the
W3C Software Notice and License, a permissive
BSD-style license. Up-to-date versions are available as source code cloned from its GitHub
Git version control repository, or in binary packages for multiple operating systems from its GitHub Releases repository.
Features
Examples of corrections to invalid or poorly constructed HTML:
* Reorder mixed-up tags
* Complete missing or mismatched end tags
* Add missing syntax elements (some tags, quotes, etc.)
* Report proprietary HTML extensions
* Change layout of markup to a predefined style
* Transform characters from some encodings into HTML entities
See also
*
Comparison of HTML parsers
*
CSSTidy
References
External links
HTML Tidy Project PageCurrent Source CodeOfficial Binary ReleasesHTML Tidy Project Page(legacy)
HTML Tidy serviceWas an online version of HTML Tidy at the W3C. The W3C online tidy service was decommissioned on 29 November, 2017.
{{HTML editors
Free software programmed in C
HTML editors
Software using the W3C license