SubRip
   HOME

TheInfoList



OR:

SubRip is a
free software Free software or libre software is computer software distributed under terms that allow users to run the software for any purpose as well as to study, change, and distribute it and any adapted versions. Free software is a matter of liberty, n ...
program for
Microsoft Windows Windows is a group of several proprietary graphical operating system families developed and marketed by Microsoft. Each family caters to a certain sector of the computing industry. For example, Windows NT for consumers, Windows Server for ...
which extracts subtitles and their timings from various video formats to a text file. It is released under the
GNU GNU () is an extensive collection of free software (383 packages as of January 2022), which can be used as an operating system or can be used in parts with other operating systems. The use of the completed GNU tools led to the family of operat ...
GPL. Its subtitle format's file extension is .srt and is widely supported. Each .srt file is a human-readable file format where the subtitles are stored sequentially along with the timing information. Most subtitles distributed on the Internet are in this format.


SubRip software

Using
optical character recognition Optical character recognition or optical character reader (OCR) is the electronic or mechanical conversion of images of typed, handwritten or printed text into machine-encoded text, whether from a scanned document, a photo of a document, a sc ...
, SubRip can extract from live video, video files and DVDs, then record the extracted subtitles and timings as a ''Subrip format''
text file A text file (sometimes spelled textfile; an old alternative name is flatfile) is a kind of computer file that is structured as a sequence of lines of electronic text. A text file exists stored as data within a computer file system. In operat ...
. It can optionally save the recognized subtitles as bitmaps for later subtraction (erasure) from the source video. In practice, SubRip is configured with the correct
codec A codec is a device or computer program that encodes or decodes a data stream or signal. ''Codec'' is a portmanteau of coder/decoder. In electronic communications, an endec is a device that acts as both an encoder and a decoder on a signal or ...
for the video source, then trained by the user on the specific text area, fonts, styles, colors and video processing requirementsZuggy, Guide. to recognize subtitles. After trial and fine tuning, SubRip can automatically extract subtitles for the whole video source file during its playback. SubRip records the beginning and end times and text for each subtitle in the output text .srt file. SubRip uses AviSynth to extract video frames from source video, and can rip subtitles from all video files supported by that program.


SubRip file format

The SubRip file format is described on the Matroska multimedia
container format A container format (informally, sometimes called a wrapper) or metafile is a file format that allows multiple data streams to be embedded into a single file, usually along with metadata for identifying and further detailing those streams. No ...
website as "perhaps the most basic of all subtitle formats." SubRip (SubRip Text) files are named with the
extension Extension, extend or extended may refer to: Mathematics Logic or set theory * Axiom of extensionality * Extensible cardinal * Extension (model theory) * Extension (predicate logic), the set of tuples of values that satisfy the predicate * Ext ...
.srt, and contain formatted lines of plain text in groups separated by a blank line. Subtitles are numbered sequentially, starting at 1. The timecode format used is hours:minutes:seconds,milliseconds with time units fixed to two zero-padded digits and fractions fixed to three zero-padded digits (00:00:00,000). The fractional separator used is the
comma The comma is a punctuation mark that appears in several variants in different languages. It has the same shape as an apostrophe or single closing quotation mark () in many typefaces, but it differs from them in being placed on the baseline ...
, since the program was written in France. # A numeric counter identifying each sequential subtitle # The time that the subtitle should appear on the screen, followed by --> and the time it should disappear # Subtitle text itself on one or more lines # A blank line containing no text, indicating the end of this subtitle Example for '' Star Wars: Episode II – Attack of the Clones'':
1
00:02:16,612 --> 00:02:19,376
Senator, we're making
our final approach into Coruscant.

2
00:02:19,482 --> 00:02:21,609
Very good, Lieutenant.

3
00:03:13,336 --> 00:03:15,167
We made it.

4
00:03:18,608 --> 00:03:20,371
I guess I was wrong.

5
00:03:20,476 --> 00:03:22,671
There was no danger at all.


Formatting

Unofficially the format has very basic text formatting, which can be either interpreted or passed through for rendering depending on the processing application. Formatting is derived from HTML tags for bold, italic, underline and color: * Bold – <b>…</b> * Italic – <i>…</i> * Underline – <u>…</u> * Font color – <font color="color name or #code">…</font> (as in
HTML The HyperText Markup Language or HTML is the standard markup language for documents designed to be displayed in a web browser. It can be assisted by technologies such as Cascading Style Sheets (CSS) and scripting languages such as JavaS ...
) * Line position – X1:… X2:… Y1:… Y2:… after the timestamp would denote the text coordinate Nested tags are allowed; some implementations prefer whole-line formatting only.


Compatibility

The SubRip .srt file format is supported by most software video players. For
Windows Windows is a group of several proprietary graphical operating system families developed and marketed by Microsoft. Each family caters to a certain sector of the computing industry. For example, Windows NT for consumers, Windows Server for se ...
software video players that do not support subtitle playback directly, the VSFilter
DirectX Microsoft DirectX is a collection of application programming interfaces (APIs) for handling tasks related to multimedia, especially game programming and video, on Microsoft platforms. Originally, the names of these APIs all began with "Direct" ...
filter displays SubRip and other subtitle formats. The SubRip format is supported directly by many subtitle creation and editing tools, as well as some hardware home media players. In August 2008, YouTube added subtitle support to its Flash video player under the "Closed Captioning" option – content producers can upload subtitles in SubRip format.


Text encoding

SubRip's default output encoding is configured as
Windows-1252 Windows-1252 or CP-1252 ( code page 1252) is a single-byte character encoding of the Latin alphabet, used by default in the legacy components of Microsoft Windows for English and many European languages including Spanish, French, and German. ...
. However, output options are also given for many
Windows code page Windows code pages are sets of characters or code pages (known as character encodings in other operating systems) used in Microsoft Windows from the 1980s and 1990s. Windows code pages were gradually superseded when Unicode was implemented in Wind ...
s as well
Unicode Unicode, formally The Unicode Standard,The formal version reference is is an information technology standard for the consistent encoding, representation, and handling of text expressed in most of the world's writing systems. The standard, ...
encodings, such as
UTF-8 UTF-8 is a variable-length character encoding used for electronic communication. Defined by the Unicode Standard, the name is derived from ''Unicode'' (or ''Universal Coded Character Set'') ''Transformation Format 8-bit''. UTF-8 is capable of e ...
and
UTF-16 UTF-16 (16-bit Unicode Transformation Format) is a character encoding capable of encoding all 1,112,064 valid code points of Unicode (in fact this number of code points is dictated by the design of UTF-16). The encoding is variable-length, as cod ...
, with or without
byte order mark The byte order mark (BOM) is a particular usage of the special Unicode character, , whose appearance as a magic number at the start of a text stream can signal several things to a program reading the text: * The byte order, or endianness, of t ...
(BOM). Therefore, there is no de facto character encoding standard for .srt files, which means that any SubRip file parser must attempt to use
Charset detection Character encoding detection, charset detection, or code page detection is the process of heuristically guessing the character encoding of a series of bytes that represent text. The technique is recognised to be unreliable and is only used when sp ...
. Unicode BOMs are typically used to aid detection.


SubViewer

In 1999, Brain created SubRip, and a friend, David Dolinski, created SubViewer, who offered it for download on his personal website. SubViewer was included in the DivX media player. On August 28, 2008,
YouTube YouTube is a global online video sharing and social media platform headquartered in San Bruno, California. It was launched on February 14, 2005, by Steve Chen, Chad Hurley, and Jawed Karim. It is owned by Google, and is the second mo ...
included support for SubViewer and SubRip, allowing existing videos to be retroactively subtitled.


WebVTT

A format originally called WebSRT (Web Subtitle Resource Tracks) was specified in 2010 by the
Web Hypertext Application Technology Working Group The Web Hypertext Application Technology Working Group (WHATWG) is a community of people interested in evolving HTML and related technologies. The WHATWG was founded by individuals from Apple Inc., the Mozilla Foundation and Opera Software, ...
for the proposed
HTML5 HTML5 is a markup language used for structuring and presenting content on the World Wide Web. It is the fifth and final major HTML version that is a World Wide Web Consortium (W3C) recommendation. The current specification is known as the HTML ...
<track> element. It shared the .srt file extension and was based on parts of the SubRip format, but was not fully compatible with it. The prospective format was later renamed WebVTT (Web Video Text Track). Google's Chrome and Microsoft's Internet Explorer 10 browsers were the first to support tags with WebVTT files for HTML5 videos. Mozilla Firefox implemented WebVTT in its nightly builds (Firefox 24), and as of Firefox 31 (July 24, 2014), Mozilla enabled WebVTT on Firefox by default. YouTube began supporting WebVTT in April, 2013.


See also

*
Closed captioning Closed captioning (CC) and subtitling are both processes of displaying text on a television, video screen, or other visual display to provide additional or interpretive information. Both are typically used as a transcription of the audio po ...
* Timed text * * Avidemux * List of free television software * MicroDVD * SubStation Alpha *
Universal Subtitle Format Universal Subtitle Format (USF) was a CoreCodec project that attempted to create a clean, documented, powerful and easy to use subtitle file format. It is based on XML for flexibility, unicode support, hierarchical system, and ease of administratio ...
* DirectVobSub – able to extract subtitles from a DVD without first extracting the files from it


Notes


References

* * * (Software release page.) * * * * * *


External links

* * * {{cite web , url= http://forum.doom9.org/showthread.php?p=470941#post470941 , title= .SRT SubRip file format specification , publisher= Doom9 , access-date= April 7, 2004 , quote= Derived from the SubRip source code Subtitling Subtitle file formats Windows multimedia software Free television software