
A programming language is a system of notation for writing
computer program
A computer program is a sequence or set of instructions in a programming language for a computer to Execution (computing), execute. It is one component of software, which also includes software documentation, documentation and other intangibl ...
s.
Programming languages are described in terms of their
syntax
In linguistics, syntax ( ) is the study of how words and morphemes combine to form larger units such as phrases and sentences. Central concerns of syntax include word order, grammatical relations, hierarchical sentence structure (constituenc ...
(form) and
semantics
Semantics is the study of linguistic Meaning (philosophy), meaning. It examines what meaning is, how words get their meaning, and how the meaning of a complex expression depends on its parts. Part of this process involves the distinction betwee ...
(meaning), usually defined by a
formal language. Languages usually provide features such as a
type system,
variables, and mechanisms for
error handling. An
implementation
Implementation is the realization of an application, execution of a plan, idea, scientific modelling, model, design, specification, Standardization, standard, algorithm, policy, or the Management, administration or management of a process or Goal ...
of a programming language is required in order to
execute programs, namely an
interpreter or a
compiler
In computing, a compiler is a computer program that Translator (computing), translates computer code written in one programming language (the ''source'' language) into another language (the ''target'' language). The name "compiler" is primaril ...
. An interpreter directly executes the source code, while a
compiler
In computing, a compiler is a computer program that Translator (computing), translates computer code written in one programming language (the ''source'' language) into another language (the ''target'' language). The name "compiler" is primaril ...
produces an
executable
In computer science, executable code, an executable file, or an executable program, sometimes simply referred to as an executable or binary, causes a computer "to perform indicated tasks according to encoded instruction (computer science), in ...
program.
Computer architecture has strongly influenced the design of programming languages, with the most common type (
imperative languages—which implement operations in a specified order) developed to perform well on the popular
von Neumann architecture. While early programming languages were closely tied to the
hardware, over time they have developed more
abstraction
Abstraction is a process where general rules and concepts are derived from the use and classifying of specific examples, literal (reality, real or Abstract and concrete, concrete) signifiers, first principles, or other methods.
"An abstraction" ...
to hide implementation details for greater simplicity.
Thousands of programming languages—often classified as imperative,
functional,
logic
Logic is the study of correct reasoning. It includes both formal and informal logic. Formal logic is the study of deductively valid inferences or logical truths. It examines how conclusions follow from premises based on the structure o ...
, or
object-oriented—have been developed for a wide variety of uses. Many aspects of programming language design involve tradeoffs—for example,
exception handling simplifies error handling, but at a performance cost.
Programming language theory
Programming language theory (PLT) is a branch of computer science that deals with the design, implementation, analysis, characterization, and classification of formal languages known as programming languages. Programming language theory is clos ...
is the subfield of
computer science
Computer science is the study of computation, information, and automation. Computer science spans Theoretical computer science, theoretical disciplines (such as algorithms, theory of computation, and information theory) to Applied science, ...
that studies the design, implementation, analysis, characterization, and classification of programming languages.
Definitions
Programming languages differ from
natural languages in that natural languages are used for interaction between people, while programming languages are designed to allow humans to communicate instructions to machines.
The term ''
computer language
A computer language is a formal language used to communicate with a computer. Types of computer languages include:
* Software construction#Construction languages, Construction language – all forms of communication by which a human can Comput ...
'' is sometimes used interchangeably with "programming language". However, usage of these terms varies among authors.
In one usage, programming languages are described as a subset of computer languages. Similarly, the term "computer language" may be used in contrast to the term "programming language" to describe languages used in computing but not considered programming languages. Most practical programming languages are Turing complete,
and as such are equivalent in what programs they can compute.
Another usage regards programming languages as theoretical constructs for programming
abstract machines and computer languages as the subset thereof that runs on physical computers, which have finite hardware resources.
John C. Reynolds emphasizes that
formal specification languages are just as much programming languages as are the languages intended for execution. He also argues that textual and even graphical input formats that affect the behavior of a computer are programming languages, despite the fact they are commonly not Turing-complete, and remarks that ignorance of programming language concepts is the reason for many flaws in input formats.
History
Early developments
The first programmable computers were invented at the end of the 1940s, and with them, the first programming languages. The earliest computers were programmed in
first-generation programming languages (1GLs),
machine language (simple instructions that could be directly executed by the processor). This code was very difficult to debug and was not
portable between different computer systems. In order to improve the ease of programming,
assembly languages (or
second-generation programming languages—2GLs) were invented, diverging from the machine language to make programs easier to understand for humans, although they did not increase portability.
Initially, hardware resources were scarce and expensive, while
human resources
Human resources (HR) is the set of people who make up the workforce of an organization, business sector, industry, or economy. A narrower concept is human capital, the knowledge and skills which the individuals command. Similar terms include ' ...
were cheaper. Therefore, cumbersome languages that were time-consuming to use, but were closer to the hardware for higher efficiency were favored. The introduction of
high-level programming language
A high-level programming language is a programming language with strong Abstraction (computer science), abstraction from the details of the computer. In contrast to low-level programming languages, it may use natural language ''elements'', be ea ...
s (
third-generation programming languages—3GLs)—revolutionized programming. These languages
abstracted away the details of the hardware, instead being designed to express algorithms that could be understood more easily by humans. For example, arithmetic expressions could now be written in symbolic notation and later translated into machine code that the hardware could execute. In 1957,
Fortran (FORmula TRANslation) was invented. Often considered the first
compiled high-level programming language, Fortran has remained in use into the twenty-first century.
1960s and 1970s

Around 1960, the first
mainframes—general purpose computers—were developed, although they could only be operated by professionals and the cost was extreme. The data and instructions were input by
punch cards, meaning that no input could be added while the program was running. The languages developed at this time therefore are designed for minimal interaction. After the invention of the
microprocessor
A microprocessor is a computer processor (computing), processor for which the data processing logic and control is included on a single integrated circuit (IC), or a small number of ICs. The microprocessor contains the arithmetic, logic, a ...
, computers in the 1970s became dramatically cheaper. New computers also allowed more user interaction, which was supported by newer programming languages.
Lisp
Lisp (historically LISP, an abbreviation of "list processing") is a family of programming languages with a long history and a distinctive, fully parenthesized Polish notation#Explanation, prefix notation.
Originally specified in the late 1950s, ...
, implemented in 1958, was the first
functional programming
In computer science, functional programming is a programming paradigm where programs are constructed by Function application, applying and Function composition (computer science), composing Function (computer science), functions. It is a declarat ...
language. Unlike Fortran, it supported
recursion
Recursion occurs when the definition of a concept or process depends on a simpler or previous version of itself. Recursion is used in a variety of disciplines ranging from linguistics to logic. The most common application of recursion is in m ...
and
conditional expressions, and it also introduced
dynamic memory management on a
heap and automatic
garbage collection. For the next decades, Lisp dominated
artificial intelligence
Artificial intelligence (AI) is the capability of computer, computational systems to perform tasks typically associated with human intelligence, such as learning, reasoning, problem-solving, perception, and decision-making. It is a field of re ...
applications. In 1978, another functional language,
ML, introduced
inferred types and polymorphic
parameters.
After
ALGOL (ALGOrithmic Language) was released in 1958 and 1960, it became the standard in computing literature for describing
algorithm
In mathematics and computer science, an algorithm () is a finite sequence of Rigour#Mathematics, mathematically rigorous instructions, typically used to solve a class of specific Computational problem, problems or to perform a computation. Algo ...
s. Although its commercial success was limited, most popular imperative languages—including
C,
Pascal,
Ada,
C++,
Java
Java is one of the Greater Sunda Islands in Indonesia. It is bordered by the Indian Ocean to the south and the Java Sea (a part of Pacific Ocean) to the north. With a population of 156.9 million people (including Madura) in mid 2024, proje ...
, and
C#—are directly or indirectly descended from ALGOL 60. Among its innovations adopted by later programming languages included greater portability and the first use of
context-free,
BNF grammar.
Simula
Simula is the name of two simulation programming languages, Simula I and Simula 67, developed in the 1960s at the Norwegian Computing Center in Oslo, by Ole-Johan Dahl and Kristen Nygaard. Syntactically, it is an approximate superset of AL ...
, the first language to support
object-oriented programming
Object-oriented programming (OOP) is a programming paradigm based on the concept of '' objects''. Objects can contain data (called fields, attributes or properties) and have actions they can perform (called procedures or methods and impl ...
(including
subtypes,
dynamic dispatch, and
inheritance
Inheritance is the practice of receiving private property, titles, debts, entitlements, privileges, rights, and obligations upon the death of an individual. The rules of inheritance differ among societies and have changed over time. Offi ...
), also descends from ALGOL and achieved commercial success. C, another ALGOL descendant, has sustained popularity into the twenty-first century. C allows access to lower-level machine operations more than other contemporary languages. Its power and efficiency, generated in part with flexible
pointer operations, comes at the cost of making it more difficult to write correct code.
Prolog
Prolog is a logic programming language that has its origins in artificial intelligence, automated theorem proving, and computational linguistics.
Prolog has its roots in first-order logic, a formal logic. Unlike many other programming language ...
, designed in 1972, was the first
logic programming language, communicating with a computer using formal logic notation. With logic programming, the programmer specifies a desired result and allows the
interpreter to decide how to achieve it.
1980s to 2000s

During the 1980s, the invention of the
personal computer
A personal computer, commonly referred to as PC or computer, is a computer designed for individual use. It is typically used for tasks such as Word processor, word processing, web browser, internet browsing, email, multimedia playback, and PC ...
transformed the roles for which programming languages were used. New languages introduced in the 1980s included C++, a
superset of C that can compile C programs but also supports
classes and
inheritance
Inheritance is the practice of receiving private property, titles, debts, entitlements, privileges, rights, and obligations upon the death of an individual. The rules of inheritance differ among societies and have changed over time. Offi ...
.
Ada and other new languages introduced support for
concurrency. The Japanese government invested heavily into the so-called
fifth-generation languages that added support for concurrency to logic programming constructs, but these languages were outperformed by other concurrency-supporting languages.
Due to the rapid growth of the
Internet
The Internet (or internet) is the Global network, global system of interconnected computer networks that uses the Internet protocol suite (TCP/IP) to communicate between networks and devices. It is a internetworking, network of networks ...
and the
World Wide Web
The World Wide Web (WWW or simply the Web) is an information system that enables Content (media), content sharing over the Internet through user-friendly ways meant to appeal to users beyond Information technology, IT specialists and hobbyis ...
in the 1990s, new programming languages were introduced to support
Web pages and
networking.
Java
Java is one of the Greater Sunda Islands in Indonesia. It is bordered by the Indian Ocean to the south and the Java Sea (a part of Pacific Ocean) to the north. With a population of 156.9 million people (including Madura) in mid 2024, proje ...
, based on C++ and designed for increased portability across systems and security, enjoyed large-scale success because these features are essential for many Internet applications. Another development was that of
dynamically typed scripting languages—
Python,
JavaScript
JavaScript (), often abbreviated as JS, is a programming language and core technology of the World Wide Web, alongside HTML and CSS. Ninety-nine percent of websites use JavaScript on the client side for webpage behavior.
Web browsers have ...
,
PHP, and
Ruby—designed to quickly produce small programs that coordinate existing
applications. Due to their integration with
HTML
Hypertext Markup Language (HTML) is the standard markup language for documents designed to be displayed in a web browser. It defines the content and structure of web content. It is often assisted by technologies such as Cascading Style Sheets ( ...
, they have also been used for building web pages hosted on
servers.
2000s to present
During the 2000s, there was a slowdown in the development of new programming languages that achieved widespread popularity. One innovation was
service-oriented programming, designed to exploit
distributed systems whose components are connected by a network. Services are similar to objects in object-oriented programming, but run on a separate process.
C# and
F# cross-pollinated ideas between imperative and functional programming. After 2010, several new languages—
Rust,
Go,
Swift,
Zig and
Carbon
Carbon () is a chemical element; it has chemical symbol, symbol C and atomic number 6. It is nonmetallic and tetravalence, tetravalent—meaning that its atoms are able to form up to four covalent bonds due to its valence shell exhibiting 4 ...
—competed for the performance-critical software for which C had historically been used. Most of the new programming languages use
static typing while a few numbers of new languages use
dynamic typing like
Ring and
Julia.
Some of the new programming languages are classified as
visual programming languages like
Scratch,
LabVIEW and
PWCT. Also, some of these languages mix between textual and visual programming usage like
Ballerina. Also, this trend lead to developing projects that help in developing new VPLs like
Blockly by
Google
Google LLC (, ) is an American multinational corporation and technology company focusing on online advertising, search engine technology, cloud computing, computer software, quantum computing, e-commerce, consumer electronics, and artificial ...
. Many game engines like
Unreal and
Unity added support for visual scripting too.
Elements
Every programming language includes fundamental elements for describing data and the operations or transformations applied to them, such as adding two numbers or selecting an item from a collection. These elements are governed by syntactic and semantic rules that define their structure and meaning, respectively.
Syntax

A programming language's surface form is known as its
syntax
In linguistics, syntax ( ) is the study of how words and morphemes combine to form larger units such as phrases and sentences. Central concerns of syntax include word order, grammatical relations, hierarchical sentence structure (constituenc ...
. Most programming languages are purely textual; they use sequences of text including words, numbers, and punctuation, much like written natural languages. On the other hand, some programming languages are
graphical, using visual relationships between symbols to specify a program.
The syntax of a language describes the possible combinations of symbols that form a syntactically correct program. The meaning given to a combination of symbols is handled by semantics (either
formal or hard-coded in a
reference implementation). Since most languages are textual, this article discusses textual syntax.
The programming language syntax is usually defined using a combination of
regular expressions (for
lexical structure) and
Backus–Naur form
In computer science, Backus–Naur form (BNF, pronounced ), also known as Backus normal form, is a notation system for defining the Syntax (programming languages), syntax of Programming language, programming languages and other Formal language, for ...
(for
grammatical structure). Below is a simple grammar, based on
Lisp
Lisp (historically LISP, an abbreviation of "list processing") is a family of programming languages with a long history and a distinctive, fully parenthesized Polish notation#Explanation, prefix notation.
Originally specified in the late 1950s, ...
:
expression ::= atom , list
atom ::= number , symbol
number ::= - 0'-'9'
symbol ::= A'-'Z''a'-'z'*
list ::= '(' expression* ')'
This grammar specifies the following:
* an ''expression'' is either an ''atom'' or a ''list'';
* an ''atom'' is either a ''number'' or a ''symbol'';
* a ''number'' is an unbroken sequence of one or more decimal digits, optionally preceded by a plus or minus sign;
* a ''symbol'' is a letter followed by zero or more of any alphabetical characters (excluding whitespace); and
* a ''list'' is a matched pair of parentheses, with zero or more ''expressions'' inside it.
The following are examples of well-formed token sequences in this grammar:
12345
,
()
and
(a b c232 (1))
.
Not all syntactically correct programs are semantically correct. Many syntactically correct programs are nonetheless ill-formed, per the language's rules; and may (depending on the language specification and the soundness of the implementation) result in an error on translation or execution. In some cases, such programs may exhibit
undefined behavior. Even when a program is well-defined within a language, it may still have a meaning that is not intended by the person who wrote it.
Using
natural language as an example, it may not be possible to assign a meaning to a grammatically correct sentence or the sentence may be false:
* "
Colorless green ideas sleep furiously." is grammatically well-formed but has no generally accepted meaning.
* "John is a married bachelor." is grammatically
well-formed but expresses a meaning that cannot be true.
The following
C language fragment is syntactically correct, but performs operations that are not semantically defined (the operation
*p >> 4
has no meaning for a value having a complex type and
p->im
is not defined because the value of
p
is the
null pointer
In computing, a null pointer (sometimes shortened to nullptr or null) or null reference is a value saved for indicating that the Pointer (computer programming), pointer or reference (computer science), reference does not refer to a valid Object (c ...
):
complex *p = NULL;
complex abs_p = sqrt(*p >> 4 + p->im);
If the
type declaration on the first line were omitted, the program would trigger an error on the undefined variable
p
during compilation. However, the program would still be syntactically correct since type declarations provide only semantic information.
The grammar needed to specify a programming language can be classified by its position in the
Chomsky hierarchy. The syntax of most programming languages can be specified using a Type-2 grammar, i.e., they are
context-free grammars. Some languages, including Perl and Lisp, contain constructs that allow execution during the parsing phase. Languages that have constructs that allow the programmer to alter the behavior of the parser make syntax analysis an
undecidable problem, and generally blur the distinction between parsing and execution. In contrast to
Lisp's macro system and Perl's
BEGIN
blocks, which may contain general computations, C macros are merely string replacements and do not require code execution.
Semantics
The term
''semantics'' refers to the meaning of languages, as opposed to their form (
syntax
In linguistics, syntax ( ) is the study of how words and morphemes combine to form larger units such as phrases and sentences. Central concerns of syntax include word order, grammatical relations, hierarchical sentence structure (constituenc ...
).
Static semantics
Static semantics defines restrictions on the structure of valid texts that are hard or impossible to express in standard syntactic formalisms.
For compiled languages, static semantics essentially include those semantic rules that can be checked at compile time. Examples include checking that every
identifier is declared before it is used (in languages that require such declarations) or that the labels on the arms of a
case statement are distinct. Many important restrictions of this type, like checking that identifiers are used in the appropriate context (e.g. not adding an integer to a function name), or that
subroutine calls have the appropriate number and type of arguments, can be enforced by defining them as rules in a
logic
Logic is the study of correct reasoning. It includes both formal and informal logic. Formal logic is the study of deductively valid inferences or logical truths. It examines how conclusions follow from premises based on the structure o ...
called a
type system. Other forms of
static analyses like
data flow analysis may also be part of static semantics. Programming languages such as
Java
Java is one of the Greater Sunda Islands in Indonesia. It is bordered by the Indian Ocean to the south and the Java Sea (a part of Pacific Ocean) to the north. With a population of 156.9 million people (including Madura) in mid 2024, proje ...
and
C# have
definite assignment analysis, a form of data flow analysis, as part of their respective static semantics.
Dynamic semantics
Once data has been specified, the machine must be instructed to perform operations on the data. For example, the semantics may define the
strategy
Strategy (from Greek στρατηγία ''stratēgia'', "troop leadership; office of general, command, generalship") is a general plan to achieve one or more long-term or overall goals under conditions of uncertainty. In the sense of the " a ...
by which expressions are evaluated to values, or the manner in which
control structures conditionally execute
statements. The ''dynamic semantics'' (also known as ''execution semantics'') of a language defines how and when the various constructs of a language should produce a program behavior. There are many ways of defining execution semantics. Natural language is often used to specify the execution semantics of languages commonly used in practice. A significant amount of academic research goes into
formal semantics of programming languages
In programming language theory, semantics is the rigorous mathematical study of the meaning of programming languages. Semantics assigns computational meaning to valid string (computer science), strings in a programming language syntax. It is cl ...
, which allows execution semantics to be specified in a formal manner. Results from this field of research have seen limited application to programming language design and implementation outside academia.
Type system
A
data type is a set of allowable values and operations that can be performed on these values. Each programming language's
type system defines which data types exist, the type of an
expression, and how
type equivalence and
type compatibility function in the language.
According to
type theory
In mathematics and theoretical computer science, a type theory is the formal presentation of a specific type system. Type theory is the academic study of type systems.
Some type theories serve as alternatives to set theory as a foundation of ...
, a language is fully typed if the specification of every operation defines types of data to which the operation is applicable.
In contrast, an untyped language, such as most
assembly languages, allows any operation to be performed on any data, generally sequences of bits of various lengths.
In practice, while few languages are fully typed, most offer a degree of typing.
Because different types (such as
integer
An integer is the number zero (0), a positive natural number (1, 2, 3, ...), or the negation of a positive natural number (−1, −2, −3, ...). The negations or additive inverses of the positive natural numbers are referred to as negative in ...
s and
floats) represent values differently, unexpected results will occur if one type is used when another is expected.
Type checking will flag this error, usually at
compile time (runtime type checking is more costly). With
strong typing,
type errors can always be detected unless variables are explicitly
cast to a different type.
Weak typing occurs when languages allow implicit casting—for example, to enable operations between variables of different types without the programmer making an explicit type conversion. The more cases in which this
type coercion is allowed, the fewer type errors can be detected.
Commonly supported types
Early programming languages often supported only built-in, numeric types such as the
integer
An integer is the number zero (0), a positive natural number (1, 2, 3, ...), or the negation of a positive natural number (−1, −2, −3, ...). The negations or additive inverses of the positive natural numbers are referred to as negative in ...
(signed and unsigned) and
floating point (to support operations on
real number
In mathematics, a real number is a number that can be used to measure a continuous one- dimensional quantity such as a duration or temperature. Here, ''continuous'' means that pairs of values can have arbitrarily small differences. Every re ...
s that are not integers). Most programming languages support multiple sizes of floats (often called
float and
double) and integers depending on the size and precision required by the programmer. Storing an integer in a type that is too small to represent it leads to
integer overflow. The most common way of representing negative numbers with signed types is
twos complement, although
ones complement is also used. Other common types include
Boolean—which is either true or false—and
character—traditionally one
byte, sufficient to represent all
ASCII
ASCII ( ), an acronym for American Standard Code for Information Interchange, is a character encoding standard for representing a particular set of 95 (English language focused) printable character, printable and 33 control character, control c ...
characters.
Arrays
An array is a systematic arrangement of similar objects, usually in rows and columns.
Things called an array include:
{{TOC right
Music
* In twelve-tone and serial composition, the presentation of simultaneous twelve-tone sets such that the ...
are a data type whose elements, in many languages, must consist of a single type of fixed length. Other languages define arrays as references to data stored elsewhere and support elements of varying types. Depending on the programming language, sequences of multiple characters, called
strings, may be supported as arrays of characters or their own
primitive type. Strings may be of fixed or variable length, which enables greater flexibility at the cost of increased storage space and more complexity. Other data types that may be supported include
lists,
associative (unordered) arrays accessed via keys,
records in which data is mapped to names in an ordered structure, and
tuples—similar to records but without names for data fields.
Pointers store memory addresses, typically referencing locations on the
heap where other data is stored.
The simplest
user-defined type is an
ordinal type, often called an
enumeration, whose values can be mapped onto the set of positive integers. Since the mid-1980s, most programming languages also support
abstract data types, in which the representation of the data and operations are
hidden from the user, who can only access an
interface. The benefits of
data abstraction can include increased reliability, reduced complexity, less potential for
name collision, and allowing the underlying
data structure
In computer science, a data structure is a data organization and storage format that is usually chosen for Efficiency, efficient Data access, access to data. More precisely, a data structure is a collection of data values, the relationships amo ...
to be changed without the client needing to alter its code.
Static and dynamic typing
In
static typing, all expressions have their types determined before a program executes, typically at compile-time.
Most widely used, statically typed programming languages require the types of variables to be specified explicitly. In some languages, types are implicit; one form of this is when the compiler can
infer types based on context. The downside of
implicit typing is the potential for errors to go undetected. Complete type inference has traditionally been associated with functional languages such as
Haskell and
ML.
With dynamic typing, the type is not attached to the variable but only the value encoded in it. A single variable can be reused for a value of a different type. Although this provides more flexibility to the programmer, it is at the cost of lower reliability and less ability for the programming language to check for errors. Some languages allow variables of a
union type to which any type of value can be assigned, in an exception to their usual static typing rules.
Concurrency
In computing, multiple instructions can be executed simultaneously. Many programming languages support instruction-level and subprogram-level concurrency. By the twenty-first century, additional processing power on computers was increasingly coming from the use of additional processors, which requires programmers to design software that makes use of multiple processors simultaneously to achieve improved performance.
Interpreted language
In computer science, an interpreter is a computer program that directly executes instructions written in a programming or scripting language, without requiring them previously to have been compiled into a machine language program. An inter ...
s such as
Python and
Ruby do not support the concurrent use of multiple processors. Other programming languages do support managing data shared between different threads by controlling the order of execution of key instructions via the use of
semaphores, controlling access to shared data via
monitor, or enabling
message passing between threads.
Exception handling
Many programming languages include exception handlers, a section of code triggered by
runtime errors that can deal with them in two main ways:
*Termination: shutting down and handing over control to the
operating system
An operating system (OS) is system software that manages computer hardware and software resources, and provides common daemon (computing), services for computer programs.
Time-sharing operating systems scheduler (computing), schedule tasks for ...
. This option is considered the simplest.
*Resumption: resuming the program near where the exception occurred. This can trigger a repeat of the exception, unless the exception handler is able to modify values to prevent the exception from reoccurring.
Some programming languages support dedicating a block of code to run regardless of whether an exception occurs before the code is reached; this is called finalization.
There is a tradeoff between increased ability to handle exceptions and reduced performance. For example, even though array index errors are common C does not check them for performance reasons. Although programmers can write code to catch user-defined exceptions, this can clutter a program. Standard libraries in some languages, such as C, use their return values to indicate an exception. Some languages and their compilers have the option of turning on and off error handling capability, either temporarily or permanently.
Design and implementation
One of the most important influences on programming language design has been
computer architecture.
Imperative languages, the most commonly used type, were designed to perform well on
von Neumann architecture, the most common computer architecture. In von Neumann architecture, the
memory
Memory is the faculty of the mind by which data or information is encoded, stored, and retrieved when needed. It is the retention of information over time for the purpose of influencing future action. If past events could not be remembe ...
stores both data and instructions, while the
CPU that performs instructions on data is separate, and data must be piped back and forth to the CPU. The central elements in these languages are variables,
assignment, and
iteration
Iteration is the repetition of a process in order to generate a (possibly unbounded) sequence of outcomes. Each repetition of the process is a single iteration, and the outcome of each iteration is then the starting point of the next iteration.
...
, which is more efficient than
recursion
Recursion occurs when the definition of a concept or process depends on a simpler or previous version of itself. Recursion is used in a variety of disciplines ranging from linguistics to logic. The most common application of recursion is in m ...
on these machines.
Many programming languages have been designed from scratch, altered to meet new needs, and combined with other languages. Many have eventually fallen into disuse. The birth of programming languages in the 1950s was stimulated by the desire to make a universal programming language suitable for all machines and uses, avoiding the need to write code for different computers. By the early 1960s, the idea of a universal language was rejected due to the differing requirements of the variety of purposes for which code was written.
Tradeoffs
Desirable qualities of programming languages include readability, writability, and reliability. These features can reduce the cost of training programmers in a language, the amount of time needed to write and maintain programs in the language, the cost of compiling the code, and increase runtime performance.
*Although early programming languages often prioritized efficiency over readability, the latter has grown in importance since the 1970s. Having multiple operations to achieve the same result can be detrimental to readability, as is
overloading operators, so that the same operator can have multiple meanings. Another feature important to readability is
orthogonality, limiting the number of constructs that a programmer has to learn. A syntax structure that is easily understood and
special words that are immediately obvious also supports readability.
*Writability is the ease of use for writing code to solve the desired problem. Along with the same features essential for readability,
abstraction
Abstraction is a process where general rules and concepts are derived from the use and classifying of specific examples, literal (reality, real or Abstract and concrete, concrete) signifiers, first principles, or other methods.
"An abstraction" ...
—interfaces that enable hiding details from the client—and
expressivity—enabling more concise programs—additionally help the programmer write code. The earliest programming languages were tied very closely to the underlying hardware of the computer, but over time support for abstraction has increased, allowing programmers express ideas that are more remote from simple translation into underlying hardware instructions. Because programmers are less tied to the complexity of the computer, their programs can do more computing with less effort from the programmer. Most programming languages come with a
standard library of commonly used functions.
*Reliability means that a program performs as specified in a wide range of circumstances.
Type checking,
exception handling, and restricted
aliasing (multiple variable names accessing the same region of memory) all can improve a program's reliability.
Programming language design often involves tradeoffs. For example, features to improve reliability typically come at the cost of performance. Increased expressivity due to a large number of operators makes writing code easier but comes at the cost of readability.
Natural-language programming has been proposed as a way to eliminate the need for a specialized language for programming. However, this goal remains distant and its benefits are open to debate.
Edsger W. Dijkstra
Edsger Wybe Dijkstra ( ; ; 11 May 1930 – 6 August 2002) was a Dutch computer scientist, programmer, software engineer, mathematician, and science essayist.
Born in Rotterdam in the Netherlands, Dijkstra studied mathematics and physics and the ...
took the position that the use of a formal language is essential to prevent the introduction of meaningless constructs.
Alan Perlis was similarly dismissive of the idea.
Specification
The specification of a programming language is an artifact that the language
users and the
implementors can use to agree upon whether a piece of
source code
In computing, source code, or simply code or source, is a plain text computer program written in a programming language. A programmer writes the human readable source code to control the behavior of a computer.
Since a computer, at base, only ...
is a valid
program in that language, and if so what its behavior shall be.
A programming language specification can take several forms, including the following:
* An explicit definition of the syntax, static semantics, and execution semantics of the language. While syntax is commonly specified using a formal grammar, semantic definitions may be written in
natural language (e.g., as in the
C language), or a
formal semantics (e.g., as in
Standard ML and
Scheme specifications).
* A description of the behavior of a
translator for the language (e.g., the
C++ and
Fortran specifications). The syntax and semantics of the language have to be inferred from this description, which may be written in natural or formal language.
* A
''reference'' or ''model'' implementation, sometimes
written in the language being specified (e.g.,
Prolog
Prolog is a logic programming language that has its origins in artificial intelligence, automated theorem proving, and computational linguistics.
Prolog has its roots in first-order logic, a formal logic. Unlike many other programming language ...
or
ANSI REXX). The syntax and semantics of the language are explicit in the behavior of the reference implementation.
Implementation
An implementation of a programming language is the conversion of a program into
machine code that can be executed by the hardware. The machine code then can be executed with the help of the
operating system
An operating system (OS) is system software that manages computer hardware and software resources, and provides common daemon (computing), services for computer programs.
Time-sharing operating systems scheduler (computing), schedule tasks for ...
. The most common form of interpretation in
production code is by a
compiler
In computing, a compiler is a computer program that Translator (computing), translates computer code written in one programming language (the ''source'' language) into another language (the ''target'' language). The name "compiler" is primaril ...
, which translates the source code via an intermediate-level language into machine code, known as an
executable
In computer science, executable code, an executable file, or an executable program, sometimes simply referred to as an executable or binary, causes a computer "to perform indicated tasks according to encoded instruction (computer science), in ...
. Once the program is compiled, it will run more quickly than with other implementation methods. Some compilers are able to provide further
optimization
Mathematical optimization (alternatively spelled ''optimisation'') or mathematical programming is the selection of a best element, with regard to some criteria, from some set of available alternatives. It is generally divided into two subfiel ...
to reduce memory or computation usage when the executable runs, but increasing compilation time.
Another implementation method is to run the program with an
interpreter, which translates each line of software into machine code just before it executes. Although it can make debugging easier, the downside of interpretation is that it runs 10 to 100 times slower than a compiled executable. Hybrid interpretation methods provide some of the benefits of compilation and some of the benefits of interpretation via partial compilation. One form this takes is
just-in-time compilation, in which the software is compiled ahead of time into an intermediate language, and then into machine code immediately before execution.
Proprietary languages
Although most of the most commonly used programming languages have fully open specifications and implementations, many programming languages exist only as proprietary programming languages with the implementation available only from a single vendor, which may claim that such a proprietary language is their intellectual property. Proprietary programming languages are commonly
domain-specific languages or internal
scripting language
In computing, a script is a relatively short and simple set of instructions that typically automation, automate an otherwise manual process. The act of writing a script is called scripting. A scripting language or script language is a programming ...
s for a single product; some proprietary languages are used only internally within a vendor, while others are available to external users.
Some programming languages exist on the border between proprietary and open; for example,
Oracle Corporation asserts proprietary rights to some aspects of the
Java programming language
Java is a high-level, general-purpose, memory-safe, object-oriented programming language. It is intended to let programmers ''write once, run anywhere'' ( WORA), meaning that compiled Java code can run on all platforms that support Jav ...
, and
Microsoft
Microsoft Corporation is an American multinational corporation and technology company, technology conglomerate headquartered in Redmond, Washington. Founded in 1975, the company became influential in the History of personal computers#The ear ...
's
C# programming language, which has open implementations of most parts of the system, also has
Common Language Runtime (CLR) as a closed environment.
Many proprietary languages are widely used, in spite of their proprietary nature; examples include
MATLAB,
VBScript, and
Wolfram Language. Some languages may make the transition from closed to open; for example,
Erlang was originally Ericsson's internal programming language.
Open source programming languages are particularly helpful for
open science
Open science is the movement to make scientific research (including publications, data, physical samples, and software) and its dissemination accessible to all levels of society, amateur or professional. Open science is transparent and accessib ...
applications, enhancing the capacity for
replication and code sharing.
Use
Thousands of different programming languages have been created, mainly in the computing field.
Individual software projects commonly use five programming languages or more.
Programming languages differ from most other forms of human expression in that they require a greater degree of precision and completeness. When using a natural language to communicate with other people, human authors and speakers can be ambiguous and make small errors, and still expect their intent to be understood. However, figuratively speaking, computers "do exactly what they are told to do", and cannot "understand" what code the programmer intended to write. The combination of the language definition, a program, and the program's inputs must fully specify the external behavior that occurs when the program is executed, within the domain of control of that program. On the other hand, ideas about an algorithm can be communicated to humans without the precision required for execution by using
pseudocode, which interleaves natural language with code written in a programming language.
A programming language provides a structured mechanism for defining pieces of data, and the operations or transformations that may be carried out automatically on that data. A
programmer uses the
abstractions present in the language to represent the concepts involved in a computation. These concepts are represented as a collection of the simplest elements available (called
primitives). ''
Programming'' is the process by which programmers combine these primitives to compose new programs, or adapt existing ones to new uses or a changing environment.
Programs for a computer might be
executed in a
batch process without any human interaction, or a user might type
commands in an
interactive session of an
interpreter. In this case the "commands" are simply programs, whose execution is chained together. When a language can run its commands through an interpreter (such as a
Unix shell
A Unix shell is a Command-line_interface#Command-line_interpreter, command-line interpreter or shell (computing), shell that provides a command line user interface for Unix-like operating systems. The shell is both an interactive command languag ...
or other
command-line interface
A command-line interface (CLI) is a means of interacting with software via command (computing), commands each formatted as a line of text. Command-line interfaces emerged in the mid-1960s, on computer terminals, as an interactive and more user ...
), without compiling, it is called a
scripting language
In computing, a script is a relatively short and simple set of instructions that typically automation, automate an otherwise manual process. The act of writing a script is called scripting. A scripting language or script language is a programming ...
.
Measuring language usage
Determining which is the most widely used programming language is difficult since the definition of usage varies by context. One language may occupy the greater number of programmer hours, a different one has more lines of code, and a third may consume the most CPU time. Some languages are very popular for particular kinds of applications. For example,
COBOL is still strong in the corporate data center, often on large
mainframes;
Fortran in scientific and engineering applications;
Ada in aerospace, transportation, military, real-time, and embedded applications; and
C in embedded applications and operating systems. Other languages are regularly used to write many different kinds of applications.
Various methods of measuring language popularity, each subject to a different bias over what is measured, have been proposed:
* counting the number of job advertisements that mention the language
* the number of books sold that teach or describe the language
* estimates of the number of existing lines of code written in the language which may underestimate languages not often found in public searches
* counts of language references (i.e., to the name of the language) found using a web search engine.
Combining and averaging information from various internet sites, stackify.com reported the ten most popular programming languages (in descending order by overall popularity):
Java
Java is one of the Greater Sunda Islands in Indonesia. It is bordered by the Indian Ocean to the south and the Java Sea (a part of Pacific Ocean) to the north. With a population of 156.9 million people (including Madura) in mid 2024, proje ...
,
C,
C++,
Python,
C#,
JavaScript
JavaScript (), often abbreviated as JS, is a programming language and core technology of the World Wide Web, alongside HTML and CSS. Ninety-nine percent of websites use JavaScript on the client side for webpage behavior.
Web browsers have ...
,
VB .NET,
R,
PHP, and
MATLAB.
As of June 2024, the top five programming languages as measured by
TIOBE index are
Python,
C++,
C,
Java
Java is one of the Greater Sunda Islands in Indonesia. It is bordered by the Indian Ocean to the south and the Java Sea (a part of Pacific Ocean) to the north. With a population of 156.9 million people (including Madura) in mid 2024, proje ...
and
C#. TIOBE provides a list of top 100 programming languages according to popularity and update this list every month.
Dialects, flavors and implementations
A dialect of a programming language or a
data exchange language is a (relatively small) variation or extension of the language that does not change its intrinsic nature. With languages such as
Scheme and
Forth, standards may be considered insufficient, inadequate, or illegitimate by implementors, so often they will deviate from the standard, making a new
dialect
A dialect is a Variety (linguistics), variety of language spoken by a particular group of people. This may include dominant and standard language, standardized varieties as well as Vernacular language, vernacular, unwritten, or non-standardize ...
. In other cases, a dialect is created for use in a
domain-specific language, often a subset. In the
Lisp
Lisp (historically LISP, an abbreviation of "list processing") is a family of programming languages with a long history and a distinctive, fully parenthesized Polish notation#Explanation, prefix notation.
Originally specified in the late 1950s, ...
world, most languages that use basic
S-expression syntax and Lisp-like semantics are considered Lisp dialects, although they vary wildly as do, say,
Racket and
Clojure. As it is common for one language to have several dialects, it can become quite difficult for an inexperienced programmer to find the right documentation. The
BASIC
Basic or BASIC may refer to:
Science and technology
* BASIC, a computer programming language
* Basic (chemistry), having the properties of a base
* Basic access authentication, in HTTP
Entertainment
* Basic (film), ''Basic'' (film), a 2003 film
...
language has
many dialects.
Classifications
Programming languages are often placed into four main categories:
imperative,
functional,
logic
Logic is the study of correct reasoning. It includes both formal and informal logic. Formal logic is the study of deductively valid inferences or logical truths. It examines how conclusions follow from premises based on the structure o ...
, and
object oriented.
*Imperative languages are designed to implement an algorithm in a specified order; they include
visual programming languages such as
.NET for generating
graphical user interface
A graphical user interface, or GUI, is a form of user interface that allows user (computing), users to human–computer interaction, interact with electronic devices through Graphics, graphical icon (computing), icons and visual indicators such ...
s.
Scripting languages, which are partly or fully
interpreted rather than
compiled, are sometimes considered a separate category but meet the definition of imperative languages.
*Functional programming languages work by successively applying functions to the given parameters. Although appreciated by many researchers for their simplicity and elegance, problems with efficiency have prevented them from being widely adopted.
*Logic languages are designed so that the software, rather than the programmer, decides what order in which the instructions are executed.
*Object-oriented programming—whose characteristic features are
data abstraction,
inheritance
Inheritance is the practice of receiving private property, titles, debts, entitlements, privileges, rights, and obligations upon the death of an individual. The rules of inheritance differ among societies and have changed over time. Offi ...
, and
dynamic dispatch—is supported by most popular imperative languages and some functional languages.
Although
markup languages are not programming languages, some have extensions that support limited programming. Additionally, there are special-purpose languages that are not easily compared to other programming languages.
See also
*
Comparison of programming languages (basic instructions)
*
Comparison of programming languages
*
Computer programming
Computer programming or coding is the composition of sequences of instructions, called computer program, programs, that computers can follow to perform tasks. It involves designing and implementing algorithms, step-by-step specifications of proc ...
*
Computer science
Computer science is the study of computation, information, and automation. Computer science spans Theoretical computer science, theoretical disciplines (such as algorithms, theory of computation, and information theory) to Applied science, ...
and
Outline of computer science
*
Domain-specific language
*
Domain-specific modeling
*
Educational programming language
*
Esoteric programming language
*
Extensible programming
*
:Extensible syntax programming languages
*
Invariant-based programming
*
List of BASIC dialects
*
List of open-source programming languages
*
Lists of programming languages
*
List of programming language researchers
*
Programming languages used in most popular websites
*
Language-oriented programming
*
Logic programming
*
Literate programming
*
Metaprogramming
**
*
Modeling language
*
Programming language theory
Programming language theory (PLT) is a branch of computer science that deals with the design, implementation, analysis, characterization, and classification of formal languages known as programming languages. Programming language theory is clos ...
*
Pseudocode
*
*
Reflective programming
*
Scientific programming language
*
Scripting language
In computing, a script is a relatively short and simple set of instructions that typically automation, automate an otherwise manual process. The act of writing a script is called scripting. A scripting language or script language is a programming ...
*
Software engineering
Software engineering is a branch of both computer science and engineering focused on designing, developing, testing, and maintaining Application software, software applications. It involves applying engineering design process, engineering principl ...
and
List of software engineering topics
References
Further reading
*
*
Raphael Finkel:
Advanced Programming Language Design', Addison Wesley 1995.
*
Daniel P. Friedman,
Mitchell Wand,
Christopher T. Haynes: ''
Essentials of Programming Languages'', The MIT Press 2001.
*
David Gelernter,
Suresh Jagannathan: ''Programming Linguistics'',
The MIT Press 1990.
*
Ellis Horowitz (ed.): ''Programming Languages, a Grand Tour'' (3rd ed.), 1987.
* Ellis Horowitz: ''Fundamentals of Programming Languages'', 1989.
*
Shriram Krishnamurthi: ''
Programming Languages: Application and Interpretation''
online publication.
*
*
Bruce J. MacLennan: ''Principles of Programming Languages: Design, Evaluation, and Implementation'',
Oxford University Press
Oxford University Press (OUP) is the publishing house of the University of Oxford. It is the largest university press in the world. Its first book was printed in Oxford in 1478, with the Press officially granted the legal right to print books ...
1999.
*
John C. Mitchell: ''Concepts in Programming Languages'',
Cambridge University Press
Cambridge University Press was the university press of the University of Cambridge. Granted a letters patent by King Henry VIII in 1534, it was the oldest university press in the world. Cambridge University Press merged with Cambridge Assessme ...
2002.
*
*
Benjamin C. Pierce: ''
Types and Programming Languages'', The MIT Press 2002.
*
Terrence W. Pratt and
Marvin Victor Zelkowitz: ''Programming Languages: Design and Implementation'' (4th ed.), Prentice Hall 2000.
*
Peter H. Salus. ''Handbook of Programming Languages'' (4 vols.). Macmillan 1998.
*
Ravi Sethi: ''Programming Languages: Concepts and Constructs'', 2nd ed.,
Addison-Wesley 1996.
*
Michael L. Scott and Jonathan Aldrich: ''Programming Language Pragmatics'', 5th ed.,
Morgan Kaufmann Publishers 2025.
*
* Franklyn Turbak and David Gifford with Mark Sheldon: ''Design Concepts in Programming Languages'', The MIT Press 2009.
*
Peter Van Roy and
Seif Haridi. ''
Concepts, Techniques, and Models of Computer Programming'', The MIT Press 2004.
*
David A. Watt. ''Programming Language Concepts and Paradigms''. Prentice Hall 1990.
* David A. Watt and
Muffy Thomas. ''Programming Language Syntax and Semantics''. Prentice Hall 1991.
* David A. Watt. ''Programming Language Processors''. Prentice Hall 1993.
* David A. Watt. ''Programming Language Design Concepts''. John Wiley & Sons 2004.
*
{{DEFAULTSORT:Programming Language
Programming language classification
Notation
Articles with example C code