The C standard library or libc is the
standard library
In computer programming, a standard library is the library made available across implementations of a programming language. These libraries are conventionally described in programming language specifications; however, contents of a language's as ...
for the
C programming language
''The C Programming Language'' (sometimes termed ''K&R'', after its authors' initials) is a computer programming book written by Brian Kernighan and Dennis Ritchie, the latter of whom originally designed and implemented the language, as well as ...
, as specified in the
ISO C
ANSI C, ISO C, and Standard C are successive standards for the C programming language published by the American National Standards Institute (ANSI) and ISO/IEC JTC 1/SC 22/WG 14 of the International Organization for Standardization (ISO) and t ...
standard.
[ ISO/ IEC (2018). '' ISO/IEC 9899:2018(E): Programming Languages - C §7''] Starting from the original
ANSI C standard, it was developed at the same time as
the C library POSIX specification, which is a superset of it. Since ANSI C was adopted by the
International Organization for Standardization
The International Organization for Standardization (ISO ) is an international standard development organization composed of representatives from the national standards organizations of member countries. Membership requirements are given in Ar ...
,
the C standard library is also called the ISO C library.
The C standard library provides
macro
Macro (or MACRO) may refer to:
Science and technology
* Macroscopic, subjects visible to the eye
* Macro photography, a type of close-up photography
* Image macro, a picture with text superimposed
* Monopole, Astrophysics and Cosmic Ray Observat ...
s,
type
Type may refer to:
Science and technology Computing
* Typing, producing text via a keyboard, typewriter, etc.
* Data type, collection of values used for computations.
* File type
* TYPE (DOS command), a command to display contents of a file.
* Ty ...
definitions and functions for tasks such as
string
String or strings may refer to:
*String (structure), a long flexible structure made from threads twisted together, which is used to tie, bind, or hang other objects
Arts, entertainment, and media Films
* ''Strings'' (1991 film), a Canadian anim ...
handling, mathematical computations, input/output processing,
memory management, and several other
operating system services.
Application programming interface
Header files
The
application programming interface
An application programming interface (API) is a way for two or more computer programs to communicate with each other. It is a type of software interface, offering a service to other pieces of software. A document or standard that describes how ...
(API) of the C standard library is declared in a number of
header files. Each header file contains one or more function declarations, data type definitions, and macros.
After a long period of stability, three new header files (
iso646.h
,
wchar.h
, and
wctype.h
) were added with ''Normative Addendum 1'' (NA1), an addition to the C Standard ratified in 1995. Six more header files (
complex.h
,
fenv.h
,
inttypes.h
,
stdbool.h
,
stdint.h
, and
tgmath.h
) were added with
C99, a revision to the C Standard published in 1999, and five more files (
stdalign.h
,
stdatomic.h
,
stdnoreturn.h
,
threads.h
, and
uchar.h
) with
C11 C11, C.XI, C-11 or C.11 may refer to:
Transport
* C-11 Fleetster, a 1920s American light transport aircraft for use of the United States Assistant Secretary of War
* Fokker C.XI, a 1935 Dutch reconnaissance seaplane
* LET C-11, a license-build var ...
in 2011. In total, there are now 29 header files:
Three of the header files (
complex.h
,
stdatomic.h
, and
threads.h
) are conditional features that implementations are not required to support.
The
POSIX standard added several nonstandard C headers for Unix-specific functionality. Many have found their way to other architectures. Examples include
fcntl.h
In Unix and Unix-like computer operating systems, a file descriptor (FD, less frequently fildes) is a process-unique identifier (handle) for a file or other input/output resource, such as a pipe or network socket.
File descriptors typically have ...
and
unistd.h
. A number of other groups are using other nonstandard headers – the
GNU C Library has
alloca.h
, and HP
OpenVMS
OpenVMS, often referred to as just VMS, is a multi-user, multiprocessing and virtual memory-based operating system. It is designed to support time-sharing, batch processing, transaction processing and workstation applications. Customers using Ope ...
has the
va_count()
function.
Documentation
On Unix-like systems, the authoritative documentation of the actually implemented API is provided in the form of
man pages. On most systems, man pages on standard library functions are in section 3; section 7 may contain some more generic pages on underlying concepts (e.g.
man 7 math_error
in
Linux).
Implementations
Unix-like systems typically have a C library in
shared library form, but the header files (and compiler toolchain) may be absent from an installation so C development may not be possible. The C library is considered part of the operating system on Unix-like systems. The C functions, including the ISO C standard ones, are widely used by programs, and are regarded as if they were not only an implementation of something in the C language, but also ''de facto'' part of the operating system interface. Unix-like operating systems generally cannot function if the C library is erased. This is true for applications which are dynamically as opposed to statically linked. Further, the kernel itself (at least in the case of Linux) operates independently of any libraries.
On Microsoft Windows, the core system dynamic libraries (
DLLs) provide an implementation of the C standard library for the
Microsoft Visual C++ compiler v6.0; the C standard library for newer versions of the Microsoft Visual C++ compiler is provided by each compiler individually, as well as ''redistributable'' packages. Compiled applications written in C are either statically linked with a C library, or linked to a dynamic version of the library that is shipped with these applications, rather than relied upon to be present on the targeted systems. Functions in a compiler's C library are not regarded as interfaces to Microsoft Windows.
Many other implementations exist, provided with both various operating systems and C compilers. Some of the popular implementations are the following:
* The
BSD libc, various implementations distributed with
BSD-derived operating systems
*
GNU C Library (glibc), used in
GNU Hurd,
GNU/kFreeBSD and
Linux
*
Microsoft C run-time library, part of
Microsoft Visual C++
*
dietlibc, an alternative small implementation of the C standard library (MMU-less)
*
μClibc, a C standard library for embedded
μClinux systems (MMU-less)
**
uclibc-ng, an embedded C library, fork of μClibc, still maintained, with
memory management unit (MMU) support
*
Newlib, a C standard library for embedded systems (MMU-less) and used in the
Cygwin
Cygwin ( ) is a POSIX-compatible programming and runtime environment that runs natively on Microsoft Windows. Under Cygwin, source code designed for Unix-like operating systems may be compiled with minimal modification and executed.
The Cygwin in ...
GNU distribution for Windows
*
klibc, primarily for booting Linux systems
*
musl, another lightweight C standard library implementation for Linux systems
*
Bionic, originally developed by Google for the Android embedded system operating system, derived from BSD libc
Compiler built-in functions
Some compilers (for example,
GCC) provide built-in versions of many of the functions in the C standard library; that is, the implementations of the functions are written into the compiled
object file, and the program calls the built-in versions instead of the functions in the C library
shared object file. This reduces function-call overhead, especially if function calls are replaced with
inline variants, and allows other forms of
optimization (as the compiler knows the
control-flow characteristics of the built-in variants), but may cause confusion when debugging (for example, the built-in versions cannot be replaced with
instrumented
Instrumentation a collective term for measuring instruments that are used for indicating, measuring and recording physical quantities. The term has its origins in the art and science of scientific instrument-making.
Instrumentation can refer to ...
variants).
However, the built-in functions must behave like ordinary functions in accordance with ISO C. The main implication is that the program must be able to create a pointer to these functions by taking their address, and invoke the function by means of that pointer. If two pointers to the same function are derived in two different translation units in the program, these two pointers must compare equal; that is, the address comes by resolving the name of the function, which has external (program-wide) linkage.
Linking, libm
Under FreeBSD and glibc, some functions such as sin() are not linked in by default and are instead bundled in the mathematical library
libm. If any of them are used, the linker must be given the directive
-lm
. POSIX requires that the c99 compiler supports
-lm
, and that the functions declared in the headers
math.h
,
complex.h
, and
fenv.h
are available for linking if
-lm
is specified, but does not specify if the functions are linked by default. musl satisfies this requirement by putting everything into a single libc library and providing an empty libm.
Detection
According to the C standard the macro
__STDC_HOSTED__
shall be defined to 1 if the implementation is hosted. A hosted implementation has all the headers specified by the C standard. An implementation can also be ''freestanding'' which means that these headers will not be present. If an implementation is ''freestanding'', it shall define
__STDC_HOSTED__
to 0.
Problems and workarounds
Buffer overflow vulnerabilities
Some functions in the C standard library have been notorious for having
buffer overflow vulnerabilities and generally encouraging buggy programming ever since their adoption. The most criticized items are:
*
string-manipulation routines, including
strcpy()
and
strcat()
, for lack of
bounds checking and possible buffer overflows if the bounds aren't checked manually;
* string routines in general, for
side-effects
In medicine, a side effect is an effect, whether therapeutic or adverse, that is secondary to the one intended; although the term is predominantly employed to describe adverse effects, it can also apply to beneficial, but unintended, consequence ...
, encouraging irresponsible buffer usage, not always guaranteeing valid
null-terminated output, linear length calculation;
*
printf()
family of routines, for spoiling the
execution stack when the format string doesn't match the arguments given. This fundamental flaw created an entire class of attacks:
format string attack
Uncontrolled format string is a type of software vulnerability discovered around 1989 that can be used in security exploits. Originally thought harmless, format string exploits can be used to crash a program or to execute harmful code. The problem ...
s;
*
gets()
and
scanf()
family of I/O routines, for lack of (either any or easy) input length checking.
Except the extreme case with
gets()
, all the security vulnerabilities can be avoided by introducing auxiliary code to perform memory management, bounds checking, input checking, etc. This is often done in the form of wrappers that make standard library functions safer and easier to use. This dates back to as early as ''
The Practice of Programming
''The Practice of Programming'' () by Brian W. Kernighan and Rob Pike is a 1999 book about computer programming and software engineering, published by Addison-Wesley.
According to the preface, the book is about "topics like testing, debuggi ...
'' book by B. Kernighan and R. Pike where the authors commonly use wrappers that print error messages and quit the program if an error occurs.
The ISO C committee published Technical reports TR 24731-1 and is working on TR 24731-2 to propose adoption of some functions with bounds checking and automatic buffer allocation, correspondingly. The former has met severe criticism with some praise, the latter received mixed responses. Despite this, TR 24731-1 has been implemented into Microsoft's C standard library and its compiler issues warnings when using old "insecure" functions.
Threading problems, vulnerability to race conditions
The
strerror
The C programming language has a set of functions implementing operations on strings (character strings and byte strings) in its standard library. Various operations, such as copying, concatenation, tokenization and searching are supported. Fo ...
()
routine is criticized for being
thread unsafe and otherwise vulnerable to
race conditions.
Error handling
The error handling of the functions in the C standard library is not consistent and sometimes confusing. According to the Linux manual page
math_error
, "The current (version 2.8) situation under glibc is messy. Most (but not all) functions raise exceptions on errors. Some also set ''errno''. A few functions set ''errno'', but don't raise an exception. A very few functions do neither."
Standardization
The original
C language provided no built-in functions such as I/O operations, unlike traditional languages such as
COBOL
COBOL (; an acronym for "common business-oriented language") is a compiled English-like computer programming language designed for business use. It is an imperative, procedural and, since 2002, object-oriented language. COBOL is primarily us ...
and
Fortran. Over time, user communities of C shared ideas and implementations of what is now called C standard libraries. Many of these ideas were incorporated eventually into the definition of the standardized C language.
Both
Unix and C were created at
AT&T's Bell Laboratories in the late 1960s and early 1970s. During the 1970s the C language became increasingly popular. Many universities and organizations began creating their own variants of the language for their own projects. By the beginning of the 1980s compatibility problems between the various C implementations became apparent. In 1983 the
American National Standards Institute
The American National Standards Institute (ANSI ) is a private non-profit organization that oversees the development of voluntary consensus standards for products, services, processes, systems, and personnel in the United States. The organi ...
(ANSI) formed a committee to establish a standard specification of C known as "
ANSI C". This work culminated in the creation of the so-called C89 standard in 1989. Part of the resulting standard was a set of
software libraries
In computer science, a library is a collection of non-volatile resources used by computer programs, often for software development. These may include configuration data, documentation, help data, message templates, pre-written code and subro ...
called the ANSI C standard library.
POSIX standard library
POSIX, as well as
SUS
Sus or SUS may refer to:
Places
* Sus, Lachin, a village
* Sus, Pune, India, a neighborhood
*Sus, Pyrénées-Atlantiques, France, a commune
*Susch, Graubünden, Switzerland, a municipality formerly called Süs
People
* Martin Sus (footballer ...
, specify a number of routines that should be available over and above those in the basic C standard library. The POSIX specification includes header files for, among other uses,
multi-threading,
networking
Network, networking and networked may refer to:
Science and technology
* Network theory, the study of graphs as a representation of relations between discrete objects
* Network science, an academic field that studies complex networks
Mathematics
...
, and
regular expressions. These are often implemented alongside the C standard library functionality, with varying degrees of closeness. For example,
glibc implements functions such as
fork
within
libc.so
, but before
NPTL was merged into glibc it constituted a separate library with its own linker flag argument. Often, this POSIX-specified functionality will be regarded as part of the library; the basic C library may be identified as the ANSI or
ISO C library.
BSD libc
BSD libc is a superset of the POSIX standard library supported by the C libraries included with
BSD operating systems such as
FreeBSD
FreeBSD is a free and open-source Unix-like operating system descended from the Berkeley Software Distribution (BSD), which was based on Research Unix. The first version of FreeBSD was released in 1993. In 2005, FreeBSD was the most popular ...
,
NetBSD
NetBSD is a free and open-source Unix operating system based on the Berkeley Software Distribution (BSD). It was the first open-source BSD descendant officially released after 386BSD was forked. It continues to be actively developed and is a ...
,
OpenBSD
OpenBSD is a security-focused, free and open-source, Unix-like operating system based on the Berkeley Software Distribution (BSD). Theo de Raadt created OpenBSD in 1995 by forking NetBSD 1.0. According to the website, the OpenBSD project em ...
and
macOS. BSD libc has some extensions that are not defined in the original standard, many of which first appeared in 1994's
4.4BSD release (the first to be largely developed after the first standard was issued in 1989). Some of the extensions of BSD libc are:
*
sys/tree.h
contains an implementation of
red–black tree
In computer science, a red–black tree is a kind of self-balancing binary search tree. Each node stores an extra bit representing "color" ("red" or "black"), used to ensure that the tree remains balanced during insertions and deletions.
When the ...
and
splay tree
*
sys/queue.h
implementations of
Linked list
In computer science, a linked list is a linear collection of data elements whose order is not given by their physical placement in memory. Instead, each element points to the next. It is a data structure consisting of a collection of nodes whic ...
,
queues,
tail queue
The tail is the section at the rear end of certain kinds of animals’ bodies; in general, the term refers to a distinct, flexible appendage to the torso. It is the part of the body that corresponds roughly to the sacrum and coccyx in mammals, r ...
, etc.
*
fgetln()
defined in
stdio.h
. This can be used to read a file line by line.
*
fts.h
contains some functions to traverse a file hierarchy
*
db.h
some functions to connect to the
Berkeley DB
*
strlcat()
and
strlcpy()
secure alternatives for
strncat()
and
strncpy()
*
err.h
contains some functions to print formatted error messages
*
vis.h
contains the
vis()
function. This function is used to display non-printable characters in a visual format.
The C standard library in other languages
Some languages include the functionality of the standard C library in their own libraries. The library may be adapted to better suit the language's structure, but the
operational semantics are kept similar. The
C++ language, for example, includes the functionality of the C standard library in the
namespace std
(e.g.,
std::printf
,
std::atoi
,
std::feof
), in header files with similar names to the C ones (
cstdio
,
cmath
,
cstdlib
, etc.). Other languages that take similar approaches are
D,
Perl,
Ruby and the main implementation of
Python known as
CPython. In Python 2, for example, the built-in file objects are defined as "implemented using C's
stdio
package", so that the available operations (open, read, write, etc.) are expected to have the same behavior as the corresponding C functions.
Rust has a crate called which allows several C functions, structs, and other type definitions to be used.
Comparison to standard libraries of other languages
The C standard library is small compared to the standard libraries of some other languages. The C library provides a basic set of mathematical functions, string manipulation,
type conversions, and file and console-based I/O. It does not include a standard set of "
container types" like the
C++ Standard Template Library, let alone the complete
graphical user interface (GUI) toolkits, networking tools, and profusion of other functionality that
Java and the
.NET Framework
The .NET Framework (pronounced as "''dot net"'') is a proprietary software framework developed by Microsoft that runs primarily on Microsoft Windows. It was the predominant implementation of the Common Language Infrastructure (CLI) until bein ...
provide as standard. The main advantage of the small standard library is that providing a working ISO C environment is much easier than it is with other languages, and consequently porting C to a new platform is comparatively easy.
See also
*
C++ Standard Library
Notes
References
Further reading
*
External links
The C Library Reference Guide* Microsof
Universal C runtime routines by categoryon MSDN
* NetBS
C libraries manual an
full C library sourceManual pages for the original C standard libraries in Unix
{{DEFAULTSORT:C Standard Library
C (programming language)