HOME

TheInfoList



OR:

The following is provided as an overview of and topical guide to databases:
Database In computing, a database is an organized collection of data or a type of data store based on the use of a database management system (DBMS), the software that interacts with end users, applications, and the database itself to capture and a ...
– organized collection of data, today typically in digital form. The data are typically organized to model relevant aspects of reality (for example, the availability of rooms in hotels), in a way that supports processes requiring this information (for example, finding a hotel with vacancies).


What type of things are databases?

Databases can be described as all of the following: *
Information Information is an Abstraction, abstract concept that refers to something which has the power Communication, to inform. At the most fundamental level, it pertains to the Interpretation (philosophy), interpretation (perhaps Interpretation (log ...
– sequence of symbols that can be interpreted as a message. Information can be recorded as signs, or transmitted as signals. *
Data Data ( , ) are a collection of discrete or continuous values that convey information, describing the quantity, quality, fact, statistics, other basic units of meaning, or simply sequences of symbols that may be further interpreted for ...
– values of qualitative or quantitative variables, belonging to a set of items. Data in computing (or data processing) are often represented by a combination of items organized in rows and multiple variables organized in columns. Data are typically the results of measurements and can be visualised using graphs or images. *
Computer data ''In computer science, data (treated as singular, plural, or as a mass noun) is any sequence of one or more symbols; datum is a single symbol of data. Data requires interpretation to become information. Digital data is data that is represen ...
– information in a form suitable for use with a computer. Data is often distinguished from programs. A program is a sequence of instructions that detail a task for the computer to perform. In this sense, data is everything in software that is not program code.


Types of databases

*
Active database In computing, an active database is a database that includes an event-driven architecture (often in the form of ECA rules) that can respond to conditions both inside and outside the database. Possible uses include security monitoring, alerting, s ...
– includes an event driven architecture (often in the form of ECA rules) which can respond to conditions both inside and outside the database. * Animation database – stores fragments of animations or human movements and which can be accessed, analyzed and queried to develop and assemble new animations. *
Back-end database {{No footnotes, date=December 2021 A back-end database is a database that is accessed by users indirectly through an external application rather than by application programming stored within the database itself or by low level manipulation of the ...
– accessed by users indirectly through an external application rather than by application programming stored within the database itself or by low level manipulation of the data (e.g. through
SQL Structured Query Language (SQL) (pronounced ''S-Q-L''; or alternatively as "sequel") is a domain-specific language used to manage data, especially in a relational database management system (RDBMS). It is particularly useful in handling s ...
commands). *
Bibliographic database A bibliographic database is a database of bibliographic records. This is an organised online collection of references to published written works like academic journal, journal and newspaper articles, conference proceedings, reports, government an ...
– database of bibliographic records, an organized digital collection of references to published literature, including journal and newspaper articles, conference proceedings, reports, government and legal publications, patents, books, etc. *
Centralized database A centralized database (sometimes abbreviated CDB) is a database that is located, stored, and maintained in a single location. This location is most often a central computer or database system, for example a desktop or server CPU, or a mainframe co ...
– database located and maintained in one location, unlike a distributed database. *
Cloud database A cloud database is a database that typically runs on a cloud computing platform and access to the database is provided as-a-service. There are two common deployment models: users can run databases on the cloud independently, using a virtual machin ...
– runs on a cloud computing platform, such as Amazon EC2, GoGrid and Rackspace. * Collection database – collection catalog of a museum or archive implemented using a computerized database, in which the institution's objects or material are catalogued. * Collective Optimization Database – open repository to enable sharing of benchmarks, data sets and optimization cases from the community, provide web services and Plug-in (computing), plugins to analyze optimization data and predict program transformations or better hardware designs for multi-objective optimizations based on statistical and machine learning techniques provided there is enough information collected in the repository from multiple users. *
Configuration management database A configuration management database (CMDB) is an ITIL term for a database used by an organization to store information about hardware and software assets (commonly referred to as configuration items). It is useful to break down configuration ite ...
– * Cooperative database – holds information on customers and their transactions. *
Correlation database In statistics, correlation or dependence is any statistical relationship, whether causal or not, between two random variables or bivariate data. Although in the broadest sense, "correlation" may indicate any type of association, in statistic ...
– database management system (DBMS) that is data model independent and designed to efficiently handle unplanned, ad hoc queries in an analytical system environment. * Current database – conventional database that stores data that is valid now. * Directory – repository or database of information which is optimized for reading, under the assumption that data updates are very rare compared to data reads. Commonly, a directory supports search and browsing in addition to simple lookups. *
Distributed database A distributed database is a database in which data is stored across different physical locations. It may be stored in multiple computers located in the same physical location (e.g. a data centre); or maybe dispersed over a computer network, netwo ...
– database in which storage devices are not all attached to a common CPU. *
Document-oriented database A document-oriented database, or document store, is a computer program and data storage system designed for storing, retrieving and managing document-oriented information, also known as semi-structured data. Document-oriented databases are one ...
– computer program designed for storing, retrieving, and managing document-oriented, or Semi-structured model, semi structured data, information. * EDA database – database specialized for the purpose of electronic design automation. *
Endgame tablebase In chess, the endgame tablebase, or simply the tablebase, is a computerised database containing precalculated evaluations of chess endgame, endgame positions. Tablebases are used to analyse finished games, as well as by chess engines to evaluate ...
– computerized database that contains precalculated exhaustive analysis of a chess endgame position. * Food composition database (FCDB) – provides detailed information on the nutritional composition of foods. *
Full-text database A full-text database or a complete-text database is a database that contains the complete text of books, dissertations, journals, magazines, newspapers or other kinds of textual documents. They differ from bibliographic databases (which contain ...
– database that contains the complete text of books, dissertations, journals, magazines, newspapers or other kinds of textual documents. Also called a "complete-text database". * Government database – collects personal information for various reasons (mass surveillance, Schengen Information System in the European Union, social security, statistics, etc.). *
Graph database A graph database (GDB) is a database that uses graph structures for semantic queries with nodes, edges, and properties to represent and store data. A key concept of the system is the graph (or edge or relationship). The graph relates the dat ...
– uses graph structures with nodes, edges, and properties to represent and store data. *
Knowledge base In computer science, a knowledge base (KB) is a set of sentences, each sentence given in a knowledge representation language, with interfaces to tell new sentences and to ask questions about what is known, where either of these interfaces migh ...
– special kind of database for knowledge management. A knowledge base provides a means for information to be collected, organised, shared, searched and utilised. *
Mobile database Mobile computing devices (e.g., smartphones and Personal digital assistant, PDAs) store and share data over a mobile network, or access a database which is actually stored by the mobile device. This could be a list of contacts, price information, di ...
– can be connected to by a mobile computing device over a mobile network. * Navigational database – database in which objects (or records) in it are found primarily by following references from other objects. *
Non-native speech database A non-native speech database is a Speech corpus, speech database of non-native pronunciations of English. Such databases are used in the development of: multilingual automatic speech recognition systems, Text-to-speech, text to speech systems, pron ...
– speech database of non-native pronunciations of English. *
Online database In computing, a database is an organized collection of Data (computing), data or a type of data store based on the use of a database management system (DBMS), the software that interacts with end users, Application software, applications, and ...
– database accessible from a network, including from the Internet. * Operational database – accessed by an Operational System to carry out regular operations of an organization. *
Parallel database A parallel database system seeks to improve performance through parallelization of various operations, such as loading data, building indexes and evaluating queries. Although data may be stored in a distributed fashion, the distribution is governed ...
– improves performance through parallelization of various operations, such as loading data, building indexes and evaluating queries. *
Probabilistic database Most real databases contain data whose correctness is uncertain. In order to work with such data, there is a need to quantify the integrity of the data. This is achieved by using probabilistic databases. A probabilistic database is an uncertain da ...
– uncertain database in which the possible worlds have associated probabilities. * Real-time database – processing system designed to handle workloads whose state is constantly changing (Buchmann). *
Relational database A relational database (RDB) is a database based on the relational model of data, as proposed by E. F. Codd in 1970. A Relational Database Management System (RDBMS) is a type of database management system that stores data in a structured for ...
– collection of data items organized as a set of formally described tables from which data can be accessed easily. *
Spatial database A spatial database is a general-purpose database (usually a relational database) that has been enhanced to include spatial data that represents objects defined in a geometric space, along with tools for querying and analyzing such data. Most ...
– database that is optimized to store and query data that is related to objects in space, including points, lines and polygons. *
Temporal database A temporal database stores data relating to time instances. It offers temporal data types and stores information relating to past, present and future time. Temporal databases can be uni-temporal, bi-temporal or tri-temporal. More specifically the ...
– database with built-in time aspects, for example a temporal data model and a temporal version of Structured Query Language (SQL). *
Time series database A time series database is a software system that is optimized for storing and serving time series through associated pairs of time(s) and value(s). In some fields, ''time series'' may be called profiles, curves, traces or trends. Several early tim ...
– a time series is an associative array of numbers indexed by a datetime or a datetime range. These time series are often called profiles or curves, depending upon the market. A time series of stock prices might be called a price curve, or a time series of energy consumption might be called a load profile. Despite the disparate naming, the operations performed on them are sufficiently common as to demand special database treatment. *
Triplestore A triplestore or RDF store is a purpose-built database for the storage and retrieval of triples through semantic queries. A triple is a data entity composed of subject– predicate– object, like "Bob is 35" (i.e., Bob's age measured in years i ...
– purpose-built database for the storage and retrieval of triples, a triple being a data entity composed of subject-predicate-object, like "Bob is 35" or "Bob knows Fred". * Very large database (VLDB) – contains an extremely high number of tuples (database rows), or occupies an extremely large physical filesystem storage space. *
Virtual private database A virtual private database or VPD masks data in a larger database so that only a subset of the data appears to exist, without actually segregating data into different tables, schemas or databases. A typical application is constraining sites, departm ...
(VPD) – masks data in a larger database so that security allows only the use of apparently private data. * Vulnerability database – platform aimed at collecting, maintaining, and disseminating information about discovered vulnerabilities targeting real computer systems. * XLDB – Stands for "eXtremely Large Data Base". *
XML database An XML database is a data persistence software system that allows data to be specified, and stored, in XML format. This data can be queried, transformed, exported and returned to a calling system. XML databases are a flavor of document-oriented ...
– data stored in XML format, where it can be queried, exported and serialized into the desired format.


History of databases

* History of databases – * History of database management systems –:


Database use

* Database usage requirements – *
Database theory Database theory encapsulates a broad range of topics related to the study and research of the theoretical realm of databases and database management systems. Theoretical aspects of data management include, among other areas, the foundations of q ...
– encapsulates a broad range of topics related to the study and research of the theoretical realm of databases and database management systems. *
Database machine A database machines or back end processor is a computer or special Computer hardware, hardware that stores and retrieves data from a database. It is specially designed for database access and is tightly coupled to the main (Front and back ends, fron ...
– or is a computer or special hardware that stores and retrieves data from a database. Also called a "back end processor" *
Database server A database server is a server which uses a database application that provides database services to other computer programs or to computers, as defined by the client–server model. Database management systems (DBMSs) frequently provide database- ...
– computer program that provides database services to other computer programs or computers, as defined by the client-server model. * Database application – computer program whose primary purpose is entering and retrieving information from a computer-managed database. **
Database management system In computing, a database is an organized collection of data or a type of data store based on the use of a database management system (DBMS), the software that interacts with end users, applications, and the database itself to capture and an ...
(DBMS) – software package with computer programs that control the creation, maintenance, and use of a database. *
Database connection A database connection is a facility in computer science that allows client software to talk to database server software, whether on the same machine or not. A connection is required to send commands and receive answers, usually in the form of a ...
– facility in computer science that allows client software to communicate with database server software, whether on the same machine or not. ** Datasource – name given to the connection set up to a database from a server. The name is commonly used when creating a query to the database. The Database Source Name (DSN) does not have to be the same as the filename for the database. For example, a database file named "friends.mdb" could be set up with a DSN of "school". Then DSN "school" would then be used to refer to the database when performing a query. ** Data Source Name (DSN) – are data structures used to describe a connection to a data source. Sometimes known as a database source name though data sources are not limited to databases. *
Database administrator A database administrator (DBA) manages computer databases. The role may include capacity planning, installation, configuration, database design, migration, performance monitoring, security, troubleshooting, as well as backup and data re ...
(DBA) – is a person responsible for the installation, configuration, upgrade, administration, monitoring and maintenance of physical databases. *
Lock Lock(s) or Locked may refer to: Common meanings *Lock and key, a mechanical device used to secure items of importance *Lock (water navigation), a device for boats to transit between different levels of water, as in a canal Arts and entertainme ...
– *
Comparison of database tools The following tables compare general and technical information for a number of available database administration tools. Please see individual product articles for further information. This article is neither all-inclusive nor necessarily up to dat ...
– (provides tables for comparing general and technical information for a number of available database administrator tools.) *
Database-centric architecture Database-centric Architecture or data-centric architecture has several distinct meanings, generally relating to software architectures in which databases play a crucial role. Often this description is meant to contrast the design to an alternative ...
– software architectures in which databases play a crucial role. Also called "data-centric architecture". * Intelligent database – was put forward as a system that manages information (rather than data) in a way that appears natural to users and which goes beyond simple record keeping. *
Two-phase locking In databases and transaction processing, two-phase locking (2PL) is a pessimistic concurrency control method that guarantees conflict-serializability. Philip A. Bernstein, Vassos Hadzilacos, Nathan Goodman (1987) ''Concurrency Control and Recov ...
(2PL) – is a concurrency control method that guarantees serializability. * Locks with ordered sharing – comprises several variants of the ''Two phase locking'' (2PL) concurrency control protocol generated by changing the blocking semantics of locks upon conflicts. *
Load file A load file in the litigation community is commonly referred to as the file used to import data (coded, captured or extracted data from Electronically stored information (Federal Rules of Civil Procedure), ESI processing) into a database; or the ...
– in the litigation community is commonly referred to as the file used to import data (coded, captured or extracted data from ESI processing) into a database; or the file used to link images. * Database publishing – area of automated media production in which specialized techniques are used to generate paginated documents from source data residing in traditional databases. * Halloween Problem – a phenomenon in databases in which an update operation causes a change in the physical location of a row, potentially allowing the row to be visited more than once during the operation. * Log shipping – process of automating the backup of a database and transaction log files on a primary (production) database server, and then restoring them onto a standby server.


Database languages

Database languages – *
Data definition language In the context of SQL, data definition or data description language (DDL) is a syntax for creating and modifying database objects such as tables, indices, and users. DDL statements are similar to a computer programming language for defining d ...
– *
Data manipulation language A data manipulation language (DML) is a computer programming language used for adding (inserting), deleting, and modifying (updating) data in a database. A DML is often a sublanguage of a broader database language such as SQL, with the DML com ...
– *
Query language A query language, also known as data query language or database query language (DQL), is a computer language used to make queries in databases and information systems. In database systems, query languages rely on strict theory to retrieve informa ...
– ** Information retrieval query language – query language used to make queries into database, where the semantics of the query are defined not by a precise rendering of a formal syntax, but by an interpretation of the most suitable results of the query. **
SQL Structured Query Language (SQL) (pronounced ''S-Q-L''; or alternatively as "sequel") is a domain-specific language used to manage data, especially in a relational database management system (RDBMS). It is particularly useful in handling s ...
(Structured Query Language) – special-purpose programming language designed for managing data held in a relational database management system (RDBMS), or for stream processing in a relational data stream management system (RDSMS). **
XQuery XQuery (XML Query) is a query language and functional programming language designed to query and transform collections of structured and unstructured data, primarily in the form of XML. It also supports text data and, through implementation-sp ...
– a query and functional programming language that queries and transforms collections of structured and unstructured data.


Database security

Database security Database security concerns the use of a broad range of information security controls to protect databases against compromises of their confidentiality, integrity and availability. It involves various types or categories of controls, such as tech ...
– * Database activity monitoring (DAM) – database security technology for monitoring and analyzing database activity that operates independently of the database management system (DBMS) and does not rely on any form of native (DBMS-resident) auditing or native logs such as trace or transaction logs. * Database audit – *
Database forensics Database forensics is a branch of digital forensic science relating to the forensic study of databases and their related metadata. The discipline is similar to computer forensics, following the normal forensic process and applying investigative ...
– branch of digital forensic science relating to the forensic study of databases and their related metadata. * Negative database – credit card terminology for a list of credit card owners who chargeback a lot.


Database design

Database design Database design is the organization of data according to a database model. The designer determines what data must be stored and how the data elements interrelate. With this information, they can begin to fit the data to the database model.Teorey, T ...
– *
Entity–relationship model An entity–relationship model (or ER model) describes interrelated things of interest in a specific domain of knowledge. A basic ER model is composed of entity types (which classify the things of interest) and specifies relationships that can e ...
(ER model) – abstract and conceptual representation of data. *
Database normalization Database normalization is the process of structuring a relational database in accordance with a series of so-called '' normal forms'' in order to reduce data redundancy and improve data integrity. It was first proposed by British computer scien ...
– process of organizing the fields and tables of a relational database to minimize redundancy and dependency. * Database refactoring – simple change to a database schema that improves its design while retaining both its behavioral and informational semantics.


Database programming

* Database abstraction layer – application programming interface which unifies the communication between a computer application and databases such as SQL Server, DB2, MySQL, PostgreSQL, Oracle or SQLite. * Object–relational mapping (ORM, O/RM, and O/R mapping) – in computer software is a programming technique for converting data between incompatible type systems in object-oriented programming languages.


Database management

* Database virtualization – it is the decoupling of the database layer, which lies between the storage and application layers within the application stack. *
Database tuning Database tuning describes a group of activities used to optimize and homogenize the performance of a database. It usually overlaps with query tuning, but refers to design of the database files, selection of the database management system (DBMS) ...
– describes a group of activities used to optimize and homogenize the performance of a database. ** Database caching – effective approach to achieve high scalability and performance. * – *
Database preservation Database preservation usually involves converting the information stored in a database to a form likely to be accessible in the long term as technology changes, without losing the initial characteristics (context, content, structure, appearance and ...
– usually involves converting the information stored in a database, without losing the characteristics (Context, Content, Structure, Appearance and Behaviour) of the data, to a format which can be used in the long term, even if the technology and daily life knowledge changes. * Database integrity – ensures that data entered into the database is accurate, valid, and consistent.


Database management systems

Database management system In computing, a database is an organized collection of data or a type of data store based on the use of a database management system (DBMS), the software that interacts with end users, applications, and the database itself to capture and an ...
– *
Database model A database model is a type of data model that determines the logical structure of a database. It fundamentally determines in which manner data can be stored, organized and manipulated. The most popular example of a database model is the relatio ...
– *
Database normalization Database normalization is the process of structuring a relational database in accordance with a series of so-called '' normal forms'' in order to reduce data redundancy and improve data integrity. It was first proposed by British computer scien ...
– organizing tables based on their attributes so that the data presented can avoid having redundancy and dependency. * Database storage structures – *
Distributed database management system A distributed database is a database in which data is stored across different physical locations. It may be stored in multiple computers located in the same physical location (e.g. a data centre); or maybe dispersed over a network of interconnec ...
– *
Federated database system A federated database system (FDBS) is a type of Meta (prefix), meta-database management system (DBMS), which transparently maps multiple autonomous Database management system, database systems into a single federated database. The constituent data ...
– type of meta-database management system (DBMS), which transparently maps multiple autonomous database systems into a single federated database. *
Referential integrity Referential integrity is a property of data stating that all its references are valid. In the context of relational databases, it requires that if a value of one attribute (column) of a relation (table) references a value of another attribute (e ...
– . *
Relational algebra In database theory, relational algebra is a theory that uses algebraic structures for modeling data and defining queries on it with well founded semantics (computer science), semantics. The theory was introduced by Edgar F. Codd. The main applica ...
– offshoot of first-order logic (and of algebra of sets), deals with a set of finitary relations (see also relation (database)) that is closed under certain operators. *
Relational calculus The relational calculus consists of two calculi, the tuple relational calculus and the domain relational calculus, that is part of the relational model for databases and provide a declarative way to specify database queries. The raison d'être ...
– consists of two calculi, the tuple relational calculus and the domain relational calculus, that are part of the relational model for databases and provide a declarative way to specify database queries. *
Relational database A relational database (RDB) is a database based on the relational model of data, as proposed by E. F. Codd in 1970. A Relational Database Management System (RDBMS) is a type of database management system that stores data in a structured for ...
– collection of data items organized as a set of formally described tables from which data can be accessed easily. *
Relational database management system A relational database (RDB) is a database based on the relational model of data, as proposed by E. F. Codd in 1970. A Relational Database Management System (RDBMS) is a type of database management system that stores data in a structured for ...
(RDBMS) – database management system (DBMS) that is based on the relational model as introduced by E. *
Relational model The relational model (RM) is an approach to managing data using a structure and language consistent with first-order predicate logic, first described in 1969 by English computer scientist Edgar F. Codd, where all data are represented in terms of t ...
– for database management is a database model based on first-order predicate logic, first formulated and proposed in 1969 by Edgar F. *
Object–relational database An object–relational database (ORD), or object–relational database management system (ORDBMS), is a database management system (DBMS) similar to a relational database, but with an object-oriented database model: objects, classes and inherit ...
(ORD) – database management system (DBMS) similar to a relational database, but with an object-oriented database model: objects, classes and inheritance are directly supported in database schemas and in the query language. Also called object–relational database management system (ORDBMS). *
Transaction processing In computer science, transaction processing is information processing that is divided into individual, indivisible operations called ''transactions''. Each transaction must succeed or fail as a complete unit; it can never be only partially c ...


Concepts

*
Database In computing, a database is an organized collection of data or a type of data store based on the use of a database management system (DBMS), the software that interacts with end users, applications, and the database itself to capture and a ...
– organized collection of data, today typically in digital form. *
ACID An acid is a molecule or ion capable of either donating a proton (i.e. Hydron, hydrogen cation, H+), known as a Brønsted–Lowry acid–base theory, Brønsted–Lowry acid, or forming a covalent bond with an electron pair, known as a Lewis ...
– (''atomicity, consistency, isolation, durability'') is a set of properties that guarantee that database transactions are processed reliably. *
Create, read, update and delete In computer programming, create, read, update, and delete (CRUD) are the four basic operations (actions) of persistent storage. CRUD is also sometimes used to describe user interface conventions that facilitate viewing, searching, and changing info ...
(CRUD) – are the four basic functions of persistent storage. *
Null Null may refer to: Science, technology, and mathematics Astronomy *Nuller, an optical tool using interferometry to block certain sources of light Computing *Null (SQL) (or NULL), a special marker and keyword in SQL indicating that a data value do ...
– *
Candidate key A candidate key, or simply a key, of a relational database is any set of columns that have a unique combination of values in each row, with the additional constraint that removing any column could produce duplicate combinations of values. A candi ...
– minimal superkey for a relation. *
Foreign key A foreign key is a set of attributes in a table that refers to the primary key of another table, linking these two tables. In the context of relational databases, a foreign key is subject to an inclusion dependency constraint that the tuples ...
– referential constraint between two tables. *
Primary key In the relational model of databases, a primary key is a designated attribute (column) that can reliably identify and distinguish between each individual record in a table. The database creator can choose an existing unique attribute or combinati ...
– * Superkey – set of attributes of a relation variable for which it holds that in all relations assigned to that variable, there are no two distinct tuples (rows) that have the same values for the attributes in this set. *
Surrogate key A surrogate key (or synthetic key, pseudokey, entity identifier, factless key, or technical key) in a database is a unique identifier for either an ''entity'' in the modeled world or an ''object'' in the database. The surrogate key is ''not'' deri ...
– unique identifier in a database for either an ''entity'' in the modeled world or an ''object'' in the database. * Armstrong's axioms – set of axioms (or, more precisely, inference rules) used to infer all the functional dependencies on a relational database. *
NoSQL NoSQL (originally meaning "Not only SQL" or "non-relational") refers to a type of database design that stores and retrieves data differently from the traditional table-based structure of relational databases. Unlike relational databases, which ...
– class of database management system identified by its non-adherence to the widely used relational database management system (RDBMS) model:


Objects

*
Relation Relation or relations may refer to: General uses * International relations, the study of interconnection of politics, economics, and law on a global level * Interpersonal relationship, association or acquaintance between two or more people * ...
– **
Table Table may refer to: * Table (database), how the table data arrangement is used within the databases * Table (furniture), a piece of furniture with a flat surface and one or more legs * Table (information), a data arrangement with rows and column ...
– **
Column A column or pillar in architecture and structural engineering is a structural element that transmits, through compression, the weight of the structure above to other structural elements below. In other words, a column is a compression member ...
– ** Row – * View – *
Database transaction A database transaction symbolizes a unit of work, performed within a database management system (or similar system) against a database, that is treated in a coherent and reliable way independent of other transactions. A transaction generally rep ...
– *
Transaction log In the field of databases in computer science, a transaction log (also transaction journal, database log, binary log or audit trail) is a history of actions executed by a database management system used to guarantee ACID properties over crashes ...
– history of actions executed by a database management system to guarantee ACID properties over crashes or hardware failures. Also called "transaction journal", "database log" or "binary log". *
Database trigger A database trigger is procedural code that is automatically executed in response to certain Event (computing), events on a particular Table (database), table or View (database), view in a database. The trigger is mostly used for maintaining the Dat ...
– procedural code that is automatically executed in response to certain events on a particular table or view in a database. *
Index Index (: indexes or indices) may refer to: Arts, entertainment, and media Fictional entities * Index (''A Certain Magical Index''), a character in the light novel series ''A Certain Magical Index'' * The Index, an item on the Halo Array in the ...
– *
Stored procedure A stored procedure (also termed prc, proc, storp, sproc, StoPro, StoredProc, StoreProc, sp, or SP) is a subroutine available to applications that access a relational database management system (RDBMS). Such procedures are stored in the database d ...
– subroutine available to applications that access a relational database system. * Cursor – * Partition


Components

*
Concurrency control In information technology and computer science, especially in the fields of computer programming, operating systems, multiprocessors, and databases, concurrency control ensures that correct results for concurrent operations are generated, whil ...
– ensures that correct results for concurrent operations are generated, while getting those results as quickly as possible. *
Data dictionary A data dictionary, or metadata repository, as defined in the ''IBM Dictionary of Computing'', is a "centralized repository of information about data such as meaning, relationships to other data, origin, usage, and format". ''Oracle Corporation, ...
– as defined in the ''IBM Dictionary of Computing'', is a "centralized repository of information about data such as meaning, relationships to other data, origin, usage, and format." Also called a "metadata repository". *
Java Database Connectivity Java Database Connectivity (JDBC) is an application programming interface (API) for the Java programming language which defines how a client may access a database. It is a Java-based data access technology used for Java database connectivity. It is ...
– . *
Open Database Connectivity In computing, Open Database Connectivity (ODBC) is a standard application programming interface (API) for accessing database management systems (DBMS). The designers of ODBC aimed to make it independent of database systems and operating systems. An ...
– *
Query language A query language, also known as data query language or database query language (DQL), is a computer language used to make queries in databases and information systems. In database systems, query languages rely on strict theory to retrieve informa ...
– *
Query optimizer Query optimization is a feature of many relational database management systems and other databases such as NoSQL and graph databases. The query optimizer attempts to determine the most efficient way to execute a given query by considering the pos ...
– component of a database management system that attempts to determine the most efficient way to execute a query. *
Query plan A query plan (or query execution plan) is a sequence of steps used to access data in a SQL relational database management system. This is a specific case of the relational model concept of access plans. Since SQL is declarative, there are typical ...
– ordered set of steps used to access or modify information in a SQL relational database management system. Also called a "query execution plan".


Functions

*
Database administration Database administration is the function of managing and maintaining database management systems (DBMS) software. Mainstream DBMS software such as Oracle, IBM Db2 and Microsoft SQL Server need ongoing management. As such, corporations that use D ...
– work done by a
database administrator A database administrator (DBA) manages computer databases. The role may include capacity planning, installation, configuration, database design, migration, performance monitoring, security, troubleshooting, as well as backup and data re ...
, some of which may be automated. *
Query optimization Query optimization is a feature of many relational database management systems and other databases such as NoSQL and graph databases. The query optimizer attempts to determine the most efficient way to execute a given query by considering the po ...
– function of many relational database management systems in which multiple query plans for satisfying a query are examined and a good query plan is identified. * Database replication


Database products

* List of object-oriented database management systems – ** Comparison of object–relational database management systems – *
List of relational database management systems This is a list of relational database management systems. List of software Front-end User interfaces Only * Apache OpenOffice Base ** HSQLDB * LibreOffice Base ** Firebird ** HSQLDB *Microsoft Access ** Access Database Engine Discontinued * Bri ...
– **
Comparison of relational database management systems The following tables compare general and technical information for a number of relational database management systems. Please see the individual products' articles for further information. Unless otherwise specified in footnotes, comparisons are ba ...
– **
MySQL MySQL () is an Open-source software, open-source relational database management system (RDBMS). Its name is a combination of "My", the name of co-founder Michael Widenius's daughter My, and "SQL", the acronym for Structured Query Language. A rel ...
( outline) – *
Document-oriented database A document-oriented database, or document store, is a computer program and data storage system designed for storing, retrieving and managing document-oriented information, also known as semi-structured data. Document-oriented databases are one ...
– computer program designed for storing, retrieving, and managing document-oriented, or Semi-structured model, semi structured data, information.


Database models

*
Database model A database model is a type of data model that determines the logical structure of a database. It fundamentally determines in which manner data can be stored, organized and manipulated. The most popular example of a database model is the relatio ...
– theoretical foundation of a database and fundamentally determines in which manner data can be stored, organized, and manipulated in a database system. It thereby defines the infrastructure offered by a particular database system. The most popular example of a database model is the relational model.


Models

*
Flat file database A flat-file database is a database stored in a file called a flat file. Records follow a uniform format, and there are no structures for indexing or recognizing relationships between records. The file is simple. A flat file can be a plain t ...
– various means to encode a database model (most commonly a table) as a single file. *
Hierarchical database model A hierarchical database model is a data model in which the data is organized into a tree-like structure. The data are stored as records which is a collection of one or more fields. Each field contains a single value, and the collection of fields i ...
– data model in which the data is organized into a tree-like structure. * – * Network model – database model conceived as a flexible way of representing objects and their relationships. *
Relational model The relational model (RM) is an approach to managing data using a structure and language consistent with first-order predicate logic, first described in 1969 by English computer scientist Edgar F. Codd, where all data are represented in terms of t ...
– *
Entity–relationship model An entity–relationship model (or ER model) describes interrelated things of interest in a specific domain of knowledge. A basic ER model is composed of entity types (which classify the things of interest) and specifies relationships that can e ...
– *
Graph database A graph database (GDB) is a database that uses graph structures for semantic queries with nodes, edges, and properties to represent and store data. A key concept of the system is the graph (or edge or relationship). The graph relates the dat ...
– uses graph structures with nodes, edges, and properties to represent and store data. *
Object database An object database or object-oriented database is a database management system in which information is represented in the form of objects as used in object-oriented programming. Object databases are different from relational databases which are ...
– database management system in which information is represented in the form of objects as used in object-oriented programming. Also called an "object-oriented database management system". * Entity–attribute–value model


Other models

* – *
Semantic data model A semantic data model (SDM) is a :wiktionary:high-level, high-level semantics-based database description and structuring formalism (database model) for databases. This database model is designed to capture more of the meaning of an application e ...
– *
Star schema In computing, the star schema or star model is the simplest style of data mart Logical schema, schema and is the approach most widely used to develop data warehouses and dimensional data marts. The star schema consists of one or more fact tables ...
– is the simplest style of data warehouse schema. Also called "star-join schema", "
data cube In computer programming contexts, a data cube (or datacube) is a multi-dimensional ("n-D") array of values. Typically, the term data cube is applied in contexts where these arrays are massively larger than the hosting computer's main memory; exa ...
", or "multi-dimensional schema". *
XML database An XML database is a data persistence software system that allows data to be specified, and stored, in XML format. This data can be queried, transformed, exported and returned to a calling system. XML databases are a flavor of document-oriented ...


Implementations

*
Flat file database A flat-file database is a database stored in a file called a flat file. Records follow a uniform format, and there are no structures for indexing or recognizing relationships between records. The file is simple. A flat file can be a plain t ...
– *
Deductive database A deductive database is a database system that can make deductions (i.e. conclude additional facts) based on rules and facts stored in its database. Datalog is the language typically used to specify facts, rules and queries in deductive database ...
– database system that can make deductions. *
Document-oriented database A document-oriented database, or document store, is a computer program and data storage system designed for storing, retrieving and managing document-oriented information, also known as semi-structured data. Document-oriented databases are one ...
– *
Object–relational database An object–relational database (ORD), or object–relational database management system (ORDBMS), is a database management system (DBMS) similar to a relational database, but with an object-oriented database model: objects, classes and inherit ...
– *
Temporal database A temporal database stores data relating to time instances. It offers temporal data types and stores information relating to past, present and future time. Temporal databases can be uni-temporal, bi-temporal or tri-temporal. More specifically the ...
– database with built-in time aspects, for example a temporal data model and a temporal version of Structured Query Language (SQL). *
XML database An XML database is a data persistence software system that allows data to be specified, and stored, in XML format. This data can be queried, transformed, exported and returned to a calling system. XML databases are a flavor of document-oriented ...
– *
Triplestore A triplestore or RDF store is a purpose-built database for the storage and retrieval of triples through semantic queries. A triple is a data entity composed of subject– predicate– object, like "Bob is 35" (i.e., Bob's age measured in years i ...
– purpose-built database for the storage and retrieval of triples, a triple being a data entity composed of subject-predicate-object, like "Bob is 35" or "Bob knows Fred".


Data warehouse

Data warehouse In computing, a data warehouse (DW or DWH), also known as an enterprise data warehouse (EDW), is a system used for Business intelligence, reporting and data analysis and is a core component of business intelligence. Data warehouses are central Re ...


Creating the data warehouse


Concepts

*
Dimension In physics and mathematics, the dimension of a mathematical space (or object) is informally defined as the minimum number of coordinates needed to specify any point within it. Thus, a line has a dimension of one (1D) because only one coo ...
– * Dimensional modeling (DM) – is the name of a set of techniques and concepts used in data warehouse design. *
Fact A fact is a truth, true data, datum about one or more aspects of a circumstance. Standard reference works are often used to Fact-checking, check facts. Science, Scientific facts are verified by repeatable careful observation or measurement by ...
– *
Online analytical processing In computing, online analytical processing (OLAP) (), is an approach to quickly answer multi-dimensional analytical (MDA) queries. The term ''OLAP'' was created as a slight modification of the traditional database term online transaction proces ...
(OLAP) – or is an approach to swiftly answer multi-dimensional analytical (multi-dimensional analytical, MDA) queries. *
Star schema In computing, the star schema or star model is the simplest style of data mart Logical schema, schema and is the approach most widely used to develop data warehouses and dimensional data marts. The star schema consists of one or more fact tables ...
– * Aggregate


Variants

* Anchor Modeling – *
Column-oriented DBMS Data orientation is the representation of tabular data in a linear memory model such as in-disk or in-memory. The two most common representations are column-oriented (columnar format) and row-oriented (row format). The choice of data orienta ...
– database management system (DBMS) that stores data tables as sections of columns of data rather than as rows of data, like most relational DBMSs. * Data Vault Modeling – *
HOLAP In computing, online analytical processing (OLAP) (), is an approach to quickly answer multi-dimensional analytical (MDA) queries. The term ''OLAP'' was created as a slight modification of the traditional database term online transaction proces ...
– *
MOLAP In computing, online analytical processing (OLAP) (), is an approach to quickly answer multi-dimensional analytical (MDA) queries. The term ''OLAP'' was created as a slight modification of the traditional database term online transaction proces ...
– stands for Multidimensional Online Analytical Processing. *
ROLAP In computing, online analytical processing (OLAP) (), is an approach to quickly answer multi-dimensional analytical (MDA) queries. The term ''OLAP'' was created as a slight modification of the traditional database term online transaction proces ...
– stands for Relational Online Analytical Processing. * Operational data store (ODS) – database designed to integrate data from multiple sources for additional operations on the data.


Elements

*
Data dictionary A data dictionary, or metadata repository, as defined in the ''IBM Dictionary of Computing'', is a "centralized repository of information about data such as meaning, relationships to other data, origin, usage, and format". ''Oracle Corporation, ...
– /
Metadata Metadata (or metainformation) is "data that provides information about other data", but not the content of the data itself, such as the text of a message or the image itself. There are many distinct types of metadata, including: * Descriptive ...
– *
Data mart A data mart is a structure/access pattern specific to ''data warehouse'' environments. The data mart is a subset of the data warehouse that focuses on a specific business line, department, subject area, or team. Whereas data warehouses have an en ...
– access layer of the data warehouse environment that is used to get data out to the users. *
Sixth normal form Sixth normal form (6NF) is a normal form used in relational database normalization which extends the relational algebra and generalizes relational operators (such as join) to support interval data, which can be useful in temporal databases. Th ...
(6NF) – term in relational database theory, used in two different ways. *
Surrogate key A surrogate key (or synthetic key, pseudokey, entity identifier, factless key, or technical key) in a database is a unique identifier for either an ''entity'' in the modeled world or an ''object'' in the database. The surrogate key is ''not'' deri ...


Fact

*
Fact table In data warehousing, a fact table consists of the measurements, metrics or Fact (data warehouse), facts of a business process. It is located at the center of a star schema or a snowflake schema surrounded by dimension tables. Where multiple fact t ...
– consists of the measurements, metrics or facts of a business process. * Early-arriving fact – * Measure


Dimension

*
Dimension table A dimension is a structure that categorizes facts and measures in order to enable users to answer business questions. Commonly used dimensions are people, products, place and time. (Note: People and time sometimes are not modeled as dimensions. ...
– one of the set of companion tables to a fact table. * Degenerate dimension – dimension key in the fact table that does not have its own dimension table, because all the interesting attributes have been placed in analytic dimensions. *
Slowly changing dimension In data management and data warehousing, a slowly changing dimension (SCD) is a dimension that stores data which, while generally stable, may change over time, often in an unpredictable manner. This contrasts with a rapidly changing dimension, su ...


Filling

* Extract-Transform-Load (ETL) – * Data extraction – act or process of retrieving data out of (usually unstructured or poorly structured) data sources for further data processing or data storage (data migration). *
Data transformation In computing, data transformation is the process of converting data from one format or structure into another format or structure. It is a fundamental aspect of most data integrationCIO.com. Agile Comes to Data Integration. Retrieved from: https ...
– converts data from a source data format into destination data. * Data loading


Using the data warehouse


Concepts

*
Business intelligence Business intelligence (BI) consists of strategies, methodologies, and technologies used by enterprises for data analysis and management of business information. Common functions of BI technologies include Financial reporting, reporting, online an ...
(BI) – is defined as the ability for an organization to take all its capabilities and convert them into knowledge, ultimately, getting the right information to the right people, at the right time, via the right channel. *
Dashboard A dashboard (also called dash, instrument panel or IP, or fascia) is a control panel (engineering), control panel set within the central console of a vehicle, boat, or cockpit of an aircraft or spacecraft. Usually located directly ahead of the ...
– *
Data mining Data mining is the process of extracting and finding patterns in massive data sets involving methods at the intersection of machine learning, statistics, and database systems. Data mining is an interdisciplinary subfield of computer science and ...
– is the process that results in the discovery of new patterns in large data sets. It is the analysis step of the "Knowledge Discovery in Databases" process, or KDD. * Decision support system (DSS) – *
OLAP cube An OLAP cube is a multi-dimensional array of data. Online analytical processing (OLAP) is a computer-based technique of analyzing data to look for insights. The term ''cube'' here refers to a multi-dimensional dataset, which is also sometimes cal ...
– set of data, organized in a way that facilitates non-predetermined queries for aggregated information, or in other words, online analytical processing.


Languages

*
Data Mining Extensions Data Mining Extensions (DMX) is a query language for data mining models supported by Microsoft's SQL Server Analysis Services product. Like SQL, it supports a data definition language (DDL), data manipulation language (DML) and a data query lan ...
(DMX) – *
MultiDimensional eXpressions Multidimensional Expressions (MDX) is a query language for online analytical processing (OLAP) using a database management system. Much like SQL, it is a query language for OLAP cubes. It is also a calculation language, with syntax similar to spre ...
(MDX) – *
XML for Analysis XML for Analysis (XMLA) is an industry standard for data access in analytical systems, such as online analytical processing (OLAP) and data mining. XMLA is based on other industry standards such as XML, SOAP and HTTP. XMLA is maintained by XMLA Coun ...
(XMLA) –


Tools

*
Business intelligence tools Business intelligence software is a type of application software designed to retrieve, analyze, transform and report data for business intelligence (BI). The applications generally read data that has been previously stored, often - though not nece ...
– *
List of reporting software The following is a list of notable report generator software. Reporting software is used to generate human-readable reports from various data sources. Commercial software * ActiveReports * Actuate Corporation * BOARD * Business Objects * ...
– *
Spreadsheets A spreadsheet is a computer application for computation, organization, analysis and storage of data in tabular form. Spreadsheets were developed as computerized analogs of paper accounting worksheets. The program operates on data entered in ce ...


People

*
Edgar F. Codd Edgar Frank "Ted" Codd (19 August 1923 – 18 April 2003) was a British computer scientist who, while working for IBM, invented the relational model for database management, the theoretical basis for relational databases and relational database ...
– English Computer scientist who introduced the relational database model * Bill Inmon – *
Ralph Kimball Ralph Kimball (born July 18, 1944) is an author on the subject of data warehousing and business intelligence. He is one of the original architects of data warehousing and is known for long-term convictions that data warehouses must be designed to ...
(Born 1944) – author on the subject of data warehousing and business intelligence.


Products

*
Comparison of OLAP servers The following tables compare general and technical information for a number of online analytical processing (OLAP) servers. Please see the individual products articles for further information. General information Data storage modes APIs and q ...


Database-related organizations

* ''
Database In computing, a database is an organized collection of data or a type of data store based on the use of a database management system (DBMS), the software that interacts with end users, applications, and the database itself to capture and a ...
'' * ''
Journal of Database Management The ''Journal of Database Management'' is a top-tier quarterly peer-reviewed academic journal. It was established in 1990 and is published by IGI Global. The editor-in-chief is Professor Keng SIAU (City University of Hong Kong). The scope of the jo ...
''


Database-related publications

* Ling Liu and Tamer M. Özsu (Eds.) (2009).
Encyclopedia of Database Systems
4100 p. 60 illus. . Table of Content available at http://refworks.springer.com/mrw/index.php?id=1217 * Beynon-Davies, P. (2004). Database Systems. 3rd Edition. Palgrave, Houndmills, Basingstoke. * Connolly, Thomas and Carolyn Begg. ''Database Systems.'' New York: Harlow, 2002. * * Gray, J. and Reuter, A. ''Transaction Processing: Concepts and Techniques'', 1st edition, Morgan Kaufmann Publishers, 1992. * Kroenke, David M. and David J. Auer. ''Database Concepts.'' 3rd ed. New York: Prentice, 2007. * * Teorey, T.; Lightstone, S. and Nadeau, T. ''Database Modeling & Design: Logical Design'', 4th edition, Morgan Kaufmann Press, 2005.


Database scholars

*
Serge Abiteboul Serge Joseph Abiteboul (born 25 August 1953 in Paris, France) is a French computer scientist working in the areas of data management, database theory, and finite model theory. Education The son of two hardware store owners, Abiteboul attended hig ...
* David Maier *
Ralph Kimball Ralph Kimball (born July 18, 1944) is an author on the subject of data warehousing and business intelligence. He is one of the original architects of data warehousing and is known for long-term convictions that data warehouses must be designed to ...
* Kamran Parsaye * C. J. Date *
Peter Chen Peter Pin-Shan Chen (; born 3 January 1947) is a Taiwanese-American computer scientist and applied mathematician. He is a retired distinguished career scientist and faculty member at Carnegie Mellon University and Distinguished Chair Professor E ...


See also

* Outline of computing ** Outline of software * Databank *
Database model A database model is a type of data model that determines the logical structure of a database. It fundamentally determines in which manner data can be stored, organized and manipulated. The most popular example of a database model is the relatio ...
*
Database schema The database schema is the structure of a database described in a formal language supported typically by a relational database management system (RDBMS). The term "wikt:schema, schema" refers to the organization of data as a blueprint of how the ...
* Record (database) * Key (database) *
Internet research In its widest sense, Internet research comprises any kind of research done on the Internet or the World Wide Web. Unlike simple fact-checking or web scraping, it often involves synthesizing from diverse sources and verifying the credibility of e ...
*
List of academic databases and search engines This page contains a representative list of major databases and search engines useful in an academic setting for finding and accessing articles in academic journals, institutional repository, institutional repositories, archives, or other collecti ...
*
List of biodiversity databases This is a list of biodiversity databases. Biodiversity databases store taxonomic information alone or more commonly also other information like distribution (spatial) data and ecological data, which provide information on the biodiversity of a pa ...
* List of Internet forums * List of online databases


References


External links


DB File extension
– information about files with the DB extension {{Outline footer *
Databases In computing, a database is an organized collection of data or a type of data store based on the use of a database management system (DBMS), the software that interacts with end users, applications, and the database itself to capture and ana ...
Databases In computing, a database is an organized collection of data or a type of data store based on the use of a database management system (DBMS), the software that interacts with end users, applications, and the database itself to capture and ana ...