In
database normalization
Database normalization is the process of structuring a relational database in accordance with a series of so-called '' normal forms'' in order to reduce data redundancy and improve data integrity. It was first proposed by British computer scien ...
, unnormalized form (UNF or 0NF), also known as an unnormalized relation or non-first normal form (N1NF or NF
2),
is a
database
In computing, a database is an organized collection of data or a type of data store based on the use of a database management system (DBMS), the software that interacts with end users, applications, and the database itself to capture and a ...
data model
A data model is an abstract model that organizes elements of data and standardizes how they relate to one another and to the properties of real-world entities. For instance, a data model may specify that the data element representing a car be ...
(organization of data in a database) which does not meet any of the conditions of database normalization defined by the
relational model
The relational model (RM) is an approach to managing data using a structure and language consistent with first-order predicate logic, first described in 1969 by English computer scientist Edgar F. Codd, where all data are represented in terms of t ...
. Database systems which support unnormalized data are sometimes called non-relational or
NoSQL
NoSQL (originally meaning "Not only SQL" or "non-relational") refers to a type of database design that stores and retrieves data differently from the traditional table-based structure of relational databases. Unlike relational databases, which ...
databases. In the relational model, unnormalized relations can be considered the starting point for a process of normalization.
"Unnormalized form" should not be confused with
denormalization, where normalization is deliberately compromised for selected tables in a relational database.
History
In 1970,
E. F. Codd proposed the
relational data model, now widely accepted as the standard data model. At that time,
office automation
Office automation refers to the varied computer machinery and software used to digitally create, collect, store, manipulate, and relay office information needed for accomplishing basic tasks. Raw data storage, electronic transfer, and the manageme ...
was the major use of data storage systems, which resulted in the proposal of many UNF/NF
2 data models like the Schek model, Jaeschke models (
non-recursive and
recursive algebra), and the nested table data model (NTD).
IBM
International Business Machines Corporation (using the trademark IBM), nicknamed Big Blue, is an American Multinational corporation, multinational technology company headquartered in Armonk, New York, and present in over 175 countries. It is ...
organized the first international workshop exclusively on this topic in 1987 which was held in
Darmstadt, Germany.
Moreover, a lot of research has been done and journals have been published to address the shortcomings of the
relational model
The relational model (RM) is an approach to managing data using a structure and language consistent with first-order predicate logic, first described in 1969 by English computer scientist Edgar F. Codd, where all data are represented in terms of t ...
. Since the turn of the millennium,
NoSQL
NoSQL (originally meaning "Not only SQL" or "non-relational") refers to a type of database design that stores and retrieves data differently from the traditional table-based structure of relational databases. Unlike relational databases, which ...
databases have become popular owing to the demands of
Web 2.0
Web 2.0 (also known as participative (or participatory) web and social web) refers to websites that emphasize user-generated content, ease of use, participatory culture, and interoperability (i.e., compatibility with other products, systems, a ...
.
Relational form
Normalization to
first normal form
First normal form (1NF) is the simplest form of database normalization defined by English computer scientist Edgar F. Codd, the inventor of the relational database. A Relation (database), relation (or a Table (database), ''table'', in SQL) can be ...
requires the initial data to be viewed as
relations. In database systems relations are represented as tables. The relation view implies some constraints on the tables:
* No duplicate rows. In practice, this is ensured by defining one or more columns as
primary keys.
* Rows do not have an intrinsic order. While tables have to be stored and presented in ''some'' order, this is unstable and implementation dependent. If a specific ordering needs to be represented, it has to be in the form of data, e.g. a "number" column.
* Columns have unique names within the same table.
* Each column has a domain (or data type) which defines the allowed values in the column.
* All rows in a table have the same set of columns.
This definition does not preclude columns having sets or relations as values, e.g. nested tables. This is the major difference to
first normal form
First normal form (1NF) is the simplest form of database normalization defined by English computer scientist Edgar F. Codd, the inventor of the relational database. A Relation (database), relation (or a Table (database), ''table'', in SQL) can be ...
.
NoSQL
NoSQL (originally meaning "Not only SQL" or "non-relational") refers to a type of database design that stores and retrieves data differently from the traditional table-based structure of relational databases. Unlike relational databases, which ...
databases like
document database
A document-oriented database, or document store, is a computer program and data storage system designed for storing, retrieving and managing document-oriented information, also known as Semi-structured model, semi-structured data.
Document-orie ...
s typically does not conform to the relational view. For example, an
JSON
JSON (JavaScript Object Notation, pronounced or ) is an open standard file format and electronic data interchange, data interchange format that uses Human-readable medium and data, human-readable text to store and transmit data objects consi ...
or
XML database
An XML database is a data persistence software system that allows data to be specified, and stored, in XML format. This data can be queried, transformed, exported and returned to a calling system. XML databases are a flavor of document-oriented ...
might support duplicate records and intrinsic ordering. Such database can be described as non-relational. But there are also database models which support the relational view, but does not embrace
first normal form
First normal form (1NF) is the simplest form of database normalization defined by English computer scientist Edgar F. Codd, the inventor of the relational database. A Relation (database), relation (or a Table (database), ''table'', in SQL) can be ...
. Such models are called non-first normal form relations (abbreviated NFR, N1NF or NF
2).
Example with a table valued column
This table represent a relation where one of the columns (Transactions) is itself relation-valued. This is a valid relation but does not conform to
first normal form
First normal form (1NF) is the simplest form of database normalization defined by English computer scientist Edgar F. Codd, the inventor of the relational database. A Relation (database), relation (or a Table (database), ''table'', in SQL) can be ...
which does not allow nested relations. The table is therefore unnormalized.
Modern applications
As of 2016, companies like
Google
Google LLC (, ) is an American multinational corporation and technology company focusing on online advertising, search engine technology, cloud computing, computer software, quantum computing, e-commerce, consumer electronics, and artificial ...
,
Amazon
Amazon most often refers to:
* Amazon River, in South America
* Amazon rainforest, a rainforest covering most of the Amazon basin
* Amazon (company), an American multinational technology company
* Amazons, a tribe of female warriors in Greek myth ...
and
Facebook
Facebook is a social media and social networking service owned by the American technology conglomerate Meta Platforms, Meta. Created in 2004 by Mark Zuckerberg with four other Harvard College students and roommates, Eduardo Saverin, Andre ...
deal with large amounts of data that are difficult to store efficiently. They use
NoSQL
NoSQL (originally meaning "Not only SQL" or "non-relational") refers to a type of database design that stores and retrieves data differently from the traditional table-based structure of relational databases. Unlike relational databases, which ...
databases, which are based on the principles of the unnormalized relational model, to deal with the storage issue.
Some examples of
NoSQL
NoSQL (originally meaning "Not only SQL" or "non-relational") refers to a type of database design that stores and retrieves data differently from the traditional table-based structure of relational databases. Unlike relational databases, which ...
databases are
MongoDB
MongoDB is a source-available, cross-platform, document-oriented database program. Classified as a NoSQL database product, MongoDB uses JSON-like documents with optional database schema, schemas. Released in February 2009 by 10gen (now MongoDB ...
,
Apache Cassandra
Apache Cassandra is a free and open-source software, free and open-source database management system designed to handle large volumes of data across multiple Commodity computing, commodity servers. The system prioritizes availability and scalab ...
and
Redis
Redis (; Remote Dictionary Server) is an in-memory key–value database, used as a distributed cache and message broker, with optional durability. Because it holds all data in memory and because of its design, Redis offers low- latency reads ...
.
See also
*
Denormalization
*
Normalization
Normalization or normalisation refers to a process that makes something more normal or regular. Science
* Normalization process theory, a sociological theory of the implementation of new technologies or innovations
* Normalization model, used in ...
*
First normal form
First normal form (1NF) is the simplest form of database normalization defined by English computer scientist Edgar F. Codd, the inventor of the relational database. A Relation (database), relation (or a Table (database), ''table'', in SQL) can be ...
*
Second normal form
Second normal form (2NF), in database normalization, is a normal form. A relation is in the second normal form if it fulfills the following two requirements:
# It is in first normal form.
# It does not have any non-prime attribute that is fun ...
*
Third normal form
Third normal form (3NF) is a database schema design approach for relational databases which uses normalizing principles to reduce the duplication of data, avoid data anomalies, ensure referential integrity, and simplify data management. It was d ...
*
Boyce–Codd normal form
*
NoSQL
NoSQL (originally meaning "Not only SQL" or "non-relational") refers to a type of database design that stores and retrieves data differently from the traditional table-based structure of relational databases. Unlike relational databases, which ...
References
{{Database normalization
Data modeling