A data management platform (DMP) is a software platform used for collecting and managing
data
Data ( , ) are a collection of discrete or continuous values that convey information, describing the quantity, quality, fact, statistics, other basic units of meaning, or simply sequences of symbols that may be further interpreted for ...
. DMPs allow businesses to identify
audience segments, which can be used to target specific users and contexts in online
advertising
Advertising is the practice and techniques employed to bring attention to a Product (business), product or Service (economics), service. Advertising aims to present a product or service in terms of utility, advantages, and qualities of int ...
campaigns. They may use
big data
Big data primarily refers to data sets that are too large or complex to be dealt with by traditional data processing, data-processing application software, software. Data with many entries (rows) offer greater statistical power, while data with ...
and
artificial intelligence
Artificial intelligence (AI) is the capability of computer, computational systems to perform tasks typically associated with human intelligence, such as learning, reasoning, problem-solving, perception, and decision-making. It is a field of re ...
algorithms to process and analyze large data sets about users from various sources.
Advantages of using DMPs include data organization, increased insight on audiences and markets, and more effective advertisement budgeting.
On the other hand, DMPs often have to deal with
privacy
Privacy (, ) is the ability of an individual or group to seclude themselves or information about themselves, and thereby express themselves selectively.
The domain of privacy partially overlaps with security, which can include the concepts of a ...
concerns due to the integration of third-party software with private data. This technology is continuously being developed by global entities such as
Nielsen and
Oracle
An oracle is a person or thing considered to provide insight, wise counsel or prophetic predictions, most notably including precognition of the future, inspired by deities. If done through occultic means, it is a form of divination.
Descript ...
.
More generally, the term data platform can refer to any
software
Software consists of computer programs that instruct the Execution (computing), execution of a computer. Software also includes design documents and specifications.
The history of software is closely tied to the development of digital comput ...
platform used for collecting and managing
data
Data ( , ) are a collection of discrete or continuous values that convey information, describing the quantity, quality, fact, statistics, other basic units of meaning, or simply sequences of symbols that may be further interpreted for ...
. It is an integrated solution which as of the 2010s can combine functionalities of for example a
data lake
A data lake is a system or data repository, repository of data stored in its natural/raw format, usually object binary large object, blobs or files. A data lake is usually a single store of data including raw copies of source system data, sensor ...
,
data warehouse
In computing, a data warehouse (DW or DWH), also known as an enterprise data warehouse (EDW), is a system used for Business intelligence, reporting and data analysis and is a core component of business intelligence. Data warehouses are central Re ...
or
data hub for
business intelligence
Business intelligence (BI) consists of strategies, methodologies, and technologies used by enterprises for data analysis and management of business information. Common functions of BI technologies include Financial reporting, reporting, online an ...
purposes.
However, this article discusses the use such technology platforms used for collecting and managing data for digital marketing purposes specifically.
Characteristics
Purpose
A DMP is any kind of software that manages the gathering, storage, and organization of data so that useful information can be leveraged from it by marketers, publishers, and other businesses. The data stored may include customer information,
demographics
Demography () is the statistical study of human populations: their size, composition (e.g., ethnic group, age), and how they change through the interplay of fertility (births), mortality (deaths), and migration.
Demographic analysis examin ...
, and mobile identifiers or
cookie IDs, which the DMP will analyze to allow businesses to create targeting segments for advertisements. DMPs can help brands learn more about their customer segments to inform acquisitions strategies and increase their sales. They also allow businesses to gauge the effectiveness of their advertising campaigns.
History
First and second generation programming languages
During the 1950s, data management became a problem for companies as computers were not quick with computations and needed a great amount of labor to deliver results. Companies started by storing their data in warehouses. Early programs were written in
binary and
decimal
The decimal numeral system (also called the base-ten positional numeral system and denary or decanary) is the standard system for denoting integer and non-integer numbers. It is the extension to non-integer numbers (''decimal fractions'') of th ...
and this was known as
absolute machine language, which later was called the first generation programming language.

After this,
assembly language
In computing, assembly language (alternatively assembler language or symbolic machine code), often referred to simply as assembly and commonly abbreviated as ASM or asm, is any low-level programming language with a very strong correspondence bet ...
- which came to be known as second generation programming languages - came into existence. This symbolic machine code grew popular among programmers as they were able to utilize alphabet letters for coding. This led to less errors in programs and improved code readability.
High-level languages
Throughout the 1960s and 1970s, as technology continued to progress and programmers became more in touch with computers, the First and Second Generation Programming Languages evolved into
high-level languages (HLL). These languages are known for being easily readable by a human and were important for allowing one to write a generic program that does not depend on the kind of computer used. HLL were known for emphasizing
memory
Memory is the faculty of the mind by which data or information is encoded, stored, and retrieved when needed. It is the retention of information over time for the purpose of influencing future action. If past events could not be remembe ...
and data management and many of the languages that came out in this era (i.e.
COBOL
COBOL (; an acronym for "common business-oriented language") is a compiled English-like computer programming language designed for business use. It is an imperative, procedural, and, since 2002, object-oriented language. COBOL is primarily ...
,
C, and
C++) are still widely used today.
Online data management and databases
Online transactions soon were a big part of many industries. This was made possible by online
data management systems. These systems can analyze information quickly and they allow programs to read, update and send information to the user.
In the 1970s,
Edgar F. Codd
Edgar Frank "Ted" Codd (19 August 1923 – 18 April 2003) was a British computer scientist who, while working for IBM, invented the relational model for database management, the theoretical basis for relational databases and relational database ...
developed an easy-to-learn language, Structured Query Language (
SQL
Structured Query Language (SQL) (pronounced ''S-Q-L''; or alternatively as "sequel")
is a domain-specific language used to manage data, especially in a relational database management system (RDBMS). It is particularly useful in handling s ...
) that had English commands. This language dealt with
relational database
A relational database (RDB) is a database based on the relational model of data, as proposed by E. F. Codd in 1970.
A Relational Database Management System (RDBMS) is a type of database management system that stores data in a structured for ...
s, improved data processing and decreased duplicated data. This relational model allowed large amounts of data to be processed quickly and improved
parallel processing,
client-server computing, and
graphical user interface
A graphical user interface, or GUI, is a form of user interface that allows user (computing), users to human–computer interaction, interact with electronic devices through Graphics, graphical icon (computing), icons and visual indicators such ...
s and it made multiple users to interact simultaneously.
To deal with the processing and research of Big Data,
NoSQL
NoSQL (originally meaning "Not only SQL" or "non-relational") refers to a type of database design that stores and retrieves data differently from the traditional table-based structure of relational databases. Unlike relational databases, which ...
came into existence. NoSQL's greatest power is its ability to store vast amounts of data. NoSQL was present in 1998, however its popularity among developers grew after 2005.
Cloud and artificial intelligence
Nowadays, data management has transferred over from local storage to the
cloud
In meteorology, a cloud is an aerosol consisting of a visible mass of miniature liquid droplets, frozen crystals, or other particles, suspended in the atmosphere of a planetary body or similar space. Water or various other chemicals may ...
. In the late 1990s and early 2000s, Salesforce and Amazon popularized the concept of internet-based services, which appealed to customers as it reduced in-house maintenance costs and increased flexibility in changing the needs of a business. With the rising prevalence of
artificial intelligence
Artificial intelligence (AI) is the capability of computer, computational systems to perform tasks typically associated with human intelligence, such as learning, reasoning, problem-solving, perception, and decision-making. It is a field of re ...
(AI), it is now easier than ever to store and sort through immense sets of data. It is in this era that DMPs have experienced their rise to prominence as the astronomical amount of user data in the world can now be processed and presented to companies for marketing purposes.
Data pipeline

DMPs first begin by gathering raw data. This requires
data collection
Data collection or data gathering is the process of gathering and measuring information on targeted variables in an established system, which then enables one to answer relevant questions and evaluate outcomes. Data collection is a research com ...
from various sources such as webpage visits and registration forms found online. Moreover, the available data is not limited to one's computer, as mobile devices, social networks, and smart devices all act as constant sources of raw data. From a technical standpoint,
JavaScript
JavaScript (), often abbreviated as JS, is a programming language and core technology of the World Wide Web, alongside HTML and CSS. Ninety-nine percent of websites use JavaScript on the client side for webpage behavior.
Web browsers have ...
trackers and
APIs are used to inform the server when the user is performing an action that should be recorded and stored.
Once the DMP has collected the data, it moves on to then process and integrate all of it. It first cleans the data by filtering out any junk or missing values. Then, it utilizes machine learning algorithms to find patterns across sets of users and organize them on a broad scale. This helps create a 360 degree customer view which in turn helps to integrate first-, second-, and third-party data types into one database.
Next comes the data management phase. Here, the DMP assists its clients - which are other companies seeking to leverage their user data - in creating user profiles. User profiles are segments of specific customer demographics that are intended to help visualize patterns and tendencies across an industry. They are also useful for shedding light on undiscovered market opportunities.
The final step in this process is the activation stage. Once all the data has been gathered, processed, and properly organized and segmented, it is put into use in the marketplace through servers or
DSPs. From here, advertisers uses other third-party services to access a DMP and provide targeted content to their intended audiences.
Functionalities
DMPs are used for
profiling, analyzing, and
targeting online customers in
digital marketing
Digital marketing is the component of marketing that uses the Internet and online-based Information technology, digital technologies such as desktop computers, mobile phones, and other digital media and platforms to promote products and service ...
. They work in the following areas:
* Ad targeting – creating audience segments and targeting specific users with personalized ad campaigns, such as displaying advertisements for cars to users who demonstrate interest in buying a new car.
* User profiling – representing real people using sets of data on user related information, such as needs, interests, and behaviors. Profiles can be created manually or through machine learning algorithms that automatically analyze and profile internet users.
* Look-alike modeling – identifying new clients who behave similarly to current customers for targeted ad campaigns.
* Business insights – discovering new insights about customers and services through data analysis, as well as supplementing existing
CRM systems with additional external data, such as users’ attributes or their interactions with online products.
* Content and product recommendations – using DMP recommendations to develop a personalized experience for all users.
* Monetizing or selling data – selling DMP data to generate additional revenue.
* Audience enrichment – analyzing and getting to know an audience through DMP analytics to know their specific needs.
* Grow your customer base – discovering a new customer segment with DMP analytics and growing awareness and brand loyalty.
Advantages and disadvantages
Advantages
There are six major advantages to using a data management platform: gathering data in one place, using third party data to discover new markets, gaining audience insights, creating a full view of customers, targeting your audience, and effectively budgeting your expenditures on marketing.
[{{cite web, url=https://piwik.pro/blog/data-gathering-platform-dmp-cdp-dw-crm/, title=Data Gathering Platforms: Pros and Cons of DMPs, CDPs, DWs & CRMs, last=Matuszewska, first=Karolina, date=2018-08-31, website=Piwik PRO, access-date=2019-10-30]
Disadvantages
Data management platforms rely heavily on Cookie technology to identify behaviours. Recent moves from Apple and now Google are moving towards blocking third party advertising cookies which places the Data management platform value proposition at risk.
Moreover, the adoption of such a platform may be difficult in an organization's current environment. This is because data aggregation technology is complex and requires the correct technical knowledge for implementation.
Another area of concern is the quality of the imported data: if it is of low quality, then the DMP will fail to provide meaningful results.
Ownership of collected data and privacy concerns
There are three main types of data in general:
* 1st party data – data collected and owned by the company itself. For example, website data, mobile application data, and CRM data.
* 2nd party data – data collected as a result of corporate cooperation. This includes online campaign data and
customer journey
Customer experience, sometimes abbreviated to CX, is the totality of cognitive, affective, sensory, and behavioral responses of a customer during all stages of the consumption process including pre-purchase, consumption, and post-purchase sta ...
data.
* 3rd party data – data delivered by data providers, which is available on the market for purchase.
There are also three main types of data collected by DMPs:
* Observed data – the digital footprint of internet users, i.e. search history or type of web browser used.
* Inferred data – conclusions based on a user's internet behavior.
* Declared data – data explicitly provided from users such as online forms or application sign ups.
DMPs are beneficial in helping digital marketers discover new audiences based on third-party data. Although this is the case, the
General Data Protection Regulation
The General Data Protection Regulation (Regulation (EU) 2016/679), abbreviated GDPR, is a European Union regulation on information privacy in the European Union (EU) and the European Economic Area (EEA). The GDPR is an important component of ...
(GDPR) makes it harder for DMPs to obtain third-party data. Previously, DMPs processed third-party data through
cookies
A cookie is a sweet biscuit with high sugar and fat content. Cookie dough is softer than that used for other types of biscuit, and they are cooked longer at lower temperatures. The dough typically contains flour, sugar, egg, and some type of ...
and existing laws did not require user consent for such data collection. However, the GDPR now demands that personal data - which includes data collected through the use of cookies - can only be used with user consent. Going forward, this means that collecting third-party data will become harder for companies, and DMPs will have greater legal obligations. As a result, future DMPs may rely more heavily on first-party and second-party data.
References
Digital marketing
Big data
Data management software
Data engineering