MarkLogic Server is a
document-oriented database
A document-oriented database, or document store, is a computer program and data storage system designed for storing, retrieving and managing document-oriented information, also known as semi-structured data.
Document-oriented databases are one ...
developed by
MarkLogic. It is a
NoSQL
NoSQL (originally meaning "Not only SQL" or "non-relational") refers to a type of database design that stores and retrieves data differently from the traditional table-based structure of relational databases. Unlike relational databases, which ...
multi-model database that evolved from an
XML database
An XML database is a data persistence software system that allows data to be specified, and stored, in XML format. This data can be queried, transformed, exported and returned to a calling system. XML databases are a flavor of document-oriented ...
to natively store
JSON
JSON (JavaScript Object Notation, pronounced or ) is an open standard file format and electronic data interchange, data interchange format that uses Human-readable medium and data, human-readable text to store and transmit data objects consi ...
documents and
RDF triples, the data model for semantics. MarkLogic is designed to be a
data hub for
operational and
analytical data.
History
MarkLogic Server was built to address shortcomings with existing search and data products. The product first focused on using
XML
Extensible Markup Language (XML) is a markup language and file format for storing, transmitting, and reconstructing data. It defines a set of rules for encoding electronic document, documents in a format that is both human-readable and Machine-r ...
as the document markup standard and
XQuery as the query standard for accessing collections of documents up to hundreds of terabytes in size.
Currently the MarkLogic platform is widely used in publishing, government, finance and other sectors.
MarkLogic's customers are mostly Global 2000 companies.
Technology
MarkLogic uses documents without upfront schemas to maintain a flexible data model. In addition to having a flexible data model, MarkLogic uses a distributed, scale-out architecture that can handle hundreds of billions of documents and hundreds of terabytes of data. It has received Common Criteria certification, and has
high availability and
disaster recovery. MarkLogic is designed to run on-premises and within public or private cloud environments like
Amazon Web Services
Amazon Web Services, Inc. (AWS) is a subsidiary of Amazon.com, Amazon that provides Software as a service, on-demand cloud computing computing platform, platforms and Application programming interface, APIs to individuals, companies, and gover ...
.
Features
;
Indexing
MarkLogic indexes the content and structure of documents including words, phrases, relationships, and values in over 200 languages with tokenization, collation, and stemming for core languages. Functionality includes the ability to toggle range indexes, geospatial indexes, the RDF triple index, and reverse indexes on or off based on your data, the kinds of queries that you will run, and your desired performance.
;
Full-text search
MarkLogic supports search across its data and metadata using a word or phrase and incorporates
Boolean logic
In mathematics and mathematical logic, Boolean algebra is a branch of algebra. It differs from elementary algebra in two ways. First, the values of the variable (mathematics), variables are the truth values ''true'' and ''false'', usually denot ...
, stemming, wildcards, case sensitivity, punctuation sensitivity, diacritic sensitivity, and search term weighting. Data can be searched using JavaScript, XQuery, SPARQL, and SQL.
Semantics
Semantics is the study of linguistic Meaning (philosophy), meaning. It examines what meaning is, how words get their meaning, and how the meaning of a complex expression depends on its parts. Part of this process involves the distinction betwee ...
MarkLogic uses
RDF triples to provide semantics for ease of storing metadata and querying.
ACID
An acid is a molecule or ion capable of either donating a proton (i.e. Hydron, hydrogen cation, H+), known as a Brønsted–Lowry acid–base theory, Brønsted–Lowry acid, or forming a covalent bond with an electron pair, known as a Lewis ...
Unlike other NoSQL databases, MarkLogic maintains
ACID
An acid is a molecule or ion capable of either donating a proton (i.e. Hydron, hydrogen cation, H+), known as a Brønsted–Lowry acid–base theory, Brønsted–Lowry acid, or forming a covalent bond with an electron pair, known as a Lewis ...
consistency for transactions.
; Replication
MarkLogic provides high availability with replica sets.
;
Scalability
Scalability is the property of a system to handle a growing amount of work. One definition for software systems specifies that this may be done by adding resources to the system.
In an economic context, a scalable business model implies that ...
MarkLogic scales horizontally using
sharding.
MarkLogic can run over multiple servers, balancing the load or replicating data to keep the system up and running in the event of hardware failure.
Security
MarkLogic has built in security features such as element-level permissions and data redaction.
Optic API for Relational Operations
An API that lets developers view their data as documents, graphs or rows.
Security
MarkLogic provides redaction, encryption, and element-level security (allowing for control on read and write rights on parts of a document).
Applications
*
Banking
A bank is a financial institution that accepts Deposit account, deposits from the public and creates a demand deposit while simultaneously making loans. Lending activities can be directly performed by the bank or indirectly through capital m ...
*
Big Data
Big data primarily refers to data sets that are too large or complex to be dealt with by traditional data processing, data-processing application software, software. Data with many entries (rows) offer greater statistical power, while data with ...
*
Fraud prevention
* Insurance Claims Management and Underwriting
* Master data management
* Recommendation engines
Licensing
MarkLogic is available under various licensing and delivery models, namely a free ''Developer'' or an ''Essential Enterprise'' license.
">Licenses are available from MarkLogic or directly from cloud marketplaces such as
Amazon Web Services
Amazon Web Services, Inc. (AWS) is a subsidiary of Amazon.com, Amazon that provides Software as a service, on-demand cloud computing computing platform, platforms and Application programming interface, APIs to individuals, companies, and gover ...
and
Microsoft Azure.
Releases
* 2003—Cerisent XQE 1.0
* 2004—Cerisent XQE 2.0
* 2005—MarkLogic Server 3.0
* 2006—MarkLogic Server 3.1
* 2007—MarkLogic Server 3.2
* 2008—MarkLogic Server 4.0
* 2009—MarkLogic Server 4.1
* 2010—MarkLogic Server 4.2
* 2011—MarkLogic Server 5.0
* 2012—MarkLogic Server 6.0
* 2013—MarkLogic Server 7.0
* 2015—MarkLogic Server 8.0: Ability to store
JSON
JSON (JavaScript Object Notation, pronounced or ) is an open standard file format and electronic data interchange, data interchange format that uses Human-readable medium and data, human-readable text to store and transmit data objects consi ...
data and process data using
JavaScript
JavaScript (), often abbreviated as JS, is a programming language and core technology of the World Wide Web, alongside HTML and CSS. Ninety-nine percent of websites use JavaScript on the client side for webpage behavior.
Web browsers have ...
.
* 2017—MarkLogic Server 9.0: Data integration across Relational and Non-Relational data.
* 2019—MarkLogic Server 10.0
* 2022—MarkLogic Server 11.0
See also
*
Document database
A document-oriented database, or document store, is a computer program and data storage system designed for storing, retrieving and managing document-oriented information, also known as Semi-structured model, semi-structured data.
Document-orie ...
*
Graph database
*
Multi-model database
*
NoSQL
NoSQL (originally meaning "Not only SQL" or "non-relational") refers to a type of database design that stores and retrieves data differently from the traditional table-based structure of relational databases. Unlike relational databases, which ...
*
Triple store
*
MongoDB
*
Operational data store
*
Online analytical processing
In computing, online analytical processing (OLAP) (), is an approach to quickly answer multi-dimensional analytical (MDA) queries. The term ''OLAP'' was created as a slight modification of the traditional database term online transaction proces ...
*
Resource description framework
The Resource Description Framework (RDF) is a method to describe and exchange graph data. It was originally designed as a data model for metadata by the World Wide Web Consortium (W3C). It provides a variety of syntax notations and formats, of whi ...
References
{{reflist
Triplestores
XML databases
Document-oriented databases
NoSQL
Data management