The Yahoo! Cloud Serving Benchmark (YCSB) is an open-source specification and program suite for evaluating retrieval and maintenance capabilities of computer programs. It is often used to compare the relative performance of
NoSQL
NoSQL (originally meaning "Not only SQL" or "non-relational") refers to a type of database design that stores and retrieves data differently from the traditional table-based structure of relational databases. Unlike relational databases, which ...
database management systems.
The original benchmark was developed by workers in the research division of
Yahoo!
Yahoo (, styled yahoo''!'' in its logo) is an American web portal that provides the search engine Yahoo Search and related services including My Yahoo, Yahoo Mail, Yahoo News, Yahoo Finance, Yahoo Sports, y!entertainment, yahoo!life, and its a ...
who released it in 2010 with the stated goal of "facilitating performance comparisons of the new generation
of cloud data serving systems", particularly for transaction-processing workloads which differed from ones measured by benchmarks designed for more traditional database management systems.
YCSB was contrasted with the TPC-H benchmark from the
Transaction Processing Performance Council
The Transaction Processing Performance Council (TPC), founded in 1988, is a non-profit organization founded to define benchmarks for transaction processing and databases, and to publish objective, verifiable TPC performance data to the industry ...
, with YCSB being called a
big data
Big data primarily refers to data sets that are too large or complex to be dealt with by traditional data processing, data-processing application software, software. Data with many entries (rows) offer greater statistical power, while data with ...
benchmark while TPC-H is a
decision support system
A decision support system (DSS) is an information system that supports business or organizational decision-making activities. DSSs serve the management, operations and planning levels of an organization (usually mid and higher management) and ...
benchmark.
YCSB was used by DBMS vendors for "benchmark marketing". It has been used in scholarly or tutorial discussions, particularly for
Apache HBase
HBase is an open-source non-relational distributed database modeled after Google's Bigtable and written in Java. It is developed as part of Apache Software Foundation's Apache Hadoop project and runs on top of HDFS (Hadoop Distributed Fil ...
. It has been used for multiple-product comparisons by industry observers such as
Network World
International Data Group (IDG, Inc.) is an American market intelligence and demand generation company focused on the technology industry. IDG, Inc.'s mission is centered around supporting the technology industry through research, data, market ...
(comparing
Cassandra
Cassandra or Kassandra (; , , sometimes referred to as Alexandra; ) in Greek mythology was a Trojan priestess dedicated to the god Apollo and fated by him to utter true prophecy, prophecies but never to be believed. In modern usage her name is e ...
,
MongoDB
MongoDB is a source-available, cross-platform, document-oriented database program. Classified as a NoSQL database product, MongoDB uses JSON-like documents with optional database schema, schemas. Released in February 2009 by 10gen (now MongoDB ...
, and
Riak
Riak (pronounced "ree-ack" ) is a distributed NoSQL key-value data store that offers high availability, fault tolerance, operational simplicity, and scalability. Riak moved to an entirely open-source project in August 2017, with many of the ...
), Thumbtack Technologies (comparing
Aerospike, Cassandra,
Couchbase
Couchbase Server, originally known as Membase, is a source-available, distributed ( shared-nothing architecture) multi-model NoSQL document-oriented database software package optimized for interactive applications. These applications may serv ...
, and MongoDB), and the
Polytechnic Institute
An institute of technology (also referred to as technological university, technical university, university of technology, polytechnic university) is an institution of tertiary education that specializes in engineering, technology, applied science ...
and
University of Coimbra
The University of Coimbra (UC; , ) is a Public university, public research university in Coimbra, Portugal. First established in Lisbon in 1290, it went through a number of relocations until moving permanently to Coimbra in 1537. The university ...
(comparing Cassandra, HBase,
Elasticsearch
Elasticsearch is a Search engine (computing), search engine based on Apache Lucene, a free and open-source search engine. It provides a distributed, Multitenancy, multitenant-capable full-text search engine with an HTTP web interface and schema ...
, MongoDB,
Oracle NoSQL,
OrientDB
OrientDB is an open source NoSQL database management system written in Java. It is a Multi-model database, supporting graph, document and object models, the relationships are managed as in graph databases with direct connections between records. ...
,
Redis
Redis (; Remote Dictionary Server) is an in-memory key–value database, used as a distributed cache and message broker, with optional durability. Because it holds all data in memory and because of its design, Redis offers low- latency reads ...
, Scalaris,
Tarantool
Tarantool is an in-memory computing platform with a flexible data schema, best used for creating high-performance applications. Two main parts of it are an in-memory database and a Lua application server.
Tarantool maintains data in memory and ...
, and
Voldemort
Lord Voldemort ( , in the films) is a fictional character and the main antagonist in the ''Harry Potter'' series of novels by J. K. Rowling. He first appears in '' Harry Potter and the Philosopher's Stone'' (1997) and returns either in pe ...
).
SanDisk Corporation published results measured on the
Oracle NoSQL Database.
Implementations
Original Java ImplementationGoLang ImplementationC++ Implementation for LevelDB, RocksDB, LMDB Embedded Key-Value StoresC++ Implementation for LevelDB, RocksDB, LMDB, WiredTiger, and UDisk Embedded Key-Value Stores
References
{{Reflist
2010 software
Benchmarks (computing)
Yahoo!