Apache Kudu is a
free and open source
Free and open-source software (FOSS) is a term used to refer to groups of software consisting of both free software and open-source software where anyone is freely licensed to use, copy, study, and change the software in any way, and the source ...
column-oriented data store of the
Apache Hadoop ecosystem. It is compatible with most of the data processing frameworks in the
Hadoop
Apache Hadoop () is a collection of open-source software utilities that facilitates using a network of many computers to solve problems involving massive amounts of data and computation. It provides a software framework for distributed storage ...
environment. It provides completeness to Hadoop's storage layer to enable fast analytics on fast data.
The
open source
Open source is source code that is made freely available for possible modification and redistribution. Products include permission to use the source code, design documents, or content of the product. The open-source model is a decentralized sof ...
project to build Apache Kudu began as internal project at
Cloudera
Cloudera, Inc. is an American software company providing enterprise data management systems that make significant use of Apache Hadoop. As of January 31, 2021, the company had approximately 1,800 customers.
History
Cloudera, Inc. was formed on ...
. The first version Apache Kudu 1.0 was released 19 September 2016.
Comparison with other storage engines
Kudu was designed and optimized for
OLAP
Online analytical processing, or OLAP (), is an approach to answer multi-dimensional analytical (MDA) queries swiftly in computing. OLAP is part of the broader category of business intelligence, which also encompasses relational databases, repo ...
workloads. Like HBase, it is a real-time store that supports key-indexed record lookup and mutation.
Kudu differs from HBase since Kudu's datamodel is a more traditional relational model, while HBase is schemaless. Kudu's "on-disk representation is truly columnar and follows an entirely different storage design than HBase/
Bigtable
Bigtable is a fully managed wide-column and key-value NoSQL database service for large analytical and operational workloads as part of the Google Cloud portfolio.
History
Bigtable development began in 2004.. It is now used by a number of Googl ...
".
See also
*
List of column-oriented DBMSes
This article is a list of column-oriented database management system software.
Free and open-source software (FOSS)
Platform as a Service (PaaS)
*Amazon Redshift
* Microsoft Azure SQL Data Warehouse
* Google BigQuery
* Oracle Autonomous ...
References
External links
*
Apache Kudu GitHub repository
{{DEFAULTSORT:Kudu
Kudu
The kudus are two species of antelope of the genus ''Tragelaphus'':
* Lesser kudu, ''Tragelaphus imberbis'', of eastern Africa
* Greater kudu, ''Tragelaphus strepsiceros'', of eastern and southern Africa
The two species look similar, thoug ...
Software using the Apache license
C++ software
Free system software
Free software
Free database management systems
Cloud computing
Online analytical processing
Data warehousing
Data warehousing products
Data analysis software
Distributed data stores
Structured storage