Apache Kudu
   HOME

TheInfoList



OR:

Apache Kudu is a
free and open source Free and open-source software (FOSS) is software available under a license that grants users the right to use, modify, and distribute the software modified or not to everyone free of charge. FOSS is an inclusive umbrella term encompassing free ...
column-oriented data store of the
Apache Hadoop Apache Hadoop () is a collection of open-source software utilities for reliable, scalable, distributed computing. It provides a software framework for distributed storage and processing of big data using the MapReduce programming model. Hadoop wa ...
ecosystem. It is compatible with most of the data processing frameworks in the
Hadoop Apache Hadoop () is a collection of Open-source software, open-source software utilities for reliable, scalable, distributed computing. It provides a software framework for Clustered file system, distributed storage and processing of big data usin ...
environment. It provides completeness to Hadoop's storage layer to enable fast analytics on fast data. The
open source Open source is source code that is made freely available for possible modification and redistribution. Products include permission to use and view the source code, design documents, or content of the product. The open source model is a decentrali ...
project to build Apache Kudu began as internal project at
Cloudera Cloudera, Inc. is an American data lake software company. History Cloudera, Inc. was formed on June 27, 2008 in Burlingame, California by Christophe Bisciglia, Amr Awadallah, Jeff Hammerbacher, and chief executive Mike Olson. Prior to Cloude ...
. The first version Apache Kudu 1.0 was released 19 September 2016.


Comparison with other storage engines

Kudu was designed and optimized for
OLAP In computing, online analytical processing (OLAP) (), is an approach to quickly answer multi-dimensional analytical (MDA) queries. The term ''OLAP'' was created as a slight modification of the traditional database term online transaction processi ...
workloads. Like HBase, it is a real-time store that supports key-indexed record lookup and mutation. Kudu differs from HBase since Kudu's datamodel is a more traditional relational model, while HBase is schemaless. Kudu's "on-disk representation is truly columnar and follows an entirely different storage design than HBase/
Bigtable Bigtable is a fully managed wide-column and key-value NoSQL database service for large analytical and operational workloads as part of the Google Cloud portfolio. History Bigtable development began in 2004.. It is now used by a number of Goo ...
".


See also

*
List of column-oriented DBMSes This article is a list of column-oriented database management system software. Free and open-source software (FOSS) Platform as a Service (PaaS) * Amazon Redshift * Microsoft Azure Synapse Analytics (formerly Azure SQL Data Warehouse) * ...


References


External links

*
Apache Kudu GitHub repository
{{DEFAULTSORT:Kudu
Kudu The kudus are two species of antelope of the genus '' Tragelaphus'': * Lesser kudu, ''Tragelaphus imberbis'', of eastern Africa * Greater kudu, ''Tragelaphus strepsiceros'', of eastern and southern Africa The two species look similar, th ...
Software using the Apache license Free software programmed in C++ Free system software Free database management systems Cloud computing Online analytical processing Data warehousing products Data analysis software Distributed data stores Structured storage