Hortonworks, Inc. was a data software company based in
Santa Clara, California
Santa Clara ( ; Spanish language, Spanish for "Clare of Assisi, Saint Clare") is a city in Santa Clara County, California. The city's population was 127,647 at the 2020 United States census, 2020 census, making it the List of cities and towns i ...
that developed and supported
open-source software
Open-source software (OSS) is Software, computer software that is released under a Open-source license, license in which the copyright holder grants users the rights to use, study, change, and Software distribution, distribute the software an ...
(primarily around
Apache Hadoop
Apache Hadoop () is a collection of open-source software utilities for reliable, scalable, distributed computing. It provides a software framework for distributed storage and processing of big data using the MapReduce programming model. Hadoop wa ...
) designed to manage
big data
Big data primarily refers to data sets that are too large or complex to be dealt with by traditional data processing, data-processing application software, software. Data with many entries (rows) offer greater statistical power, while data with ...
and associated processing.
Hortonworks software was used to build enterprise data services and applications such as
IoT (connected cars, for example), single view of X (such as customer, risk, patient), and advanced analytics and
machine learning
Machine learning (ML) is a field of study in artificial intelligence concerned with the development and study of Computational statistics, statistical algorithms that can learn from data and generalise to unseen data, and thus perform Task ( ...
(such as next best action and realtime cybersecurity). Hortonworks had three interoperable product lines:
* Hortonworks Data Platform (HDP): based on
Apache Hadoop
Apache Hadoop () is a collection of open-source software utilities for reliable, scalable, distributed computing. It provides a software framework for distributed storage and processing of big data using the MapReduce programming model. Hadoop wa ...
,
Apache Hive
Apache Hive is a data warehouse software project. It is built on top of Apache Hadoop for providing data query and analysis. Hive gives an SQL-like Interface (computing), interface to query data stored in various databases and file systems that i ...
,
Apache Spark
Apache Spark is an open-source unified analytics engine for large-scale data processing. Spark provides an interface for programming clusters with implicit data parallelism and fault tolerance. Originally developed at the University of Californ ...
* Hortonworks DataFlow (HDF): based on
Apache NiFi,
Apache Storm,
Apache Kafka
Apache Kafka is a distributed event store and stream-processing platform. It is an open-source system developed by the Apache Software Foundation written in Java and Scala. The project aims to provide a unified, high-throughput, low-latency pl ...
* Hortonworks DataPlane services (DPS): based on Apache Atlas and Cloudbreak and a pluggable architecture into which partners such as
IBM
International Business Machines Corporation (using the trademark IBM), nicknamed Big Blue, is an American Multinational corporation, multinational technology company headquartered in Armonk, New York, and present in over 175 countries. It is ...
can add their services.
In January 2019, Hortonworks completed its merger with
Cloudera
Cloudera, Inc. is an American data lake software company.
History
Cloudera, Inc. was formed on June 27, 2008 in Burlingame, California by Christophe Bisciglia, Amr Awadallah, Jeff Hammerbacher, and chief executive Mike Olson. Prior to Cloude ...
.
History
Hortonworks was formed in June 2011 as an independent company, funded by $23 million
venture capital
Venture capital (VC) is a form of private equity financing provided by firms or funds to start-up company, startup, early-stage, and emerging companies, that have been deemed to have high growth potential or that have demonstrated high growth in ...
from
Yahoo!
Yahoo (, styled yahoo''!'' in its logo) is an American web portal that provides the search engine Yahoo Search and related services including My Yahoo, Yahoo Mail, Yahoo News, Yahoo Finance, Yahoo Sports, y!entertainment, yahoo!life, and its a ...
and
Benchmark Capital
Benchmark is a venture capital firm founded in 1995 by Bob Kagle, Bruce Dunlevie, Andy Rachleff, Kevin Harvey, and Val Vaden. The firm is known for its equal partnership structure and focus on early-stage investing, typically leading the firs ...
. Its first office was in
Sunnyvale, California
Sunnyvale () is a city located in the Santa Clara Valley in northwestern Santa Clara County, California, United States.
Sunnyvale lies along the historic El Camino Real (California), El Camino Real and U.S. Route 101 in California, Highway 1 ...
.
The company employed contributors to the
open source software
Open-source software (OSS) is Software, computer software that is released under a Open-source license, license in which the copyright holder grants users the rights to use, study, change, and Software distribution, distribute the software an ...
project
Apache Hadoop
Apache Hadoop () is a collection of open-source software utilities for reliable, scalable, distributed computing. It provides a software framework for distributed storage and processing of big data using the MapReduce programming model. Hadoop wa ...
. The Hortonworks Data Platform (HDP) product, first released in June 2012, included Apache Hadoop and was used for storing, processing, and analyzing large volumes of data. The platform was designed to deal with data from many sources and formats. The platform included Hadoop technology such as the Hadoop Distributed File System,
MapReduce
MapReduce is a programming model and an associated implementation for processing and generating big data sets with a parallel and distributed algorithm on a cluster.
A MapReduce program is composed of a ''map'' procedure, which performs filte ...
, Pig, Hive,
HBase
HBase is an open-source non-relational distributed database modeled after Google's Bigtable and written in Java. It is developed as part of Apache Software Foundation's Apache Hadoop project and runs on top of HDFS (Hadoop Distributed File Sy ...
,
ZooKeeper
A zookeeper, sometimes referred as animal keeper, is a person who manages zoo animals that are kept in captivity for conservation or to be displayed to the public.Hurwitz, Jane. Choosing a Career in Animal Care (World of Work). New York: Rosen Gr ...
, and additional components.
Eric Baldeschweiler (from Yahoo) was initial chief executive, and Rob Bearden chief operating officer, formerly from
SpringSource
Spring (previously known as SpringSource) was a software company founded by Rod Johnson, who also created the Spring Framework, an open-source application framework for enterprise Java applications. VMware purchased Spring for $420 million in ...
. Benchmark partner
Peter Fenton was a board member. The company name refers to the character
Horton the Elephant, since the elephant is the symbol for Hadoop.
In October 2018, Hortonworks and
Cloudera
Cloudera, Inc. is an American data lake software company.
History
Cloudera, Inc. was formed on June 27, 2008 in Burlingame, California by Christophe Bisciglia, Amr Awadallah, Jeff Hammerbacher, and chief executive Mike Olson. Prior to Cloude ...
announced they would be merging in an all-stock merger of equals.
After the merger, the Apache products of Hortonworks became Cloudera Data Platform.
References
External links
{{Authority control
Software companies based in the San Francisco Bay Area
Companies based in Sunnyvale, California
Companies based in Santa Clara, California
Companies formerly listed on the Nasdaq
Hadoop
Apache Software Foundation
Software companies established in 2011
2011 establishments in the United States
2011 establishments in California
Big data companies
2014 initial public offerings
2019 mergers and acquisitions
Defunct software companies of the United States