- Elasticsearch for Apache Hadoop and Spark: other versions:
- Preface
- Elasticsearch for Apache Hadoop
- Documentation sections
- Key features
- Requirements
- Installation
- Architecture
- Configuration
- Runtime options
- Security
- Logging
- Map/Reduce integration
- Cascading support
- Apache Hive integration
- Apache Pig support
- Apache Spark support
- Apache Storm support
- Mapping and Types
- Error Handlers
- Hadoop Metrics
- Performance considerations
- Cloud/restricted environments
- Troubleshooting
- Resources
- License
- Breaking Changes
- Release Notes
- Elasticsearch for Apache Hadoop version 6.5.4
- Elasticsearch for Apache Hadoop version 6.5.3
- Elasticsearch for Apache Hadoop version 6.5.2
- Elasticsearch for Apache Hadoop version 6.5.1
- Elasticsearch for Apache Hadoop version 6.5.0
- Elasticsearch for Apache Hadoop version 6.4.3
- Elasticsearch for Apache Hadoop version 6.4.2
- Elasticsearch for Apache Hadoop version 6.4.1
- Elasticsearch for Apache Hadoop version 6.4.0
- Elasticsearch for Apache Hadoop version 6.3.2
- Elasticsearch for Apache Hadoop version 6.3.1
- Elasticsearch for Apache Hadoop version 6.3.0
- Elasticsearch for Apache Hadoop version 6.2.4
- Elasticsearch for Apache Hadoop version 6.2.3
- Elasticsearch for Apache Hadoop version 6.2.2
- Elasticsearch for Apache Hadoop version 6.2.1
- Elasticsearch for Apache Hadoop version 6.2.0
- Elasticsearch for Apache Hadoop version 6.1.4
- Elasticsearch for Apache Hadoop version 6.1.3
- Elasticsearch for Apache Hadoop version 6.1.2
- Elasticsearch for Apache Hadoop version 6.1.1
- Elasticsearch for Apache Hadoop version 6.1.0
- Elasticsearch for Apache Hadoop version 6.0.1
- Elasticsearch for Apache Hadoop version 6.0.0
- Elasticsearch for Apache Hadoop version 6.0.0-rc2
- Elasticsearch for Apache Hadoop version 6.0.0-rc1
- Elasticsearch for Apache Hadoop version 6.0.0-beta2
- Elasticsearch for Apache Hadoop version 6.0.0-beta1
- Elasticsearch for Apache Hadoop version 6.0.0-alpha2
- Elasticsearch for Apache Hadoop version 6.0.0-alpha1
Elasticsearch for Apache Hadoop
editElasticsearch for Apache Hadoop
editElasticsearch for Apache Hadoop is an open-source, stand-alone, self-contained, small library that allows Hadoop jobs (whether using Map/Reduce or libraries built upon it such as Hive, Pig or Cascading or new upcoming libraries like Apache Spark ) to interact with Elasticsearch. One can think of it as a connector that allows data to flow bi-directionaly so that applications can leverage transparently the Elasticsearch engine capabilities to significantly enrich their capabilities and increase the performance.
Elasticsearch for Apache Hadoop offers first-class support for vanilla Map/Reduce, Cascading, Pig and Hive so that using Elasticsearch is literally like using resources within the Hadoop cluster. As such, Elasticsearch for Apache Hadoop is a passive component, allowing Hadoop jobs to use it as a library and interact with Elasticsearch through Elasticsearch for Apache Hadoop APIs.
While the official name of the project is Elasticsearch for Apache Hadoop throughout the documentation the term elasticsearch-hadoop will be used instead to increase readability.
If you are looking for Elasticsearch HDFS Snapshot/Restore plugin (a separate project), please refer to its home page.