How to connect Hive to Pentaho

Discover how to get data from Hive (and from other sources) into Pentaho by locating it into your data warehouse that is connected to Pentaho.
Load your Hive data into your central data warehouse to analyze it with Pentaho.
To analyze Hive data with Pentaho, Pipes provides you with fast and easy access to all your data by automatically loading it in your data warehouse. Always up-top-date, no performance issues, without writing a single line of code.
1

Connect your data warehouse

It will be the central database for your Hive data. Pipes supports the most popular relational data warehouses in the cloud and on-premises.
2

Connect to Hive

You just need to enter the associated credentials to allow Pipes access to the Hive API.
3

Create a data pipeline

Create a pipeline from Hive to your central data warehouse. The pipeline will run automatically on your defined schedule, so you will always have fresh data available.
4

Access your Hive data with Pentaho

Connect Pentaho to your data warehouse. You will see your Hive data there in form of standardized tables. Now you can analyze your data without performance issues!

About Hive

Apache Hive is a data warehouse infrastructure which provides query, data summarization, and analysis, built on top of Hadoop. The Apache Hive data warehouse software facilitates writing, reading, and managing large datasets with distributed storage using SQL. A JDBC driver and command line tool are provided to connect users to Hive.

About Pentaho

Pentaho provides Business Intellingence solutions with a community and an enterprise edition. Users can create reportings and factful dashboards as well as run data mining and extract, transform, load (ETL) processes with Pentaho.

Your benefits with Pipes

Get central access to all your data

Access data from 200+ data sources with our ready-to-use connectors and replicate it to your central data warehouse.

Automate your data workflows

Stop manually extracting data and automate your data integration without any coding. We maintain all pipelines for you and cover all API changes!

Enable data-driven decision-making

Empower everyone in your company with consistent and standardized data, automate data delivery and measure KPIs across different systems.