How to integrate Parquet File in Cassandra

Learn how to connect Parquet File and Cassandra and instantly get access to your data.

About Parquet File

Apache Parquet is an open-source data repository of the Apache Hadoop ecosystem. It is comparable to the other columnar storage formats RCFile and Optimized RCFile available in Hadoop. It is compatible with most data processing frameworks in the Hadoop environment. It provides efficient data compression and encryption systems with improved performance for processing complex data in large volumes.

About Cassandra

Apache Cassandra is an open-source distributed NoSQL database management system that is scalable while also placing a high value on performance. The Java-based system can be managed and monitored via Java Management Extensions (JMX).  

What is Data Virtuality?

Data Virtuality enables companies to build an agile BI stack in 1 day. It connects to Parquet File, Cassandra and more than 200 other databases and cloud services. All connected data sources can be directly queried with SQL and data can be moved into any analytical database. Customers of the Data Virtuality Logical Data Warehouse are digital businesses with the highest flexibility needs.

IMMEDIATE ACCESS TO DATA

Connect over 200+ databases, cloud services and files (XML, CSV, etc.) in minutes. Query data with your favorite analysis tools.

CENTRAL DATA MODEL

Set uniform definitions for their data and apply them to their analysis tools, regardless of the underlying data source.

ALL QUESTIONS IN SQL

With Data Virtuality you can query all data sources with SQL. NoSQL, CSV or XML File: We transform any connected data source in SQL.

REAL TIME REPORTING

Data Virtuality controls the exchange of data between all databases, cloud services, and analysis tools to help everyone in their organization get the information they need.