How to integrate Parquet File in MySQL CDC

Learn how to connect Parquet File and MySQL CDC and instantly get access to your data.

About Parquet File

Apache Parquet is an open-source data repository of the Apache Hadoop ecosystem. It is comparable to the other columnar storage formats RCFile and Optimized RCFile available in Hadoop. It is compatible with most data processing frameworks in the Hadoop environment. It provides efficient data compression and encryption systems with improved performance for processing complex data in large volumes.

About MySQL CDC

MySQL is a popular open-source RDBMS (relational database management system) and offered under two different editions: the open source MySQL Community Server and the proprietary Enterprise Server. The MySQL Enterprise Server is differentiated by a series of proprietary extensions which install as server plugins but is built from the same code base. In databases, change data capture (CDC) is a set of software patterns used to determine and track data that has changed so that action can be taken using the changed data.

What is DataVirtuality?

DataVirtuality enables companies to build an agile BI stack in 1 day. It connects to Parquet File, MySQL CDC and more than 150 other databases and cloud services. All connected data sources can be directly queried with SQL and data can be moved into any analytical database. Customers of the DataVirtuality Logical Data Warehouse are digital businesses with the highest flexibility needs.

IMMEDIATE ACCESS TO DATA

Connect over 150+ databases, cloud services and files (XML, CSV, etc.) in minutes. Query data with your favorite analysis tools.

CENTRAL DATA MODEL

Set uniform definitions for their data and apply them to their analysis tools, regardless of the underlying data source.

ALL QUESTIONS IN SQL

With DataVirtuality you can query all data sources with SQL. NoSQL, CSV or XML File: We transform any connected data source in SQL.

REAL TIME REPORTING

DataVirtuality controls the exchange of data between all databases, cloud services, and analysis tools to help everyone in their organization get the information they need.