How to integrate Parquet File in MongoDB

Learn how to connect Parquet File and MongoDB and instantly get access to your data.

About Parquet File

Apache Parquet is an open-source data repository of the Apache Hadoop ecosystem. It is comparable to the other columnar storage formats RCFile and Optimized RCFile available in Hadoop. It is compatible with most data processing frameworks in the Hadoop environment. It provides efficient data compression and encryption systems with improved performance for processing complex data in large volumes.

About MongoDB

MongoDB is an open-source document-oriented cross-platform database program that uses JSON-like documents with schemas and is classified as a NoSQL database. The main features of MongoDB among others cover file storage, load balancing, indexing, ad-hoc queries.

What is Data Virtuality?

Data Virtuality enables companies to build an agile BI stack in 1 day. It connects to Parquet File, MongoDB and more than 200 other databases and cloud services. All connected data sources can be directly queried with SQL and data can be moved into any analytical database. Customers of the Data Virtuality Logical Data Warehouse are digital businesses with the highest flexibility needs.

IMMEDIATE ACCESS TO DATA

Connect over 200+ databases, cloud services and files (XML, CSV, etc.) in minutes. Query data with your favorite analysis tools.

CENTRAL DATA MODEL

Set uniform definitions for their data and apply them to their analysis tools, regardless of the underlying data source.

ALL QUESTIONS IN SQL

With Data Virtuality you can query all data sources with SQL. NoSQL, CSV or XML File: We transform any connected data source in SQL.

REAL TIME REPORTING

Data Virtuality controls the exchange of data between all databases, cloud services, and analysis tools to help everyone in their organization get the information they need.