Member-only story

Google BigQuery — 101 (Things you should know)

6 min readJan 1, 2021

BigQuery (Source:cloud.google.com/bigquery)

What is BigQuery?
-Serverless
-Highly Scalable
-Highly Efficient
-Cloud-based Data Warehouse
-with a Distributed SQL Query Engine that can process terabytes of data in seconds.

The traditional way to work with a data warehouse is to start with an ETL job i.e. Extract the raw data from the source, Transform the data and Load it into a data warehouse. The ETL pipeline to load data into BigQuery is typically written in Apache Beam or Apache Spark which extracts raw data (either streaming data or batch files) ,transforms this data (performs cleanup and/or aggregations) and then loads it into BigQuery.

Alternatively you can even configure an EL or an ELT data pipeline to load data into a BigQuery warehouse.

BigQuery Highlights/Features:

(From the official Documentation: https://cloud.google.com/bigquery/docs)

Google BigQuery — 101 (Things you should know)

BigQuery Highlights/Features:

Written by Praneeth Kandula

No responses yet