Member-only story

Google BigQuery — 101 (Things you should know)

Praneeth Kandula
6 min readJan 1, 2021

--

BigQuery (Source:cloud.google.com/bigquery)

What is BigQuery?
-Serverless
-Highly Scalable
-Highly Efficient
-Cloud-based Data Warehouse
-with a Distributed SQL Query Engine that can process terabytes of data in seconds.

The traditional way to work with a data warehouse is to start with an ETL job i.e. Extract the raw data from the source, Transform the data and Load it into a data warehouse. The ETL pipeline to load data into BigQuery is typically written in Apache Beam or Apache Spark which extracts raw data (either streaming data or batch files) ,transforms this data (performs cleanup and/or aggregations) and then loads it into BigQuery.

ETL Pipeline into BigQuery

Alternatively you can even configure an EL or an ELT data pipeline to load data into a BigQuery warehouse.

Workflows

BigQuery Highlights/Features:

(From the official Documentation: https://cloud.google.com/bigquery/docs)

--

--

No responses yet