Data engineering Google Cloud PlatformAutomate data collection with GCP Cloud Functions Introduction In the previous post, we have demonstrate steps to build an Airflow ELT data pipeline for crime… 4 minute read
Data engineering Data pipeline Google Cloud PlatformBuild a data pipeline with Airflow and GCP Using London crime data to create an automatic end-to-end pipeline. 3 minute read
Data engineering Data pipeline Google Cloud PlatformTransforming data in BigQuery We will go through three different transformations on a dataset in BigQuery. Filter for unique data Using REGEX_CONTAINS… 3 minute read
Data engineering Data pipeline Google Cloud PlatformRunning BigQuery DAG in Airflow locally with Docker Before we deploy new DAG to production, it’s best practice to test it out locally to spot any… 5 minute read
Data engineering Google Cloud PlatformUsing pandas_gbq to import dataframe to BigQuery Given data in a text file which is the format of a python dictionary, we can first turn… 1 minute read
Data engineering Google Cloud PlatformHow do I prepared and passed the Google Cloud Professional Data Engineer Certification Exam As a data analyst who wants to up-skill myself and hopefully become a data engineer, I figure that… 4 minute read
Case study Data analyticsLondon crime rate study with BigQuery and Data Studio Finding the safest neighborhood in London with DataStudio 1. Getting our data and process it Following step 1… 4 minute read
Case study Data analyticsLondon crime rate study with GeoViz Try to find the safest neighborhood in London. 1. Get crime raw data from London Datastore No better… 3 minute read
Data engineeringSetup Cloud Spanner database and exporting it Cloud spanner is a transactional database that designed for cloud with horizontal and vertical scaling capability. We will… 2 minute read
Data engineeringStreaming data to BigQuery with Pub/Sub In order to create a streaming data pipeline in Google Cloud Platform, we are going to make use… 2 minute read