PRINCIPAL CONSULTANT BigQuery is just one service that Google Cloud Platform provides, which is a database service, but if your...
Blog
Code
A Custom Data Quality and Profiling Solution With Apache Spark on AWS
PRINCIPAL CONSULTANT, DATA ENGINEER One of the most important steps when doing data integration is understanding the data we’re...
Connecting VSCode using SSH to a VM Instance on GCP
ASSOCIATE DATA ENGINEERIntroduction A virtual machine, or a VM, is a virtualized instance of a computer that can perform almost...
Go’s Concurrency Model
ASSOCIATE DATA ENGINEER This post attempts to demonstrate the power of utilizing concurrency, specifically the power of Go's...
Noria
ASSOCIATE DATA ENGINEERSIntroduction Nowadays, modern web applications are read-heavy, interactive, and have significant skew....
Using Azure Databricks for Batch and Streaming Processing
JUNIOR DATA ENGINEER, ASSOCIATE DATA ENGINEERIntroduction DATABRICKS is an organization and big data processing platform founded...
Machine learning on GCP – Cloud TPU vs Cloud Functions
ASSOCIATE DATA ENGINEERS Introduction In the PREVIOUS BLOG POST, we talked about machine learning on Google Cloud Platform using...
Machine learning using Persistor & Google Cloud functions
Introduction Machine learning is a very hot topic in today's world. From classification, and regression to clustering, we’ve all...
How we speed up a Google Cloud Function… by a factor of 10!
DATA ENGINEERIntroduction Recently, we faced a challenge regarding a Google Cloud Function (referred to as CF hereafter)....