News
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua.
Cloudera cluster on Alibaba Cloud
CLOUD INFRASTRUCTURE ENGINEERCloudera Enterprise is a modern platform for machine learning and analytics, optimized...
Syntio Janitor
If you are managing an elaborate data platform, it is crucial to get the right data in the right place at the right...
Why Open Source?
BRAND & MARKETING SPECIALIST On November the 25th we took a giant step in our almost four-year history by...
Syntio presents Persistor
Imagine if you could access all your raw, unfiltered data in an instant, never having to ask any of your data...
Serverless ETL orchestration using AWS Step functions and on-demand Redshift cluster
DATA ENGINEER We're building a recommendation engine that is based on customer usage, billing data and a set of...
Back To Work
SENIOR DATA ENGINEER Now that you shopped all the shops, walked all the walks and re-friended all your friends, it is...
GCP pipeline: pub/sub-lookup-storage (part 2/2)
DATA ENGINEERS This post will briefly describe how to create Cloud Run service and showcase two different cases for...
High Performance Computing with Slurm on GCP
DATA ENGINEER We are talking about what is HPC, the motivation to use it, and then we choose the world of Slurm to...
Data processing with Dataflow SQL (part 2/2)
DATA ENGINEERIn previous blog post we had a short introduction to Dataflow SQL and Apache Beam in general, which will...