In this blog post we present a way to version your Airflow DAGs on a single server through isolated pipeline and... ...
June 10, 2022
In this blog post we present a way to version your Airflow DAGs on a single server through isolated pipeline and... ...
June 10, 2022
By Vikram Srivastava and Marcelo Mayworm
As part of our “Data Engineers of Netflix” series, Pallavi Phadnis shares her journey to Data Engineering at Netflix.
October 28, 2021
This post is part of our “Data Engineers of Netflix” series, where our very own data engineers talk about their journ...
July 14, 2021
By Alok Tiagi, Hariharan Ananthakrishnan, Ivan Porto Carrero and Keerti Lakshminarayan
Data Engineers of Netflix — Interview with Samuel Setegne Netflix Technology Blog Follow Apr 26, 2021 · 5 min read Sa...
This post is part of our “Data Engineers of Netflix” interview series, where our very own data engineers talk about t...
March 15, 2021
Co-authors: Chris Li, Kevin Lau, and Subbu Sanka Editor’s Note: Recently, the Apache Software Foundation (ASF) announ...
February 24, 2021
By Anupom Syam
By Hariharan Ananthakrishnan and Angela Ho
Realtime and batch analytics at Airbnb and the role Druid plays in our analytics system architecture
November 13, 2018
Apache’s lightning fast engine for data analysis and machine learning ...
Solving the many small files problem for AVRO ...
Over the last two years at LinkedIn, I’ve been working on a distributed key-value database called “Venice.” Venice is...
December 20, 2017
Editor's note: This blog has been updated. Typical distributed data systems are clusters composed of a set of machine...
July 26, 2017
We look at the design, implementation, and generation of complex events. ...
July 13, 2017
What is the shape of your big data? While we do love to talk about the size of our big data—terabytes, petabytes, and...
Training models using larger amounts of data for a great Zalando Hack Week project. ...
March 29, 2017