By Vikram Srivastava and Marcelo Mayworm
Blog posts about Big Data
As part of our “Data Engineers of Netflix” series, Pallavi Phadnis shares her journey to Data Engineering at Netflix.
Oct 28, 2021
This post is part of our “Data Engineers of Netflix” series, where our very own data engineers talk about their journeys to Data…
Jul 14, 2021
By Alok Tiagi, Hariharan Ananthakrishnan, Ivan Porto Carrero and Keerti Lakshminarayan
This post is part of our “Data Engineers of Netflix” interview series, where our very own data engineers talk about their journeys to Data…
Co-authors: Chris Li, Kevin Lau, and Subbu Sanka Editor’s Note: Recently, the Apache Software Foundation (ASF) announced Apache® Gobblin™ as a Top-Level Pr...
Feb 24, 2021
By Anupom Syam
By Hariharan Ananthakrishnan and Angela Ho
Realtime and batch analytics at Airbnb and the role Druid plays in our analytics system architecture
Nov 13, 2018
Over the last two years at LinkedIn, I’ve been working on a distributed key-value database called “Venice.” Venice is designed to be a significant improvem...
Dec 20, 2017
Editor's note: This blog has been updated. Typical distributed data systems are clusters composed of a set of machines. If the dataset...
What is the shape of your big data? While we do love to talk about the size of our big data—terabytes, petabytes, and beyond—perhaps we are not paying due ...
This post is the second in a series discussing asynchronous processing and multithreading in Apache Samza. In the previous post, we...
Jan 6, 2017
This post is the second in a series of posts that discuss some of the hard problems in stream processing. In the previous post, we explored the use of lamb...
Aug 22, 2016
About a year ago, we open sourced Gobblin, a universal data ingestion framework that aimed to solve data integration challenges faced by people working on ...
We live in an age where we want to know relevant things happening around the world as soon as they happen; an age where digital content is updated instantl...
LinkedIn leverages professional and educational data from our more than 400 million members to create Student Decision tools that can help students make sm...