Co-authors: Khai Tran and Steve Weiss Batch and streaming computations are often combined together in the Lambda architecture, but carry the cost of mainta...
Blog posts about Samza
If Apache Kafka is the lifeblood of all nearline processing at LinkedIn, then Apache Samza is the beating heart pumping that blood...
Jun 20, 2019
We are pleased to announce today the release of Samza 1.0, a significant milestone in the history of the project. Apache Samza is a...
Nov 27, 2018
Co-authors: Vivek Nelamangala and PJ Xiao Introduction to Notifications Social media are computer-mediated platforms that facilitate creation and sharing o...
Co-authors: Max Wolffe and Akhilesh Gupta Introduction You can’t fix something if you don’t know there’s a problem. Measuring and...
Apr 19, 2018
Coauthor: Bharath Kumarasubramanian If you’re sharing content on LinkedIn, you’re positioning yourself as a thought leader among the largest group of profe...
Nov 1, 2016
This post is the second in a series of posts that discuss some of the hard problems in stream processing. In the previous post, we explored the use of lamb...
Aug 22, 2016
We live in an age where we want to know relevant things happening around the world as soon as they happen; an age where digital content is updated instantl...
At LinkedIn, events pertaining to application and system monitoring, member behavior tracking, inter-application communication, etc.,...
Jan 26, 2016
This is an interview with Clement Fung, who interned with the Voldemort team last year and liked LinkedIn so much that he decided to come back this year fo...
LinkedIn started in 2003 with the goal of connecting to your network for better job opportunities. It had only 2,700 members the first...
The LinkedIn engineering team has developed and built Apache Kafka into a powerful open source solution for managing streams of...
Jan 29, 2015
Apache Samza is a stream processing framework that LinkedIn developed to solve some of our toughest stream processing challenges. We open sourced it in Sep...
This post originally appeared as a contributed piece on The New Stack. LinkedIn began processing “big data” on Apache Hadoop six years...
Jan 8, 2015
At LinkedIn, we use a log-centric system called Apache Kafka to move tons of data around. If you're not familiar with Kafka, you can...
It's not easy to quickly gather all the data that goes into a LinkedIn page view, particularly for something like our home page. LinkedIn benefits from a v...
Aug 18, 2014
Less than a year ago, we announced the first open source release of Apache Incubator Samza, a framework for processing big data...