By Vikram Srivastava and Marcelo Mayworm
Blog posts about Stream Processing
By Andrew Nguonly, Armando Magalhães, Obi-Ike Nwoke, Shervin Afshar, Sreyashi Das, Tongliang Liu, Wei Liu, Yucheng Zeng
Co-authors: Zihan Li, Sudarshan Vasudevan, Lei Sun, and Shirshanka Das Data analytics and AI power many business-critical use cases at...
Co-authors: Xiang Zhang and Jingyu Zhu Introduction The Lambda architecture has become a popular architectural style that promises...
Co-authors: Yixing Zhang, Bingfeng Xia, Ke Wu, and Xinyu Liu Since Beam Samza runner was developed in 2018 at LinkedIn, we now have 100+ Samza Beam jobs ru...
Co-authors: Khai Tran and Steve Weiss Batch and streaming computations are often combined together in the Lambda architecture, but carry the cost of mainta...
By Jeff Chao on behalf of the Mantis team
Editor's note: This blog has been updated. Brooklin—a distributed service for streaming data in near real-time and at scale—has been...
If Apache Kafka is the lifeblood of all nearline processing at LinkedIn, then Apache Samza is the beating heart pumping that blood...
Jun 20, 2019
The existing Lambda architecture With the evolution of big data technologies over time, two classes of computations have been...
Jan 29, 2019
We are pleased to announce today the release of Samza 1.0, a significant milestone in the history of the project. Apache Samza is a...
Nov 27, 2018
A few years ago, we announced Rest.li 2.x and a Protocol Upgrade Story. Today, we are excited to share another major milestone: the...
Nov 2, 2018
Gobblin is a distributed data integration framework that simplifies common aspects of big data integration, such as ingestion, replication, organization, a...
Jan 17, 2018
Over the last two years at LinkedIn, I’ve been working on a distributed key-value database called “Venice.” Venice is designed to be a significant improvem...
Dec 20, 2017
Co-authors: Saurabh Goyal and Janardh Bantupalli In our previous blog post introducing Brooklin, we outlined the reasons why we created our own framework f...
Nov 22, 2017
Editor's note: This blog has been updated. Near-realtime (nearline) applications drive many of the critical services within LinkedIn, such as notifications...
Oct 11, 2017
Over a decade ago, test strategies invested heavily in UI-driven tests. Backend and mid-tier services were tested using automated UI-based tests. While UI-...
Apr 27, 2017
This post is the second in a series discussing asynchronous processing and multithreading in Apache Samza. In the previous post, we...
Jan 6, 2017