Co-authors: Chris Li, Kevin Lau, and Subbu Sanka Editor’s Note: Recently, the Apache Software Foundation (ASF) announced Apache® Gobblin™ as a Top-Level Pr...
Feb 24, 2021
Co-authors: Chris Li, Kevin Lau, and Subbu Sanka Editor’s Note: Recently, the Apache Software Foundation (ASF) announced Apache® Gobblin™ as a Top-Level Pr...
Feb 24, 2021
Co-authors: Zihan Li, Sudarshan Vasudevan, Lei Sun, and Shirshanka Das Data analytics and AI power many business-critical use cases at...
Co-authors: Khai Tran and Steve Weiss Batch and streaming computations are often combined together in the Lambda architecture, but carry the cost of mainta...
Co-authors: Krishnan Raman and Joey Salacup Editor's note: This blog has been updated. Monitoring big data pipelines often equates to...
Oct 30, 2019
Gobblin is a distributed data integration framework that simplifies common aspects of big data integration, such as ingestion, replication, organization, a...
Jan 17, 2018
About a year ago, we open sourced Gobblin, a universal data ingestion framework that aimed to solve data integration challenges faced by people working on ...
We shared Gobblin with the open source community a year ago. Since then, we’ve seen increasing interest and adoption among engineers, researchers and analy...
Apr 13, 2016
Genesis Less than a year ago, we introduced Gobblin, a unified ingestion framework, to the world of Big Data. Since then, we’ve shared ongoing progress thr...
When dealing with massive amounts of data at scale, it’s important to have state of the art infrastructure and algorithms that can...
Authors: Shirshanka Das, Lin Qiao The holiday season for gobbling is upon us; and at LinkedIn, we’ve been getting better at gobbling large amounts of diffe...