Discover Best Tech Engineering Blogs

Capacity Recommendation Engine: Throughput and Utilization Based Predictive Scaling

Introduction Capacity is a key component of reliability. Uber's services require enough resources in order to handle ...

aiops

distributed systems

ai/ml

tools

January 19, 2022

Uber

Cadence Multi-Tenant Task Processing

Introduction Cadence is a multi-tenant orchestration framework that helps developers at Uber to write fault-tolerant,...

December 16, 2021

How Uber Migrated Financial Data from DynamoDB to Docstore

Introduction Each day, Uber moves millions of people around the world and delivers tens of millions of food and groce...

November 10, 2021

Introducing uGroup: Uber’s Consumer Management Framework

Background Apache Kafka® is widely used across Uber’s multiple business lines. Take the example of an Uber ride: When...

October 21, 2021

Improving HDFS I/O Utilization for Efficiency

Scaling our data infrastructure with lower hardware costs while maintaining high performance and service reliability ...

October 13, 2021

Building Uber’s Fulfillment Platform for Planet-Scale using Google Cloud Spanner

Introduction The Fulfillment Platform is a foundational Uber domain that enables the rapid scaling of new verticals...

September 29, 2021

Distributed tier merge: How LinkedIn tackles stragglers in search index build

Co-authors: Andy Li and Hongbin Wu Indexing plays the key role in modern search engines for fast and accurate informa...

September 27, 2021

Real-Time Exactly-Once Ad Event Processing with Apache Flink, Kafka, and Pinot

Uber recently launched a new capability: Ads on UberEats. With this new ability came new challenges that needed to be...

September 23, 2021

Automating Data Protection at Scale, Part 1

Part one of a series on how we provide powerful, automated, and scalable data privacy and security engineering capabi...

September 14, 2021

The exabyte club: LinkedIn’s journey of scaling the Hadoop Distributed File System

Co-authors: Konstantin V. Shvachko, Chen Liang, and Simbarashe Dzinamarira LinkedIn runs its big data analytics on Ha...

May 27, 2021

Airbnb’s Promotions and Communications Platform

OMNI is an intuitive, homegrown platform that supports message creation, processing, and distribution to engage our g...

May 11, 2021

Himeji: A Scalable Centralized System for Authorization at Airbnb

Access Control at scale for a complex product

HN Discussion

infrastructure

authorization

access control systems

distributed systems

security

April 20, 2021

Jhubbub on Helix: Stateless and elastic made easy

Co-authors: Hunter Lee and Dru Pollini LinkedIn was built to help professionals achieve more in their careers, and ev...

August 27, 2020

Dynein: Building a Distributed Delayed Job Queueing System

Learn about the background, challenges, and future of Airbnb’s distributed scheduling and queueing system.

December 10, 2019

LinkedIn NYC Tech Talk series: Engineering Excellence Meetup

We regularly play host to a series of meetups here at the LinkedIn office in the Empire State Building. Open to the c...

distributed systems

events

August 28, 2019

Improving performance and capacity for Espresso with new Netty framework

In this blog post, we’ll share how we migrated Espresso, LinkedIn’s distributed data store, to a new Netty4-based fra...

June 27, 2019

Star-tree index: Powering fast aggregations on Pinot

Pinot is an open source, scalable distributed OLAP data store that entered the Apache Incubation recently. Developed ...

June 14, 2019

Avoiding Double Payments in a Distributed Payments System

How we built a generic idempotency framework to achieve eventual consistency and correctness across our payments micr...

April 16, 2019

Blog posts about .css-ir0lpz{color:transparent;background-clip:text;-webkit-background-clip:text;background-image:linear-gradient(90deg,rgb(97,94,255),rgb(255,106,77)),linear-gradient(90deg,#615eff,#ff6a4d);}Distributed Systems

Blog posts about Distributed Systems