Discover Best Tech Engineering Blogs

Career stories: Next-gen systems, servers, and SREs

Saira joined our Bangalore site reliability engineering (SRE) team to tackle large-scale, site engineering challenges...

production infrastructure engineering

asia pacific

engineering

career stories

sre

December 16, 2022

Operating system upgrades at LinkedIn’s scale

Co-authors: Hengyang Hu, Dinesh Dhakal, Kalyanasundaram Somasundaram Introduction Completing recurring operating syst...

linkedin engineering

sre

August 31, 2022

Zalando

Operation-Based SLOs

Zalando developed a new type of SLOs to monitor the critical aspects of its business which is based on Operations.......

sre

April 28, 2022

Spike detection in Alert Correlation

Introduction LinkedIn’s stack consists of thousands of different microservices and the associated complex dependencie...

sre

December 22, 2021

Zalando

Tracing SRE’s journey in Zalando - Part III

Follow Zalando's journey to adopt SRE in its tech organization. ...

sre

October 15, 2021

Zalando

Tracing SRE’s journey in Zalando - Part II

Follow Zalando's journey to adopt SRE in its tech organization. ...

sre

September 21, 2021

Zalando

Tracing SRE’s journey in Zalando - Part I

Follow Zalando's journey to adopt SRE in its tech organization. ...

sre

September 13, 2021

Rethinking site capacity projections with Capacity Analyzer

While site outages are inevitable, it’s our job to minimize both the duration of outages and the likelihood for an ou...

performance

infrastructure

sre

March 16, 2021

Open source update: School of SRE

Co-authors: Akbar KM and Kalyanasundaram Somasundaram Site up and secure is a fundamental element of how we operate, ...

open source

sre

February 3, 2021

Fixing Linux filesystem performance regressions

As companies grow, adapt, morph, and mature, one item remains the same: the need for reinvention. Technical infrastru...

October 16, 2020

How Zalando prepares for Cyber Week

Learn how we prepare our platform for Cyber Week - the highest traffic period in the year. ...

cyber week

sre

testing

October 8, 2020

The impact of slow NFS on data systems

Espresso is LinkedIn's defacto NoSQL database solution. It is an online, distributed, fault-tolerant database that po...

performance

espresso

sre

June 23, 2020

Scaling LinkedIn’s Edge with Azure Front Door

Co-authors: Viranch Mehta, Jon Sorenson, Samir Jafferali As LinkedIn has grown to more than 690 million members, we’v...

June 16, 2020

Keeping Customers Streaming — The Centralized Site Reliability Practice at Netflix

By Hank Jacobs, Senior Site Reliability Engineer on CORE

May 27, 2020

Coding Conversations: Four teams, three tracks, two offices

Editor's Note: LinkedIn Engineering is dedicated to solving complex problems at scale to create economic opportunity ...

engineering culture

women in tech

sre

February 7, 2020

The Top 2019 LinkedIn Engineering Blogs

As the year draws to a close, we’re taking a look back at ten of our most popular 2019 articles on the LinkedIn Engin...

December 9, 2019

Eliminating toil with fully automated load testing

Introduction In 2013, when LinkedIn moved to multiple data centers across the globe, we needed a way to redirect traf...

data center

automation

sre

December 6, 2019

A look at our biggest SRE[in]con yet

Co-authors: Todd Palino, Samir Jafferali, Kurt Andersen, and Carolyn Blood LinkedIn hosted its 4th annual SRE[in]con ...

engineering culture

events

sre

November 14, 2019

Blog posts about .css-ir0lpz{color:transparent;background-clip:text;-webkit-background-clip:text;background-image:linear-gradient(90deg,rgb(97,94,255),rgb(255,106,77)),linear-gradient(90deg,#615eff,#ff6a4d);}SRE

Blog posts about SRE