Lexiang Huang | Metastable Failures in the Wild | #17 | Disseminate - bringing you the best Computer Science research.

Summary:

In this episode Lexiang Huang talks about a framework for understanding a class of failures in distributed systems called metastable failures. Lexiang tells us about his study on the prevalence of such failures in the wild and how he and his colleagues scoured over publicly available incident reports from many organizations, ranging from hyperscalers to small companies. Listen to the episode to find out about his main findings and gain a deeper understanding of metastable failures and how you can identity, prevent, and mitigate against them!

Lexiang Huang | Metastable Failures in the Wild | #17

Summary:

Links:

Related Episodes

Matt Perron | Analytical Workload Cost and Performance Stability With Elastic Pools | #57

High Impact in Databases with... Andreas Kipf

Marvin Wyrich & Justus Bogner | How Software Engineering Research Is Discussed on LinkedIn | #56