Intro to SRE
Reliability is a critical feature of most software, and maintenance rather than initial development predominates the cost of most software systems.
Co-developer: Alesia Braga
@lizthegrey
I make developers, operators, and workers as a whole more productive and empowered.
Liz is a developer advocate, labor and ethics organizer, and Site Reliability Engineer (SRE) with over two decades of experience. She is currently a Technical Fellow at honeycomb.io, and previously was an SRE working on products ranging from the Google Cloud Load Balancer to Google Flights.
She lives in Vancouver, BC with her wife Elly and partners, and in Sydney, NSW. She plays classical piano, leads an EVE Online alliance, and advocates for transgender rights.
Changing the way we approach tools, collaboration, and success metrics for managing distributed systems. Covers production stakes, observability for collaboration, Service Level Objectives, and risk-based prioritization.
QCon London 2019, Velocity San Jose (keynote), DevOpsDays Atlanta
Examining how SREs spend their time and how we can empower non-SRE engineers instead of seeking individual recognition.
Monitorama PDX 2019
Statistics can come to our rescue, enabling us to gather accurate, specific, and error-bounded data through reducing junk data, reusing data points as samples, and recycling data into counters.
SREcon Europe 2019
How engineers can ensure their work serves the public good through grassroots employee advocacy.
SREcon EMEA 2018 (joint keynote with Emily Gorcenski), Write/Speak/Code 2018, QCon NYC 2018
Reliability is a critical feature of most software, and maintenance rather than initial development predominates the cost of most software systems.
Co-developer: Alesia Braga
When using tens or hundreds of microservices to provide an application's critical functionality, diagnosing interactions becomes complex.
Co-developers: George Talbot, Adam Mckaig
Making your team safe and inclusive doesn't end with unconscious bias training.
Service level objectives and error budgets are the cornerstone of Site Reliability Engineering.
Co-developers: CRE team (Kristina Bennett, Alex Bramley, David Ferguson, Marie Cosgrove-Davies)
Planning approach addressing month-long projects, oncall rotations, and week-long technical debt initiatives.
Co-developers: John Tobin, Dave O'Connor
Guidance on handling management challenges and influencing projects as individual contributor or tech lead.
Building technical and leadership skills doesn't only happen in the workplace!
December 2025 - Present
Technical advocacy and observability innovation.
October 2022 - December 2025
Customer-facing technical leadership and strategic guidance.
February 2019 - October 2022
Developer advocacy, community building, and observability education.
2008 - 2019
Site Reliability Engineer and staff engineer positions focusing on SRE, distributed systems, and infrastructure. Products included Google Cloud Load Balancer, Google Flights, Bigtable, and GFE.
Massachusetts Institute of Technology (MIT), 2014
Senior Member of the Australian Computer Society [MACS (Snr) CP]
O'Reilly, May 2022. Co-authored with Charity Majors and George Miranda.
Connect