Search this site
Embedded Files
Skip to main content
Skip to navigation
Pragmatic SRE
pSRE
1 Foundations
SRE Introduction
Crafting A SRE Strategy
Executing the SRE Strategy
2 Reliability Engineering
Defining Reliability Requirements
SLOs, SLIs, SLAs & Error Budgets
Capacity Management
Performance Engineering
Chaos Engineering
Reducing Toil
3 Operational Excellence
Incident Management
Monitoring
Alerts & Dashboards
Logging
4 Product Engineering
SRE & Product Engineering
SRE Engagement Model
Delivery Pipelines
Zero Downtime Deployments
Microservices
Feature Toggles
5 Data
Foundations of Data Driven SRE
Data Driven Incident Management
Data Driven Capacity Management
Data Driven Automation
Challenges in Data Driven SRE
6 Culture
Organizational Culture & SRE
The Anatomy of a SRE Team
Hiring the Right SREs
Measuring Success
Conclusion
Glossary of Terms
Availability Table
SLO Examples
AI/ML for pSRE
1 Essential Use Cases
2 Advanced Use Cases
3 Assessing o11y Maturity
4 AI for Problem Management
The Art of Reliability War
01 Laying Plans
02 Waging War
03 Attack by Stratagem
04 Tactical Dispositions
05 Energy
06 Weak Points & Strong
07 Maneuvering
08 Variation in Tactics
09 The Army on the March
10 Terrain
11 The Nine Situations
12 The Attack by Fire
13 The Use of Spies
Blue Work / Red Work
References
About Me
Feedback
Pragmatic SRE
Availability Table
<
Home
>
Google Sites
Report abuse
Google Sites
Report abuse