ITSM & DevOps Category Banner Image

Site Reliability Engineering (SRE) Practitioner

  • Length 3 days
  • Price  Contact Us
  • Version v1.3
  • Inclusions Exam voucher
Course overview
View dates &
book now
Register interest

Why study this course

The Site Reliability Engineering (SRE) Practitioner℠ course introduces a range of practices and tools for advancing SRE through a mixture of observability, automation, platform engineering, Value Stream Management, AIOps, generative AI, incident management, and organisational ways of working and business alignment, tailored for those focused on large-scale service scalability and reliability.

You'll be introduced to ways to scale services economically and reliably in an organisation. You'll explore collaboration, design, automation, and smart incident response practices and tools to improve agility, cross-functional collaboration, and transparency of health of services towards building resiliency.

This course equips participants with practices, methods, and tools to guide people across the organisation involved in reliability using real-life scenarios and case stories. Upon completion of the course, participants will have tangible takeaways to leverage when back in the office, such as implementing SRE models that fit their organisational context, establishing effective Service Level Objects (SLOs), building advanced observability and AIOps into distributed systems, platforms for self-service, building resiliency by design, and effective incident response strategies.

This course was developed by leveraging a broad range of SRE sources, engaging with experienced transformation leaders and practitioners in the SRE space, and working with organisations embracing SRE to extract real-life best practices. It has been designed to teach the key principles and practices necessary for starting and evolving SRE practices.

After this course, you'll be well-positioned to successfully complete the SRE Practitioner certification exam.

Inclusions
  • Learner Workbook (excellent post-course reference)

  • Quick Reference Guide (QRG)

  • Unique exercises designed to apply concepts

  • Exam voucher

  • Sample documents, templates, tools, and techniques

  • Access to additional value-added resources and communities

Examination

This course pricing includes an exam voucher to sit an online proctored exam through PeopleCert. A sample exam paper will be discussed during class to assist with preparation.

  • 90 minutes

  • 40 multiple-choice questions

  • Answer 26 questions correctly (65%) to pass

Request Course Information


What you’ll learn

Participants in this course have the following learning objectives:

  • Gain a practical view of how to successfully implement a flourishing SRE culture in your organisation

  • Understand underlying principles of SRE and what it is not in terms of anti-patterns, and how you become aware of them to avoid them

  • Understand how to realise organisational impact of SRE practices

  • Acing the art of SLIs and SLOs in a distributed ecosystem and extending the usage of error budgets beyond normal to ensure stakeholders' needs are consistently met

  • Build security and resilience into a distributed, zero-trust environment, by design

  • How to implement full stack observability, distributed tracing, and bring about an observability-driven development culture

  • Curating data using AI to move from reactive to proactive and predictive incident management and how you use DataOps to build clean data lineage

  • How to use platform engineering and Value Stream Management platforms to support self-service portals

  • How Generative AI can help SREs improve automation

  • Implement practical Chaos Engineering

  • Major incident response strategies for a SRE based on incident command framework, and examples of anatomy of unmanaged incidents

  • SRE transformation strategies

  • Understanding the SRE role and understanding why reliability is everyone’s problem

  • SRE success story learnings


PeopleCert DOI DevOps Institute ATO Accredited Training Organisation logo badge

DevOps Institute at Lumify Work

Lumify Work is proud to be ANZ's only Platinum Partner of PeopleCert, and an Accredited Training Organisation for PeopleCert's DevOps courses and certifications.

The DevOps Institute (DOI) brings enterprise-level DevOps training courses and certifications to the IT market, setting the standard in quality.


Who is the course for?

Professionals including the following:

  • Anyone focused on large-scale service scalability and reliability

  • Anyone interested in modern IT leadership and organisational change approaches

  • Business managers and stakeholders

  • Change agents

  • Consultants

  • DevOps practitioners

  • IT directors, managers, and team leaders

  • Product Owners

  • Quality Assurance practitioners

  • Scrum Masters

  • Security practitioners

  • Software engineers

  • Site Reliability Engineers

  • System integrators

  • Tool providers


Course subjects

Module 1: SRE Anti-Patterns

  • Recap of SRE Principles and Practices

  • SRE Myths and Anti-Patterns

  • SRE Practices for Incident Response

Module 2: SLO is a Proxy for Customer Happiness

  • What has changed with SLO?

  • Identifying System Boundaries for Setting SLIs is Critical

  • How do you use error budgets beyond the velocity versus stability debate?

Module 3: Building Secure and Reliable Systems

  • Non-Abstract, Large-Scale Design

  • Fault-Tolerant Design Patterns

  • Designing for Security

  • Designing for Resiliency, Scalability, Performance, Reliability, and Security

Module 4: Full-Stack Observability

  • The Complexity of Modern Applications

  • Observability

  • Monitoring and Telemetry

Module 5: Using Platform Engineering and AIOps

  • Taking a Platform-Centric View

  • Using AIOps to Improve Resiliency and How DataOps Can Help

  • Implementing and Measuring AIOps

Module 6: SRE and Incident Response Management

  • SRE Key Responsibilities Towards Incident Response

  • Incident Response Patterns

  • AI/ML for Better Incident Management

Module 7: Chaos Engineering

  • Μodern Systems are Chaotic

  • Chaos Engineering

  • Chaos Engineering for Security

Module 8: SRE is a Form of DevOps

  • Key Principles of SRE

  • Metrics for Success

  • Transforming to SRE Practices

Post-Class Assignments/Exercises

  • Non-Abstract, Large-Scale Design (after day 1)

  • Engineering Instrumentation: Instrumenting Gremlin (after day 2)

Exam Preparation

  • Exam Requirements, Question Weighting, and Terminology List

  • Sample Exam Review


Prerequisites

It is highly recommended that participants attend the SRE Foundation course and earn the SRE Foundation certification prior to attending this SRE Practitioner course.

It is also recommended that participants have an understanding and knowledge of common SRE terminology, concepts, principles, and related work experience.


PeopleCert Exams

All PeopleCert exams are now conducted via online proctoring, with an improved web-based platform released in February 2024. Candidates can schedule their exam for any time within the voucher validity period of 12 months, to be taken anywhere, on any Windows or Mac computer. Live proctors will guide and invigilate the exam process. For full details please see PeopleCert's Guidelines for Web-Based Exam Driver.

To comply with PeopleCert requirements, Lumify Work cannot provide PeopleCert training courses (ITIL®, Lean Six Sigma, DevOps Institute, MoP®, M_o_R®, MSP®, P3O®, PRINCE2®, and PRINCE2 Agile®) without the corresponding official exam.


Terms & Conditions

The supply of this course by Lumify Work is governed by the booking terms and conditions. Please read the terms and conditions carefully before enrolling in this course, as enrolment in the course is conditional on acceptance of these terms and conditions.


Request Course Information

Awaiting course schedule

If you would like to receive a notification when this course becomes available, enter your details below.

Personalise your schedule with Lumify USchedule

Interested in a course that we have not yet scheduled? Get in touch, and ask for your preferred date and time. We can work together to make it happen.