Course description
SRE Fundamentals and Security
Site Reliability Engineers must have the right tools and strategies to perform in a technical, fast-paced environment. IBM Cloud SRE is guided by nine competency areas that lead to the successful practice of the discipline:
- Applying Site Reliability Engineering principles
- Operations
- Monitoring and incident management
- Security and compliance
- Compute infrastructure
- Networking
- Storage and data management
- Reliability and resiliency
- Deployment automation
In this first course of the three-part Professional Certificate in Site Reliability Engineering (SRE), you will focus on the first four SRE competencies:
- Applying Site Reliability Engineering principles
- Operations
- Monitoring and incident management
- Security and compliance
Upcoming start dates
Who should attend?
Prerequisites
At least 1 year experience in SRE or technology.
Understanding of:
- DevOps practices
- Software engineering principles
- System administration
- Network and OSI model
- Incident management
- Root cause analysis
Training content
Welcome and Introduction
You will cover the following topics:
An introduction to the IBM Professional SRE role
SRE Fundamentals and Terminology
You will cover the following topics:
- Deeper dive into SRE role
- SRE principles
- Managing trade-offs between change, velocity, and reliability
- Negotiating service level objectives, service level indicators, error budgets and the user experience
- IBM Cloud tools and technology across the Software Development Life Cycle
- Applying software engineering principles to drive reliability
Operations
You will cover the following topics:
- Performing operational readiness reviews (ORR) on IBM Cloud
- Creating ORR checklist
- Employing cost-optimization strategies
- Managing backups and recoveries on IBM Cloud
Monitoring
You will cover the following topics:
- Monitoring overview
- Creating and maintaining metrics, traces, and alerts on IBM Cloud
- Collecting, analyzing, and managing logs on IBM Cloud
- Identifying key metrics for service health on IBM Cloud
- Using performance and availability metrics to measure the health of services on IBM Cloud
Incident Management
You will cover the following topics:
- Managing incidents on IBM Cloud
- Developing a balanced action plan to mitigate future incidents
- Performing the post-incident review
Security and Compliance
You will cover the following topics:
- Monitoring and managing security threats on IBM Cloud
- Implementing and managing security policies on IBM Cloud
- Implementing encryption models
- Managing role-based access control on IBM Cloud
Course delivery details
This course is offered through IBM, a partner institute of EdX.
2-3 hours per week
Costs
- Verified Track -$99
- Audit Track - Free
Contact this provider
edX
edX For Business helps leading companies upskill their labor forces by making the world’s greatest educational resources available to learners across a wide variety of in-demand fields. edX For Business delivers high-quality corporate eLearning to train and engage your employees...