Senior Site Reliability Engineer
- The opportunity to collaborate with some of the brightest and best minds in Australia
- Be part of a great team culture with a team that loves to have fun
- Surry Hills based, with a hybrid working model along with an "on call" component
We are Woolworths Group
We are Woolworths Group. 200,000+ bright minds, passionate hearts, and unique perspectives across Australia and New Zealand. Connected by a shared Purpose - 'to create better experiences together for a better tomorrow'. That Purpose fuels our ambition to explore new ideas, make brave commitments, and innovate better ways to meet the food and everyday needs of more than 24 million customers every week.
If you're excited to turn today's blue-sky thinking into a better tomorrow for future generations, you'll find yourself supported and enriched in a dynamic, inclusive, and empowering workplace. With a culture of genuine care, a flexible approach to work, and opportunities across the group to grow your career and make a meaningful impact, the possibilities for what we can achieve together are endless.
What you’ll do
As a Senior Site Reliability Engineer (SRE) you will be responsible for ensuring the reliability, availability, and performance of our critical digital services and systems. You will bridge the gap between software engineering and operations, employing a blend of coding skills and operational expertise to design and maintain scalable and resilient applications and infrastructure. You will establish and uphold service level objectives (SLOs), automate processes, conduct performance analysis, and respond swiftly to incidents, aiming to minimise downtime and enhance the overall user experience. Your focus on reliability and efficiency will ensure a stable and optimised technology environment, aligning with the organisation's business objectives.
- Work with product teams to design, implement and enhance highly available and scalable systems, ensuring reliability and performance of applications.
- Collaborate with cross-functional teams to define and establish service level objectives (SLOs) and service level agreements (SLAs) for critical applications and components.
- Maintain observability and monitoring tools, alerts, and dashboards to provide visibility into system health and performance, proactively identifying and resolving any performance bottlenecks or availability issues.
- Alongside the Incident Commander, play a lead remediation role on major incident bridges.
- Play a key role in post-incident reviews to identify root causes and implement preventive measures to avoid future incidents.
- Automate repetitive tasks and processes to improve efficiency and reduce manual intervention.
.
What you’ll bring
- Strong demonstrable experience working technical delivery teams up to 10+ people.
- Strong knowledge of cloud infrastructure and integration across enterprise platforms.
- Ability to switch context between multiple technologies.
- Experience implementing SRE practices, influencing engineers and product owners to prioritise reliability in their product roadmap.
- Ability to debug, optimize code and automate routine tasks
- 5 + years experience with SRE Practices, DevSecOps, Observability Platforms, Automation Tools & Platforms
- Strong experience with a subset of technologies in our current stack, including:
- Cloud - Microsoft Azure, Google Cloud Platform
- Code - Primarily .NET Core, C#. Knowledge in Powershell, Bash, NodeJS, GoLang a bonus
- Databases - SQL Server, NoSQL (MongoDB, CosmosDB and/or ElasticSearch), Redis
- CI - Azure DevOps, GitHub Actions
- Infrastructure - Kubernetes, Terraform
- Observability - Dynatrace, Azure Log Analytics, App Insights, Pagerduty
What you’ll experience
Our Team Members are at the heart of everything we do, and we’re always looking for ways to support your career journey and reward great work:
- A flexible hybrid working environment
- Team discounts across our range of Woolworths Group brands you know and love and a robust rewards program that celebrates and incentivises purpose-driven work.
- A global business with endless career possibilities around every corner and across every discipline – with valuable exposure to a vast and exciting business network.
- A range of programs to help you prioritise and manage your well-being, including 24/7 access to the Sonder app.
- A progressive and competitive leave policy that gives you more space for what matters to you.