Where

Site Reliability Engineer - Kalix (Australia/New Zealand)

Lightbend
Perth Full-day Full-time

Description:

Lightbend operates Kalix, a cloud platform that makes distributed systems and design patterns consumable as a service. Our mission is to take care of the complexities of running distributed systems, allowing developers to focus on their business logic while delivering resilient and scalable systems. We are taking the traditional stateless FaaS model, and turning it on its head, pushing into the uncharted territory of managing stateful application code, built on the solid foundations of tried and tested distributed computing principles that we have successfully delivered over more than a decade.

We are looking for experienced Site Reliability Engineers in Indian and European time zones to join our Cloud Services team who are excited to leverage leading SRE practices to operate highly resilient and scalable systems.

Responsibilities:

  • Develop and extend software to monitor and improve end-to-end platform performance, identify runtime deficiencies, find potential failures, and fix production issues in a fully managed multi-cloud environment.
  • Participate in on-call rotation and incident-resolution.
  • Build deep, full-stack knowledge of our platforms and applications.
  • Work to simplify and automate deployment processes, run-time operations, and provide non-disruptive releases.
  • Help create and maintain an environment that provides security and privacy for our customers' data.
  • Maintain application reliability and uptime SLAs throughout the application lifecycle using programmatic self-healing and software automation.
  • Travel occasionally to meet with the rest of Lightbend’s technical team.

Candidates can be based anywhere in Europe or in India, as this is a fully remote position. This is not a full-time firefighting role requiring super heroes. Site reliability is the entire team’s responsibility. We are looking for an operations expert to be a part of building and running our new offerings as we expand our platform.

Qualifications:

You

  • Are an SRE who understands how to operate modern distributed data systems on Kubernetes to be extremely reliable with predictable performance.
  • Have experience with (multiple) cloud service offerings, specifically from an operational perspective (we operate on Google Cloud and AWS today).
  • Have a passion for automating the complexities of orchestrating and running multi-tenant cloud application services.
  • Are accustomed to collaborating with business owners and understanding diverse business requirements.
  • Have two or more years of experience in distributed systems architecture and runtime requirements.
  • Are a voracious learner, ready to take on new technologies and techniques quickly and constantly.
  • Have excellent written and verbal communication skills in at least English.
  • Are skilful at interacting and working with people; working with a self-organized lean and agile team to mitigate project risks, manage effort and ensure quality.
  • Are dedicated to best practices such as infrastructure as code, automated testing, code reviews, CI/CD, GitOps, and testing.
  • Are biased towards action on tough problems and issues, and focused on your customer’s success.
  • Are an agent of change, constantly learning and seeking better outcomes.
  • Are familiar with many of the supporting technologies we use, including Terraform, Crossplane, FluxCD GitOps, Prometheus, Grafana, Actors, Service Mesh frameworks, etc.
  • Are experienced with complex and secure networking environments, including Encryption Keys, and TLS.

Ideally, you also...

  • Have knowledge of the Lightbend technologies and distributed systems, including Akka clustering.
  • Have supported SaaS/PaaS systems.
  • Have an awareness of Serverless/Functions-as-a-service Platforms.

What we offer:

Lightbend is a welcoming, transparent, and highly distributed company dedicated to creating high-performance systems that bring success to all who use them. With a strong focus on work-life balance, our company offers a fast-paced, collaborative environment mixed with challenging and engaging work. This combination has attracted and retained some of the brightest minds in our technology communities.

Lightbend is an Equal Opportunity Employer.

Powered by JazzHR

17 Apr 2024;   from: adzuna.com.au

Similar jobs

  • Imdex
  • Perth
Join a high-performing team and make a difference at a growing global ASX300 firm in the mining tech industry! Permanent role!
21 days ago
  • Randstad
  • Perth

Description:

About the roleOur Blue Chip mining client is seeking an experienced Reliability Engineer for a Perth based position. The Reliability Engineer will be responsible for leading tactics and strategy development for fixed plant assets using ...
11 days ago
  • Orora
  • Perth

Description:

Join our Canning Vale team as a Reliability Engineer to drive continuous improvement across the site
14 days ago
  • Randstad
  • Perth

Description:

Fantastic opportunity for a reliability engineer to join a global mining company on a Perth and site based roster
11 days ago