Site Reliability Engineer (SRE)
Job Description
[c. $120-160k Comp Package (or equivalent) | Hybrid Working]
Are you an experienced Site Reliability/DevOps Engineer looking to drive automation, scalability, and resilience in a high-performance environment? Our client, a global multi-strategy investment firm, is seeking SREs with 3-8 years of experience (based on location) to enhance their infrastructure, streamline operations, and improve platform reliability. With a strong focus on Kubernetes, automation, and Infrastructure as Code (IaC), this role offers the chance to work closely with engineering, portfolio management, and trading teams to ensure the firm’s technology stack operates at peak performance...
Key Responsibilities
- Improve platform reliability and efficiency by implementing automation-first solutions
- Optimise Kubernetes environments, ensuring high availability and scalability
- Develop and maintain CI/CD pipelines, working with teams to integrate and enhance deployment processes
- Automate infrastructure provisioning and configuration using Terraform, Ansible, or Salt
- Enhance observability and monitoring, defining best practices and integrating with tools such as Prometheus, Grafana, or Datadog
- Support AWS-based infrastructure, optimising deployments and improving security, networking, and performance
- Collaborate with engineers, portfolio managers, and business teams, ensuring technology meets operational needs
- Lead or contribute to key projects, such as cloud migrations, proof-of-concept initiatives, and automation strategies
What You Bring...
- 3-8 years of experience in Site Reliability Engineering, DevOps, or Cloud Infrastructure
- Proficiency in Python or Go, with experience developing automation tools
- Deep understanding of Kubernetes, including cluster management and optimisation
- Hands-on experience with AWS and cloud-native best practices
- Strong CI/CD expertise, with experience in Jenkins, GitHub Actions, or GitLab
- Infrastructure as Code (IaC) experience, working with Terraform, Ansible, or Salt
- Excellent communication skills, with the ability to collaborate across technical and non-technical teams
- (Preferred) Experience with Kafka or other message bus systems
...
Apply for this role
All fields marked with * are required.