Site Reliability Engineer
Job Description
Our client, a global investment management firm, aims to hire a talented engineer to join a high-performing team which reaches all parts of their infrastructure, in their offices all over the globe. As a truly technology and data-driven firm, they design and build cutting-edge systems from high-performance trading platforms to large-scale data analysis and compute farms. This is expected to be end-to-end; collaborating internally in how effectively to host on-prem infrastructure; and working with development and investment teams on best practices, defining and implementing in shared libraries.
Role Responsibilities:
- Working directly with development and investment business use-cases; review and evaluate infrastructure configurations; and provide recommendations
- Provide best practices for producer and consumer configuration to development and investment by contributing to Python libraries
- Action regular performance tests, providing expectations of healthy and unhealthy traffic
- Schedule regular chaos engineering events with external teams, building a methodology of designing for failure
Technical Experience and Qualifications Required:
- Bachelor’s Degree in Computer Science, Engineering or related subject
- Experience with Chef and Kitchen
- Experience with Python
- Experience in Docker and Kubernetes
- Solid knowledge of Linux based systems (CentOS 7), Windows a plus
- 5+ years in development, infrastructure/system engineering or application support
- 3+ years DevOps / SRE experience
- Self-motivated individual that takes the initiative and has an ownership, accountability, and collaborative mindset
- Solid experience with deployment, maintenance, and analytics of popular databases environments (Postgres, MSSQL, MySQL)
- Experience with the ELK Stack
Share this role with your network

Apply for this role
All fields marked with * are required.