Network Engineer - HPC Infrastructure
Job Description
[Up to c. $350k Comp Package (or equivalent) | Hybrid Working]
A global quantitative trading firm is searching for an experienced infrastructure-focused Network Engineer to help shape the future of its high-performance compute (HPC) environment. Operating across thousands of nodes, this is a mission-critical estate underpinning cutting-edge research and real-time trading. You'll play a leading role in designing and scaling next-gen network architecture across multiple data centres - working closely with HPC systems teams, vendors, and internal stakeholders to enhance throughput, resilience, and automation. Ideal for someone who thrives in highly technical, performance-obsessed environments and wants to see their work have immediate, global impact...
Key Responsibilities
- Design, implement, and scale high-throughput network infrastructure across large-scale HPC environments
- Oversee product evaluation and vendor coordination to ensure future-ready switching and routing
- Collaborate with HPC Systems and internal engineering teams to align network design with compute demands
- Own lifecycle project management for data centre expansion and capacity planning
- Build and maintain tooling for automation, observability, and performance tuning
- Troubleshoot complex layer 2/3 networking issues across LAN/WAN architectures
- Provide mentorship to junior team members and promote documentation best practices
- Participate in external meetings and influence long-term network strategy decisions
What You’ll Bring...
- 7+ years of hands-on experience engineering large-scale compute networks (HPC or similar environments)
- Strong grasp of routing protocols and architectures including BGP, OSPF, PIM, IGMP, spine-leaf, and VXLAN
- Track record deploying and managing Arista (EOS), Cisco (NX-OS), and SONiC switches in low-latency setups
- Proficiency with tools like tcpdump, Wireshark, and Linux command-line networking utilities
- Experience managing cloud networking across AWS, Azure, or GCP
- Programming or scripting experience in Python; exposure to GitHub, Prometheus, Grafana, or ELK
- Familiarity with configuration management (e.g. Ansible, Salt) to support zero-touch networking
- Ability to independently diagnose problems in the field and drive full-cycle resolutions
- Familiarity with capacity planning, fibre optics, and data centre-level cooling and power constraints
- (Preferred) Experience designing networks for AI, ML, or GPU-accelerated workloads
- (Preferred) Experience with network automation at scale and a passion for evolving infrastructure practices
...
Apply for this role
All fields marked with * are required.