Company Overview:
Join one of the most advanced systematic trading firms, renowned for leveraging cutting-edge technology and built on the ability to respond rapidly to market opportunities. Focused on innovation, collaboration, and a hands-on approach to managing infrastructure-critical applications, this firm offers a dynamic environment where your contributions will have a direct and meaningful impact.
Role Overview:
We are seeking a Site Reliability Engineer (SRE) to design, develop, and maintain highly reliable and scalable systems that support mission-critical trading applications. You will engage in greenfield projects and collaborate closely with development, IT, and trading teams to ensure seamless system operations within a fast-paced, high-performance setting.
Key Responsibilities:
-
Architect and deploy robust systems that underpin the firm’s trading infrastructure.
-
Proactively oversee infrastructure to maintain reliability, scalability, and optimal performance.
-
Work cross-functionally with development, IT, and trading teams to promote automation and continuous improvement.
-
Stay informed on emerging technologies and industry trends to drive system enhancements.
Required Experience:
-
Minimum 4 years in DevOps or Site Reliability Engineering roles.
-
Proficiency in one or more programming languages such as Python, Go, Ruby, or Perl.
-
Strong Linux system administration skills.
-
Practical experience with observability and monitoring tools like Prometheus, Grafana, Thanos, and the ELK stack.
-
Familiarity with container orchestration and cloud platforms including Kubernetes, Docker, AWS, and GCP.