Join Annapurna Labs (an Amazon Web Services company) as a Software Engineer specializing in distributed AI/ML systems. In this high-impact role, you’ll develop and optimize collective operations that enable AI workloads to scale efficiently across accelerators and servers. Ideal for engineers skilled in C/C++ development, Linux systems, and performance optimization, this position offers the chance to work on cutting-edge high-performance computing (HPC) and machine learning (ML) technologies that power AWS infrastructure worldwide.
Why You’ll Love This Role
-
Work at the heart of AWS infrastructure innovation within Annapurna Labs.
-
Develop technology that powers the world’s largest AI and ML workloads.
-
Competitive salary, health insurance, and retirement benefits.
-
Flexible work arrangements and career development opportunities.
-
Collaborate with world-class engineers, architects, and scientists.
Key Responsibilities
-
Design, develop, and optimize distributed collective operations for AI/ML workloads.
-
Write low-level, high-performance code in C/C++ for Linux-based systems.
-
Collaborate with hardware, RTL, and infrastructure teams to deliver scalable solutions.
-
Contribute to system architecture discussions to enhance scalability and reliability.
-
Mentor junior engineers and ensure code quality through reviews and best practices.
-
Troubleshoot complex distributed system issues and support production deployments.
-
Stay updated on emerging trends in AI/ML, HPC, and systems engineering.
Required Qualifications
-
Bachelor’s degree in Computer Science, Engineering, or equivalent experience.
-
5+ years of professional software development experience (excluding internships).
-
Proficiency in C or C++ and strong understanding of software architecture.
-
Experience with Linux systems, kernel-level programming, and performance tuning.
-
Hands-on experience with the full software development lifecycle.
-
Proven leadership or mentorship experience in technical teams.
Preferred Qualifications
-
Experience with embedded systems, high-speed networking, or HPC interconnects.
-
Background in distributed systems or machine learning infrastructure.
-
Familiarity with large-scale, low-latency systems and collective communication frameworks.
-
Bachelor’s or advanced degree in Computer Science or related field.
About the Company
Annapurna Labs (U.S.) Inc., a core part of Amazon Web Services (AWS), designs and develops advanced hardware and software that power EC2 instances globally. The company focuses on high-performance computing (HPC), machine learning (ML), and scalable infrastructure, driving the innovation that underpins AWS’s world-class cloud ecosystem.