Back to jobsJob overview
About the role
Sr. ML Performance Engineer, AWS Neuron, Annapurna Labs at Amazon Development Centre Canada ULC
Required Skills
machine learninghigh-performance computingdistributed systemscompiler optimizationperformance profilingpytorchtensorflowjaxaws neuron sdk
About the Role
Senior ML Performance Engineer role at AWS Neuron/Annapurna Labs focused on optimizing machine learning workloads on AWS custom ML accelerators (Inferentia/Trainium). The position involves analyzing system-level performance across the entire technology stack, implementing compiler optimizations, and working directly with customers to enable and optimize their ML models.Key Responsibilities
- Analyze and optimize system-level performance of machine learning models across frameworks to runtime
- Conduct detailed performance analysis and profiling of ML workloads, identifying bottlenecks in large-scale ML systems
- Work directly with customers to enable and optimize their ML models on AWS accelerators
- Design and implement compiler optimizations, transforming manual improvements into automated compiler passes
- Collaborate across teams to develop innovative optimization techniques for AWS Neuron SDK
Required Skills & Qualifications
Must Have:
- 5+ years of non-internship professional software development experience
- 5+ years of programming with at least one software programming language experience
- 5+ years of leading design or architecture (design patterns, reliability and scaling) of new and existing systems experience
- Experience as a mentor, tech lead or leading an engineering team
Nice to Have:
- 5+ years of full software development life cycle experience
- Bachelor's degree in computer science or equivalent
Benefits & Perks
- Inclusive team culture with diversity and inclusion initiatives
- Work/life balance with flexible working hours
- Mentorship and career growth opportunities
- Broad mix of experience levels and knowledge sharing environment