Back to jobsJob overview

About the role

EFA Network Sr. Software Engineer, EFA ML Software Team at Amazon Development Center U.S., Inc.

Required Skills

cnetworkinghigh-performance computingmachine learningcloudawslibfabricopen mpisoftware architecture

About the Role

Lead a team developing high-performance networking software for AWS's Elastic Fabric Adapter (EFA) to support Machine Learning and HPC workloads. Write optimized C code for open-source projects like Libfabric and Open MPI, and collaborate with ML infrastructure teams to scale solutions across large GPU/CPU clusters.

Key Responsibilities

  • Lead a team of networking developers focused on high-performance code
  • Write and optimize C code for EFA-related open-source projects (e.g., Libfabric, Open MPI)
  • Design new APIs for cloud networking innovations
  • Analyze customer workloads for high bandwidth and low latency networking
  • Provide expert support to major AI industry customers

Required Skills & Qualifications

Must Have:

  • 5+ years of professional software development experience (non-internship)
  • 5+ years leading design/architecture of systems with focus on reliability and scaling
  • 5+ years experience in full software development life cycle (coding standards, reviews, testing, operations)
  • Experience as a mentor, tech lead, or leading an engineering team
  • Minimum 5+ years of low-level programming in C

Nice to Have:

  • Bachelor's degree in computer science or equivalent

Benefits & Perks

  • Medical, financial, and other benefits (total compensation package)
  • Equity and sign-on payments may be provided
  • Inclusive culture with accommodations for disabilities