Back to jobsJob overview

About the role

Software Engineer II (Azure HPC/AI) at Microsoft

Required Skills

azurehpcaimpigpu computinglinuxpythondistributed systemscloud services

About the Role

Software Engineer II role focused on developing and maintaining Azure HPC/AI Images and Microsoft HPC Pack. Contributes to performance-optimized OS images for AI/HPC workloads and cluster management tools. Works with technologies like MPI libraries, GPU frameworks, high-speed networking, and parallel file systems.

Key Responsibilities

  • Works with stakeholders to determine user requirements for features
  • Contributes to design documents and identifies dependencies for product areas
  • Creates and implements code for products, services, or features
  • Breaks down larger work items into smaller tasks and provides estimations
  • Acts as Designated Responsible Individual (DRI) for system monitoring and problem resolution

Required Skills & Qualifications

Must Have:

  • Bachelor's Degree in Computer Science or related field OR equivalent experience
  • Technical engineering experience with coding in languages like Bash, C, C++, C#, Python, or Go
  • Familiarity with Linux/Windows system programming, distributed systems, or cloud services
  • Hands-on experience with HPC/AI clusters, MPI libraries, GPU computing, or large-scale cluster environments

Nice to Have:

  • Master's Degree in Computer Science or related field
  • Experience with Microsoft Azure or other cloud platforms
  • Experience with kernel debugging, driver troubleshooting and performance tuning
  • Experience with container technologies (Docker, Kubernetes) and cloud-native HPC/AI workflows
  • Knowledge of high-speed networking (InfiniBand, RDMA, NVLink) and parallel file systems

Benefits & Perks

  • Industry leading healthcare