Back to jobsJob overview

About the role

Principal Software Engineer - Azure Core at Microsoft

Required Skills

linuxkubernetesai infrastructuregopythondata center managementhigh-performance computingdistributed storage

About the Role

Principal Software Engineer role in Azure Core, focusing on building and managing the core platform for Microsoft's cloud services. Responsibilities include designing hyperscale infrastructure, working with AI data centers, and leading technical projects. Requires expertise in Linux, Kubernetes, AI infrastructure, and data center management.

Key Responsibilities

  • Partners with stakeholders to determine user requirements for complex scenarios
  • Provides technical leadership for design documents and dependencies
  • Leads by example and mentors others to produce maintainable code
  • Leads project plans and work items across multiple products
  • Acts as Designated Responsible Individual (DRI) for system monitoring and mentoring

Required Skills & Qualifications

Must Have:

  • Bachelor's Degree in Computer Science or related field AND 8+ years technical engineering experience with coding in Go, Rust, Bash, or Python OR equivalent experience
  • 5+ years experience building and managing data centers
  • Networking and security expertise in high-performance computing, RDMA over InfiniBand or RoCE, and eBPF
  • Experience with storage and acceleration technologies for AI workloads, including distributed storage systems

Nice to Have:

  • 1+ years experience with AI/ML job scheduling and orchestration at scale using SLURM, Ray, or Kueue
  • Model training optimization for performance and scalability
  • 1+ years experience improving model serving and inference efficiency for low latency and high throughput

Benefits & Perks

  • Industry leading healthcare