Back to jobsJob overview

About the role

Principal Software Engineer at Microsoft

Required Skills

pythonc#javaazurekubernetesai/mldistributed systemsmicroservicescontainers

About the Role

Principal Software Engineer role focused on enhancing Azure's reliability and resilience through distributed systems design and AI-driven automation. Responsibilities include improving observability, defining SLOs, and collaborating across teams to embed reliability into product architecture. Requires extensive experience in cloud systems, distributed software patterns, and building scalable solutions.

Key Responsibilities

  • Design and implement world-class distributed systems to support billions of users worldwide
  • Improve the reliability and resilience of key Azure products
  • Define and maintain system reliability goals through Service Level Objectives (SLOs)
  • Enhance production systems with improvements in observability, telemetry, alerting, and incident management
  • Leverage AI technologies to detect anomalies, predict incidents, and automate operational workflows at scale

Required Skills & Qualifications

Must Have:

  • Bachelor's Degree in Computer Science or related technical field AND 6+ years technical engineering experience with coding in languages including C, C++, C#, Java, JavaScript, or Python OR equivalent experience
  • 5+ years of experience analyzing modern distributed software design patterns and cloud systems architecture, including microservices, containers, load-balancing, queuing, caching
  • 5+ years of experience in building, shipping and operating reliable solutions
  • Ability to meet Microsoft, customer and/or government security screening requirements including Microsoft Cloud Background Check

Nice to Have:

  • Familiarity with modern distributed software design patterns and cloud systems architecture, including microservices, containers, load balancing, queuing, caching
  • Experience as a technical lead or engineering manager
  • Experience working on large and unfamiliar codebases (millions of lines of code)
  • Experience with open-source projects, Kubernetes, Linux and containers
  • Experience in AI adoption with tools like GitHub Copilot, Azure OpenAI and custom copilots to streamline development and reduce toil

Benefits & Perks

  • Industry leading healthcare