Back to jobsJob overview

About the role

Research Engineer Intern at AgentHub

Required Skills

pythonreinforcement learningai agentsmodel evaluationdata generationresearchsystems scalingsimulation

About the Role

Research Engineer Intern at AgentHub, working directly with founders to advance AI agent evaluation and simulation capabilities. You'll research and implement state-of-the-art methodologies for evaluating agents across safety, reliability, and efficiency. This role involves translating research into real features for customers at the frontier of AI.

Key Responsibilities

  • Design and build core methodologies for evaluating agents across instruction following, safety, groundedness, and efficiency
  • Research, experiment, and implement robust data generation capabilities
  • Tie in latest research advancements and productionalize them for customer value
  • Build scalable systems for ingesting, storing, and analyzing structured/unstructured agent outputs
  • Design evaluation pipelines capturing reasoning, safety, reliability, and edge-case behavior

Required Skills & Qualifications

Must Have:

  • Working towards Bachelors/Masters/PhD in Computer Science or related field
  • Passionate about building category-defining products
  • Previous background and experience in reinforcement learning
  • Opinionated, perpetually curious, and love having scope over problems and delivering

Nice to Have:

  • Demonstrated experience in model/agent evaluation