Back to jobsJob overview
About the role
Research Scientist Intern, Multimodal Generative AI and Robotics (PhD) at Meta
Required Skills
pythonpytorchdeep learningcomputer visiongenerative aillmsroboticsmultimodal ai
About the Role
Research Scientist Intern focused on multimodal generative AI and robotics. The role involves developing unified predictive models integrating language, vision, and human motion, and advancing state-of-the-art machine learning techniques. The intern will work with large-scale egocentric datasets to build contextual AI models.Key Responsibilities
- Plan and execute cutting-edge research to advance machine learning and large-scale training
- Collaborate with researchers and engineers to develop experiments and prototypes for contextual AI and robotic systems
- Design, setup, and run practical experiments related to large-scale sensing and machine reasoning
- Develop unified predictive models integrating language, vision, human motion, and actions
- Benchmark against state-of-the-art approaches in world modeling, video generation, and vision-language-action models
Required Skills & Qualifications
Must Have:
- Currently has or is in the process of obtaining a PhD in computer vision, computer graphics, 3D machine perception, or deep learning
- Knowledge in deep learning, computer vision, graphics, generative modeling, LLMs, and VLMs
- Hands-on experience implementing deep learning algorithms, large-scale training, benchmarking, and evaluation
- Experience working within Python environments such as PyTorch
- Experience working in a Unix environment
- Must obtain work authorization in the country of employment at the time of hire
Nice to Have:
- Preference for 24 week full time internship
- Intent to return to a degree-program after the internship
- Proven track record of significant results demonstrated by grants, fellowships, patents, or first-authored publications at top conferences
- Strong track-record of published research in LLMs, VLMs, video generation, world modeling, VLA, human motion modeling, policy learning, or generative modeling
- Strong programming experience using Python and PyTorch
- Demonstrated software engineer experience via internship, work experience, coding competitions, or open source contributions
- Experience working and communicating cross functionally in a team environment
Benefits & Perks
- $7,650/month to $12,134/month + benefits
- Compensation determined by skills, qualifications, experience, and location
- Benefits package available