[Remote] AI Research Scientist, Computer Vision - Facebook Video Intelligence

Remote, USA
Posted Jun 15, 2026
Full-time

Note: The job is a remote job and is open to candidates in USA. Meta is a technology company that builds platforms to connect people and foster communities. The AI Research Scientist, Computer Vision role focuses on developing advanced video generation and understanding models to enhance AI-driven video creation experiences and improve comprehension of video content.

Responsibilities

  • Build a variety of multimodal foundation models such as text-to-video generative models, image-to-video generative models, video understanding models, unified native video generative models
  • Design core foundation model architectures and progressive pre-train
  • Post-train foundation models using techniques such as Supervised Fine-Tuning (SFT), Reinforcement Learning from Human Feedback (RLHF), Direct Preference Optimization (DPO), and Low-Rank Adaptation (LoRA)
  • Conduct research to develop SOTA GenAI models for the Facebook family of apps
  • Collaborate with colleagues from the infrastructure and product teams on launching models

Skills

  • Bachelor's degree in Computer Science, Computer Engineering, relevant technical field, or equivalent practical experience
  • PhD in Computer Science, Machine Learning, or a relevant technical field
  • 1+ year of industry experience training multimodal, computer vision, LLM or related AI/ML models
  • Experience owning and/or driving complex technical projects from end-to-end
  • Publications at peer-reviewed conferences (e.g. ICLR, NeurIPS, ICML, KDD, CVPR, ICCV, ACL)
  • Programming experience in Python and hands-on experience with frameworks such as PyTorch
  • Must obtain work authorization in the country of employment at the time of hire, and maintain ongoing work authorization during employment
  • First-authored publications at peer-reviewed conferences (e.g. ICLR, NeurIPS, ICML, KDD, CVPR, ICCV, ACL)
  • Experience collaborating in cross-functional teams, including product, engineering, and research
  • Experience building text-to-video generative models, image-to-video generative models, video understanding models, and/or unified native video generative models

Benefits

  • Bonus
  • Equity
  • Benefits

Company Overview

  • Meta's mission is to build the future of human connection and the technology that makes it possible. It was founded in 2004, and is headquartered in Menlo Park, CA, US, with a workforce of 10001+ employees. Its website is https://www.metacareers.com/.

  • More Remote Jobs