AI Research Engineer, Media - Meta Superintelligence Labs

Meta

Posted 1 day ago

Full Time

Menlo Park, California

In Person

Smart Summary

Responsibilities include contributing to the training of next-generation multimodal foundation models, advancing their capabilities in understanding, generation, and grounding, and enabling them for downstream product use-cases. Researchers will also lead and execute research that pushes the state of the art in multimodal reasoning and generation, prioritizing directly applicable work.

We are seeking AI Researchers with experience in image and video understanding, generation, and narrative creation to join Meta Superintelligence Labs. You should have a Bachelor's degree in Computer Science or a related field, along with industry research experience in LLM/NLP, computer vision, or related AI/ML models. Proficiency in Python and experience with frameworks like PyTorch or Spark is also required.

Must Have Skills for ATS

Multimodal Models

Image Understanding

Video Understanding

Generation

Narrative Creation

LLMs

Data Curation

Data Pipelines

Multimodal Reasoning

Model Training

Inference Optimization

Media Generation

Python

PyTorch

Spark

Job Description

We are seeking AI Researchers to join the Product and Applied Research (PAR) Media group within Meta Superintelligence Labs (MSL). As a member of the PAR Media group, you will drive innovation in image and video understanding, generation, and narrative creation at an unprecedented scale. We own the research, development and deployment of cutting edge multimodal models across Meta AI, FoA, and the entire Meta creator and developer ecosystem. Our work directly powers product roadmaps with flexible, state-of-the-art solutions designed to lead, not follow. We partner closely with AI product teams across Meta to translate our research into impactful, real-world experiences. This means we’re not just building technology…we’re building the future of how people create, communicate, and connect. If you’re passionate about advancing the future of AI-driven media experiences and eager to make a tangible impact on billions of users, we invite you to join us on this journey.

Responsibilities
  • Contribute to the training of next-generation multimodal foundation models, advance their capabilities in understanding, generation, and grounding, and enable them for downstream product use-cases
  • Support creative data sourcing, high-quality pre/mid/post-training data curation, and scale and optimize data pipelines for multimodal large language models (LLMs)
  • Lead, collaborate, and execute on research that pushes forward the state of the art in multimodal reasoning and generation research, and prioritize research that can be directly applied to Meta’s product development


Minimum Qualifications
  • Bachelor's degree in Computer Science, Computer Engineering, relevant technical field, or equivalent practical experience
  • 1+ year of industry research experience in LLM/NLP, computer vision, or related AI/ML models
  • Skilled in model training, data, or inference & efficiency for image, video, and/or related multimodal models
  • Proficient in media generation, understanding, and/or grounding
  • Experience owning and/or driving complex technical projects from end-to-end
  • Programming experience in Python and hands-on experience with frameworks like PyTorch or Spark


Preferred Qualifications
  • Experience working on frontier-quality/state-of-the-art Large Media Models
  • Masters degree or PhD in Computer Science, AI/ML, or a relevant technical field
  • Demonstrated significant industry influence in the field of AI and/or published research in leading peer-reviewed conferences (e.g., ACL, NeurIPS, ICML, ICLR, AAAI, KDD, CVPR, ICCV)


$74.04/hour to $217,000/year + bonus + equity + benefits

Meta

Meta's mission is to build the future of human connection and the technology that makes it possible. Our technologies help people connect, find communities, and grow businesses. When Facebook launched in 2004, it changed the way people connect. Apps like Messenger, Instagram and WhatsApp further empowered billions around the world. Now, Meta is moving beyond 2D screens toward immersive experiences like augmented and virtual reality to help build the next evolution in social technology. To help create a safe and respectful online space, we encourage constructive conversations on this page. Please note the following: • Start with an open mind. Whether you agree or disagree, engage with empathy. • Comments violating our Community Standards will be removed or hidden. Please treat everybody with respect. • Keep it constructive. Use your interactions here to learn about and grow your understanding of others. • Our moderators are here to uphold these guidelines for the benefit of everyone, every day. • If you are seeking support for issues related to your Facebook account, please reference our Help Center (https://www.facebook.com/help) or Help Community (https://www.facebook.com/help/community). For a full listing of our jobs, visit https://www.metacareers.com

Runway Icon
Boost Your Interview Chances

With Runway

See Your Fit for This Role

1-5 min

Your Score

?

Top Applicants

90%

Your Job Search Advantage

Key Gaps & Next Steps:

Address these in your resume & Interview

Top Strengths For This Role

Highlight these in your cover letter & interview

Your Interview Guide

A Personalized Interview Strategy

Freshest Opportunities

Never Miss a Good Fit

Get notified when jobs mach your criteria