Research Scientist, Multimodal Creation
At Google DeepMind, we value diversity of experience, knowledge, backgrounds and perspectives and harness these qualities to create extraordinary impact. We are committed to equal employment opportunity regardless of sex, race, religion or belief, ethnic or national origin, disability, age, citizenship, marital, domestic or civil partnership status, sexual orientation, gender identity, pregnancy, or related condition (including breastfeeding) or any other basis as protected by applicable law. If you have a disability or additional need that requires accommodation, please do not hesitate to let us know.
SnapshotImagine a world where anyone can effortlessly create stunning videos, limited only by their imagination. That's the future we're building at Google DeepMind. As a Research Scientist on our team, you'll be at the forefront of generative AI innovation, developing groundbreaking capabilities and tools that empower creators across the globe. You'll work alongside a world-class team of experts, pushing the boundaries of what's possible with our foundation models Gemini and Veo, and directly shaping the next generation of video creation experiences.
About usArtificial Intelligence could be one of humanity’s most useful inventions. At Google DeepMind, we’re a team of scientists, engineers, machine learning experts and more, working together to advance the state of the art in artificial intelligence. We use our technologies for widespread public benefit and scientific discovery, and collaborate with others on critical challenges, ensuring safety and ethics are the highest priority.
The roleWe're seeking a highly motivated Research Scientist to spearhead the development of generative AI models that will revolutionize the way people create. You'll be at the forefront of innovation, working with cutting-edge foundation models like Gemini and Veo to design intuitive, powerful tools that unleash the creative potential of millions. Your research will directly influence the future of video content, enabling creators to bring their wildest ideas to life with ease.
Key responsibilities- Advancing video creation capabilities: Pioneer novel methods to enhance Gemini / Veo models for video creation, improving output quality, reducing latency, and enabling creator control.
- Developing GenAI-powered creator experiences: Architect innovative creation experiences powered by Gemini and Veo. Prototyping and iterating on new interactive features that empower creators throughout their creative journey.
- Partnering with Products: Collaborate deeply with teams across Google to understand user needs and seamlessly integrate video creation solutions into Google products.
In order to set you up for success as a Research Scientist at Google DeepMind, we look for the following skills and experience:
- PhD in Computer Science, Statistics or related field with equivalent practical experience.
- Expertise with generative models, such as diffusion models and GANs.
In addition, the following would be an advantage:
- Proven track record of publications in top computer vision and machine learning conferences.
- Past experience on multimodal generation, instruction tuning, and controllable generation.
- Extensive experience with deep learning frameworks (e.g. PyTorch, JAX) and large-scale model training.
The US base salary range for this full-time position is between $161,000 - $245,000 + bonus + equity + benefits. Your recruiter can share more about the specific salary range for your targeted location during the hiring process.
Note: In the event your application is successful and an offer of employment is made to you, any offer of employment will be conditional on the results of a background check, performed by a third party acting on our behalf. For more information on how we handle your data, please see our Applicant and Candidate Privacy Policy.