Research Scientist, SKY Multimodal

DeepMind

Full Time

Mountain View, CA, USA

11 months ago

Apply now

At Google DeepMind, we value diversity of experience, knowledge, backgrounds and perspectives and harness these qualities to create extraordinary impact. We are committed to equal employment opportunity regardless of sex, race, religion or belief, ethnic or national origin, disability, age, citizenship, marital, domestic or civil partnership status, sexual orientation, gender identity, pregnancy, or related condition (including breastfeeding) or any other basis as protected by applicable law. If you have a disability or additional need that requires accommodation, please do not hesitate to let us know.

Snapshot

At Google DeepMind, we've built a unique culture and work environment where long-term ambitious research can flourish. The GDM SKY (Scaled AI) team is part of Google DeepMind’s GenAI unit dedicated to developing and deploying scaled AI solutions that will impact Google and the world in positive ways. Within GDM SKY, the SKY Multimodal team is focused on cutting-edge research in multimodal learning and their applications in building the most powerful large language models capable of understanding and generating information across various modalities including language, image, and video.

About us

Artificial Intelligence could be one of humanity’s most useful inventions. At Google DeepMind, we’re a team of scientists, engineers, machine learning experts and more, working together to advance the state of the art in artificial intelligence. We use our technologies for widespread public benefit and scientific discovery, and collaborate with others on critical challenges, ensuring safety and ethics are the highest priority.

The role

Research Scientists at Google DeepMind lead our efforts in developing the next generation of models and algorithms that drive our progress towards increasingly capable Artificial Intelligence.

We are looking for a Research Scientist to join us in Google Deepmind to build the most powerful multimodal LLM (Gemini) that excel at both understanding and generation tasks. More specifically, your job responsibilities include:

Key responsibilities

Explore research ideas to expand and improve native multimodal understanding and generation capabilities of Gemini throughout pre-training and post-training.
Design and run model training experiments to verify ideas and land them in Gemini.
Build and extend evaluation benchmarks to evaluate the native understanding and generation capabilities of the models, and identify opportunities for improvement.
Collaborate closely with product and other research teams in applying Gemini for various multimodal use cases in Gemini App (Bard), Cloud, Search, etc.

About you

In order to set you up for success as a Research Scientist at Google DeepMind, we look for the following skills and experience:

Self-motivated researcher and engineer who can drive ideas from conception to landing.
Hands-on research experience in computer vision or related fields.

In addition, the following would be an advantage:

Strong publication record in computer vision, generative models, or other related fields.
Experience in large language models.
Experience with JAX, Tensorflow, or other similar machine learning platforms.

The deadline for applications is Friday 12th July at 5pm BST.

The US base salary range for this full-time position is between $136000 - $245000 + bonus + equity + benefits. Your recruiter can share more about the specific salary range for your targeted location during the hiring process.

research

Apply now

DeepMind

Research Scientist, Multimodal LLMs

Full Time

8 months ago

Mountain View, CA, USA

DeepMind

Research Scientist, Multimodal Creation

Full Time

6 months ago

Seattle, WA, USA

DeepMind

Research Engineer, Sky

Full Time

5 months ago

New York, NY, USA

Research Scientist, SKY Multimodal

For Candidates

For Startups

Search by Role

Search by City

Search by Tech

About