Research Scientist, Conversation
Our team is focused on developing advanced conversational capabilities powered by the Gemini LLM. Our primary objective is to empower multimodal conversational agents with cutting-edge speech and audio functionalities. We strive to build intelligent agents capable of orchestrating all facets of complex, real-time, multi-speaker, and multimodal conversations. This includes enabling full-duplex and expressive conversations, as well as leveraging multi-modality to deliver state-of-the-art agentic systems. Ultimately, our ambition is to create, scale, and productionize these sophisticated conversational capabilities for a diverse array of Google products and experiences.
About usArtificial Intelligence could be one of humanity’s most useful inventions. At Google DeepMind, we’re a team of scientists, engineers, machine learning experts and more, working together to advance the state of the art in artificial intelligence. We use our technologies for widespread public benefit and scientific discovery, and collaborate with others on critical challenges, ensuring safety and ethics are the highest priority.
The roleAt Google DeepMind, our Research Scientists are at the forefront of developing innovative algorithmic architectures, with the ultimate goal of achieving Artificial General Intelligence.
As a member of this team, you will play a key role in the research and development of intelligent, voice-first conversational systems that introduce new agentic AI capabilities to Gemini. This position offers the chance to conduct groundbreaking research and see it through to production.
Key responsibilitiesAs a Researcher, you will:
- Partner with the Gemini/GDM teams to design, develop, and deploy novel multimodal conversational agents.
- Develop audio-first models capable of orchestrating and planning complex dialogs, including leveraging external tools like search when necessary.
- Leverage new sources of data (real and synthetic) to empower new real-time dialog capabilities.
- Work with infra teams to design models suitable for streaming bi-directional dialog, so the user experience is always fluid and low-latency.
- Rapidly prototype and evaluate new technologies.
In order to set you up for success as a Research Scientist at Google DeepMind, we look for the following skills and experience:
- PhD in Computer Science, or Machine Learning related field.
- Experience working with LLMs.
- Demonstrated experience in data preparation, training, and evaluation of ML models.
In addition, the following would be an advantage:
- A strong record of publications in top-tier machine-learning related conferences
- Experience in dialog and agentic systems.
- Experience with multimodal models and processing (e.g., text / video / audio).
- Research background in NLP / Generative AI
The US base salary range for this full-time position is between $141,000 - $202,000 + bonus + equity + benefits. Your recruiter can share more about the specific salary range for your targeted location during the hiring process.
At Google DeepMind, we value diversity of experience, knowledge, backgrounds and perspectives and harness these qualities to create extraordinary impact. We are committed to equal employment opportunity regardless of sex, race, religion or belief, ethnic or national origin, disability, age, citizenship, marital, domestic or civil partnership status, sexual orientation, gender identity, pregnancy, or related condition (including breastfeeding) or any other basis as protected by applicable law. If you have a disability or additional need that requires accommodation, please do not hesitate to let us know.