Research Engineer, Multimodal Generative AI, Google DeepMind

Full Time
21 hours ago

Additional job description

We are seeking highly motivated and innovative Research Engineers to join our team in Singapore, focused on advancing state-of-the-art methods for AI generative media models, with a particular focus on culturally-adapted image and video generation. This role offers a unique opportunity to contribute to foundational research in multimodal LLMs, with a special emphasis on unique challenges in the Asia-Pacific (APAC) region, while collaborating with a world-class team at Google DeepMind around the world.  If you are passionate about shaping the future of human-computer interaction through multilingual and multimodal LLMs and are eager to make a significant impact on users in the APAC region and beyond, we encourage you to apply.  

At Google DeepMind, we've built a unique culture and work environment where long-term ambitious research can flourish. Our special interdisciplinary team combines the best techniques from deep learning, reinforcement learning, and systems neuroscience to build general-purpose learning algorithms. We have already made a number of high-profile breakthroughs towards building artificial general intelligence, and we have all the ingredients in place to make further significant progress over the coming year!

Job responsibilities

  • Design, rapidly implement, and rigorously evaluate cutting-edge deep learning algorithms and data curation for multimodal generative AI, with a particular emphasis on culturally-adapted image and video synthesis.

  • Work closely with other research scientists, engineers, and product teams across Google DeepMind, fostering a collaborative and intellectually stimulating environment.

  • Proactively identify and address technical challenges, stay updated on the latest AI advancements, and focus on developing innovations that can be effectively contributing to real-world impact.

  • Work in collaboration with our Ethics and Governance teams to ensure our advances in intelligence are developed ethically and provide broad benefits to humanity.

Minimum qualifications

  • Bachelors in Computer Science, Artificial Intelligence, or related area.

  • 2+ years of relevant experience in deep learning research and development, particularly in generative AI and related to image and video synthesis. This includes diffusion models and autoregressive generative models.

  • Strong engineering skills in Python and deep learning frameworks (e.g., Jax, TensorFlow, PyTorch), with a track record of building high-quality research prototypes and systems.

Preferred qualifications

  • Demonstrated experience in large-scale training of multimodal generative models.

  • A keen eye for visual aesthetics and detail, coupled with a passion for creating high-quality, visually compelling generative content.

  • Strong publication record in top-tier AI conferences or journals.

  • Experience working in a collaborative, cross-functional team environment, particularly across different time zones.