Technical Program Manager, Compute Planning

Full Time
London, UK
9 months ago

At Google DeepMind, we value diversity of experience, knowledge, backgrounds and perspectives and harness these qualities to create extraordinary impact. We are committed to equal employment opportunities regardless of sex, race, religion or belief, ethnic or national origin, disability, age, citizenship, marital, domestic or civil partnership status, sexual orientation, gender identity, pregnancy, maternity or related condition (including breastfeeding) or any other basis as protected by applicable law. If you have a disability or additional need that requires accommodation, please do not hesitate to let us know.

Snapshot

The role sits within Google DeepMind's Compute Planning team. This team is the driving force behind the hardware that powers DeepMind's revolutionary advancements, across all areas of the business. In this role, you’ll help to build and manage the compute resources that power GDM's Research, architecting the infrastructure that enables our researchers to push the boundaries of machine learning.

About Us

Artificial Intelligence could be one of humanity’s most useful inventions. At Google DeepMind, we’re a team of scientists, engineers, machine learning experts and more, working together to advance the state of the art in artificial intelligence. We use our technologies for widespread public benefit and scientific discovery, and collaborate with others on critical challenges, ensuring safety and ethics are the highest priority!

The role

As a Technical Program Manager in the Compute Planning team you will play a key role in the building and ongoing management of Google DeepMind’s extensive ML Hardware portfolio and where relevant, resolve issues and requests related to tooling and processes that support engineering execution.

You will work alongside our Product Managers, Engineers and Researchers to help define and optimise DeepMind's overall capabilities in this space!

Key Responsibilities

  • Implementing new/adapting existing tooling and infrastructure to support GDM Compute resource management and efficiency across the fleet.
  • Partnering with Reliability and Infrastructure teams as required to help ensure a stable experimentation platform.
  • Providing best practices to researchers and engineers in areas such as resource availability, efficiency, scaling & performance.
  • Solving issues across various infrastructure platforms (in partnership with engineering teams).
  • Producing user guides, process documentation, and other forms of educational resources.
  • Assisting with new ML hardware/software pilot programs.

About you

In order to set you up for success as a Technical Program Manager at Google DeepMind, we look for the following skills and experience:

  • Prior Experience in one of the following disciplines; Software Engineering, SRE, Engineering Production, DevOps, or equivalent.
  • You can upskill independently using internal documentation and implement additions to our infrastructure by writing custom Code and/or configuration files
  • You can craft and implement novel infrastructure in at least one programming language - (Python, C++ preferred)
  • Excellent technical understanding and communication ability, with the ability to distil sophisticated technical ideas to their essence
  • Skilled at navigating, updating and defining sophisticated processes
  • Initiative to address problems with solutions that are repeatable, scalable and balanced.
  • Proven stakeholder leadership skills
  • Comfortable dealing with ambiguity and able to thrive in a dynamic environment.
  • A curiosity about Google DeepMind's mission and AI / Machine Learning
  • You are flexible, adaptable and highly responsive to the needs of the project, team and wider group.

In addition, the following would be an advantage

  • BS degree in Computer Science, Engineering, related fields or equivalent practical technical experience.
  • Knowledge of data centre infrastructure and operations.

Opening date: Friday 15th March 2024

Closing date: Thursday 28th March 2024, 12pm GMT