About the Role
- At Google DeepMind, our research team is dedicated to tackling the most complex challenges in online information quality. We strive to advance the state of the art by developing innovative solutions to detect manipulated media and misleading narratives, ensuring the integrity of digital discourse. A prominent example of our scientific discovery is Backstory. Our interdisciplinary work spans provenance analysis and the creation of tools for AI-assisted information literacy, leveraging our technologies for the widespread public benefit of a safer online environment. We thrive in a supportive environment that encourages rapid prototyping and iteration, driving our research achievements directly into Google’s flagship models, including Gemini.
- Artificial Intelligence could be one of humanity’s most useful inventions. At Google DeepMind, we’re a team of scientists, engineers, machine learning experts and more, working together to advance the state of the art in artificial intelligence. We use our technologies for widespread public benefit and scientific discovery, and collaborate with others on critical challenges, ensuring safety and ethics are the highest priority.
- To succeed in this role, you will need to be passionate about advancing information literacy using machine learning and other computational techniques. You'll join an interdisciplinary team of domain experts, ML researchers, and engineers to research and build multimodal reasoning systems and Vision-Language Models (VLMs) to assess the trustworthiness of media (images, audio, and videos) on the internet.
Responsibilities
- Plan and perform rapid prototyping of computer vision and multimodal machine learning techniques applied to determining authenticity of media information.
- Design and train multimodal models capable of complex visual reasoning.
- Undertake exploratory analysis to inform experimentation and research directions.
- Engage with product teams to drive the development of our research.
- Implement tools, libraries, and frameworks to speed up and enable new research.
- Report and present research findings, software developments, experimental results, and data analysis clearly and efficiently.
- Collaborate with internal and external scientific domain experts.
Requirements
- PhD/Master’s degree in Computer Science, AI, ML, or equivalent practical experience.
- At least 2 years of relevant experience developing computer vision techniques or multimodal machine learning models.
- Experience in software development using Python and deep learning frameworks (e.g., Jax, TensorFlow, PyTorch), with a proven track record of building high-quality research prototypes and systems.
- Quantitative skills in math and statistics.
- Experience exploring, analysing and visualising data.
Qualifications
- Experience in training and deployment of large-scale models.
- Experience with Video Understanding
- Experience with Large Language Models, prompt engineering, few-shot learning, post-training techniques, and evaluations.
- A proven track record of research or engineering achievements, such as publications in peer-reviewed conferences or journals.
Benefits
The US base salary range for this full-time position is between 174,000 USD - 252,000 USD + bonus + equity + benefits. Your recruiter can share more about the specific salary range for your targeted location during the hiring process.