Research Engineer, Multimodal Reasoning For Information Literacy at Google DeepMind

About the Role

At Google DeepMind, our research team is dedicated to tackling the most complex challenges in online information quality. We strive to advance the state of the art by developing innovative solutions to detect manipulated media and misleading narratives, ensuring the integrity of digital discourse. A prominent example of our scientific discovery is Backstory. Our interdisciplinary work spans provenance analysis and the creation of tools for AI-assisted information literacy, leveraging our technologies for the widespread public benefit of a safer online environment. We thrive in a supportive environment that encourages rapid prototyping and iteration, driving our research achievements directly into Google’s flagship models, including Gemini.
Artificial Intelligence could be one of humanity’s most useful inventions. At Google DeepMind, we’re a team of scientists, engineers, machine learning experts and more, working together to advance the state of the art in artificial intelligence. We use our technologies for widespread public benefit and scientific discovery, and collaborate with others on critical challenges, ensuring safety and ethics are the highest priority.
To succeed in this role, you will need to be passionate about advancing information literacy using machine learning and other computational techniques. You'll join an interdisciplinary team of domain experts, ML researchers, and engineers to research and build multimodal reasoning systems and Vision-Language Models (VLMs) to assess the trustworthiness of media (images, audio, and videos) on the internet.

Responsibilities

Plan and perform rapid prototyping of computer vision and multimodal machine learning techniques applied to determining authenticity of media information.
Design and train multimodal models capable of complex visual reasoning.
Undertake exploratory analysis to inform experimentation and research directions.
Engage with product teams to drive the development of our research.
Implement tools, libraries, and frameworks to speed up and enable new research.
Report and present research findings, software developments, experimental results, and data analysis clearly and efficiently.
Collaborate with internal and external scientific domain experts.

Requirements

PhD/Master’s degree in Computer Science, AI, ML, or equivalent practical experience.
At least 2 years of relevant experience developing computer vision techniques or multimodal machine learning models.
Experience in software development using Python and deep learning frameworks (e.g., Jax, TensorFlow, PyTorch), with a proven track record of building high-quality research prototypes and systems.
Quantitative skills in math and statistics.
Experience exploring, analysing and visualising data.

Qualifications

Experience in training and deployment of large-scale models.
Experience with Video Understanding
Experience with Large Language Models, prompt engineering, few-shot learning, post-training techniques, and evaluations.
A proven track record of research or engineering achievements, such as publications in peer-reviewed conferences or journals.

Benefits

The US base salary range for this full-time position is between 174,000 USD - 252,000 USD + bonus + equity + benefits. Your recruiter can share more about the specific salary range for your targeted location during the hiring process.

Frequently Asked Questions

How do I apply for this Research Engineer, Multimodal Reasoning For Information Literacy position?

Click the "Apply Now" button on this page to be directed to the application. You will be taken to the employer's application page.

Is this position remote?

This role is based in Mountain View, California, US. Check the full description for remote or hybrid options.

What is the salary range?

The listed salary range for this position is $174,000 - $252,000. Final compensation may vary based on experience, qualifications, and location.

When was this job posted?

This position was posted 3 days ago. We recommend applying promptly as positions can fill quickly.

About the Role

At Google DeepMind, our research team is dedicated to tackling the most complex challenges in online information quality. We strive to advance the state of the art by developing innovative solutions to detect manipulated media and misleading narratives, ensuring the integrity of digital discourse. A prominent example of our scientific discovery is Backstory. Our interdisciplinary work spans provenance analysis and the creation of tools for AI-assisted information literacy, leveraging our technologies for the widespread public benefit of a safer online environment. We thrive in a supportive environment that encourages rapid prototyping and iteration, driving our research achievements directly into Google’s flagship models, including Gemini.

Artificial Intelligence could be one of humanity’s most useful inventions. At Google DeepMind, we’re a team of scientists, engineers, machine learning experts and more, working together to advance the state of the art in artificial intelligence. We use our technologies for widespread public benefit and scientific discovery, and collaborate with others on critical challenges, ensuring safety and ethics are the highest priority.

To succeed in this role, you will need to be passionate about advancing information literacy using machine learning and other computational techniques. You'll join an interdisciplinary team of domain experts, ML researchers, and engineers to research and build multimodal reasoning systems and Vision-Language Models (VLMs) to assess the trustworthiness of media (images, audio, and videos) on the internet.

Responsibilities

Plan and perform rapid prototyping of computer vision and multimodal machine learning techniques applied to determining authenticity of media information.

Design and train multimodal models capable of complex visual reasoning.

Undertake exploratory analysis to inform experimentation and research directions.

Engage with product teams to drive the development of our research.

Implement tools, libraries, and frameworks to speed up and enable new research.

Report and present research findings, software developments, experimental results, and data analysis clearly and efficiently.

Collaborate with internal and external scientific domain experts.

Requirements

PhD/Master’s degree in Computer Science, AI, ML, or equivalent practical experience.

At least 2 years of relevant experience developing computer vision techniques or multimodal machine learning models.

Experience in software development using Python and deep learning frameworks (e.g., Jax, TensorFlow, PyTorch), with a proven track record of building high-quality research prototypes and systems.

Quantitative skills in math and statistics.

Experience exploring, analysing and visualising data.

Qualifications

Experience in training and deployment of large-scale models.

Experience with Video Understanding

Experience with Large Language Models, prompt engineering, few-shot learning, post-training techniques, and evaluations.

A proven track record of research or engineering achievements, such as publications in peer-reviewed conferences or journals.

Frequently Asked Questions

How do I apply for this Research Engineer, Multimodal Reasoning For Information Literacy position?

Click the "Apply Now" button on this page to be directed to the application. You will be taken to the employer's application page.

Is this position remote?

This role is based in Mountain View, California, US. Check the full description for remote or hybrid options.

What is the salary range?

The listed salary range for this position is $174,000 - $252,000. Final compensation may vary based on experience, qualifications, and location.

When was this job posted?

This position was posted 3 days ago. We recommend applying promptly as positions can fill quickly.

Research Engineer, Multimodal Reasoning For Information Literacy

About the Role

Responsibilities

Requirements

Qualifications

Benefits

Similar Job Alerts

Google DeepMind

Frequently Asked Questions

How do I apply for this Research Engineer, Multimodal Reasoning For Information Literacy position?

Is this position remote?

What is the salary range?

When was this job posted?

Explore More

Career Resources

Research Engineer, Multimodal Reasoning For Information Literacy

About the Role

Responsibilities

Requirements

Qualifications

Benefits

Similar Job Alerts

Google DeepMind

Frequently Asked Questions

How do I apply for this Research Engineer, Multimodal Reasoning For Information Literacy position?

Is this position remote?

What is the salary range?

When was this job posted?

Explore More

Career Resources