How do I apply for this Research Engineer, Multimodal Generative AI (Image/Video) position?

Click the "Apply Now" button on the job listing page to be directed to the application. You will be taken to the employer's application page.

Is this position remote?

This role is based in Kirkland, Washington, US; Seattle, Washington, US. Check the full description for remote or hybrid options.

What is the salary range?

The listed salary range for this position is $166,000 - $244,000. Final compensation may vary based on experience, qualifications, and location.

When was this job posted?

This position was posted 3 days ago. We recommend applying promptly as positions can fill quickly.

Research Engineer, Multimodal Generative AI (Image/Video) at Google DeepMind

About the Role

This Research Engineer role focuses on developing state-of-the-art methods for multimodal generative AI models, with a primary emphasis on image generation and editing. The position is part of the team behind "Nano Banana" at Google DeepMind.
At Google DeepMind, they cultivate a unique culture and work environment where ambitious, long-term research can flourish. Their interdisciplinary team integrates the best techniques from deep learning, reinforcement learning, and systems neuroscience to build general-purpose learning algorithms. This approach has already led to significant breakthroughs toward artificial general intelligence, and the necessary elements are in place for further substantial progress.
Google DeepMind's overarching mission is predicated on the belief that Artificial Intelligence could be one of humanity’s most useful inventions. The organization comprises scientists, engineers, machine learning experts, and others who collaborate to advance AI. They apply their technologies for widespread public benefit and scientific discovery, actively partnering on critical challenges, and maintaining safety and ethics as their highest priorities.
Research Engineers at Google DeepMind are at the forefront of developing novel tools, infrastructure, and algorithms with the ultimate goal of achieving Artificial General Intelligence. They are expected to independently build state-of-the-art foundation models and research infrastructure, collaborate with teams on large-scale AI projects, and devise solutions to fundamental questions in machine learning and AI. Leveraging expertise from diverse disciplines such as deep learning, computer vision, language modeling, and advanced generative architectures, Research Engineers are key contributors to groundbreaking research.

Requirements

PhD in Computer Science, Artificial Intelligence, Machine Learning, Computer Vision, or equivalent practical experience.
Proven experience in deep learning research and development, particularly in generative AI and related to image synthesis. This includes diffusion models and autoregressive generative models. Experience with post-training is a plus.
Exceptional engineering skills in Python and deep learning frameworks (e.g., Jax, TensorFlow, PyTorch), with a track record of building high-quality research prototypes and systems.
Strong publication record at top-tier machine learning, computer vision, and graphics conferences (e.g., NeurIPS, ICLR, ICML, SIGGRAPH, CVPR, ICCV).

Qualifications

Demonstrated experience in multimodal generative modeling, especially combining large language models with visual generation (e.g., text-to-image/video systems, joint autoregressive and diffusion models).
A keen eye for visual aesthetics and detail, coupled with a passion for creating high-quality, visually compelling generative content.
A real passion for AI!

Benefits

Enhanced maternity, paternity, adoption, and shared parental leave.
Private medical and dental insurance for yourself and any dependents.
Flexible working options.
Excellent facilities such as healthy food, an on-site gym, faith rooms, terraces etc.
Relocation assistance to Mountain View and immigration support (depending on eligibility).
Bonus, equity, and additional benefits.

Research Engineer, Multimodal Generative AI (Image/Video)

About the Role

Requirements

Qualifications

Benefits

Similar Job Alerts

Google DeepMind

Frequently Asked Questions

How do I apply for this Research Engineer, Multimodal Generative AI (Image/Video) position?

Is this position remote?

What is the salary range?

When was this job posted?

Explore More

Career Resources