About the Role
- This Research Engineer role focuses on developing state-of-the-art methods for multimodal generative AI models, with a primary emphasis on image generation and editing. The position is part of the team behind "Nano Banana" at Google DeepMind.
- At Google DeepMind, they cultivate a unique culture and work environment where ambitious, long-term research can flourish. Their interdisciplinary team integrates the best techniques from deep learning, reinforcement learning, and systems neuroscience to build general-purpose learning algorithms. This approach has already led to significant breakthroughs toward artificial general intelligence, and the necessary elements are in place for further substantial progress.
- Google DeepMind's overarching mission is predicated on the belief that Artificial Intelligence could be one of humanity’s most useful inventions. The organization comprises scientists, engineers, machine learning experts, and others who collaborate to advance AI. They apply their technologies for widespread public benefit and scientific discovery, actively partnering on critical challenges, and maintaining safety and ethics as their highest priorities.
- Research Engineers at Google DeepMind are at the forefront of developing novel tools, infrastructure, and algorithms with the ultimate goal of achieving Artificial General Intelligence. They are expected to independently build state-of-the-art foundation models and research infrastructure, collaborate with teams on large-scale AI projects, and devise solutions to fundamental questions in machine learning and AI. Leveraging expertise from diverse disciplines such as deep learning, computer vision, language modeling, and advanced generative architectures, Research Engineers are key contributors to groundbreaking research.
Requirements
- PhD in Computer Science, Artificial Intelligence, Machine Learning, Computer Vision, or equivalent practical experience.
- Proven experience in deep learning research and development, particularly in generative AI and related to image synthesis. This includes diffusion models and autoregressive generative models. Experience with post-training is a plus.
- Exceptional engineering skills in Python and deep learning frameworks (e.g., Jax, TensorFlow, PyTorch), with a track record of building high-quality research prototypes and systems.
- Strong publication record at top-tier machine learning, computer vision, and graphics conferences (e.g., NeurIPS, ICLR, ICML, SIGGRAPH, CVPR, ICCV).
Qualifications
- Demonstrated experience in multimodal generative modeling, especially combining large language models with visual generation (e.g., text-to-image/video systems, joint autoregressive and diffusion models).
- A keen eye for visual aesthetics and detail, coupled with a passion for creating high-quality, visually compelling generative content.
- A real passion for AI!
Benefits
- Enhanced maternity, paternity, adoption, and shared parental leave.
- Private medical and dental insurance for yourself and any dependents.
- Flexible working options.
- Excellent facilities such as healthy food, an on-site gym, faith rooms, terraces etc.
- Relocation assistance to Mountain View and immigration support (depending on eligibility).
- Bonus, equity, and additional benefits.