HiredinAI LogoHiredinAI
JobsCompaniesJob Alerts
  1. Home
  2. chevron_right
  3. Multi-Modal AI Engineer
  4. chevron_right
  5. Senior Member of Technical Staff, Multimodal AI

Senior Member of Technical Staff, Multimodal AI

Cohere
C
apartmentCoherelocation_onRemote (Global, with offices in Toronto, London, Paris, San Francisco, New York, Montreal)schedulePosted 1 day ago
Full-timeMultimodal AIAI SystemsTextSpeech

About the Role

Cohere's mission is to scale intelligence to serve humanity by training and deploying frontier models for developers and enterprises building AI systems. The company focuses on magical experiences like content generation, semantic search, RAG, and agents, believing their work is instrumental to AI's widespread adoption. They are a team of passionate researchers, engineers, and designers committed to increasing model capabilities and customer value. At Cohere, there is a strong belief in the transformative power of multimodal AI to revolutionize human-technology interaction. The engineering teams are dedicated to pushing the boundaries of what is possible in this field, and this Senior Member of Technical Staff role is crucial to this endeavor. The position offers an ideal environment for exploration, innovation, and shaping the future of AI, supported by an exceptional ratio of compute resources to engineers. This role involves the design and development of cutting-edge multimodal AI systems, which includes seamlessly integrating various modalities such as text, speech, and vision. A key aspect of the role is conducting research and experiments on advanced compute infrastructure, exploring novel ideas in multimodal representation learning, transfer learning, and other related areas. The specialist will collaborate closely with world-class teams, both learning from and contributing to their collective expertise in the field. Notably, Cohere's Multimodal team introduced Command A Vision, a flagship vision-language model in July 2025, which consistently outperforms major models like Llama 4 Maverick, Mistral Medium/Pixtral Large, and GPT4.1, achieving an 83.1% average benchmark with 112B parameters running on just 2 GPUs. This demonstrates Cohere’s ability to achieve breakthrough performance with a focused team, emphasizing that breakthrough compute is not always necessary.

Responsibilities

  • Design and develop cutting-edge multimodal AI systems, integrating various modalities such as text, speech, and vision.
  • Conduct research and experiments on our advanced compute infrastructure, exploring novel ideas in multimodal representation learning, transfer learning, and more.
  • Collaborate closely with our world-class teams, learning from and contributing to their expertise in the field.

Requirements

  • Possess exceptional software engineering skills, with a proven track record of building robust and scalable systems.
  • Have a strong command of Python and are well-versed in popular deep learning frameworks like JAX, PyTorch, and TensorFlow, with an understanding of their multimodal capabilities.
  • Knowledge of distributed training strategies, especially for large-scale multimodal models.
  • Familiarity with autoregressive models, particularly their application in multimodal tasks such as image or video captioning, speech-to-text generation.

Qualifications

  • Publications in top-tier venues demonstrating your expertise in multimodal AI research.
  • Experience in writing efficient GPU kernels using CUDA, optimising performance for multimodal tasks.

Benefits

  • An open and inclusive culture and work environment
  • Work closely with a team on the cutting edge of AI research
  • Weekly lunch stipend, in-office lunches & snacks
  • Full health and dental benefits, including a separate budget to take care of your mental health
  • 100% Parental Leave top-up for up to 6 months
  • Personal enrichment benefits towards arts and culture, fitness and well-being, quality time, and workspace improvement
  • Remote-flexible, offices in Toronto, New York, San Francisco, London and Paris, as well as a co-working stipend
  • 6 weeks of vacation (30 working days!)
notifications_active

Similar Job Alerts

Get notified about new Multi-Modal AI Engineer roles.

expand_more
expand_more
Cohere
C

Cohere

View Companyarrow_forward
Multi-Modal AI EngineerRemote (Global

Frequently Asked Questions

How do I apply for this Senior Member of Technical Staff, Multimodal AI position?

Click the "Apply Now" button on this page to be directed to the application. You will be taken to the employer's application page.

Is this position remote?

Yes, this role is listed as a remote position.

When was this job posted?

This position was posted 1 day ago. We recommend applying promptly as positions can fill quickly.

Explore More

wifiBrowse all Remote AI Jobsattach_moneyAI Salary GuideschoolEntry Level AI JobscategoryMore Multi-Modal AI Engineer Jobs

Career Resources

article

How to Get an AI Job with No Experience

Career guide

article

AI Jobs Salary Guide 2026

Compensation data

article

AI-Proof Jobs: 25 Careers Safe from Automation

Career advice

article

The Complete Guide to AI Training Jobs

Industry guide

smart_toy
HiredinAI

Curated AI jobs across engineering, marketing, design, research, and more — from top companies and startups, updated daily.

alternate_emailworkcode

For Job Seekers

  • Browse Jobs
  • Job Categories
  • Companies
  • Remote AI Jobs
  • Entry Level Jobs
  • AI Salaries
  • Job Alerts
  • Career Blog

For Employers

  • Post a Job
  • Pricing
  • Employer Login
  • Dashboard

Resources

  • Blog
  • AI Glossary
  • Career Advice
  • Salary Guides
  • Industry News

AI Jobs by City

  • San Francisco
  • New York
  • London
  • Seattle
  • Toronto
  • Remote

Company

  • About Us
  • Contact
  • Privacy Policy
  • Terms of Service
  • Guidelines
  • DMCA

© 2026 HiredinAI. All rights reserved.

SitemapPrivacyTermsCookies