Senior ML Systems Engineer, Frameworks

il y a 6 jours


Paris, Île-de-France Cohere Temps plein

Who are we?

Our mission is to scale intelligence to serve humanity. We're training and deploying frontier models for developers and enterprises who are building AI systems to power magical experiences like content generation, semantic search, RAG, and agents. We believe that our work is instrumental to the widespread adoption of AI.

We obsess over what we build. Each one of us is responsible for contributing to increasing the capabilities of our models and the value they drive for our customers. We like to work hard and move fast to do what's best for our customers.

Cohere is a team of researchers, engineers, designers, and more, who are passionate about their craft. Each person is one of the best in the world at what they do. We believe that a diverse range of perspectives is a requirement for building great products.

Join us on our mission and shape the future

We're looking for a senior engineer to help build, maintain and evolve the training framework that powers our frontier-scale language models. This role sits at the intersection of large-scale training, distributed systems, and HPC infrastructure. You will design and maintain the core components that enable fast, reliable, and scalable model training — and build the tooling that connects research ideas to thousands of GPUs.

If you enjoy working across the full stack of ML systems, this role gives you the opportunity and autonomy to have massive impact.

What You'll Work On
  • Build and own the training framework responsible for large-scale LLM training.

  • Design distributed training abstractions (data/tensor/pipeline parallelism, FSDP/ZeRO strategies, memory management, checkpointing).

  • Improve training throughput and stability on multi-node clusters (e.g., GB200/300, AMD, H200/100).

  • Develop and maintain tooling for monitoring, logging, debugging, and developer ergonomics.

  • Collaborate closely with infra teams to ensure Slurm setups, container environments, and hardware configurations support high-performance training.

  • Investigate and resolve performance bottlenecks across the ML systems stack.

  • Build robust systems that ensure reproducible, debuggable, large-scale runs.

You Might Be a Good Fit If You Have
  • Strong engineering experience in large-scale distributed training or HPC systems.
    Deep familiarity with JAX internals, distributed training libraries, or custom kernels/fused ops.

  • Experience with multi-node cluster orchestration (Slurm, Ray, Kubernetes, or similar).

  • Comfort debugging performance issues across CUDA/NCCL, networking, IO, and data pipelines.

  • Experience working with containerized environments (Docker, Singularity/Apptainer).

  • A track record of building tools that increase developer velocity for ML teams.

  • Excellent judgment around trade-offs: performance vs complexity, research velocity vs maintainability.

  • Strong collaboration skills — you'll work closely with infra, research, and deployment teams.

Nice to Have
  • Experience with training LLMs or other large transformer architectures.

  • Contributions to ML frameworks (PyTorch, JAX, DeepSpeed, Megatron, xFormers, etc.).

  • Familiarity with evaluation and serving frameworks (vLLM, TensorRT-LLM, custom KV caches).

  • Experience with data pipeline optimization, sharded datasets, or caching strategies.

  • Background in performance engineering, profiling, or low-level systems.

Bonus: paper at top-tier venues (such as NeurIPS, ICML, ICLR, AIStats, MLSys, JMLR, AAAI, Nature, COLING, ACL, EMNLP).

Why Join Us
  • You'll work on some of the most challenging and consequential ML systems problems today.

  • You'll collaborate with a world-class team working fast and at scale.

  • You'll have end-to-end ownership over critical components of the training stack.

  • You'll shape the next generation of infrastructure for frontier-scale models.

  • You'll build tools and systems that directly accelerate research and model quality.

Sample Projects:

  • Build a high-performance data loading and caching pipeline.

  • Implement performance profiling across the ML systems stack

  • Develop internal metrics and monitoring for training runs.

  • Build reproducibility and regression testing infrastructure.

  • Develop a performant fault-tolerant distributed checkpointing system.

If some of the above doesn't line up perfectly with your experience, we still encourage you to apply

We value and celebrate diversity and strive to create an inclusive work environment for all. We welcome applicants from all backgrounds and are committed to providing equal opportunities. Should you require any accommodations during the recruitment process, please submit an Accommodations Request Form, and we will work together to meet your needs.

Full-Time Employees at Cohere enjoy these Perks:

An open and inclusive culture and work environment 

Work closely with a team on the cutting edge of AI research 

Weekly lunch stipend, in-office lunches & snacks

Full health and dental benefits, including a separate budget to take care of your mental health 

100% Parental Leave top-up for up to 6 months

Personal enrichment benefits towards arts and culture, fitness and well-being, quality time, and workspace improvement

Remote-flexible, offices in Toronto, New York, San Francisco, London and Paris, as well as a co-working stipend

6 weeks of vacation (30 working days)


  • ML Ops Engineer

    Il y a 17 minutes


    Paris, Île-de-France Match Group Temps plein

    At Meetic, a Match Group company, we have been pioneering online dating for over 20 years, leveraging our legacy to reinvent how people meet, connect, and build meaningful relationships across Europe and beyond. As part of Match Group, we manage a diverse portfolio of brands, from iconic names like Meetic, Match, Lexa, and LoveScout24 to tailored...

  • Consultant Senior ML Engineer

    Il y a 18 minutes


    Paris, Île-de-France Talan Temps plein 55 000 € - 70 000 €

    Company Description Talan est un groupe international de conseil et d'expertises technologiques qui accélère la transformation de ses clients par les leviers de l'innovation, la technologie et la dataDepuis plus de 20 ans, Talan conseille et accompagne les entreprises et les institutions publiques dans la mise en œuvre de leurs projets de transformation...

  • Member of Technical Staff

    il y a 6 jours


    Paris, Île-de-France Adaptive ML Temps plein

    About the teamAdaptive ML is a frontier AI startup building a Reinforcement Learning Operations (RLOps) platform that enables enterprises to specialize and deploy LLMs into production with measurable impact.We provide the core infrastructure to tune, evaluate, and serve specialized models at scale — pioneering task-specific LLM development and running...

  • Consultant ML/AI Engineer

    il y a 6 jours


    Paris, Île-de-France OnePoint Temps plein

    Contribuez aux grandes transformations des entreprises et des acteurs publics en alliant innovation technologique et expertise métier, au service de nos clients et de la société pour les faire avancer durablement.Au-delà de la RSE, nous avons développé notre propre approche, RESET, qui englobe l'ensemble de nos engagements en matière de...

  • Senior Software Engineer

    Il y a 17 minutes


    Paris, Île-de-France RevTech Temps plein

    Location: Central Paris – 2 days onsite per week Salary: €65-85k DOE Recruiting for a Series B scale up in the Cybersecurity space, with a worldwide team they're looking to expand their Paris office by hiring a number of senior engineers.They offer best in class preventative tools in Cybersecurity, they're backed by US and European top-tier VC firms so...

  • Senior ML

    il y a 2 semaines


    Paris, Île-de-France Winamax Temps plein

    Basés en plein cœur de Paris, nous faisons bouger l'industrie des jeux en ligne. Leader du poker et des paris sportifs en France avec joueurs et parieurs mensuels, nous sommes présents en Espagne, en Allemagne et bientôt en Italie et au Portugal. Nous offrons à nos joueurs une expérience exceptionnelle, à la fois technique, créative et qualitative....

  • Azure Cloud Engineer

    il y a 6 jours


    Paris, Île-de-France Emagine Consulting Temps plein

    Vous êtes un ingénieur Azure Cloud expérimenté et possédez une solide expertise dans les domaines AI Foundry et Azure Machine Learning ?emagine vous offre l'opportunité de participer au déploiement et à l'évolution d'une plateforme d'exploration IA et d'un environnement Sandbox conçus pour accélérer l'innovation en matière de ML et d'IA dans les...

  • AI Engineer

    Il y a 17 minutes


    Paris, Île-de-France Collective Temps plein

    Budget: 650€Contexte du posteNous recherchons unAI Engineerissu d'un parcoursData Engineer, Data Scientist ou Machine Learning Engineer, capable de concevoir et d'industrialiser des pipelines RAG et des solutions IA génératives end‑to‑end. Le profil interviendra au sein d'équipes Data, Cloud et Produit pour accélérer l'adoption de l'IA...

  • Senior Robotics Engineer

    Il y a 17 minutes


    Paris, Île-de-France IC Resources Temps plein

    The right to work in France without sponsorship is essential for this vacancy.An exciting opportunity for a Senior Robotics Engineer has arisen with a medtech company, based in Paris.As a Senior Robotics Software Engineer, you will play a key role in designing, developing, and improving the core robotics control software for their innovative device, enabling...

  • Senior Manager

    il y a 6 jours


    Paris, Île-de-France Stibo Systems Temps plein

    Senior Campaign Manager, United Kingdom or EuropeDrive global integrated campaigns that fuel growth and engagement This is your opportunity to lead high-impact, performance-driven campaigns in a fast-paced, innovative environment."This role is about turning integrated campaigns into a predictable, optimized growth engine for the business," says Lani Wilson,...