Gpu Engineer

il y a 3 jours


Paris, France Kog AI Temps plein

**KOG**:
Kog is a real-time AI startup, created a little over a year ago, which aims to revolutionize how AI is used in digital experiences. The goal is to make AI faster, more efficient, and more intuitive.

We creatively optimize at a very low level with our own solutions and seek out new ideas that we implement ambitiously. **We do not copy what already exists**

We've built an inference engine that optimizes AI performance at the Assembly level by bypassing traditional abstraction layers, and we've made significant advancements in several areas:

- Inter-GPU communication
- Kernel fusion
- Grid synchronization
- Memory access optimization

The inference engine offers speed improvements 3 to 10 times greater compared to the best GPU alternatives, starting with AMD MI300X.

Kog is therefore based on two axes:

- Hardcore GPU Engineering
- Stream R&D: new model architectures for speed

Our final objective is to be 10x faster on GPUs and 10x faster on model architecture, thus 100x faster total.

**POSITION**:
We wish to strengthen our world-class team with technically brilliant individuals who want to take on this challenge. Your missions will include:

- Implementing cutting-edge AI models in low-level C++ code and Assembly on high-end AMD and NVIDIA GPUs
- Reverse-engineering subtle GPU features (such as memory page mappings, memory channels, hash functions, cache behaviors, credit assignment logic, etc.)
- Leveraging this knowledge to find and implement creative optimization ideas
- Optimizing the Kog inference engine to make AI inference incredibly fast (10x compared to vLLM, SGLang, or TensorRT-LLM—we are already at 3x)

**PROFILE**:

- World-class talents with 5+ years of experience
- Proficiency in CUDA or ROCm
- Start-up mindset
- Team player attitude
- PhD or Top Engineering Schools
- Someone who has side projects or shows great passion and interest

We meet one week per month in our Paris office, and the rest of the time, you can be in full-remote or go to the office if you prefer.


  • Lead GPU Engineer

    il y a 7 heures


    Paris, Île-de-France Kog AI Temps plein

    KOG: Kog is a European VC-funded startup and real-time AI frontier lab building the world's fastest AI execution layer. As part of the 2030 French Tech cohort, we are on a mission to redefine the boundaries of artificial intelligence by enabling true real-time interaction at a scale never seen before.While the industry often settles for incremental software...

  • Software Engineer, Platform

    il y a 2 semaines


    Paris, France AeroVect Temps plein

    **Who We Are**: As a Platform Engineer at AeroVect, you will own the reliability, performance, and scalability of the software foundation that powers our autonomous ground vehicle fleet. You will be responsible for managing and optimizing our Ubuntu‑based operating system images, middleware, and device drivers that interface with a diverse multi‑sensor...

  • ML Engineer | DeepTech

    Il y a 2 minutes


    Paris, France Data Recrutement Temps plein

    L'ENTREPRISE : STARTUP ULTRA INNOVANTE, TECHNOLOGIE DE POINTEMedtech française en deeptech exploitant l'intelligence artificielle pour répondre aux besoins médicaux non satisfaits des patients vivant avec des maladies immunitaires en utilisant des données multimodales pour prédire le diagnostic, le pronostic et la réponse au traitement des patients....

  • Computer Vision Engineer

    il y a 1 semaine


    Paris, France Licorne Society Temps plein

    Licorne Society a été missionné par une startup en pleine croissance pour les aider à trouver leur Computer Vision Engineer. **Our Mission** Disrupt the world of authentication and bring trust to interactions between individuals and businesses while enhancing the user experience when it comes to authentication or recurrent ID verification. **Our...

  • Software Engineer

    il y a 6 jours


    Paris, France Mistral AI Temps plein

    **About Mistral** At Mistral AI, we believe in the power of AI to simplify tasks, save time, and enhance learning and creativity. Our technology is designed to integrate seamlessly into daily working life. We democratize AI through high-performance, optimized, open-source and cutting-edge models, products and solutions. Our comprehensive AI platform is...


  • Paris, France Reflection AI Temps plein

    **Role Overview**: As a member of the technical staff focused on infrastructure at Reflection you will lead the development of a world-class AI research platform, enabling scalable and efficient language model and reinforcement learning training and inference. You will design and implement infrastructure capable of scaling across large GPU clusters and...


  • Paris, France Data Recrutement Temps plein

    LA START-UP : SOLUTION DEEP LEARNING/IA PERMETTANT L'ANALYSE DE RECOLTE AGRICOLECette startup développe une web & mobile app SaaS B2B, qui permet aux acteurs de l'agriculture, de pouvoir analyser la qualité de leurs récoltes avec une simple photo. Ils souhaitent révolutionner le domaine de l'agriculture en proposant une solution à forte valeur...


  • Paris, France Marso Robotics Temps plein

    Senior Embedded Software Engineer – AI & Autonomous Robotics (M/F) Join the team at Marso Robotics as a Senior Embedded Software Engineer and help build the next generation of autonomous robots that seamlessly integrate artificial intelligence with hardware. As a Software & Embedded Robotics Engineer, you will play a central role in developing our robots...


  • Greater Paris Metropolitan Region, France Cherry Pick Temps plein

    Le Contexte : "L'Industrie au service de l'Académie"Le ML Lab est une équipe de pointe focalisée sur des travaux de recherche académique. Pour libérer leur productivité, ils utilisent un écosystème hybride : des ressources internes AWS et un "Neo-Cloud" spécialisé (Lambda.ai) offrant des performances de calcul (GPU) "énervées".Votre mission, au...

  • System Engineer

    il y a 6 jours


    Paris, France Faurecia Temps plein

    **We are looking for a Visualization & Surrounding 3D View - System Engineer (f/m) to join our team!** Our **Faurecia Clarion Electronics** Business Group is looking for a **Visualization & Surrounding 3D View System Engineer **for its **Automated Driving Product Division** based in our R&D Center in Paris (12ème arrondissement - line 14 metro Cour St...