Gpu Engineer

il y a 1 jour


Paris, France Kog AI Temps plein

**KOG**:

- An early-stage startup was created a little over a year ago
- Real-time AI platform aimed at revolutionizing how artificial intelligence is used in digital experiences
- The goal is to make AI faster, more efficient, and more intuitive
- Creative optimization at a very low level with our solutions
- Seek out new ideas and implement them ambitiously
- Do not copy what already exists
- The inference engine optimizes AI performance at the assembly level
- Bypassing traditional abstraction layers
- Significant advancements in several areas: inter-GPU communication, kernel fusion, grid synchronization, and memory access optimization
- The inference engine offers speed improvements 3 to 10 times greater compared to the best GPU alternatives starting with AMD MI300X
- The link between GPUs and the product is real-time AI
- Creation of dynamic interfaces thanks to real-time AI
- Kog is therefore based on two axes:

- Hardcore GPU Engineering
- Stream R&D: new model architectures optimized for speed
- Final objective: 10x on GPUs and 10x on model architecture, thus 100x in total

**POSITION**:
We wish to strengthen our world-class team with technically brilliant individuals who want to take on this challenge. Your missions will include:

- Implementing cutting-edge AI models in low-level C++ code and Assembly on high-end AMD and NVIDIA GPUs
- Reverse-engineering subtle GPU features (such as memory page mappings, memory channels, hash functions, cache behaviors, credit assignment logic, etc.)
- Leveraging this knowledge to find and implement creative optimization ideas
- Optimizing the Kog inference engine to make AI inference incredibly fast (10x compared to vLLM, SGLang, or TensorRT-LLM—we are already at 3x)


  • GPU Engineer

    il y a 2 semaines


    Paris, Île-de-France Kog AI Temps plein

    KOG:Kog is a French deeptech company building an ultra-fast AI execution layer for real-time AI.We target up to 10x gains through GPU optimization and, crucially, up to 10x gains through model and training architecture design.We start on AMD GPUs and will expand to other accelerators.Our aim is a modular, real-time AI platform where developers and users can...

  • Lead GPU Engineer

    il y a 5 jours


    Paris, Île-de-France Kog AI Temps plein

    KOG: Kog is a Paris-based deeptech company building the world's fastest AI execution layer.We are not just optimizing existing libraries; we are bypassing inefficient abstraction layers to rewrite the rules of AI inference. By coding at the Assembly level on high-end GPUs (starting with the AMD MI300X), we unlock raw performance that standard stacks leave on...

  • Lead GPU Engineer

    il y a 4 jours


    Paris, France Kog AI Temps plein

    KOG: Kog is a European VC-funded startup and real-time AI frontier lab building the world’s fastest AI execution layer. As part of the 2030 French Tech cohort, we are on a mission to redefine the boundaries of artificial intelligence by enabling true real-time interaction at a scale never seen before. While the industry often settles for incremental...

  • Lead GPU Engineer

    il y a 3 jours


    Paris, Île-de-France Kog AI Temps plein

    KOG:Kog is a European VC-funded startup and real-time AI frontier lab building the world's fastest AI execution layer, part of the 2030 French Tech cohort.We are not just optimizing existing libraries; we are bypassing inefficient abstraction layers to rewrite the rules of AI inference. By coding at the Assembly level on high-end GPUs (starting with the AMD...

  • Lead GPU Engineer

    il y a 5 jours


    Paris, Île-de-France Kog AI Temps plein

    KOG: Kog is a European VC-funded startup and real-time AI frontier lab building the world's fastest AI execution layer. As part of the 2030 French Tech cohort, we are on a mission to redefine the boundaries of artificial intelligence by enabling true real-time interaction at a scale never seen before.While the industry often settles for incremental software...


  • Paris, France Pathway Temps plein

    A pioneering AI startup is seeking a Senior ML Infrastructure / DevOps Engineer who loves Linux and scaling GPU clusters. You will be responsible for the infrastructure powering ML workloads across multiple cloud providers. The role involves designing and automating ML platforms, managing GPU/CPU clusters, and working closely with R&D. Ideal candidates will...


  • Paris, France Kog AI Temps plein

    A leading AI technology firm located in Paris is seeking a Lead GPU Engineer to drive the technical strategy and lead a team focused on high-performance AI solutions. You will bridge the gap between architectural vision and execution, helping optimize their proprietary inference engine to achieve industry-leading speeds. This role offers a competitive...

  • Research Engineer

    il y a 1 jour


    Paris, France Kog AI Temps plein

    Open to freelancing or to a permanent position ! **KOG**: - An early-stage startup was created a little over a year ago - Real-time AI platform aimed at revolutionizing how artificial intelligence is used in digital experiences - The goal is to make AI faster, more efficient, and more intuitive - Creative optimization at a very low level with our...

  • Mlop's Engineer

    il y a 6 jours


    Paris, France UCASE CONSULTING Temps plein

    **Contexte de la mission**: Notre client cherche à se renforcer avec un profil MLops Engineer pour le déploiement d'un modèle Custom Gen AI. Ce modèle inclut des technologies telles que les LLM (Large Language Models) et des solutions Open Source. Le profil recherché devra apporter une expertise dans l'optimisation et le déploiement de modèles sur des...


  • Paris, France Codezys Temps plein

    **Métiers et Fonctions**: Data Management Machine Learning Engineer **Spécialités technologiques**: Cloud Big Data Machine Learning Simulation Deep Learning **Type de facturation**: Assistance Technique (facturation au taux journalier) **Compétences clés**: **Technologies et outils**: AWS (5 ans), Docker, Pytorch, GIT, CI/CD, Xarray, Python (5...