GPU Engineer

il y a 1 semaine


Paris, Île-de-France Kog AI Temps plein

KOG:

Kog is a French deeptech company building an ultra-fast AI execution layer for real-time AI.

We target up to 10x gains through GPU optimization and, crucially, up to 10x gains through model and training architecture design.

We start on AMD GPUs and will expand to other accelerators.

Our aim is a modular, real-time AI platform where developers and users can generate, customize, and operate AI agents and applications collaboratively, with a strong focus on European sovereignty, efficiency, and user control.

KOG:

  • An early-stage startup was created a little over a year ago
  • Real-time AI platform aimed at revolutionizing how artificial intelligence is used in digital experiences
  • The goal is to make AI faster, more efficient, and more intuitive
  • Creative optimization at a very low level with our solutions
  • Seek out new ideas and implement them ambitiously
  • Do not copy what already exists
  • The inference engine optimizes AI performance at the assembly level
  • Bypassing traditional abstraction layers
  • Significant advancements in several areas: inter-GPU communication, kernel fusion, grid synchronization, and memory access optimization
  • The inference engine offers speed improvements 3 to 10 times greater compared to the best GPU alternatives starting with AMD MI300X
  • The goal is to unlock uses such as collaboration for complex creation (video games, films, music, applications, software, etc.)
  • The link between GPUs and the product is real-time AI
  • Creation of dynamic interfaces thanks to real-time AI
  • Kog is therefore based on two axes:

  • Hardcore GPU Engineering

  • Stream R&D: new model architectures optimized for speed
  • Final objective: 10x on GPUs and 10x on model architecture, thus 100x in total

POSITION:

We wish to strengthen our world-class team with technically brilliant individuals who want to take on this challenge. Your missions will include:

  • Implementing cutting-edge AI models in low-level C++ code and Assembly on high-end AMD and NVIDIA GPUs
  • Reverse-engineering subtle GPU features (such as memory page mappings, memory channels, hash functions, cache behaviors, credit assignment logic, etc.)
  • Leveraging this knowledge to find and implement creative optimization ideas
  • Optimizing the Kog inference engine to make AI inference incredibly fast (10x compared to vLLM, SGLang, or TensorRT-LLM—we are already at 3x)

What we offer:

  • Competitive salary
  • Equities (BSPCE)
  • Elite technical challenges
  • World-class team (9 engineers, including 3 PhD)
  • A creative environment where your goal is to push back the limits
  • Equipment you'll need to perform
  • WeWork offices in the 13th district of Paris (near Station F)
  • Afterworks during our Paris week

You can apply right below if you feel that you're up to the task


  • Lead GPU Engineer

    il y a 3 jours


    Paris, Île-de-France Kog AI Temps plein

    KOG: Kog is a Paris-based deeptech company building the world's fastest AI execution layer.We are not just optimizing existing libraries; we are bypassing inefficient abstraction layers to rewrite the rules of AI inference. By coding at the Assembly level on high-end GPUs (starting with the AMD MI300X), we unlock raw performance that standard stacks leave on...

  • Lead GPU Engineer

    il y a 19 heures


    Paris, Île-de-France Kog AI Temps plein

    KOG:Kog is a European VC-funded startup and real-time AI frontier lab building the world's fastest AI execution layer, part of the 2030 French Tech cohort.We are not just optimizing existing libraries; we are bypassing inefficient abstraction layers to rewrite the rules of AI inference. By coding at the Assembly level on high-end GPUs (starting with the AMD...

  • Lead GPU Engineer

    il y a 3 jours


    Paris, Île-de-France Kog AI Temps plein

    KOG: Kog is a European VC-funded startup and real-time AI frontier lab building the world's fastest AI execution layer. As part of the 2030 French Tech cohort, we are on a mission to redefine the boundaries of artificial intelligence by enabling true real-time interaction at a scale never seen before.While the industry often settles for incremental software...

  • Senior DevOps Engineer

    il y a 6 jours


    Paris, Île-de-France Wirk Temps plein

    WIRK — Senior DevOps Engineer (Kubernetes, Terraform, LLM & ELK) — Paris (On-site)À propos de WIRKWIRK est une entreprise innovante spécialisée dans l'extraction documentaire grace à des plateformes scalables et intelligentes qui répondent aux défis techniques modernes.Chez WIRK, nous plaçons lafiabilité, la performance et l'innovation au cœur...

  • Site Reliability Engineer

    il y a 2 jours


    Paris, Île-de-France Criteo Temps plein

    What You'll Do:At Criteo, our Platform Core group builds the foundational services that power our global advertising platform. We design and operate scalable, resilient systems that support real-time decision-making and data processing at massive scale.As we expand our capabilities in high-performance inference and distributed computing, we're forming a new...

  • Systems Engineer

    il y a 2 semaines


    Paris, Île-de-France Collective Temps plein

    Systems Engineer / DevOps Engineer (Operating Systems, Kubernetes, Web Services)Location : Sud Ile de FranceCDI ou contratAbout the RoleWe are seeking a skilled and forward-thinking Systems Engineer with deep expertise in Linux-based operating systems, RedHat OpenShift, Kubernetes and cloud-native web services. The ideal candidate will have hands-on...

  • Systems Engineer

    il y a 2 semaines


    Paris, Île-de-France a-82ac-4f4c-a6b2-5eb6545b4923 Temps plein

    Systems Engineer / DevOps Engineer (Operating Systems, Kubernetes, Web Services)TJM: 600€We are also hiring for this post (CDI)Location : Sud Ile de france, proche SaclayAbout the RoleWe are seeking a skilled and forward-thinking Systems Engineer with deep expertise in Linux-based operating systems, RedHat OpenShift, Kubernetes and cloud-native web...

  • Imaging Software Engineer

    il y a 24 heures


    Paris, Île-de-France Harmattan AI Temps plein

    About UsAt Harmattan AI, we are a next-generation defense prime building autonomous and scalable defense systems. Driven by rigorous engineering developments of new defense products based on recent robotics and AI developments, we are on a steep growth trajectory. If you are interested in a career in a highly technical environment, thrive on pushing...

  • Site Reliability Engineer

    il y a 3 jours


    Paris, Île-de-France OVHcloud Temps plein

    Site Reliability Engineer - AI Core H/F/N H/F/NAu sein de votre équipe #OneTeamVous rejoindrez l'équipe pluri-disciplinaire AI Core responsable du développement des produits d'intelligence artificielle d'OVHcloud et de leur continuité de service..Dans le cadre des produits IA, vous maintiendrez et accompagnerez les évolutions de infrastructure pour...

  • Backend Engineer

    il y a 21 heures


    Paris, Île-de-France TxxxxxxAI Temps plein

    de l'entreprise : À mesure que les systèmes d'IA opèrent de plus en plus dans le monde physique — robots, infrastructures intelligentes, systèmes autonomes — un problème critique demeure non résolu : nous savons faire des prédictions, mais nous avons encore du mal à quantifier, en temps réel, la fiabilité réelle de ces...