GPU Engineer
il y a 2 semaines
KOG:
Kog is a French deeptech company building an ultra-fast AI execution layer for real-time AI.
We target up to 10x gains through GPU optimization and, crucially, up to 10x gains through model and training architecture design.
We start on AMD GPUs and will expand to other accelerators.
Our aim is a modular, real-time AI platform where developers and users can generate, customize, and operate AI agents and applications collaboratively, with a strong focus on European sovereignty, efficiency, and user control.
KOG:
- An early-stage startup was created a little over a year ago
- Real-time AI platform aimed at revolutionizing how artificial intelligence is used in digital experiences
- The goal is to make AI faster, more efficient, and more intuitive
- Creative optimization at a very low level with our solutions
- Seek out new ideas and implement them ambitiously
- Do not copy what already exists
- The inference engine optimizes AI performance at the assembly level
- Bypassing traditional abstraction layers
- Significant advancements in several areas: inter-GPU communication, kernel fusion, grid synchronization, and memory access optimization
- The inference engine offers speed improvements 3 to 10 times greater compared to the best GPU alternatives starting with AMD MI300X
- The goal is to unlock uses such as collaboration for complex creation (video games, films, music, applications, software, etc.)
- The link between GPUs and the product is real-time AI
- Creation of dynamic interfaces thanks to real-time AI
Kog is therefore based on two axes:
Hardcore GPU Engineering
- Stream R&D: new model architectures optimized for speed
- Final objective: 10x on GPUs and 10x on model architecture, thus 100x in total
POSITION:
We wish to strengthen our world-class team with technically brilliant individuals who want to take on this challenge. Your missions will include:
- Implementing cutting-edge AI models in low-level C++ code and Assembly on high-end AMD and NVIDIA GPUs
- Reverse-engineering subtle GPU features (such as memory page mappings, memory channels, hash functions, cache behaviors, credit assignment logic, etc.)
- Leveraging this knowledge to find and implement creative optimization ideas
- Optimizing the Kog inference engine to make AI inference incredibly fast (10x compared to vLLM, SGLang, or TensorRT-LLM—we are already at 3x)
What we offer:
- Competitive salary
- Equities (BSPCE)
- Elite technical challenges
- World-class team (9 engineers, including 3 PhD)
- A creative environment where your goal is to push back the limits
- Equipment you'll need to perform
- WeWork offices in the 13th district of Paris (near Station F)
- Afterworks during our Paris week
You can apply right below if you feel that you're up to the task
-
GPU Engineer
il y a 2 jours
Paris, Île-de-France Kog AI Temps pleinKOG: Kog is a European VC-funded startup and real-time AI frontier lab building the world's fastest AI execution layer, part of the 2030 French Tech cohort.We are not just optimizing existing libraries; we are bypassing inefficient abstraction layers to rewrite the rules of AI inference. By coding at the Assembly level on high-end GPUs (starting with the AMD...
-
Lead GPU Engineer
il y a 4 jours
Paris, Île-de-France Kog AI Temps pleinKOG: Kog is a Paris-based deeptech company building the world's fastest AI execution layer.We are not just optimizing existing libraries; we are bypassing inefficient abstraction layers to rewrite the rules of AI inference. By coding at the Assembly level on high-end GPUs (starting with the AMD MI300X), we unlock raw performance that standard stacks leave on...
-
Lead GPU Engineer
il y a 2 jours
Paris, Île-de-France Kog AI Temps pleinKOG:Kog is a European VC-funded startup and real-time AI frontier lab building the world's fastest AI execution layer, part of the 2030 French Tech cohort.We are not just optimizing existing libraries; we are bypassing inefficient abstraction layers to rewrite the rules of AI inference. By coding at the Assembly level on high-end GPUs (starting with the AMD...
-
AI Engineer
il y a 2 semaines
Paris, Île-de-France Experis Temps pleinSoftware Engineer – AI Infrastructure for Agents (Paris)We're seeking aSoftware Engineerwith deep expertise inPython, PyTorch, CUDA, and C++to join our AI research team inParis. This role is focused on building high-performance infrastructure for training and deploying large-scale AI agents.Important:Candidates will be asked to complete atimed coding...
-
Pre-Training Engineer
il y a 2 jours
Paris, Île-de-France Kog AI Temps pleinKOG: Kog is a French deeptech company building an ultra-fast AI execution layer for real-time AI. We target up to 10x gains through GPU optimization and, crucially, up to 10x gains through model and training architecture design. We start on AMD GPUs and will expand to other accelerators. Our aim is a modular, real-time AI platform where developers and users...
-
Senior DevOps Engineer
il y a 7 jours
Paris, Île-de-France Wirk Temps pleinWIRK — Senior DevOps Engineer (Kubernetes, Terraform, LLM & ELK) — Paris (On-site)À propos de WIRKWIRK est une entreprise innovante spécialisée dans l'extraction documentaire grace à des plateformes scalables et intelligentes qui répondent aux défis techniques modernes.Chez WIRK, nous plaçons lafiabilité, la performance et l'innovation au cœur...
-
Site Reliability Engineer
il y a 4 jours
Paris, Île-de-France Criteo Temps pleinWhat You'll Do:At Criteo, our Platform Core group builds the foundational services that power our global advertising platform. We design and operate scalable, resilient systems that support real-time decision-making and data processing at massive scale.As we expand our capabilities in high-performance inference and distributed computing, we're forming a new...
-
Imaging Software Engineer
il y a 2 jours
Paris, Île-de-France Harmattan AI Temps pleinAbout UsAt Harmattan AI, we are a next-generation defense prime building autonomous and scalable defense systems. Driven by rigorous engineering developments of new defense products based on recent robotics and AI developments, we are on a steep growth trajectory. If you are interested in a career in a highly technical environment, thrive on pushing...
-
Research Engineer
il y a 2 jours
Paris, Île-de-France Kog AI Temps pleinKOG: Kog is a European VC-funded startup and real-time AI frontier lab building the world's fastest AI execution layer, part of the 2030 French Tech cohort.We are not just optimizing existing libraries; we are bypassing inefficient abstraction layers to rewrite the rules of AI inference. By coding at the Assembly level on high-end GPUs (starting with the AMD...
-
Imaging Software Engineer
il y a 4 jours
Paris, Île-de-France Harmattan AI Temps pleinAbout UsAt Harmattan AI, we are a next-generation defense prime building autonomous and scalable defense systems. Driven by rigorous engineering developments of new defense products based on recent robotics and AI developments, we are on a steep growth trajectory. If you are interested in a career in a highly technical environment, thrive on pushing...