GPU Engineer

il y a 1 semaine

Paris, Île-de-France Kog AI Temps plein

KOG:

Kog is a French deeptech company building an ultra-fast AI execution layer for real-time AI.

We target up to 10x gains through GPU optimization and, crucially, up to 10x gains through model and training architecture design.

We start on AMD GPUs and will expand to other accelerators.

Our aim is a modular, real-time AI platform where developers and users can generate, customize, and operate AI agents and applications collaboratively, with a strong focus on European sovereignty, efficiency, and user control.

KOG:

An early-stage startup was created a little over a year ago
Real-time AI platform aimed at revolutionizing how artificial intelligence is used in digital experiences
The goal is to make AI faster, more efficient, and more intuitive
Creative optimization at a very low level with our solutions
Seek out new ideas and implement them ambitiously
Do not copy what already exists
The inference engine optimizes AI performance at the assembly level
Bypassing traditional abstraction layers
Significant advancements in several areas: inter-GPU communication, kernel fusion, grid synchronization, and memory access optimization
The inference engine offers speed improvements 3 to 10 times greater compared to the best GPU alternatives starting with AMD MI300X
The goal is to unlock uses such as collaboration for complex creation (video games, films, music, applications, software, etc.)
The link between GPUs and the product is real-time AI
Creation of dynamic interfaces thanks to real-time AI
Kog is therefore based on two axes:
Hardcore GPU Engineering
Stream R&D: new model architectures optimized for speed
Final objective: 10x on GPUs and 10x on model architecture, thus 100x in total

POSITION:

We wish to strengthen our world-class team with technically brilliant individuals who want to take on this challenge. Your missions will include:

Implementing cutting-edge AI models in low-level C++ code and Assembly on high-end AMD and NVIDIA GPUs
Reverse-engineering subtle GPU features (such as memory page mappings, memory channels, hash functions, cache behaviors, credit assignment logic, etc.)
Leveraging this knowledge to find and implement creative optimization ideas
Optimizing the Kog inference engine to make AI inference incredibly fast (10x compared to vLLM, SGLang, or TensorRT-LLM—we are already at 3x)

What we offer:

Competitive salary
Equities (BSPCE)
Elite technical challenges
World-class team (9 engineers, including 3 PhD)
A creative environment where your goal is to push back the limits
Equipment you'll need to perform
WeWork offices in the 13th district of Paris (near Station F)
Afterworks during our Paris week

You can apply right below if you feel that you're up to the task

Lead GPU Engineer

il y a 3 jours

Paris, Île-de-France Kog AI Temps plein

KOG: Kog is a Paris-based deeptech company building the world's fastest AI execution layer.We are not just optimizing existing libraries; we are bypassing inefficient abstraction layers to rewrite the rules of AI inference. By coding at the Assembly level on high-end GPUs (starting with the AMD MI300X), we unlock raw performance that standard stacks leave on...
Lead GPU Engineer

il y a 1 jour

Paris, Île-de-France Kog AI Temps plein

KOG:Kog is a European VC-funded startup and real-time AI frontier lab building the world's fastest AI execution layer, part of the 2030 French Tech cohort.We are not just optimizing existing libraries; we are bypassing inefficient abstraction layers to rewrite the rules of AI inference. By coding at the Assembly level on high-end GPUs (starting with the AMD...
Lead GPU Engineer

il y a 3 jours

Paris, Île-de-France Kog AI Temps plein

KOG: Kog is a European VC-funded startup and real-time AI frontier lab building the world's fastest AI execution layer. As part of the 2030 French Tech cohort, we are on a mission to redefine the boundaries of artificial intelligence by enabling true real-time interaction at a scale never seen before.While the industry often settles for incremental software...
Senior DevOps Engineer

il y a 6 jours

Paris, Île-de-France Wirk Temps plein

WIRK — Senior DevOps Engineer (Kubernetes, Terraform, LLM & ELK) — Paris (On-site)À propos de WIRKWIRK est une entreprise innovante spécialisée dans l'extraction documentaire grace à des plateformes scalables et intelligentes qui répondent aux défis techniques modernes.Chez WIRK, nous plaçons lafiabilité, la performance et l'innovation au cœur...
Site Reliability Engineer

il y a 3 jours

Paris, Île-de-France Criteo Temps plein

What You'll Do:At Criteo, our Platform Core group builds the foundational services that power our global advertising platform. We design and operate scalable, resilient systems that support real-time decision-making and data processing at massive scale.As we expand our capabilities in high-performance inference and distributed computing, we're forming a new...
Systems Engineer

il y a 2 semaines

Paris, Île-de-France Collective Temps plein

Systems Engineer / DevOps Engineer (Operating Systems, Kubernetes, Web Services)Location : Sud Ile de FranceCDI ou contratAbout the RoleWe are seeking a skilled and forward-thinking Systems Engineer with deep expertise in Linux-based operating systems, RedHat OpenShift, Kubernetes and cloud-native web services. The ideal candidate will have hands-on...
Systems Engineer

il y a 2 semaines

Paris, Île-de-France a-82ac-4f4c-a6b2-5eb6545b4923 Temps plein

Systems Engineer / DevOps Engineer (Operating Systems, Kubernetes, Web Services)TJM: 600€We are also hiring for this post (CDI)Location : Sud Ile de france, proche SaclayAbout the RoleWe are seeking a skilled and forward-thinking Systems Engineer with deep expertise in Linux-based operating systems, RedHat OpenShift, Kubernetes and cloud-native web...
Imaging Software Engineer

il y a 1 jour

Paris, Île-de-France Harmattan AI Temps plein

About UsAt Harmattan AI, we are a next-generation defense prime building autonomous and scalable defense systems. Driven by rigorous engineering developments of new defense products based on recent robotics and AI developments, we are on a steep growth trajectory. If you are interested in a career in a highly technical environment, thrive on pushing...
Site Reliability Engineer

il y a 3 jours

Paris, Île-de-France OVHcloud Temps plein

Site Reliability Engineer - AI Core H/F/N H/F/NAu sein de votre équipe #OneTeamVous rejoindrez l'équipe pluri-disciplinaire AI Core responsable du développement des produits d'intelligence artificielle d'OVHcloud et de leur continuité de service..Dans le cadre des produits IA, vous maintiendrez et accompagnerez les évolutions de infrastructure pour...
Backend Engineer

il y a 1 jour

Paris, Île-de-France TxxxxxxAI Temps plein

de l'entreprise : À mesure que les systèmes d'IA opèrent de plus en plus dans le monde physique — robots, infrastructures intelligentes, systèmes autonomes — un problème critique demeure non résolu : nous savons faire des prédictions, mais nous avons encore du mal à quantifier, en temps réel, la fiabilité réelle de ces...

Amériques

Europe

Asie / Océanie

Afrique

GPU Engineer