Gpu Performance Engineer
Il y a 2 mois
**About the team**:
**Adaptive is helping companies build singular generative AI experiences by democratizing the use of reinforcement learning**. We are building the foundational technologies, tools, and products required for models to learn directly from users' interactions and for models to self-critique and self-improve from simple written guidelines. Our tightly-knit team was previously involved in the creation of state-of-the-art open-access large language models such as Falcon-180B. We have closed a $20M seed with Index & ICONIQ, and are looking forward to shipping a first version of our platform, Adaptive Engine, in early 2024.
Our Technical Staff is responsible for building the foundational technology powering Adaptive, in line with requests and requirements identified by our Product and Commercial Staff. We strive to build excellent, robust, and efficient technology, and to conduct at-scale, honest research with high-impact for our roadmap and customers.
**About the role**:
As a GPU Performance Engineer in our Technical Staff, **you will help ensure that our LLM stack (**Adaptive Harmony**) delivers state of the art performance across a wide variety of settings**; including in latency-bound regimes where serving requests with sub-second response times is key, to throughput-bound regimes during training and offline inference. **You will help build the foundational technology powering Adaptive** by delivering performance improvements directly to our clients as well as to our internal workloads.
Some examples of tasks you will encounter during your work:
- Profile and iterate GPU inference kernels in Triton or CUDA, identifying memory bottlenecks and optimizing latency—and decide how to adequately benchmark an inference service;
- Systematically identify and eliminate synchronization points between the CPU and GPU, enabling asynchronous communication of results from Python workers to our Rust backend;
- Work with quantization methods to minimize the memory footprint of our models;
- Modify existing implementation of kernels to support requested features, and efficiently implement novel operations entirely from scratch.
We are looking for self-driven, intense individuals, who value **technical excellency, honesty, and growth**.
**Your responsibilities**:
Generally,
- ** Contribute to our product roadmap**, by identifying promising trends that can improve performance;
- Report clearly on your work to a distributed collaborative team, with a **bias for asynchronous written communication**.
On the engineering side,
- ** Write high-quality software in CUDA and/or Triton**, with a focus on performance and robustness;
- ** Profile dedicated GPU kernels in CUDA or Triton**, optimizing across latency/compute-bound regimes for complex workloads.
**Your (ideal) background**:
- ** A **M.Sc**./Ph.D. in computer science, or demonstrated experience in software engineering**, preferably with a focus on GPU-optimization;
- ** Strong programming skills**, preferably with a focus on systems and general purpose GPU programming;
- ** Contributions to relevant open-source projects**, such as CUTLASS, Triton and MLIR;
- ** A track record of writing high performance kernels, **preferably demonstrated ability to reach state of the art performance on well defined tasks;
- ** Passionate about the future of generative AI, **and eager to build foundational technology to help machines deliver more singular experiences.
**Benefits**:
- Comprehensive medical (health, dental, and vision) insurance;
- 401(k) plan with 4% matching (or equivalent);
- Unlimited PTO — we strongly encourage at least 5 weeks each year;
- Mental health, wellness, and personal development stipends;
- Visa sponsorship if you wish to relocate to New York or Paris.
-
Senior High Performance Computing Software Engineer
il y a 1 mois
Paris, France Zendar Temps plein**Who We Are**: Zendar is creating a high-resolution radar imaging system that has resolution similar to lidar, allowing cars to see in inclement weather. We want to change the role of radar in the autonomous driving sensor stack and demonstrate that radars can take on many of the functions of lidar at a much lower cost and in all weather conditions....
-
Senior Software Engineer
il y a 1 mois
Paris, France Coders Connect Temps pleinCoders Connect is thrilled to be partnering with an EMEA leader in decision-making AI products for the Enterprise, with headquarters in London, and offices in Paris, Berlin, Tunis, Lagos, Dubai, Cape Town and the USA. The company has been named among the Top 100 global AI startups for three consecutive years by CB Insights, as well as one of the 100 most...
-
Research Engineer
il y a 3 jours
Paris, France Meta Temps plein**Research Engineer Responsibilities**: - Define use cases and develop methodology and benchmarks to evaluate different approaches - Push the boundaries of the capabilities of code generation models beyond the current state of the art **Minimum Qualifications**: - BS, MS or Ph.D. degree in Computer Science or related quantitative field - Industry...
-
Senior Software Engineer
il y a 3 semaines
Paris, France NextTech Recruitment Temps pleinSenior Computer Graphics Engineer Full-time Location: Paris office and remote (hybrid) The opportunity to join a fast-growing startup in the field of Computer Graphics and AI working on cutting edge technologies like NeRF, reconstruction and rendering for imaging and vision. Required skills for the Senior Computer Graphics Engineer: As a Senior Computer...
-
Charge de Mission Calcul Haute Performance Et
il y a 1 mois
Paris, France Kicklox Temps plein**L'offre**: **Secteurs** Aéronautique **Missions à réaliser** Vos missions principales sont : 1/ Apporter un soutien technique aux unités « numérique » du département dans le développement de leurs capacités dans le domaine du Calcul Haute Performance : Contribuer au portage des logiciels sur les nouvelles architectures des machines & à leur...
-
Senior Software Engineer
il y a 3 semaines
Paris, France NextTech Recruitment Temps pleinSenior Computer Graphics EngineerFull-timeLocation: Paris office and remote (hybrid)The opportunity to join a fast-growing startup in the field of Computer Graphics and AI working on cutting edge technologies like NeRF, reconstruction and rendering for imaging and vision.Required skills for the Senior Computer Graphics Engineer:As a Senior Computer Graphics...
-
Senior Software Engineer
il y a 3 semaines
Paris, France NextTech Recruitment Temps pleinSenior Computer Graphics EngineerFull-timeLocation: Paris office and remote (hybrid)The opportunity to join a fast-growing startup in the field of Computer Graphics and AI working on cutting edge technologies like NeRF, reconstruction and rendering for imaging and vision.Required skills for the Senior Computer Graphics Engineer:As a Senior Computer Graphics...
-
Senior Software Engineer
il y a 3 semaines
Paris, Ile-de-France NextTech Recruitment Temps pleinSenior Computer Graphics EngineerFull-timeLocation: Paris office and remote (hybrid)The opportunity to join a fast-growing startup in the field of Computer Graphics and AI working on cutting edge technologies like NeRF, reconstruction and rendering for imaging and vision.Required skills for the Senior Computer Graphics Engineer:As a Senior Computer Graphics...
-
Generative AI Engineer
il y a 4 jours
Paris, France Devoteam Temps pleinDevoteam est un leader du conseil en stratégie numérique, plateformes technologiques et cybersécurité. En alliant technologie, créativité et data, Devoteam accompagne ses clients dans la transformation numérique de leur activité afin de libérer leur plein potentiel. Avec 10 000 collaborateurs en Europe, au Moyen-Orient et oken Afrique, Devoteam...
-
Generative AI Engineer
il y a 4 jours
Paris, France Devoteam Temps pleinDevoteam est un leader du conseil en stratégie numérique, plateformes technologiques et cybersécurité. En alliant technologie, créativité et data, Devoteam accompagne ses clients dans la transformation numérique de leur activité afin de libérer leur plein potentiel. Avec 10 000 collaborateurs en Europe, au Moyen-Orient et oken Afrique, Devoteam...
-
Generative AI Engineer
il y a 4 jours
Paris, Ile-de-France Devoteam Temps pleinDevoteam est un leader du conseil en stratégie numérique, plateformes technologiques et cybersécurité. En alliant technologie, créativité et data, Devoteam accompagne ses clients dans la transformation numérique de leur activité afin de libérer leur plein potentiel. Avec 10 000 collaborateurs en Europe, au Moyen-Orient et oken Afrique, Devoteam...
-
Research Engineer
il y a 4 jours
Paris, France Meta Temps plein**Research Engineer - FAIR Responsibilities**: - Optimize, profile, and improve large language models for research and for deployment - Define use cases and develop methodology and benchmarks to evaluate different approaches - Push the boundaries of the capabilities of code generation models beyond the current state of the art **Minimum...
-
Software Research Engineer
il y a 1 mois
Paris, France Zendar Temps pleinZendar is hiring a software engineer with a strong mathematical background. **About Zendar**: Zendar is creating a high-resolution radar imaging system that has resolution similar to lidar, allowing cars to see in inclement weather. We are a team of electrical, mechanical, software engineers and researchers developing the next generation radar technology....
-
Senior Ml Software Engineer
il y a 2 semaines
Paris, France InstaDeep Temps pleinOur research team publishes advanced research on reinforcement learning in top AI conferences such as NeurIPS and collaborates with world-leading researchers and companies. From Q2 2022, InstaDeep is expanding in the United States. With this aim, InstaDeep is looking for a ML Software/DevOps Engineer to support the development of its US activities. J In...
-
Performance Engineer
il y a 1 semaine
Paris, Île-de-France COGNIZANT FRANCE Temps pleinQuelles sont les missions ?ous recherchons un.e Ingénieur en tests de performance et monitoring. Vous rejoignez notre équipe à Paris qui fournit des services de qualité et d'assurance pour un grand compte français de l'industrie. L'équipe Qe&A, dont vous ferez partie, est responsable des activités de performance des applications notamment. Mission -...
-
Software Engineer, ML Frameworks
il y a 1 semaine
Paris, France InstaDeep Temps pleinOur research team publishes advanced research on reinforcement learning in top AI conferences such as NeurIPS and collaborates with world-leading researchers and companies..In this role at InstaDeep, you will report to the Research Lead. You will design and create the algorithms capable of learning and making predictions that define machine...
-
Software Engineer, ML Frameworks
il y a 1 semaine
Paris, France InstaDeep Temps pleinOur research team publishes advanced research on reinforcement learning in top AI conferences such as NeurIPS and collaborates with world-leading researchers and companies..In this role at InstaDeep, you will report to the Research Lead. You will design and create the algorithms capable of learning and making predictions that define machine...
-
Software Engineer, ML Frameworks
il y a 1 semaine
Paris, Ile-de-France InstaDeep Temps pleinOur research team publishes advanced research on reinforcement learning in top AI conferences such as NeurIPS and collaborates with world-leading researchers and companies..In this role at InstaDeep, you will report to the Research Lead. You will design and create the algorithms capable of learning and making predictions that define machine...
-
Senior Machine Learning Software Engineer
il y a 1 semaine
Paris, France Inara Temps pleinSenior ML Software EngineerLocation: Hybrid / ParisSalary: 75-85,000 Euros Sector: AI Products & SolutionsNO SPONSORSHIP AVAILABLE, SORRYThis is a leading AI product and solutions company who operate globally and work at the forefront of technology, across software, cloud and AI. They partner with well known companies and vendors meaning you get to work on...
-
Senior Machine Learning Software Engineer
il y a 1 semaine
Paris, Ile-de-France Inara Temps pleinSenior ML Software EngineerLocation: Hybrid / ParisSalary: 75-85,000 Euros Sector: AI Products & SolutionsNO SPONSORSHIP AVAILABLE, SORRYThis is a leading AI product and solutions company who operate globally and work at the forefront of technology, across software, cloud and AI. They partner with well known companies and vendors meaning you get to work on...