Gpu Performance Engineer

Il y a 2 mois


Paris, France Adaptive ML Temps plein

**About the team**:
**Adaptive is helping companies build singular generative AI experiences by democratizing the use of reinforcement learning**. We are building the foundational technologies, tools, and products required for models to learn directly from users' interactions and for models to self-critique and self-improve from simple written guidelines. Our tightly-knit team was previously involved in the creation of state-of-the-art open-access large language models such as Falcon-180B. We have closed a $20M seed with Index & ICONIQ, and are looking forward to shipping a first version of our platform, Adaptive Engine, in early 2024.

Our Technical Staff is responsible for building the foundational technology powering Adaptive, in line with requests and requirements identified by our Product and Commercial Staff. We strive to build excellent, robust, and efficient technology, and to conduct at-scale, honest research with high-impact for our roadmap and customers.

**About the role**:
As a GPU Performance Engineer in our Technical Staff, **you will help ensure that our LLM stack (**Adaptive Harmony**) delivers state of the art performance across a wide variety of settings**; including in latency-bound regimes where serving requests with sub-second response times is key, to throughput-bound regimes during training and offline inference. **You will help build the foundational technology powering Adaptive** by delivering performance improvements directly to our clients as well as to our internal workloads.

Some examples of tasks you will encounter during your work:

- Profile and iterate GPU inference kernels in Triton or CUDA, identifying memory bottlenecks and optimizing latency—and decide how to adequately benchmark an inference service;
- Systematically identify and eliminate synchronization points between the CPU and GPU, enabling asynchronous communication of results from Python workers to our Rust backend;
- Work with quantization methods to minimize the memory footprint of our models;
- Modify existing implementation of kernels to support requested features, and efficiently implement novel operations entirely from scratch.

We are looking for self-driven, intense individuals, who value **technical excellency, honesty, and growth**.

**Your responsibilities**:
Generally,
- ** Contribute to our product roadmap**, by identifying promising trends that can improve performance;
- Report clearly on your work to a distributed collaborative team, with a **bias for asynchronous written communication**.

On the engineering side,
- ** Write high-quality software in CUDA and/or Triton**, with a focus on performance and robustness;
- ** Profile dedicated GPU kernels in CUDA or Triton**, optimizing across latency/compute-bound regimes for complex workloads.

**Your (ideal) background**:

- ** A **M.Sc**./Ph.D. in computer science, or demonstrated experience in software engineering**, preferably with a focus on GPU-optimization;
- ** Strong programming skills**, preferably with a focus on systems and general purpose GPU programming;
- ** Contributions to relevant open-source projects**, such as CUTLASS, Triton and MLIR;
- ** A track record of writing high performance kernels, **preferably demonstrated ability to reach state of the art performance on well defined tasks;
- ** Passionate about the future of generative AI, **and eager to build foundational technology to help machines deliver more singular experiences.

**Benefits**:

- Comprehensive medical (health, dental, and vision) insurance;
- 401(k) plan with 4% matching (or equivalent);
- Unlimited PTO — we strongly encourage at least 5 weeks each year;
- Mental health, wellness, and personal development stipends;
- Visa sponsorship if you wish to relocate to New York or Paris.



  • Paris, France Zendar Temps plein

    **Who We Are**: Zendar is creating a high-resolution radar imaging system that has resolution similar to lidar, allowing cars to see in inclement weather. We want to change the role of radar in the autonomous driving sensor stack and demonstrate that radars can take on many of the functions of lidar at a much lower cost and in all weather conditions....

  • Senior Software Engineer

    il y a 1 mois


    Paris, France Coders Connect Temps plein

    Coders Connect is thrilled to be partnering with an EMEA leader in decision-making AI products for the Enterprise, with headquarters in London, and offices in Paris, Berlin, Tunis, Lagos, Dubai, Cape Town and the USA. The company has been named among the Top 100 global AI startups for three consecutive years by CB Insights, as well as one of the 100 most...

  • Research Engineer

    il y a 3 jours


    Paris, France Meta Temps plein

    **Research Engineer Responsibilities**: - Define use cases and develop methodology and benchmarks to evaluate different approaches - Push the boundaries of the capabilities of code generation models beyond the current state of the art **Minimum Qualifications**: - BS, MS or Ph.D. degree in Computer Science or related quantitative field - Industry...

  • Senior Software Engineer

    il y a 3 semaines


    Paris, France NextTech Recruitment Temps plein

    Senior Computer Graphics Engineer Full-time Location: Paris office and remote (hybrid) The opportunity to join a fast-growing startup in the field of Computer Graphics and AI working on cutting edge technologies like NeRF, reconstruction and rendering for imaging and vision. Required skills for the Senior Computer Graphics Engineer: As a Senior Computer...


  • Paris, France Kicklox Temps plein

    **L'offre**: **Secteurs** Aéronautique **Missions à réaliser** Vos missions principales sont : 1/ Apporter un soutien technique aux unités « numérique » du département dans le développement de leurs capacités dans le domaine du Calcul Haute Performance : Contribuer au portage des logiciels sur les nouvelles architectures des machines & à leur...

  • Senior Software Engineer

    il y a 3 semaines


    Paris, France NextTech Recruitment Temps plein

    Senior Computer Graphics EngineerFull-timeLocation: Paris office and remote (hybrid)The opportunity to join a fast-growing startup in the field of Computer Graphics and AI working on cutting edge technologies like NeRF, reconstruction and rendering for imaging and vision.Required skills for the Senior Computer Graphics Engineer:As a Senior Computer Graphics...

  • Senior Software Engineer

    il y a 3 semaines


    Paris, France NextTech Recruitment Temps plein

    Senior Computer Graphics EngineerFull-timeLocation: Paris office and remote (hybrid)The opportunity to join a fast-growing startup in the field of Computer Graphics and AI working on cutting edge technologies like NeRF, reconstruction and rendering for imaging and vision.Required skills for the Senior Computer Graphics Engineer:As a Senior Computer Graphics...

  • Senior Software Engineer

    il y a 3 semaines


    Paris, Ile-de-France NextTech Recruitment Temps plein

    Senior Computer Graphics EngineerFull-timeLocation: Paris office and remote (hybrid)The opportunity to join a fast-growing startup in the field of Computer Graphics and AI working on cutting edge technologies like NeRF, reconstruction and rendering for imaging and vision.Required skills for the Senior Computer Graphics Engineer:As a Senior Computer Graphics...

  • Generative AI Engineer

    il y a 4 jours


    Paris, France Devoteam Temps plein

    Devoteam est un leader du conseil en stratégie numérique, plateformes technologiques et cybersécurité. En alliant technologie, créativité et data, Devoteam accompagne ses clients dans la transformation numérique de leur activité afin de libérer leur plein potentiel. Avec 10 000 collaborateurs en Europe, au Moyen-Orient et oken Afrique, Devoteam...

  • Generative AI Engineer

    il y a 4 jours


    Paris, France Devoteam Temps plein

    Devoteam est un leader du conseil en stratégie numérique, plateformes technologiques et cybersécurité. En alliant technologie, créativité et data, Devoteam accompagne ses clients dans la transformation numérique de leur activité afin de libérer leur plein potentiel. Avec 10 000 collaborateurs en Europe, au Moyen-Orient et oken Afrique, Devoteam...

  • Generative AI Engineer

    il y a 4 jours


    Paris, Ile-de-France Devoteam Temps plein

    Devoteam est un leader du conseil en stratégie numérique, plateformes technologiques et cybersécurité. En alliant technologie, créativité et data, Devoteam accompagne ses clients dans la transformation numérique de leur activité afin de libérer leur plein potentiel. Avec 10 000 collaborateurs en Europe, au Moyen-Orient et oken Afrique, Devoteam...

  • Research Engineer

    il y a 4 jours


    Paris, France Meta Temps plein

    **Research Engineer - FAIR Responsibilities**: - Optimize, profile, and improve large language models for research and for deployment - Define use cases and develop methodology and benchmarks to evaluate different approaches - Push the boundaries of the capabilities of code generation models beyond the current state of the art **Minimum...


  • Paris, France Zendar Temps plein

    Zendar is hiring a software engineer with a strong mathematical background. **About Zendar**: Zendar is creating a high-resolution radar imaging system that has resolution similar to lidar, allowing cars to see in inclement weather. We are a team of electrical, mechanical, software engineers and researchers developing the next generation radar technology....

  • Senior Ml Software Engineer

    il y a 2 semaines


    Paris, France InstaDeep Temps plein

    Our research team publishes advanced research on reinforcement learning in top AI conferences such as NeurIPS and collaborates with world-leading researchers and companies. From Q2 2022, InstaDeep is expanding in the United States. With this aim, InstaDeep is looking for a ML Software/DevOps Engineer to support the development of its US activities. J In...

  • Performance Engineer

    il y a 1 semaine


    Paris, Île-de-France COGNIZANT FRANCE Temps plein

    Quelles sont les missions ?ous recherchons un.e Ingénieur en tests de performance et monitoring. Vous rejoignez notre équipe à Paris qui fournit des services de qualité et d'assurance pour un grand compte français de l'industrie. L'équipe Qe&A, dont vous ferez partie, est responsable des activités de performance des applications notamment. Mission -...


  • Paris, France InstaDeep Temps plein

    Our research team publishes advanced research on reinforcement learning in top AI conferences such as NeurIPS and collaborates with world-leading researchers and companies..In this role at InstaDeep, you will report to the Research Lead. You will design and create the algorithms capable of learning and making predictions that define machine...


  • Paris, France InstaDeep Temps plein

    Our research team publishes advanced research on reinforcement learning in top AI conferences such as NeurIPS and collaborates with world-leading researchers and companies..In this role at InstaDeep, you will report to the Research Lead. You will design and create the algorithms capable of learning and making predictions that define machine...


  • Paris, Ile-de-France InstaDeep Temps plein

    Our research team publishes advanced research on reinforcement learning in top AI conferences such as NeurIPS and collaborates with world-leading researchers and companies..In this role at InstaDeep, you will report to the Research Lead. You will design and create the algorithms capable of learning and making predictions that define machine...


  • Paris, France Inara Temps plein

    Senior ML Software EngineerLocation: Hybrid / ParisSalary: 75-85,000 Euros Sector: AI Products & SolutionsNO SPONSORSHIP AVAILABLE, SORRYThis is a leading AI product and solutions company who operate globally and work at the forefront of technology, across software, cloud and AI. They partner with well known companies and vendors meaning you get to work on...


  • Paris, Ile-de-France Inara Temps plein

    Senior ML Software EngineerLocation: Hybrid / ParisSalary: 75-85,000 Euros Sector: AI Products & SolutionsNO SPONSORSHIP AVAILABLE, SORRYThis is a leading AI product and solutions company who operate globally and work at the forefront of technology, across software, cloud and AI. They partner with well known companies and vendors meaning you get to work on...