Site Reliability Engineer
il y a 6 jours
What You'll Do:
At Criteo, our Platform Core group builds the foundational services that power our global advertising platform. We design and operate scalable, resilient systems that support real-time decision-making and data processing at massive scale.
As we expand our capabilities in high-performance inference and distributed computing, we're forming a new team focused on GPU-powered services and cutting-edge ML serving technologies.
What You'll Do
As a Site Reliability Engineer in this new team, you'll be at the forefront of building and operating GPU-powered services for machine learning workloads.
Your mission will be to ensure the reliability, scalability, and performance of our systems that leverage:
- Ray: You'll manage on-demand provisioning of Ray clusters on Kubernetes, enabling scalable distributed computing as a service for ML training and inference. You'll design, maintain, and monitor these ray-as-a-service systems, and deliver these capabilities as robust, self-service platform offerings.
- Nvidia Triton Inference Server: You'll optimize and operate high-performance inference services using Triton, ensuring low-latency and high-throughput serving of deep learning models.
You'll work closely with ML engineers, data scientists, and other infrastructure teams to deliver production-grade services that accelerate innovation across Criteo.
Who You Are:
- Master's or PhD in Computer Science (or equivalent experience).
- 5+ years in backend engineering, SRE or DevOps.
- Strong experience with Kubernetes, especially in dynamic provisioning and custom operators.
- Hands-on experience with GPU workloads, ideally in ML training or inference contexts.
- Solid programming skills in C#, Python, Go, or similar languages.
- Passion for automation, observability, and building reliable services.
Bonus Points
- Familiarity with Ray or other distributed computing frameworks.
- Knowledge of Nvidia Triton, TensorRT, or similar inference serving technologies.
- Familiarity with cloud-native GPU orchestration (e.g., GKE, EKS, or on-prem equivalents).
We acknowledge that many candidates may not meet every single role requirement listed above. If your experience looks a little different from our requirements but you believe that you can still bring value to the role, we'd love to see your application
Who We Are:
Criteo is a leader in commerce media, helping brands, agencies, and publishers create meaningful consumer connections through AI-powered advertising solutions. We're shaping a more open and sustainable digital future for advertising.
At Criteo, our culture is as unique as it is diverse. From our offices across the globe or from the comfort of home, our 3,600 Criteos collaborate together to build an open, impactful, and forward-thinking environment.
We foster a workplace where everyone is valued, and employment decisions are based solely on skills, qualifications, and business needs—never on non-job-related factors or legally protected characteristics.
What We Offer:
Ways of working – Our hybrid model blends home with in-office experiences, making space for both.
Grow with us – Learning, mentorship & career development programs.
Your wellbeing matters – Health benefits, wellness perks & mental health support.
A team that cares – Diverse, inclusive, and globally connected.
Fair pay & perks – Attractive salary, with performance-based rewards and family-friendly policies, plus the potential for equity depending on role and level.
Additional benefits may vary depending on the country where you work and the nature of your employment with Criteo.
-
Site Reliability Engineer
il y a 5 jours
Paris, Île-de-France OVHcloud Temps pleinSite Reliability Engineer - Network Observability H/F/NAu sein de votre équipe #OneTeamVous rejoindrez l'équipe Network Observability, en charge de la conception des produits d'observability pour une infrastructure composée de plus de serveurs, 5 millions d'adresses IP publiques et équipements réseau ; le maintien en condition opérationnel et...
-
Site Reliability Engineer
il y a 6 jours
Paris, Île-de-France Blackfluo Temps pleinJob DescriptionLocation: Full remote, EU timezone (CET +/- 2 hours)Start Date: As soon as possibleLanguages: English requiredWe are looking for a skilled Site Reliability Engineer (SRE) with deep expertise in AWS to help us scale and secure our infrastructure. As an SRE, you will be instrumental in ensuring the reliability, performance, and scalability of...
-
Senior Site Reliability Engineer
il y a 5 jours
Paris, Île-de-France Swile Temps pleinAt Swile, we believe that good products can help reduce friction in daily professional life and boost employee satisfaction. Today, we provide innovative solutions in various areas such as Fintech, Travel, HR, and Employee Benefits to more than 5.5 million users in 85,000 companies in France and Brazil. Your role as a Senior Site Reliability Engineer (SRE)...
-
Site Reliability Engineer
il y a 5 jours
Paris, Île-de-France Mistral Ai Temps pleinAbout Mistral At Mistral AI, we believe in the power of AI to simplify tasks, save time, and enhance learning and creativity. Our technology is designed to integrate seamlessly into daily working life. We democratize AI through high-performance, optimized, open-source and cutting-edge models, products and solutions. Our comprehensive AI platform is designed...
-
Site Reliability Engineer
il y a 2 semaines
Paris, Île-de-France Criteo Temps pleinWhat You'll Do:About the TeamThe Platform Core group at Criteo is composed of seven agile, human-sized teams providing the foundational platform and systems powering all Criteo products.Within this group, the Analytics Infrastructure team builds and operates the distributed, multi-datacenter analytic data stores and platforms enabling interactive querying,...
-
Site Reliability Engineer
il y a 1 semaine
Paris, Île-de-France Welcome to the Jungle France Temps pleinAs our Site Reliability Engineer you are responsible forimplementing and maintaining scalable infrastructure and systems that ensure the reliability,performance, and security of our production environments.This hands-on position bridges the gap between development and operations, applying software engineering principles to infrastructure and operational...
-
site reliability engineer
il y a 6 jours
Paris, Île-de-France STATION F Temps pleinAboutAt Welcome to the Jungle, we believe working is good. But thriving with the right people is better. We provide a suite of tools, content, and experiences that make recruitment more transparent, authentic, and human.We help companies build their recruitment strategy by sharing their story through employer branding, enabling them to attract, engage, and...
-
Lead Site Reliability Engineer
il y a 13 heures
Paris, Île-de-France Mistral AI Temps pleinAbout Mistral At Mistral AI, we believe in the power of AI to simplify tasks, save time, and enhance learning and creativity. Our technology is designed to integrate seamlessly into daily working life. We democratize AI through high-performance, optimized, open-source and cutting-edge models, products and solutions. Our comprehensive AI platform is...
-
Lead Site Reliability Engineer
il y a 3 heures
Paris, Île-de-France Mistral Ai Temps pleinAbout Mistral At Mistral AI, we believe in the power of AI to simplify tasks, save time, and enhance learning and creativity. Our technology is designed to integrate seamlessly into daily working life. We democratize AI through high-performance, optimized, open-source and cutting-edge models, products and solutions. Our comprehensive AI platform is designed...
-
Site Reliability Engineer
il y a 7 jours
Paris, Île-de-France AKUR8 Temps pleinAkur8 is a young, dynamic, fast growing Insurtech scale-up that is transforming insurance pricing and reserving with transparent machine learning.Our SaaS platform leverages the power of transparent machine learning and predictive analytics to inject game-changing speed, performance and reliability into insurers' pricing and reserving processes.Powered by...