Site Reliability Engineer

il y a 9 heures


Paris, Île-de-France OVHcloud Temps plein

Site Reliability Engineer - AI Core H/F/N H/F/N

Au sein de votre équipe #OneTeam

  • Vous rejoindrez l'équipe pluri-disciplinaire AI Core responsable du développement des produits d'intelligence artificielle d'OVHcloud et de leur continuité de service..
  • Dans le cadre des produits IA, vous maintiendrez et accompagnerez les évolutions de infrastructure pour l'intégration de nouveaux matériels, les évolutions de la plateforme ainsi que le perfectionnement de nos méthodes de déploiement.
  • En tant que Site Reliability Engineer, vous contribuerez à l'évolution des produits existants en termes de fonctionnalités, de stabilité et de performances pour créer une expérience IA complète sur l'infrastructure d'OVHcloud.

Vos principales responsabilités

  • Gestion de l'infrastructure basée sur Kubernetes et GPU
  • Contribuer à l'évolution de l'infrastructure en lien avec l'état de l'art
  • Opérez la plate-forme sous-jacente 24h / 24 et 7j / 7 dans plusieurs centres de données dans le monde
  • Contribuer à la vision OVHcloud AI et à la feuille de route de l'équipe

Votre futur impact

Dans 6 mois

  • Co-développer des évolutions du backend de le plateforme IA.
  • Participer au maintien en opération de la plateforme en jours ouvrés.

Et dans 1 an

  • Contribution active aux produits IA existants pour améliorer l'expérience client.
  • Participations au développement logiciel du control-plane IA.
  • Participation à la rotation des astreintes 24/7.

_

Compétences requises :

  • Expérience en administration ou ingénierie système.
  • Maîtrise Kubernetes / Docker et la philosophie Cloud Native.
  • Expérience sur des sujets de CI/CD.
  • Connaissances en infrastructure-as-code, en particulier Terraform et Ansible.

C'est un +

  • Intérêt particulier pour le domaine des données et de l'IA.
  • Appétence et/ou compétence dans le développement, en particulier en Golang.

  • Site Reliability Engineer

    il y a 8 heures


    Paris, Île-de-France Blackfluo Temps plein

    Job DescriptionLocation: Full remote, EU timezone (CET +/- 2 hours)Start Date: As soon as possibleLanguages: English requiredWe are looking for a skilled Site Reliability Engineer (SRE) with deep expertise in AWS to help us scale and secure our infrastructure. As an SRE, you will be instrumental in ensuring the reliability, performance, and scalability of...

  • Site Reliability Engineer

    il y a 2 jours


    Paris, Île-de-France Mistral Ai Temps plein

    About Mistral At Mistral AI, we believe in the power of AI to simplify tasks, save time, and enhance learning and creativity. Our technology is designed to integrate seamlessly into daily working life. We democratize AI through high-performance, optimized, open-source and cutting-edge models, products and solutions. Our comprehensive AI platform is designed...

  • Site Reliability Engineer

    il y a 8 heures


    Paris, Île-de-France OVHcloud Temps plein

    Site Reliability Engineer - Network Observability H/F/NAu sein de votre équipe #OneTeamVous rejoindrez l'équipe Network Observability, en charge de la conception des produits d'observability pour une infrastructure composée de plus de serveurs, 5 millions d'adresses IP publiques et équipements réseau ; le maintien en condition opérationnel et...

  • Site Reliability Engineer

    il y a 1 semaine


    Paris, Île-de-France Criteo Temps plein

    What You'll Do:About the TeamThe Platform Core group at Criteo is composed of seven agile, human-sized teams providing the foundational platform and systems powering all Criteo products.Within this group, the Analytics Infrastructure team builds and operates the distributed, multi-datacenter analytic data stores and platforms enabling interactive querying,...

  • Site Reliability Engineer II

    il y a 2 semaines


    Paris, Île-de-France Doctolib Temps plein

    What We DoDoctolib's Engineering environment is rich and we are building innovative products and features aiming each day to ease doctors' and patient life. We are looking for aSite Reliability Engineer IIto keep Doctolib production systems running smoothly. You will also be a key-player to support the exponential growth of Doctolib services.What You Will...

  • Site Reliability Engineer II

    il y a 1 semaine


    Paris, Île-de-France Doctolib Temps plein

    What we do Doctolib's Engineering environment is rich and we are building innovative products and features aiming each day to ease doctors' and patient life. We are looking for a Site Reliability Engineer II to keep Doctolib production systems running smoothly. You will also be a key-player to support the exponential growth of Doctolib services. ...

  • Site Reliability Engineer

    il y a 4 jours


    Paris, Île-de-France Welcome to the Jungle France Temps plein

    As our Site Reliability Engineer you are responsible forimplementing and maintaining scalable infrastructure and systems that ensure the reliability,performance, and security of our production environments.This hands-on position bridges the gap between development and operations, applying software engineering principles to infrastructure and operational...

  • Site Reliability Engineer

    il y a 1 semaine


    Paris, Île-de-France Criteo Temps plein

    What You'll Do: About the TeamThe Platform Core group at Criteo is composed of seven agile, human-sized teams providing the foundational platform and systems powering all Criteo products.Within this group, the Analytics Infrastructure team builds and operates the distributed, multi-datacenter analytic data stores and platforms enabling interactive...

  • Site Reliability Engineer

    il y a 8 heures


    Paris, Île-de-France Welcome to the Jungle France Temps plein

    As our Site Reliability Engineer you are responsible forimplementing and maintaining scalable infrastructure and systems that ensure the reliability,performance, and security of our production environments.This hands-on position bridges the gap between development and operations, applying software engineering principles to infrastructure and operational...

  • site reliability engineer

    il y a 7 heures


    Paris, Île-de-France STATION F Temps plein

    AboutAt Welcome to the Jungle, we believe working is good. But thriving with the right people is better. We provide a suite of tools, content, and experiences that make recruitment more transparent, authentic, and human.We help companies build their recruitment strategy by sharing their story through employer branding, enabling them to attract, engage, and...