Site Reliability Engineer

il y a 1 jour


Lille, France Groupe iliad Temps plein

Le posteFondée en 1999, Scaleway est la filiale cloud du groupe Iliad, l’un des leaders des télécommunications en Europe. Notre mission est de favoriser une industrie numérique plus responsable en aidant les développeurs et les entreprises à créer, déployer et adapter des applications à n'importe quelle infrastructure.Depuis nos bureaux situés à Paris et à Lille, nous perfectionnons quotidiennement l'écosystème cloud de Scaleway, dont nous sommes les premiers utilisateurs.Nos quelques 25 000 clients nous choisissent pour notre redondance multi-AZ, notre expérience-utilisateur fluide, nos datacenters neutres en carbone ainsi que nos outils natifs de gestion d'architectures multi-cloud. Nos produits incluent des solutions entièrement gérées pour le bare metal, la conteneurisation et les architectures serverless, offrant ainsi un choix responsable dans le domaine du cloud computing.Rejoignez notre équipe dynamique de près de 600 collaborateurs venant de divers horizons, dans un environnement stimulant et international alliant excellence technique, créativité et partage. About the job Scaleway is looking for a Site Reliability Engineer to join our teams.Reporting to a Lead SRE, you will be responsible to ensure we can reliably serve our products for users around the world. We expect you to have a strong background in development and system administration. Our systems evolve constantly and the tools needed to observe and act to ensure their resilience need to evolve accordingly.Profil recherchéPrevious experience as a developer in Go, Python or RustExperience in system programming with usual scripting languages (bash, Python)Demonstrated ability to troubleshoot production systems failuresA great attitude and desire to work with a teamPassion for incremental improvements on tooling, love all things of automationExperience with Linux systems (Ubuntu / Debian)Experience with cloud environments architecture (baremetal, virtual machines, containers, orchestrators)Good understanding of computer networks : TCP / IP, DNS, load-balancing, IPv6, BGP and network virtualisationUnderstanding of written and spoken english, capable of writing technical documentation in English, ability to speak english if neededExperience with infrastructure as code and continuous deploymentExperience dealing with physical hardware automationExperience with monitoring & logging systemsExperience administering relational databasesKnowledge of one cloud platform and related use-casesTake initiatives to propose new solutions and defend themTeam player, willing to share knowledge, opinions, and participate in regular team ritualsGood communication skills and coaching skillsCreate or optimize existing tools & documentation that will help identify, diagnose and remediate production incidents, automating as much as possibleTroubleshoot high-impact issues working with multiple engineering teamsTake on-call responsibilities, mitigate issues encountered in production and secure the best real-time answer to our customersEnsure a high quality of service for our customers by leveraging observability and monitoring technologiesManage lifecycle of products in productionHelp implementing best practices in stability, resiliency, scalability, security and performance across our systemsPython, Go, RustRabbitMQPostgreSQLHA Proxy, Nginx, REST APIs / FlaskS3 APISentry, Prometheus, Grafana, ElasticSearch, Fluentd, KibanaAnsible, AWX, Foreman, SaltGitLab, NexusUbuntu, Debian, CentOSJira, Confluence, Slack, GSuiteLocation This position isbased in our offices in Paris or Lille (France)Si vous ne vous voyez pas cocher toutes les cases, n'hésitez pas à postuler tout de même. Ne vous limitez pas à une description de poste - on ne sait jamais || #J-18808-Ljbffr


  • Site Reliability Engineer

    il y a 2 semaines


    Lille, Hauts-de-France Scaleway Temps plein

    OUR STORY: Join Scaleway and shape the sovereign cloud of tomorrow Since 1999, we have been designing secure, sustainable infrastructures aimed at supporting the most ambitious companies.Historically known for our dedicated servers (Dedibox), we made a strategic shift to cloud computing in 2015. Staying true to our principles of simplicity, flexibility, and...

  • Site Reliability Engineer

    il y a 4 heures


    Lille, Hauts-de-France Scaleway Temps plein

    NOTRE ADN Rejoignez Scaleway pour construire le Cloud souverain européen Fondée en 1999,Scalewayest lafiliale cloud du groupe Iliad, l'un des leaders européens des télécommunications. Notre mission ? Mettre en oeuvre une industrie numérique plus responsable en aidant les développeurs et les entreprises à créer, déployer et adapter leurs...

  • Site Reliability Engineer

    il y a 1 semaine


    Paris / Bordeaux / Lille / Lyon / Toulouse / Rennes / Rouen, France Scaleway Temps plein

    OUR STORY: Join Scaleway and shape the sovereign cloud of tomorrow Since 1999, we have been designing secure, sustainable infrastructures aimed at supporting the most ambitious companies. Historically known for our dedicated servers (Dedibox), we made a strategic shift to cloud computing in 2015. Staying true to our principles of simplicity, flexibility,...


  • Lille, France Scaleway Temps plein

    A leading cloud infrastructure provider in Lille is looking for a Site Reliability Engineer to enhance the reliability, performance, and scalability of their storage platforms. The role involves developing automation tools, maintaining CI/CD pipelines, and collaborating with diverse teams. Candidates should have strong experiences in Infrastructure as Code,...


  • Lille, France ESENCA Temps plein

    Nous recherchons une personne pour une mission de Ops / Site Reliability Engineer (SRE) afin de compléter l?équipe Operations de la Global Tech & Data Platform. Ton expérience des architectures techniques cloud, de la sécurité, du réseau et des pratiques de CI/CD te permettront: - de développer des solutions pour concevoir, construire et exploiter...

  • Site Reliability Engineer

    il y a 1 semaine


    Paris / Lille / Toulouse / Bordeaux / Lyon, France Scaleway Temps plein

    OUR STORY: Join Scaleway and shape the sovereign cloud of tomorrow Since 1999, we have been designing secure, sustainable infrastructures aimed at supporting the most ambitious companies. Historically known for our dedicated servers (Dedibox), we made a strategic shift to cloud computing in 2015. Staying true to our principles of simplicity, flexibility,...


  • Lille, France Canonical Temps plein

    Join to apply for the Site Reliability Engineering Manager role at Canonical.Canonical is a leading provider of open‑source software and operating systems for global enterprise and technology markets. Our platform, Ubuntu, is widely used in public cloud, data science, AI, engineering innovation and IoT. We support the world's leading public cloud and...


  • Lille, France Canonical Temps plein

    A leading open-source software provider is seeking a Site Reliability Engineering Manager to lead a devops team in delivering quality managed services. This role involves mentoring engineers, coordinating projects, and representing the team to stakeholders. Applicants should have proven experience in infrastructure as code and managing devops teams. The...

  • Site Reliability Engineer

    il y a 2 semaines


    Paris / Bordeaux / Lille / Lyon / Rennes / Rouen / Toulouse, France Scaleway Temps plein

    OUR STORY: Join Scaleway and shape the sovereign cloud of tomorrow Since 1999, we have been designing secure, sustainable infrastructures aimed at supporting the most ambitious companies. Historically known for our dedicated servers (Dedibox), we made a strategic shift to cloud computing in 2015. Staying true to our principles of simplicity, flexibility,...

  • Site Reliability Engineer

    il y a 7 jours


    Lille, Hauts-de-France Free-Work Temps plein

    Contexte de la missionNous renforçons notre équipe Opérations et cherchons un·e SRE pour nous accompagner sur des sujets clés : fiabilité des systèmes, gestion des incidents et automatisation des opérations. Missions principalesGarantir la disponibilité, la performance et la scalabilité de la plateformeGérer les incidents de bout en bout (analyse...