Site Reliability Engineer, IaaS

il y a 2 semaines


Paris, Île-de-France Algolia Temps plein

Algolia is set to enable every company to create world-class Search and Discovery experiences with an API-first approach. Performance and Scalability is at the heart of our mission: we power trillion searches a year, for 10K+ customers all over the world.

If you're a problem solver, able to think outside the box and eager to nurture others and learn from them, then this is your challenge

The Team

The Infrastructure as a Service (IaaS) team aims at upholding the reliability and scalability we expect from Algolia's infrastructure for its critical systems and products. Our focus is on enabling teams across Algolia to leverage this infrastructure while keeping it under control through an always increasing level of automation.

The Opportunity

The Site Reliability Engineer position within the Infrastructure As a Service team provides a dynamic opportunity for a professional with foundational experience in maintaining and optimizing scalable infrastructures. This role specifically concentrates on three key areas: server and container hosting, cloud and network expertise and flawless observability.

As a member of the Infrastructure As a Service team, you will play a key role in supporting the reliability and scalability of Algolia's Search products and core internal services. Your responsibilities will include operating components or features, ensuring proper monitoring and alerting are in place, and assisting in the transition from legacy systems. You will work on planning and accountability for the next quarter, demonstrating independence in problem-solving and minimal reliance on managers and senior team members.

Your role will consist of:

Kubernetes and Cloud Services Management : Help maintain and optimize a fleet of Kubernetes-based architectures and cloud services, enhancing fault tolerance and resource utilization. System Management and Configuration : Continuously improve and refine the infrastructure code and automation that manage our Fleet of several thousand servers, keeping it safe, efficient and reliable. Maintain and Extend our Control Plane : Go beyond our current control plane and turn it into a platform that everyone at Algolia can leverage to build performant, reliable and scalable products. Observability Implementation : Support the development and deployment of observability solutions, providing your team and others with actionable insights to track and enhance system reliability. Collaboration and Problem Solving : Work collaboratively with team members to identify and solve problems, reducing dependence on senior staff for guidance. Process Improvement : Contribute to establishing engineering processes and best practices to ensure high-quality, reliable, and scalable systems.

You might be a fit if you have:

Programming Skills : Basic to intermediate knowledge of programming languages such as Python, Ruby or Golang, with an understanding of software craftsmanship. Experience with Linux and Kubernetes : Experience in setting up and managing fleets of Linux servers and Kubernetes-based architectures. Knowledge of Distributed Systems : Exposure to operating distributed systems and understanding their challenges at a basic level. Public Cloud Experience : Familiarity with public cloud providers such as Microsoft Azure, AWS, or GCP. Problem-Solving Skills : Ability to independently identify and solve problems, demonstrating initiative and minimal reliance on senior team members. Communication and Organization Skills : Strong communication and organizational skills to effectively collaborate with team members and stakeholders. 3 years or more of related work experience.

We're looking for someone who can live our values:

GRIT - Problem-solving and perseverance capability in an ever-changing and growing environment TRUST - Willingness to trust our co-workers and to take ownership CANDOR - Ability to receive and give constructive feedback. CARE - Genuine care about other team members, our clients and the decisions we make in the company. HUMILITY- Aptitude for learning from others, putting ego aside.

#LI-Remote

REMOTE STRATEGY:

Algolia's flexible workplace model is designed to empower all Algolians to fulfill our mission to power search and discovery with ease. We place an emphasis on an individual's impact, contribution, and output, over their physical location. Algolia is a high-trust environment and our team members have the autonomy to choose where they want to work and when. We know community comes in many forms and strive to create opportunities for intentional in-person connection in our offices and virtually for our remote colleagues around the world.

We have a global presence with physical offices in San Francisco, NYC, Paris, London, Sydney and Bucharest.



  • Paris, Île-de-France Ibanfirst Sa Temps plein

    Site Reliability Engineer Collaborate with fellow Site Reliability Engineers, Incident Manager, and Software Engineers to troubleshoot incidents and implement measures to enhance our services continuously. Bonus Points: Experience with technologies such as Go, Java, PHP, ReactJS, and Python. Infrastructure: Working knowledge of Proxmox VE, Linux...

  • Lead Site Reliability Engineer

    il y a 2 semaines


    Paris, Île-de-France Aurélie Bagot ( Consultante recrutement indépendante) Temps plein

    Je recherche pour un client start-up en plein essort un **Lead Site Reliability Engineer **dont les principales missions seront de:- Concevoir les architectures de qualité pour nos clients- Accompagner les juniors SRE dans leur quotidien et les faire monter en compétences- Garantir le delivery : avoir un plan pour réussir le projetVoici quelques exemples...

  • Site Reliability Engineer

    il y a 4 semaines


    Paris, Île-de-France Criteo Temps plein

    What You'll Do:Our Network Edge team plays a pivotal role by establishing essential connectivity among our data centers and Internet/Cloud providers. We engineer and manage the backbone network that guarantees smooth data transmission, enabling our platform to thrive globally.Responsibilities: Contribute to enhancing the reliability, performance, and...


  • Paris, Île-de-France Criteo Temps plein

    What You'll Do:Our Network Edge team plays a pivotal role by establishing essential connectivity among our data centers and Internet/Cloud providers. We engineer and manage the backbone network that guarantees smooth data transmission, enabling our platform to thrive globally.Responsibilities: Contribute to enhancing the reliability, performance, and...

  • Site Reliability Engineer

    il y a 2 semaines


    Paris, Île-de-France Symaps Temps plein

    The company is fully remote-enabled, allowing the 160 "BotBusters" spread around the world (and they are actively recruiting) Made up of six subteams (Dashboard, Engine, Infrastructure, Integrations, Threat Research & Security), the DataDome tech team is spread across Europe and the US. We are present in more than 25 data centers around the world, deployed...


  • Paris, Île-de-France Adobe Temps plein

    Join Adobe Stock Team as a Site Reliability Engineer (SRE)At Adobe, we are on a mission to revolutionize digital experiences. We equip individuals and businesses with the tools needed to create exceptional digital experiences, whether you are a budding artist or a global brand. Our focus is on enabling people to produce stunning images, videos, and apps,...

  • Site Reliability Engineer

    il y a 2 semaines


    Paris, Île-de-France Algolia Temps plein

    Algolia is set to enable every company to create world-class Search and Discovery experiences with an API-first approach.Performance and Scalability is at the heart of our mission: we power 1.5 trillion searches a year, for 10K+ customers all over the world.If you're a problem solver, able to think outside the box and eager to nurture others and learn from...

  • Site Reliability Engineer

    il y a 2 semaines


    Paris, Île-de-France DataDome Temps plein

    Made up of six subteams (Dashboard, Engine, Infrastructure, Integrations, Threat Research & Security), the DataDome tech team is spread across Europe and the US. We handle over 2 000 billion events per day giving responses within 3ms (99p). We are present in more than 25 data centers around the world, deployed using Docker. We deploy on AWS, Scaleway,...

  • Site Reliability Engineer Sre

    il y a 2 semaines


    Paris 01 Louvre, Île-de-France Havana IT & Apps Temps plein

    Contexte :Dans le cadre de notre expansion et pour répondre aux besoins croissants de nos clients en termes de fiabilité et de performance des systèmes, nous cherchons à intégrer un Site Reliability Engineer (SRE) possédant une expertise approfondie en Python.Ce professionnel aura pour mission de concevoir, développer et optimiser des infrastructures...

  • Site Reliability Engineer

    il y a 2 semaines


    Paris 10 Entrepôt, Île-de-France UNLCK Temps plein

    Vous êtes à la recherche d'une nouvelle expérience dans une entreprise qui évolue dans environnement international ?**Contexte**Créée en 2014, cette entreprise est basée en plein cœur de Paris. Elle accompagne les entreprises du e-commerce de plus de 40 pays dans l'analyse de leurs données. Elle propose un outil d'aide à la décision afin de mieux...

  • Site Reliability Engineer

    il y a 4 semaines


    Paris, Île-de-France Adobe Temps plein

    Our CompanyChanging the world through digital experiences is what Adobe's all about. We give everyone—from emerging artists to global brands—everything they need to design and deliver exceptional digital experiences We're passionate about empowering people to create beautiful and powerful images, videos, and apps, and transform how companies interact...

  • Site Reliability Engineer

    il y a 2 semaines


    Paris, Île-de-France KatchMe Temps plein

    Site Reliability Engineer (SRE) Contexte Blockchain & NFT Startup à taille humaine (20p+) Phase de scaling coming Société Start-up à la pointe de la technologie spécialisée dans le domaine des bots notamment sur Discord. Elle aide les propriétaires de plus de 4 millions de communautés actives mensuellement à créer, gérer et développer leurs...

  • Site Reliability Engineer

    il y a 2 semaines


    Paris, Île-de-France Sekoia SAS Temps plein

    rethinks cybersecurity to make it ever more relevant, effective and accessible. One of the main challenges we address is to constantly analyze and understand emerging threats in order to define appropriate strategies and have the capacity to execute them on a large scale. By combining technology and a multi-disciplinary team of Threat Intelligence...


  • Paris, Île-de-France VISIAN Temps plein

    Le SRE / Site Reliability Engineer est un profil visant à garantir l'agilité nécessaire aux contributeurs d'un projet, tout en garantissant la fiabilité et la stabilité des produits. Il agit donc comme élément principal de la mise en place des piliers du reliability engineering : Définition et suivi des mesures de fiabilité. Automatisation des...


  • Paris, Île-de-France VISIAN Temps plein

    Le SRE / Site Reliability Engineer est un profil visant à garantir l'agilité nécessaire aux contributeurs d'un projet, tout en garantissant la fiabilité et la stabilité des produits. Il agit donc comme élément principal de la mise en place des piliers du reliability engineering : Définition et suivi des mesures de fiabilité. Automatisation des...


  • Paris, Île-de-France Ledger Enterprise Temps plein

    We're making the world of digital assets accessible and secure for everyone.Founded in 2014, Ledger is the global platform for digital assets and Web3. Headquartered in Paris and Vierzon, with offices in UK, US, Switzerland and Singapore, Ledger has a team of more than 700 professionals developing a variety of products and services to enable individuals and...

  • Lead Site Reliability Engineer

    il y a 4 semaines


    Paris, Île-de-France AB Tasty Temps plein

    AB Tasty is a global leader in AI-powered experience optimization solutions empowering brands using personalization, experimentation, recommendations, and search to build better experiences on their websites and apps. Integrated into a single platform, AB Tasty offers web and API-based solutions that provide companies with a unified approach to creating...

  • Lead Site Reliability Engineer

    il y a 4 semaines


    Paris, Île-de-France AB Tasty Temps plein

    AB Tasty is a global leader in AI-powered experience optimization solutions empowering brands using personalization, experimentation, recommendations, and search to build better experiences on their websites and apps. Integrated into a single platform, AB Tasty offers web and API-based solutions that provide companies with a unified approach to creating...


  • Paris, Île-de-France AB Tasty Temps plein

    AB Tasty is a global leader in AI-powered experience optimization solutions empowering brands using personalization, experimentation, recommendations, and search to build better experiences on their websites and apps. Integrated into a single platform, AB Tasty offers web and API-based solutions that provide companies with a unified approach to creating...


  • Paris, Île-de-France AB Tasty Temps plein

    AB Tasty is a global leader in AI-powered experience optimization solutions empowering brands using personalization, experimentation, recommendations, and search to build better experiences on their websites and apps. Integrated into a single platform, AB Tasty offers web and API-based solutions that provide companies with a unified approach to creating...