Site Reliability Engineer

il y a 1 semaine


Paris, France Algolia Temps plein

Algolia is set to enable every company to create world-class Search and Discovery experiences with an API-first approach. Performance and Scalability is at the heart of our mission: we power 1.5 trillion searches a year, for 10K+ customers all over the world.

If you're a problem solver, able to think outside the box and eager to nurture others and learn from them, then this is your challenge

**The Team**:
The Infrastructure as a Service (IaaS) team aims at upholding the reliability and scalability we expect from Algolia's infrastructure for its critical systems and products. Our focus is on enabling teams across Algolia to leverage this infrastructure while keeping it under control through an always increasing level of automation.

**The Opportunity**:
The Site Reliability Engineer position within the Infrastructure As a Service team provides a dynamic opportunity for a professional with foundational experience in maintaining and optimizing scalable infrastructures. This role specifically concentrates on three key areas: server and container hosting, cloud and network expertise and flawless observability.

As a member of the Infrastructure As a Service team, you will play a key role in supporting the reliability and scalability of Algolia's Search products and core internal services. Your responsibilities will include operating components or features, ensuring proper monitoring and alerting are in place, and assisting in the transition from legacy systems. You will work on planning and accountability for the next quarter, demonstrating independence in problem-solving and mínimal reliance on managers and senior team members.

**Your role will consist of**:

- **Kubernetes and Cloud Services Management**: Help maintain and optimize a fleet of Kubernetes-based architectures and cloud services, enhancing fault tolerance and resource utilization.
- **System Management and Configuration**: Continuously improve and refine the infrastructure code and automation that manage our Fleet of several thousand servers, keeping it safe, efficient and reliable.
- **Observability Implementation**: Support the development and deployment of observability solutions, providing your team and others with actionable insights to track and enhance system reliability.
- **Collaboration and Problem Solving**: Work collaboratively with team members to identify and solve problems, reducing dependence on senior staff for guidance.
- **Process Improvement**: Contribute to establishing engineering processes and best practices to ensure high-quality, reliable, and scalable systems.

**You might be a fit if you have**:

- **Programming Skills**: Basic to intermediate knowledge of programming languages such as Python, Ruby or Golang, with an understanding of software craftsmanship.
- **Experience with Linux and Kubernetes**: Experience in setting up and managing fleets of Linux servers and Kubernetes-based architectures.
- **Knowledge of Distributed Systems**: Exposure to operating distributed systems and understanding their challenges at a basic level.
- **Public Cloud Experience**: Familiarity with public cloud providers such as Microsoft Azure, AWS, or GCP.
- **Problem-Solving Skills**: Ability to independently identify and solve problems, demonstrating initiative and mínimal reliance on senior team members.
- **Communication and Organization Skills**: Strong communication and organizational skills to effectively collaborate with team members and stakeholders.

**We're looking for someone who can live our values**:

- GRIT - Problem-solving and perseverance capability in an ever-changing and growing environment
- TRUST - Willingness to trust our co-workers and to take ownership
- CANDOR - Ability to receive and give constructive feedback.
- CARE - Genuine care about other team members, our clients and the decisions we make in the company.
- HUMILITY- Aptitude for learning from others, putting ego aside.

REMOTE STRATEGY:
Algolia's flexible workplace model is designed to empower all Algolians to fulfill our mission to power search and discovery with ease. We place an emphasis on an individual's impact, contribution, and output, over their physical location. Algolia is a high-trust environment and our team members have the autonomy to choose where they want to work and when. We know community comes in many forms and strive to create opportunities for intentional in-person connection in our offices and virtually for our remote colleagues around the world.

We have a global presence with physical offices in San Francisco, NYC, Paris, London, Sydney and Bucharest.

ABOUT US:
Algolia prides itself on being a pioneer and market leader offering an AI Search solution that empowers 17,000+ businesses to compose customer experiences at internet scale that predict what their users want with blazing fast search and web browse experience. Algolia powers more than 30 billion search requests a week - four times more than Microsoft Bing, Yahoo, Baidu, Y


  • Site Reliability Engineer

    il y a 2 semaines


    Paris, France Welcome to the Jungle France Temps plein

    Site Reliability Engineer Join to apply for the Site Reliability Engineer role at Welcome to the Jungle France As our Site Reliability Engineer you are responsible for implementing and maintaining scalable infrastructure and systems that ensure the reliability, performance, and security of our production environments. This hands‑on position bridges the gap...

  • Site Reliability Engineer

    il y a 24 heures


    Paris, Île-de-France Mistral Ai Temps plein

    About Mistral At Mistral AI, we believe in the power of AI to simplify tasks, save time, and enhance learning and creativity. Our technology is designed to integrate seamlessly into daily working life. We democratize AI through high-performance, optimized, open-source and cutting-edge models, products and solutions. Our comprehensive AI platform is designed...


  • Paris, France Groupe iliad Temps plein

    Une société de technologies recherche un Site Reliability Engineer pour renforcer la fiabilité de ses services cloud. Vous aurez l'opportunité de travailler sur des systèmes complexes, avec des responsabilités d'astreinte et de collaboration avec différentes équipes. Le poste nécessite des compétences en développement (Go, Python ou Rust) et une...

  • Site Reliability Engineer

    il y a 5 jours


    Paris, France OVHcloud Temps plein

    CDI - IT, Technologie & Produit - PARIS, FR, 75017CESSON-SEVIGNE, FR, 35510NANTES, FR, 44000BREST, FR, 29200 - Hybride PROCESS DE RECRUTEMENT **1. Échange dans les 2 à 4 semaines avec notre hiring officer**:Arthur **2. Entretien avec le manager**:Raphael **3. Rencontre possible avec l'équipe ou un pair** REJOINDRE L’AVENTURE OVHcloud OVHcloud...

  • Site Reliability Engineer

    il y a 6 jours


    Paris, Île-de-France Criteo Temps plein

    What You'll Do:About the TeamThe Platform Core group at Criteo is composed of seven agile, human-sized teams providing the foundational platform and systems powering all Criteo products.Within this group, the Analytics Infrastructure team builds and operates the distributed, multi-datacenter analytic data stores and platforms enabling interactive querying,...

  • Site Reliability Engineer II

    il y a 2 semaines


    Paris, Île-de-France Doctolib Temps plein

    What We DoDoctolib's Engineering environment is rich and we are building innovative products and features aiming each day to ease doctors' and patient life. We are looking for aSite Reliability Engineer IIto keep Doctolib production systems running smoothly. You will also be a key-player to support the exponential growth of Doctolib services.What You Will...


  • Paris, Île-de-France Doctolib Temps plein

    What we do Doctolib's Engineering environment is rich and we are building innovative products and features aiming each day to ease doctors' and patient life. We are looking for a Site Reliability Engineer II to keep Doctolib production systems running smoothly. You will also be a key-player to support the exponential growth of Doctolib services. ...

  • Site Reliability Engineer

    il y a 3 jours


    Paris, Île-de-France Welcome to the Jungle France Temps plein

    As our Site Reliability Engineer you are responsible forimplementing and maintaining scalable infrastructure and systems that ensure the reliability,performance, and security of our production environments.This hands-on position bridges the gap between development and operations, applying software engineering principles to infrastructure and operational...

  • Site Reliability Engineer

    il y a 1 semaine


    Paris, Île-de-France Criteo Temps plein

    What You'll Do: About the TeamThe Platform Core group at Criteo is composed of seven agile, human-sized teams providing the foundational platform and systems powering all Criteo products.Within this group, the Analytics Infrastructure team builds and operates the distributed, multi-datacenter analytic data stores and platforms enabling interactive...


  • Paris, France Doctolib Temps plein

    Join a team of passionate and hardworking entrepreneurs to transform healthcare! Working in the tech team at Doctolib involves building innovative products and features to improve the daily lives of care teams and patients. We work in feature teams in an agile environment, while collaborating with engineering, design, and business teams. Doctolib is...