Manager - Site Reliability Engineering

il y a 1 semaine


Paris, France Algolia Temps plein

As Manager Site Reliability Engineer in Production Engineering team of Algolia, you will lead the PaaS (Platform as a Service) team of Site Reliability Engineers responsible for ensuring the reliability, availability, and scalability of multiple services which have an impact on all Algolia’s products.

Your team will focus on Engineering Productivity, providing a Dev Experience and Toolings to increase the velocity of the whole organization.

You will be supported by experienced Individual Contributors to automate and secure services such as:

- CI/CD for multiple products and environments
- Observability (alerting, monitoring, log management)
- Hosting Services (Cloud and Kubernetes based)
- Data Services
- Identity Services

**YOUR ROLE WILL CONSIST OF**:

- Collaborating with senior leadership to define the overall technical direction and strategy for the organization, and ensure that the SRE team's goals and initiatives are aligned with this strategy.
- As well as building and maintaining strong relationships with stakeholders across the organization, as you represent the SRE organization in cross-functional meetings.
- You also stay close to product and design teams to ensure that the user experience is always top of mind.
- You are expected to provide leadership, guidance and mentorship to your team members, helping them to develop their technical skills and knowledge of best practices in site reliability engineering. You continuously evaluate and improve the performance of the SRE team, and you identify and implement initiatives to drive operational excellence and improve overall service reliability.
- Establishing and enforcing engineering processes and best practices that ensure high-quality, reliable, and scalable systems, as well as working with other teams to promote the adoption of these processes and practices across the organization.
- You will be responsible for defining and maintaining service level agreements (SLAs) and key performance indicators (KPIs) for your team's services, and you work with other teams to ensure that these SLAs and KPIs are being met. As well as leading cross-functional efforts to resolve complex technical issues and mitigate operational risks across multiple teams and domains.
- Along with your team you will help design and implement monitoring, alerting, and metrics systems to ensure the availability, performance, and reliability of your team's services, and you continuously refine and improve these systems.
- Collaborating with other technical teams to identify opportunities to automate processes, and design and implement automated tools and systems to support these processes.
- As manager, you also manage the budget for your team, ensuring that resources are being used effectively and efficiently.
- Finally, you are responsible for documenting your team's projects and processes, and you ensure that this documentation is up-to-date and accessible to all stakeholders.

**YOU MIGHT BE A FIT IF**:

- _4+ years of engineering management experience_
- _You are fluent in Agile methodology and can lead a project from the idea to Production_
- _You are comfortable managing a large team regrouping all seniority levels, and accompanying Individual Contributors in their growth and development_
- _You are knowledgeable in DevOps principles, CI/CD pipelines, Kubernetes (Administration and Utilisation)_
- _You are knowledgeable in Infrastructure as Code such as Terraform deployed to multiple cloud environments_
- _You are knowledgeable of at least one programming language (Python, Golang, Ruby.)_
- _Full professional English proficiency_
- _Ability to make decisions and take ownership for them_

**WE'RE LOOKING FOR SOMEONE WHO CAN LIVE OUR VALUES**:
GRIT - Problem-solving and perseverance capability in an ever-changing and growing environment.

TRUST - Willingness to trust our co-workers and to take ownership.

CANDOR - Ability to receive and give constructive feedback.

CARE - Genuine care about other team members, our clients and the decisions we make in the company.

HUMILITY- Aptitude for learning from others, putting ego aside.

LI-Hybrid #LI-Remote

REMOTE STRATEGY:
Algolia’s workplace strategy, **Hybrid Remote**, is designed to harness the power of the opportunities that remote work offers both employees and the company, while also providing an engaging in-office experience for the times when an employee is in an office. Our workplace approach reflects the belief that an employee’s impact, contribution, and output are more important than their physical location.

The majority of employees will be able to choose if, and when, they come into an office on a regular basis. There will be times when our people are asked to come into an office for “moments that matter:” activities like critical planning meetings and team social gatherings. Beyond those events, 80% of our workforce may choose the location from where they work in the country in which they were hired.

A



  • Paris, France Algolia Temps plein

    Algolia is set to enable every company to create world-class Search and Discovery experiences with an API-first approach. Performance and Scalability is at the heart of our mission: we power 1.5 trillion searches a year, for 10K+ customers all over the world. As Manager Site Reliability Engineer in the Production Engineering team of Algolia, **you will lead...


  • Paris, France Broadridge Financial Solutions Temps plein

    A leading financial services firm in Paris is seeking a Manager for Site Reliability Engineering. This pivotal role involves leading a team across multiple countries to ensure system reliability, performance, and scalability. Responsibilities include managing SRE practices, overseeing technical strategy, and collaborating with stakeholders to drive...


  • Paris, France Canonical Temps plein

    Join to apply for the Site Reliability Engineering Manager role at CanonicalCanonical is a leading provider of open-source software and operating systems for global enterprise and technology markets. Our platform, Ubuntu, is very widely used in breakthrough enterprise initiatives such as public cloud, data science, AI, engineering innovation and IoT. Our...


  • Paris, France Broadridge Financial Solutions Temps plein

    At Broadridge, we've built a culture where the highest goal is to empower others to accomplish more. If you’re passionate about developing your career, while helping others along the way, come join the Broadridge team.## **Role Overview**Broadridge Trading & Connectivity Solutions (BTCS) is seeking a highly skilled **Manager, Site Reliability Engineering**...


  • Paris, France LunaLogic Temps plein

    SRE (Site Reliability Engineering) Confirmé Contexte Notre client est une banque d'investissement internationale de premier plan. Profil recherché **Profil SRE confirmé (3 à 7 ans d'expérience)** - Connaissance d'au moins un langage de scripting tel que Python (pas forcément Python) - Bonne connaissance de l'environnement Linux - Connaissance de GIT...

  • Site Reliability Engineer

    il y a 18 heures


    Paris, France Scaleway Temps plein

    1 day ago Be among the first 25 applicantsWHY WE NEED YOU ?Our growth is driving us to strengthen our Engineering Enablers team to ensure the high reliability, performance, and scalability and to support and scale our production environments.WHY WE NEED YOU ?Our growth is driving us to strengthen our Engineering Enablers team to ensure the high reliability,...


  • Paris, Île-de-France Broadridge Temps plein

    At Broadridge, we've built a culture where the highest goal is to empower others to accomplish more. If you're passionate about developing your career, while helping others along the way, come join the Broadridge team.Role OverviewBroadridge Trading & Connectivity Solutions (BTCS) is seeking a highly skilled Manager, Site Reliability Engineering to lead and...


  • Paris, Île-de-France Broadridge Trading & Connectivity Solutions Temps plein

    At Broadridge, we've built a culture where the highest goal is to empower others to accomplish more. If you're passionate about developing your career, while helping others along the way, come join the Broadridge team.Role OverviewBroadridge Trading & Connectivity Solutions (BTCS) is seeking a highly skilled Manager, Site Reliability Engineering to lead and...

  • Site Reliability Engineer

    il y a 17 heures


    Paris, France Blackfluo.ai Temps plein

    About the job Site Reliability Engineer (SRE)Job DescriptionLocation: Full remote, EU timezone (CET +/- 2 hours)Start Date: As soon as possibleLanguages: English requiredWe are looking for a skilled Site Reliability Engineer (SRE) with deep expertise in AWS to help us scale and secure our infrastructure. As an SRE, you will be instrumental in ensuring the...

  • Site Reliability Engineer

    il y a 20 heures


    Paris, France EdTechFrance Temps plein

    # Site Reliability Engineer* Paris* Full-Time* We help companies build their recruitment strategy by sharing their story through employer branding, enabling them to attract, engage, and retain talent who share their values.* We guide candidates to their future teams through immersive job listings and support them throughout their job search with a...