Manager, Site Reliability Engineering
il y a 3 jours
Algolia is set to enable every company to create world-class Search and Discovery experiences with an API-first approach. Performance and Scalability is at the heart of our mission: we power 1.5 trillion searches a year, for 10K+ customers all over the world.As a Site Reliability Engineering Manager in the Production Engineering team of Algolia, you will lead the Fleet team of Site Reliability Engineers responsible for the provisioning and the global reliability of the Search Products at scale.Your team will focus on creating pragmatic solutions to optimize the Search Products availability and costs at scale, depending on the needs of the customer, the Product teams, and the different engineering teams that deliver a unique Search Experience to our customers.You will manage a team of experienced Individual Contributors who are responsible for:Operating and scaling the entire Search fleet, ensuring global performance and reliability.Reducing and maintaining the level of incidents through actionable KPIs and well-defined SLOs, while coaching and delegating Tier 3 support responsibilities.Running and continuously improving our in-house Edge Load Balancer.Building, operating, and enhancing a robust backup and restore system to ensure compliance with our SLAs.FinOps responsibilities, including monitoring infrastructure costs at scale and identifying optimization opportunities.YOUR ROLE WILL CONSIST OF:Collaborating with senior leadership to define the overall technical direction and strategy for the organization, and ensure that the SRE team's goals and initiatives are aligned with this strategy.Building and maintaining strong relationships with stakeholders across the organization, as you represent the SRE organization in cross-functional meetings.You will also stay close to product and design teams to ensure that the user experience is always top of mind.Providing leadership, guidance and mentorship to your team members, helping them to develop their technical skills and knowledge of best practices in site reliability engineering.Establishing and enforcing engineering processes and best practices that ensure high-quality, reliable, and scalable systems.You will be responsible for defining and maintaining service level agreements (SLAs) and key performance indicators (KPIs) for your team's services.Helping design and implement monitoring, alerting, and metrics systems to ensure the availability, performance, and reliability of your team's services.Collaborating with other technical teams to identify opportunities to automate processes.Managing the budget for your team, ensuring that resources are being used efficiently.Documenting your team's projects and processes, and ensuring that this documentation is up-to-date and accessible to all stakeholders.YOU MIGHT BE A FIT IF YOU HAVE:4+ years of engineering management experienceYou are fluent in Agile methodology and can lead a project from idea to productionYou are an excellent communicator, collaborating with Product managers, Technical Program Managers, and Individual ContributorsYou are comfortable managing a large team of various seniority levelsYou know how to deploy an application from laptop to production and are comfortable with production requirementsYou are knowledgeable in DevOps principles and CI/CD pipelinesYou are knowledgeable in Configuration Management and Infrastructure as Code such as Chef and TerraformYou are knowledgeable in at least one programming language (Python, Golang, Ruby) and are familiar with software craftsmanshipFull professional English proficiencyAbility to make decisions and take ownership for themWE'RE LOOKING FOR SOMEONE WHO CAN LIVE OUR VALUES:GRIT - Problem-solving and perseverance capability in an ever-changing and growing environment.TRUST - Willingness to trust our co-workers and to take ownership.CANDOR - Ability to receive and give constructive feedback.CARE - Genuine care about other team members, our clients and the decisions we make in the company.HUMILITY - Aptitude for learning from others, putting ego aside.FLEXIBLE WORKPLACE STRATEGY:Algolia’s flexible workplace model is designed to empower all Algolians to fulfill our mission to power search and discovery with ease. We emphasize an individual’s impact, contribution, and output over their physical location. Algolia is a high-trust environment, and many of our team members have the autonomy to choose where they want to work and when.While we have a global presence with physical offices in Paris, NYC, London, Sydney, and Bucharest, we also offer many of our team members the option to work remotely either as fully remote or hybrid-remote employees. Please note that positions listed as 'Remote' are only available for remote work within the specified country.ABOUT US:Algolia prides itself on being a pioneer and market leader offering an AI Search solution that empowers 17,000+ businesses to compose customer experiences at internet scale that predict what their users want with blazing fast search and web browse experience.Algolia is part of a cadre of innovative new companies that are driving the next generation of software development, creating APIs that make developers’ lives easier.In 2021, the company closed $150 million in series D funding and quadrupled its post-money valuation of $2.25 billion.WHO WE'RE LOOKING FOR:We’re looking for talented, passionate people to build the world’s best search & discovery technology. As an ownership-driven company, we seek team members who thrive within an environment based on autonomy and diversity.READY TO APPLY?If you share our values and our enthusiasm for building the world’s best search & discovery technology, we’d love to review your application #J-18808-Ljbffr
-
Site Reliability Engineering Manager
il y a 2 semaines
Paris, France Canonical Temps pleinJoin to apply for the Site Reliability Engineering Manager role at CanonicalCanonical is a leading provider of open-source software and operating systems for global enterprise and technology markets. Our platform, Ubuntu, is very widely used in breakthrough enterprise initiatives such as public cloud, data science, AI, engineering innovation and IoT. Our...
-
Site Reliability Engineer
il y a 3 jours
Paris, France Scaleway Temps plein1 day ago Be among the first 25 applicantsWHY WE NEED YOU ?Our growth is driving us to strengthen our Engineering Enablers team to ensure the high reliability, performance, and scalability and to support and scale our production environments.WHY WE NEED YOU ?Our growth is driving us to strengthen our Engineering Enablers team to ensure the high reliability,...
-
Engineering Manager
il y a 5 heures
Paris, France Doctolib Temps pleinWe are looking for an Engineering Manager to join the OREO (Observability Reliability Engineering Obsession) team in Platform Engineering. As an Engineering Manager, your mission will be to lead the Reliability & Observability team and drive the evolution of Doctolib’s observability platform, supporting the exponential growth of Doctolib services while...
-
Site Reliability Engineer
il y a 3 jours
Paris, France Blackfluo.ai Temps pleinAbout the job Site Reliability Engineer (SRE)Job DescriptionLocation: Full remote, EU timezone (CET +/- 2 hours)Start Date: As soon as possibleLanguages: English requiredWe are looking for a skilled Site Reliability Engineer (SRE) with deep expertise in AWS to help us scale and secure our infrastructure. As an SRE, you will be instrumental in ensuring the...
-
Site Reliability Engineer
il y a 3 jours
Paris, France EdTechFrance Temps plein# Site Reliability Engineer* Paris* Full-Time* We help companies build their recruitment strategy by sharing their story through employer branding, enabling them to attract, engage, and retain talent who share their values.* We guide candidates to their future teams through immersive job listings and support them throughout their job search with a...
-
Site Reliability Engineer
il y a 4 jours
Paris, Île-de-France Blackfluo Temps pleinJob DescriptionLocation: Full remote, EU timezone (CET +/- 2 hours)Start Date: As soon as possibleLanguages: English requiredWe are looking for a skilled Site Reliability Engineer (SRE) with deep expertise in AWS to help us scale and secure our infrastructure. As an SRE, you will be instrumental in ensuring the reliability, performance, and scalability of...
-
Site Reliability Engineer
il y a 3 jours
Paris, France RAISE France Temps plein# Site Reliability Engineer* Paris* Full-Time* We help companies build their recruitment strategy by sharing their story through employer branding, enabling them to attract, engage, and retain talent who share their values.* We guide candidates to their future teams through immersive job listings and support them throughout their job search with a...
-
SITE RELIABILITY ENGINEER
il y a 3 jours
Paris, France STATION F Temps plein1 day ago Be among the first 25 applicantsGet AI-powered advice on this job and more exclusive features.AboutAt Welcome to the Jungle, we believe working is good. But thriving with the right people is better. We provide a suite of tools, content, and experiences that make recruitment more transparent, authentic, and human.We help companies build their...
-
Site Reliability Engineer
il y a 3 jours
Paris, France Mistral AI Temps pleinOverviewJoin to apply for the Site Reliability Engineer role at Mistral AI1 week ago Be among the first 25 applicantsJoin to apply for the Site Reliability Engineer role at Mistral AIGet AI-powered advice on this job and more exclusive features. About MistralAt Mistral AI, we believe in the power of AI to simplify tasks, save time, and enhance learning and...
-
Senior Director of Site Reliability Engineering
il y a 4 jours
Paris, France Dataiku Temps pleinHeadquartered in New York City, Dataiku was founded in Paris in 2013 and achieved unicorn status in 2019. Now, more than 1,000+ employees work across the globe in our offices and remotely. Backed by a renowned set of investors and partners including CapitalG, Tiger Global, and ICONIQ Growth, we've set out to build the future of AI. We are seeking a highly...