Site Reliability Engineer

il y a 6 jours

Rue Isabelle Eberhardt Montpellier France France Sweep Temps plein

Sweep is hiring a Site Reliability Engineer (SRE), to join our SRE & infrastructure team and help us ensure the reliability, scalability, and performance of our systems.

This role is ideal for someone with solid DevOps background, expertise in AWS and high-traffic systems, and a commitment to fostering a collaborative, inclusive environment within our SRE guild and broader engineering culture.

Climate change is the defining issue of our time. By empowering companies with technology that helps them manage their climate impact, we believe Sweep can make a meaningful contribution to a better future for all of us.

Ok, sounds promising. What will I be doing?

As a key player in our Engineering team, you will collaborate with engineering teams to design and implement cutting-edge, automated infrastructure to support both our core platform and AI-driven solutions.

1. Contribute to the team's ownership of technical infrastructure

Design, implement, and maintain highly available, scalable, and secure cloud infrastructure for the Sweep Data platform and AI workloads using Infrastructure as Code practices.
Improve and expand our observability strategy, working with Datadog to enhance metrics, dashboards, and alerting across our Rails application and AI workloads.
Develop scalable infrastructure to support the machine learning model training, deployment, and monitoring.
Participate in incident response and post-mortem reviews as part of the Oak team

2. Support critical scaling initiatives

Support critical infrastructure scaling projects.
Contribute to high-traffic systems design.
Help establish team processes including runbooks, workflows, and documentation.

3. Facilitate collaboration

Work closely with engineers who have elevated infrastructure privileges within our DevOps culture.
Collaborate within the SRE guild and contribute to best practices across the engineering team.
Collaborate with AI/ML teams to optimize and scale AI/ML pipelines and workloads.

4. Manage day-to-day operations

Manage day-to-day operations including on-call duties, capacity planning, and proactive system health monitoring.
Implement security measures and data protection protocols.
Support enterprise customer security requirements including BYOK implementation and data sovereignty compliance.
Maintain and contribute to Sweep strong level of compliance including SOC 2 Type 2, ISO 27001 and more.

5. Continuously improve and learn

Use a proactive approach to problem-solving and a commitment to building fault-tolerant systems.
Stay up-to-date with the latest industry trends and technologies to ensure we're always building on solid foundations.

That sounds just right for me. What do I need to bring?

Glad you asked. This is who we're looking for:

Qualifications

Engineering degree in computer science or 3+ years of DevOps/SRE experience, with strong candidates at 5+ years preferred
Good knowledge of AWS (including ECS/Fargate), Docker, Terraform, PostgreSQL at scale (experience with sharding, clustering, or high-volume scenarios preferred)
Datadog expertise strongly preferred
Experience with continuous integration and continuous deployment
Experience with high-traffic, multi-tenant systems and database scaling strategies
Knowledge and experience in data modeling, database design, and data management
Strong operational mindset with experience in day-to-day production operations
Experience with on-call rotations and production incident management
Experience improving observability and monitoring systems
Understanding of clean code and clean infrastructure practices
You speak English fluently, French is a plus

Technical bonuses

Ruby on Rails experience is a plus
Snowflake experience is a plus
Change Data Capture and data pipeline experience is valuable
Familiarity with high-traffic systems
ARC (Actions Runner Controller) and Kubernetes is a plus

Qualities

Autonomous and self-structured
Willing to imagine and implement processes to ease developers' lives
Passionate about solving problems and developing solutions
A team player who values collaboration and feedback

Copy that. And what's in it for me?

By joining Sweep, you'll be part of an exciting startup with a vision to change the world. We're ready to hit the ground running, and joining us at this early stage allows you the unique opportunity to help shape our journey.

Our flexible work model allows you to balance personal and professional commitments while staying connected with your global colleagues. Even though our hubs are in Paris, London, and Montpellier, we're committed to fostering a connected and engaged remote work culture.

As a B Corporation, we're dedicated to creating successful businesses that benefit everyone, including society and the planet.

Ready for the most exciting chapter of your career? Come join us on this extraordinary ride

Site Reliability Engineer

il y a 2 semaines

Rue des Minimes, Grenoble, France Hyperweb Temps plein

Nous recherchons un Site Reliability Engineer expérimenté, capable de garantir la disponibilité, la performance et la résilience de la plateforme.Votre rôle est d'appliquer une approche ingénierie à l'exploitation : automatiser au maximum, réduire le toil, améliorer la fiabilité du système et accompagner les équipes produit dans un delivery sûr...
Remote Site Reliability Engineer — Cloud Infra

il y a 2 semaines

France Overstory Temps plein

A mission-driven technology company based in France seeks a Site Reliability Engineer to enhance GCP infrastructure and DevOps practices. The role involves designing cloud systems, building automation tools, and championing observability within the organization. Ideal candidates have strong cloud management skills and a proactive, collaborative mindset....
Site Reliability Engineer

il y a 1 semaine

France Kiln Temps plein

Full time - Paris/London hybrid or Remote from EU/UKKiln is now part of the prestigious French Government program #FT120 from La French Tech As a Site Reliability Engineer - Platform at Kiln, you'll join our Infrastructure Team to build robust and scalable cloud infrastructure. You'll collaborate with Protocol, Smart Contract and Software Engineering teams...
Site Reliability Engineer

il y a 2 semaines

Montpellier, France SWEEP Temps plein

Join to apply for the Site Reliability Engineer role at SWEEP6 days ago Be among the first 25 applicantsGet AI-powered advice on this job and more exclusive features.This range is provided by SWEEP. Your actual pay will be based on your skills and experience — talk with your recruiter to learn more.OverviewSweep is hiring a Site Reliability Engineer (SRE),...
Site Reliability Engineer

il y a 1 semaine

France Kiln Temps plein

Full time - Paris/London hybrid or Remote from EU/UKKiln is now part of the prestigious French Government program #FT120 from La French Tech As a Site Reliability Engineer - Staking at Kiln, you'll join our Infrastructure team to build the future of Kiln Validators.You will collaborate closely with Product, GTM and other Engineering teams to support the...
DevOps / Site Reliability Engineer

il y a 1 semaine

Rue du Sentier, Paris, France Zama Temps plein

Zama recently unveiled the Zama Confidential Blockchain Protocol, which enables confidential smart contracts on top of any blockchain L1 or L2 using Fully Homomorphic Encryption (FHE). The Blockchain division is working on our managed relayer that lets internal teams and external developers use the open-source Zama Protocol through simple, authenticated...
Site Reliability Engineer GCP

il y a 2 jours

Rue du Faubourg Montmartre, Paris, France Supervizor Temps plein

Supervizor est en pleine transformation de son infrastructure avec une migration stratégique vers Google Cloud Platform. Notre équipe Ops pilote actuellement la migration de nos workloads depuis Azure vers GCP, et nous construisons parallèlement une plateforme cloud-native moderne.Nous recherchons un·e Site Reliability Engineer en Fully Remote,...
Site Reliability Engineer

il y a 2 semaines

Rue de Grenelle, Paris, France Molotov Temps plein

As a member of the SRE / DevOps team, composed of French and international engineers, you'll play a key role in designing, operating, and scaling Molotov's infrastructure across our web platform, connected devices, and Smart TVs.You'll be part of a broader technical community at Molotov, collaborating closely with developers, product teams, and engineers...
Stage - Site Reliability Engineer (F/H)

il y a 3 jours

Rue d'Alsace-Lorraine, Toulouse, France OpenAirlines Temps plein

Contexte du stage :Nous recherchons un stagiaire SRE (Site Reliability Engineer) motivé et curieux techniquement pour nous aider à étudier, concevoir et mettre en œuvre une première itération d'une plateforme interne pour développeurs (Internal Developer Platform – IDP).Ce stage est une excellente opportunité d'acquérir une expérience pratique en...
Site Reliability Engineer

il y a 1 jour

Montpellier, France Synopsys Temps plein

Synopsys is looking for a motivated individual to work in our Silicon Lifecycle Analytics Operations team. As Site Reliability Engineer (SRE), you will play a key role in designing and developing programmatic infrastructure and automation to build, maintain and improve the Synopsys Silicon Lifecycle Management (SLM) solutions. **Responsibilities**: -...

Amériques

Europe

Asie / Océanie

Afrique

Site Reliability Engineer