Site Reliability Engineer

Il y a 15 minutes


Montpellier, France SWEEP Temps plein

OverviewSweep is hiring a Site Reliability Engineer (SRE), to join our SRE & infrastructure team and help us ensure the reliability, scalability, and performance of our systems.This role is ideal for someone with solid DevOps background, expertise in AWS and high-traffic systems, and a commitment to fostering a collaborative, inclusive environment within our SRE guild and broader engineering culture.Climate change is the defining issue of our time. By empowering companies with technology that helps them manage their climate impact, we believe Sweep can make a meaningful contribution to a better future for all of us.Ok, sounds promising. What will I be doing?ResponsibilitiesContribute to the team\'s ownership of technical infrastructure 🛠️Design, implement, and maintain highly available, scalable, and secure cloud infrastructure for the Sweep Data platform and AI workloads using Infrastructure as Code practices.Improve and expand our observability strategy, working with Datadog to enhance metrics, dashboards, and alerting across our Rails application and AI workloads.Develop scalable infrastructure to support the machine learning model training, deployment, and monitoring.Participate in incident response and post-mortem reviews as part of the Oak teamSupport critical scaling initiatives 📈Support critical infrastructure scaling projects.Contribute to high-traffic systems design.Help establish team processes including runbooks, workflows, and documentation.Facilitate collaboration 🤝Work closely with engineers who have elevated infrastructure privileges within our DevOps culture.Collaborate within the SRE guild and contribute to best practices across the engineering team.Collaborate with AI/ML teams to optimize and scale AI/ML pipelines and workloads.Manage day-to-day operations 🔧Manage day-to-day operations including on-call duties, capacity planning, and proactive system health monitoring.Implement security measures and data protection protocols.Support enterprise customer security requirements including BYOK implementation and data sovereignty compliance.Maintain and contribute to Sweep strong level of compliance including SOC 2 Type 2, ISO 27001 and more.Continuously improve and learn 🚀Use a proactive approach to problem-solving and a commitment to building fault-tolerant systems.Stay up-to-date with the latest industry trends and technologies to ensure we\'re always building on solid foundations.QualificationsEngineering degree in computer science or 3+ years of DevOps/SRE experience, with strong candidates at 5+ years preferredGood knowledge of AWS (including ECS/Fargate), Docker, Terraform, PostgreSQL at scale (experience with sharding, clustering, or high-volume scenarios preferred)Datadog expertise strongly preferredExperience with continuous integration and continuous deploymentExperience with high-traffic, multi-tenant systems and database scaling strategiesKnowledge and experience in data modeling, database design, and data managementStrong operational mindset with experience in day-to-day production operationsExperience with on-call rotations and production incident managementExperience improving observability and monitoring systemsUnderstanding of clean code and clean infrastructure practicesYou speak English fluently, French is a plusTechnical bonusesRuby on Rails experience is a plusSnowflake experience is a plusChange Data Capture and data pipeline experience is valuableFamiliarity with high-traffic systemsARC (Actions Runner Controller) and Kubernetes is a plusQualitiesAutonomous and self-structuredWilling to imagine and implement processes to ease developers\' livesPassionate about solving problems and developing solutionsA team player who values collaboration and feedbackWhat’s in it for you?By joining Sweep, you\'ll be part of an exciting startup with a vision to change the world. We\'re ready to hit the ground running, and joining us at this early stage allows you the unique opportunity to help shape our journey.Our flexible work model allows you to balance personal and professional commitments while staying connected with your global colleagues. Even though our hubs are in France, the UK and the US, we\'re committed to fostering a connected and engaged remote work culture.As a B Corporation, we\'re dedicated to creating successful businesses that benefit everyone, including society and the planet.Ready for the most exciting chapter of your career? Come join us on this extraordinary ride #J-18808-Ljbffr


  • Site Reliability Engineer

    il y a 1 semaine


    Montpellier, France Synopsys Temps plein

    Synopsys is looking for a motivated individual to work in our Silicon Lifecycle Analytics Operations team. As Site Reliability Engineer (SRE), you will play a key role in designing and developing programmatic infrastructure and automation to build, maintain and improve the Synopsys Silicon Lifecycle Management (SLM) solutions. **Responsibilities**: -...

  • Site Reliability Engineer

    il y a 2 semaines


    Rue Isabelle Eberhardt Montpellier, France, France Sweep Temps plein

    Sweep is hiring a Site Reliability Engineer (SRE), to join our SRE & infrastructure team and help us ensure the reliability, scalability, and performance of our systems. This role is ideal for someone with solid DevOps background, expertise in AWS and high-traffic systems, and a commitment to fostering a collaborative, inclusive environment within our SRE...


  • Montpellier, France Ikighia Temps plein

    Offre d'Emploi : CDI uniquementLieu : MontpellierTélétravail : 2 jours par semaineType de contrat : CDI statut cadre - convention collective SyntecDémarrage : ASAPExpérience : minimum 5 ansDans le cadre du renforcement de la résilience de son Système d'Information, notre client recherche un Site Reliability Engineer (SRE) pour intervenir sur des...


  • Montpellier, France GE Vernova Temps plein

    **Key Responsibilities** - Perform reliability modeling and statistical analysis of energy management and protection/control products to predict field performance and act as authority for reliability design decisions. - Develop and execute reliability test plans for hardware and software, including hardware-in-the-loop (HIL), environmental, and stress...


  • Montpellier, France Ikighia Temps plein

    Offre d'Emploi : CDI uniquementLieu : MontpellierTélétravail : Full remote - Présence en France requise Type de contrat : CDI statut cadre - convention collective SyntecDémarrage : ASAPExpérience : minimum 5 ansDans le cadre du renforcement de la résilience de son Système d'Information, notre client recherche un Site Reliability Engineer (SRE) pour...


  • Montpellier, France Alan Temps plein

    A tech-focused insurance company is seeking a Platform Engineer to support tech foundations and enable product crews. The role involves tackling impactful projects, ensuring infrastructure reliability, improving system observability, and mentoring other engineers. Candidates should have over 3 years of platform engineering experience, familiarity with AWS,...

  • Platform Engineer

    Il y a 12 minutes


    Montpellier, France Alan Temps plein

    Platform Engineer (x/f/m) - Tech Foundations Join Alan as a Platform Engineer to support our tech foundations and enable product crews. What you'll work on Infrastructure enablement for product crews (hosting improvements, CI/CD, scalability, multi‑cloud architecture) Security and compliance facilitation (authentication, encryption, threat protection)...

  • DevOps Engineer

    il y a 4 jours


    Montpellier, France CompuGroup Medical Temps plein

    Senior Engineer - DevOps Become ALL IN! for Health as a Senior Engineer DevOps (M/F/d) incl. remote We are expanding our European R&D and Product Management Center and look for creative minds and team players who understand agile development, love technical challenges, can work cross-functionally, and strive to grow professionally and contribute to a...

  • Senior DevOps Engineer

    il y a 1 semaine


    Montpellier, France CompuGroup Medical Temps plein

    Senior Engineer - DevOps Become ALL IN! for Health as a Senior Engineer DevOps (M/F/d) incl. remote We are expanding our European R&D and Product Management Center and look for creative minds and team players who understand agile development, love technical challenges, can work cross-functionally, and strive to grow professionally and contribute to a...

  • Conducteur D'engins

    il y a 20 heures


    Montpellier, France ISA Développement Temps plein

    Dans le cadre de son développement l'agence ISA INTERIM de Sète recherche un conducteur d'engins H/F CACES 2 4. Sous la responsabilité du responsable de site vous aurez en charge le déchargement de ciment en VRAC et stockage Contrôle et entretien de l'engin Respect des consignes de sécurité Dans le cadre de son développement l'agence ISA INTERIM...