Doctorant F/H Private and Byzantine-Robust Federated Learning

Il y a 6 mois


Montpellier, France INRIA Temps plein

Contexte et atouts du poste

This PhD position is a collaboration between two Inria research teams: ,
and

Mission confiée

Context

The increasing size of data generated by smartphones and IoT devices motivated the development of Federated Learning (FL) (Kairouz et al. 2021), a decentralized learning framework for on-device collaborative training of machine learning models. FL algorithms like FedAvg (McMahan et al. 2017) allow clients to train a common global model without sharing their personal data. FL reduces data collection costs and can help to mitigate data privacy issues, making it possible to train models on large datasets that would otherwise be inaccessible. FL is currently used by many big tech companies (e.g., Google, Apple, Facebook) for learning on their users' data, but the research community envisions also promising applications to learning across large data-silos, like hospitals that cannot share their patients' data (Rieke et al. 2020).

While they mitigate privacy concerns by not exchanging raw data, FL does not in itself offer rigorous privacy guarantees, and FL algorithms can be attacked by malicious participants. For these reasons, in recent years a large body of the literature has focused on the design of decentralized algorithms that are more privacy-preserving, using different variants of differential privacy (Noble et al. 2022; Cyffers et al. 2022). On the other hand, another line of research has focused on decentralized learning algorithms that are robust to the presence of malicious individuals in the system (Byzantine agents) (Farhadkhani et al. 2022, 2023; Guerraoui et al. 2023). Nevertheless, the design and analysis of algorithms that are both robust and privacy-preserving is far less considered and understood. Recently, it has been shown that in the case where the server is honest-but-curious, the combination of differential privacy and robustness induces an additional error term, making them at odds with each other (Allouah et al. 2023). Specifically, we face a utility-privacy-robustness trilemma (on top of the conventional privacy-utility and robustness-utility trade-offs). Conversely, in the case of a trusted server, some studies (Hopkins et al. 2023) have shown that privacy and robustness can actually be mutually beneficial. A key question then arises: in what contexts are these two notions really good for each other?

Research Objectives

The main goal of this PhD is to answer the previous question on the basis of new theoretical analyses, and to design decentralized algorithms that are both robust and differentially private. Several lines of research could be investigated.

A natural direction to seek better trade-offs between privacy, robustness and utility is to relax the notions of privacy and/or robustness. One may consider the general framework of Pufferfish privacy (Kifer & Machanavajjhala, 2014; Pierquin et al., 2023), which allows to relax differential privacy by considering more specific *secrets* to protect and by constraining the prior belief that the adversary may have about the data. Similarly, while Byzantine robustness has been shown to be at odds with local differential privacy (Allouah et al. 2023(a)), it is possible to consider weaker threat models, such as the hidden state model (Ye & Shokri, 2022), the shuffle model (Cheu et al., 2019) or the network model (Cyffers et al., 2022). Regarding robustness, current approaches are designed to ensure protection against Byzantine users that can misbehave arbitrarily (Guerraoui et al., 2023). However, such robustness is too stringent and leads to conservative learning performance in practice when no user is fully adversarial. For example, in the case of medical applications it is safe to assume that all the users, usually hospitals, clinics or pharmacies, are honest by intention, but misbehavior could occur due to mistakes like *mislabelling* (Allen-Zhu et al., 2020). Refining (or designing new) Byzantine-robust schemes to weaker adversaries is crucial to fully realize the benefits of robust decentralized learning in real-world applications.

Another line of investigation is to reconsider the notion of utility. In the majority of the aformentioned work, the key quantity to control (utility) is the optimization error of the empirical risk. However, in (decentralized) machine learning one is often interested in also controling the generalization error (Bassily et al., 2020; Le Bars et al., 2024), namely the error that will be made on unobserved data points. In this case, it will be interesting to study how robustness and privacy can jointly improve algorithm stability and thus help generalization, e.g., by studying the connections between *gradient coherence* (Chatterjee, 2019) and *robust aggregation* (Yin et al., 2018; Allouah et al., 2023(b).

A last direction of research is to consider model update *compression* or *sparsification* techniques (Stich et al., 2018) that have been independently shown to help privacy (e.g, see Rui et al., 2023). Whether the benefits of these scheme hold true when aiming for robustness along with privacy remains unclear. Some technical challenges are as follows. (i) While sparsification (in decentralized learning) improves the overall privacy-utility trade-off, the same need not be true for the privacy-robustness trade-off. (ii) The compression noise can be amplified in the presence of malicious clients in the system (Rammal et al., 2024).

References

- Kairouz, P. et al. Advances and open problems in federated learning. Foundations and Trends in Machine Learning, 14(1-2), pp. 1-210, 2021.

- McMahan, B., Moore, E., Ramage, D., Hampson, S. and Aguera y Arcas, B. Communication efficient learning of deep networks from decentralized data. AISTATS 2017.

- Rieke, N., Hancox, J., Li, W. et al. The future of digital health with federated learning. npj Digit. Med. 3, 119, 2020.

- Noble, M., Bellet, A., and Dieuleveut, A. Differentially private federated learning on heterogeneous data. AISTATS 2022.

- Cyffers, Bellet, A. Privacy amplification by decentralization. AISTATS 2022

- Farhadkhani, S., Guerraoui, R., Gupta, N., Hoang, L. N., Pinot, R., & Stephan, J. Robust Collaborative Learning with Linear Gradient Overhead. ICML 2023.

- Guerraoui, R., Nirupam G., and Rafael P. Byzantine Machine Learning: A Primer. ACM Computing Surveys, 2023.

- Farhadkhani, S., Guerraoui, R., Gupta, N., Pinot, R., and Stephan, J. Byzantine machine learning made easy by resilient averaging of momentums. ICML 2022.

- Allouah, Y., Guerraoui, R., Gupta, N., Pinot, R., & Stephan, J. On the privacy-robustness-utility trilemma in distributed learning. ICML 2023(a).

- Allen-Zhu, Z., Ebrahimianghazani, F., Li, J., & Alistarh, D. Byzantine-Resilient Non-Convex Stochastic Gradient Descent. ICML 2020.

- Hopkins, S. B., Kamath, G., Majid, M., & Narayanan, S. Robustness implies privacy in statistical estimation. STOC 2023.

- Bassily, R., Feldman, V., Guzmán, C., & Talwar, K. Stability of stochastic gradient descent on nonsmooth convex losses. NeurIPS 2020.

- Le Bars, B., Bellet, A., Tommasi M., Scaman K., Neglia, G. Improved Stability and Generalization Guarantees of the Decentralized SGD Algorithm. ICML 2024.

- Yin, D., Chen, Y., Kannan, R., & Bartlett, P. Byzantine-robust distributed learning: Towards optimal statistical rates. ICML 2018

- Chatterjee, Satrajit. Coherent Gradients: An Approach to Understanding Generalization in Gradient Descent-based Optimization. ICML 2019.

- Allouah, Y., Farhadkhani, S., Guerraoui, R., Gupta, N., Pinot, R., & Stephan, J. Fixing by Mixing: A Recipe for Optimal Byzantine ML Under Heterogeneity. AISTATS 2023(b).

- Kifer & Machanavajjhala. Pufferfish: A Framework for Mathematical Privacy Definitions. ACM Transactions on Database System, 2014.

- Pierquin et al. Rényi Pufferfish Privacy: General Additive Noise Mechanisms and Privacy Amplification by Iteration. ICML 2024.

- Ye & Shokri. Differentially Private Learning Needs Hidden State (Or Much Faster Convergence). NeurIPS 2022.

- Cheu et al. Distributed Differential Privacy via Shuffling. Eurocrypt 2019.

- Stich, Sebastian U., Jean-Baptiste Cordonnier, and Martin Jaggi. Sparsified SGD with Memory. NeurIPS 2018.

- Rui, H., Yuanxiong Guo, and Yanmin Gong. Federated Learning with Sparsified Model Perturbation: Improving Accuracy Under Client-level Differential Privacy. IEEE Transactions on Mobile Computing, 2023.

- Rammal, A., Gruntkowska, K., Fedin, N., Gorbunov, E. and Richtárik, P. Communication Compression for Byzantine Robust Learning: New Efficient Algorithms and Improved Rates. AISTATS 2024

Principales activités

Research

Compétences

The applicant is expected to have studied machine learning and/or optimization, and to have good mathematical skills. Some knowledge in distributed algorithms and broad interest for the topic of trustworthy AI is a plus.

Avantages

Restauration subventionnée Transports publics remboursés partiellement Congés: 7 semaines de congés annuels + 10 jours de RTT (base temps plein) + possibilité d'autorisations d'absence exceptionnelle (ex : enfants malades, déménagement) Possibilité de télétravail (après 6 mois d'ancienneté) et aménagement du temps de travail Équipements professionnels à disposition (visioconférence, prêts de matériels informatiques, etc.) Prestations sociales, culturelles et sportives (Association de gestion des œuvres sociales d'Inria) Accès à la formation professionnelle Sécurité sociale

Rémunération

Gross Salary per month: 2100€ brut per month (year 1 & 2) and 2190€ brut per month (year 3)



  • Montpellier, Occitanie, France INRIA Temps plein

    About the PositionThis PhD research position is a collaboration between two Inria research teams. The successful candidate will work on designing decentralized algorithms that are both robust and differentially private.Research ObjectivesThe main goal of this PhD is to answer the question of when privacy and robustness are mutually beneficial in...


  • Montpellier, Occitanie, France INRIA Temps plein

    Context and Advantages of the PositionThis PhD research position is a collaboration between two Inria research teams, offering a unique opportunity to work on cutting-edge topics in decentralized machine learning and trustworthy AI.Research ObjectivesThe main goal of this PhD is to investigate the intersection of privacy, robustness, and utility in...

  • Doctorant (H/F)

    il y a 3 semaines


    Montpellier, France CNRS Temps plein

    Cette offre est disponible dans les langues suivantes: - Français - Anglais Date Limite Candidature : mercredi 30 octobre 2024 23:59:00 heure de Paris **Informations générales**: **Intitulé de l'offre **:Doctorant (H/F) - Modélisation hybride et optimisation opérationnelle des réseaux couplés d'électricité et de chaleur** Référence :...


  • Montpellier, Occitanie, France WORLDFISH Temps plein

    About WorldFishWorldFish is a leading international research organization working to transform aquatic food systems to reduce hunger, malnutrition, and poverty. It collaborates with international, regional, and national partners to co-develop and deliver scientific innovations, evidence for policy, and knowledge to enable equitable and inclusive impact for...


  • Montpellier, France CNRS Temps plein

    Cette offre est disponible dans les langues suivantes: - Français - Anglais Date Limite Candidature : mercredi 27 novembre 2024 23:59:00 heure de Paris **Informations générales**: **Intitulé de l'offre **:Doctorant/doctorante en Modélisation multi-échelle du processus de broyage des résidus végétaux (H/F)** Référence : UMR5508-FRARAD-001 Nombre...


  • Montpellier, France WORLDFISH Temps plein

    About WorldFish “WorldFish is a leading international research organization working to transform aquatic food systems to reduce hunger, malnutrition, and poverty. It collaborates with international, regional, and national partners to co-develop and deliver scientific innovations, evidence for policy, and knowledge to enable equitable and inclusive...


  • Montpellier, France Inria Temps plein

    A propos du centre ou de la direction fonctionnelleThe Inria center at Université Côte d'Azur includes 42 research teams and 9 support services. The center's staff (about 500 people) is made up of scientists of different nationalities, engineers, technicians and administrative staff. The teams are mainly located on the university campuses of Sophia...

  • Doctorant(E) (H/F)

    il y a 4 semaines


    Montpellier, France CNRS Temps plein

    Cette offre est disponible dans les langues suivantes: - Français - Anglais Date Limite Candidature : mercredi 20 novembre 2024 23:59:00 heure de Paris **Informations générales**: **Intitulé de l'offre **:Doctorant(e) (H/F) en robotique souple et gonflable** Référence : UMR5506-CEDGIR-001 Nombre de Postes : 1 Lieu de travail : MONTPELLIER Date de...


  • Montpellier, France CNRS Temps plein

    Cette offre est disponible dans les langues suivantes: - Français - Anglais Date Limite Candidature : mercredi 23 octobre 2024 23:59:00 heure de Paris **Informations générales**: **Intitulé de l'offre **:H/F doctorant sur les interactions plantes-pollinisateurs dans le gradient rural urbain** Référence : UMR5175-PIECHE-002 Nombre de Postes : 1 Lieu...


  • Montpellier, Occitanie, France Alfa Interim Montpellier Temps plein

    About the RoleWe are seeking an experienced software professional with a specialization in AI and machine learning technologies to join our team. As a Software Engineer, you will be responsible for designing, developing, and implementing AI and machine learning solutions to drive business growth and improve customer experiences.Key ResponsibilitiesDesign and...


  • Montpellier, France Inria Temps plein

    Le descriptif de l’offre ci-dessous est en Anglais_ **Type de contrat **:CDD **Contrat renouvelable **:Oui **Niveau de diplôme exigé **:Bac + 5 ou équivalent **Fonction **:Ingénieur scientifique contractuel **Niveau d'expérience souhaité **:De 3 à 5 ans **A propos du centre ou de la direction fonctionnelle**: The Inria center at Université...


  • Montpellier, France CNRS Temps plein

    Cette offre est disponible dans les langues suivantes: - Français - Anglais Date Limite Candidature : mardi 24 septembre 2024 23:59:00 heure de Paris **Informations générales**: **Intitulé de l'offre **:Doctorant en neuro-immunologie (H/F)** Référence : UMR5203-MIRWEI-002 Nombre de Postes : 1 Lieu de travail : MONTPELLIER Date de publication : mardi...


  • Montpellier, France CNRS Temps plein

    Cette offre est disponible dans les langues suivantes: - Français - Anglais Date Limite Candidature : vendredi 13 décembre 2024 23:59:00 heure de Paris **Informations générales**: **Intitulé de l'offre **:Post-doctorant en humanités environnementales sur les flux de gènes assistés (H/F)** Référence : UMR5175-VIRMAR-005 Nombre de Postes : 1 Lieu...


  • Montpellier, Occitanie, France CIMMYT Temps plein

    CIMMYT is a cutting-edge, non-profit, international organization dedicated to solving tomorrow's problems today. It is entrusted with fostering improved quantity, quality, and dependability of production systems and basic cereals such as maize, wheat, triticale, sorghum, millets, and associated crops through applied agricultural science, particularly in the...

  • Cdd Doctorant

    Il y a 3 mois


    Montpellier, France CNRS Temps plein

    Cette offre est disponible dans les langues suivantes: - Français - Anglais Date Limite Candidature : mardi 17 septembre 2024 23:59:00 heure de Paris **Informations générales**: **Intitulé de l'offre **:CDD doctorant (H/F) - Optimisation des verres chalcogénures polarisés thermiquement pour la détection sélective dans le moyen...

  • French Professor

    Il y a 5 mois


    Montpellier, France HERE AND NOW - The French Institute Temps plein

    **About us**: **Roles & Responsibilities**: 1. Facilitate efficient learning through interactive and collaborative sessions. 2. Act as a source of accurate and reliable information to students. 3. Developing curriculum and lesson plans that align with the institute's standards, with an emphasis on Business French language skills 4. Create a collaborative...


  • Montpellier, France INRIA Temps plein

    Post-Doctoral Research Visit F/M Policy learning under distributional shifts Le descriptif de l’offre ci-dessous est en Anglais Type de contrat : CDD Niveau de diplôme exigé : Thèse ou équivalent Fonction : Post-Doctorant A propos du centre ou de la direction fonctionnelle The Inria centre at Université Côte d'Azur...


  • Montpellier, Occitanie, France APSH 34 Temps plein

    Assume the role of a Data Scientist at {company} and contribute to the development of innovative machine learning models. Utilize your expertise in data analysis and statistical modeling to drive business growth and improve decision-making processes.Collaborate with cross-functional teams to design and implement data-driven solutionsDevelop and deploy...

  • French Professor

    Il y a 5 mois


    Montpellier, France HERE AND NOW - The French Institute Temps plein

    **About us**: **Eligibility criteria**: **2. THIS IS NOT AN ONLINE/WFH job** **Roles & Responsibilities**: 1. Facilitate efficient learning through interactive and collaborative sessions. 2. Act as a source of accurate and reliable information to students. 3. Developing curriculum and lesson plans that align with the institute's standards, with an...

  • Doctorant en Biologie

    il y a 1 mois


    Montpellier, France CNRS Temps plein

    Cette offre est disponible dans les langues suivantes: - Français - Anglais Date Limite Candidature : vendredi 8 novembre 2024 23:59:00 heure de Paris **Informations générales**: **Intitulé de l'offre **:Doctorant en biologie (H/F)** Référence : UMR5535-SARADE-064 Nombre de Postes : 1 Lieu de travail : MONTPELLIER Date de publication : vendredi 18...