Post-Doctoral Research Visit F/M Interfacing Crop Models with Reinforcement Learning
il y a 7 jours
Le descriptif de l'offre ci-dessous est en Anglais
Type de contrat : CDD
Niveau de diplôme exigé : Thèse ou équivalent
Fonction : Post-Doctorant
Niveau d'expérience souhaité : Jeune diplômé
A propos du centre ou de la direction fonctionnelleCreated in 2008, the Inria center at the University of Lille employs 360 people, including 305 scientists in 16 research teams. Recognized for its strong involvement in the socio-economic development of the Hauts-De-France region, the Inria center at the University of Lille maintains a close relationship with large companies and SMEs. By fostering synergies between researchers and industry, Inria contributes to the transfer of skills and expertise in the field of digital technologies, and provides access to the best of European and international research for the benefit of innovation and businesses, particularly in the region.
For over 10 years, the Inria center at the University of Lille has been at the heart of Lille's university and scientific ecosystem, as well as at the heart of Frenchtech, with a technology showroom based on avenue de Bretagne in Lille, on the EuraTechnologies site of economic excellence dedicated to information and communication technologies (ICT).
Contexte et atouts du posteSupervision: The Postdoctoral researcher will be advised by Odalric-Ambrym Maillard from Inria team-project
Scool and Cyrille Midingoyi from CIRAD/PERSYST AIDA Unit.
Place: This position will be primarily held at the research center Inria Lille – Nord Europe, Villeneuve d'Ascq,
in the Inria team-project Scool (Sequential, Continual and Online Learning), with strong regular interactions with
CIRAD AIDA unit in Montpellier.
Keywords: Multi-armed bandits, Sequential statistics, Societal challenge.
INRIA The postdoctoral researcher will be hosted at Centre Inria de l'Université de Lille, in the Scool team. Scool (Sequential COntinual and Online Learning) is an Inria team-project. It was created on November 1st, 2020 as the follow-up of the team SequeL. In a nutshell, the research topic of Scool is the study of the sequential decision making problem under uncertainty. Most of our activities are related to either bandit problems, or reinforcement learning problems. Through collaborations, we are working on their application in various fields including health, agriculture and ecology, sustainable development. More information, please visit
Odalric-Ambrym Maillard is a permanent researcher at Inria. He has worked for over a decade on advancing the theoretical foundations of reinforcement learning, using a combination of tools from statistics, optimization and control, in order to build more efficient algorithms able to provide decision making in uncertain environments.
He was PI of several projects, including ANR-JCJC project BADASS (BAnDits Against non-Stationarity and Structure), Inria Action Exploratoire SR4SG (Sequential Recommendation for Sustainable Gardening) and Inria-Japan Associate team RELIANT (Reliable Bandit strategies). His goal is to push forward key fundamental and applied questions related to the grand-challenge of making reinforcement learning applicable in real-life societal applications.
Context The project is part of the AgroecologIcaL decision making and Optimization with REinforcement learning (Agrilore) project from ANR-TSIA 2025 initiative. This project brings together an interdisciplinary board of researchers from INRIA, CIRAD, and INRAE.
Agroecological intensification is a key response to the current challenges of food security and climate
change \cite{vikas2024agroecological}. Among the agroecological levers, crop diversification, especially intercropping (i.e. growing least two different crop species in the same field), offers major agronomic potential. However, their implementation is still based on limited knowledge. As the production of experimental references is cumbersome and costly, process-based (mechanistic) modeling has emerged as an effective alternative. However, many standard Process-Based crop Models (PBM) including STICS or DSSAT were initially built for monocultures, and while extensions exist, they still struggle to represent all the complex interactions inherent in intercropping systems. In parallel, the crop modeling community is increasingly focusing on issues of model modularity and interoperability, as illustrated by the Crop2ML framework devloped as part of Agricultural Model Exchange Initiative (AMEI).
This project aims to overcome these limitations by integrating artificial intelligence (AI) approaches,
notably reinforcement learning (RL), to optimize decision-making under conditions of uncertainty.
Building on a proven interdisciplinary collaboration in simpler intercropping contexts, we leverage this dynamic to tackle a major challenge: adapting the RL-Agro coupling to the emblematic case of intercropping.
Beyond its agronomic importance, this research also contributes to the rapidly growing intersection between AI and environmental modeling. Reinforcement learning environments inspired by complex natural systems are gaining traction in the machine learning community as challenging, high-dimensional testbeds. Notably, the recently developed WOFOSTGym simulator \cite{solow2025wofostgym}, bridging crop modeling and RL, received the Outstanding Paper Award at the 2024 Reinforcement Learning Conference, highlighting the community's enthusiasm for scientifically grounded RL environments.
Developing a generic Gym-PBM, therefore, not only supports sustainable agriculture but also provides the RL community with a novel, physically grounded benchmark characterized by temporal dependencies, partial observability, and uncertainty—features often missing in standard RL benchmarks. The system's modular design will enable broader methodological experimentation and open the door for AI researchers to engage with real-world, sustainability-driven challenges.
By positioning the project at the interface between agronomy, AI, and applied mathematics, this research contributes to the emergence of a new interdisciplinary field where model-based reasoning and data-driven learning co-evolve. This synergy is expected to foster new collaborations between RL researchers and agricultural scientists, accelerating innovation in both domains.
[1] Vikas and Rajiv Ranjan. Agroecological approaches to sustainable development. Frontiers in Sustainable Food Systems, 8:1405409, 2024.
[2] Cyrille Ahmed Midingoyi, Christophe Pradal, Andreas Enders, Davide Fumagalli, Patrice Lecharpentier, H´elene Raynal, Marcello Donatelli, Davide Fanchini, Ioannis N. Athanasiadis, Cheryl Porter, Gerrit Hoogenboom, F.A.A. Oliveira, Dean Holzworth, and Pierre Martre. Crop modeling frameworks interoperability through bidirectional source code transformation. Environ. Model. Softw., 168(C), October
[3] Pierre Martre, Donatelli Marcello, Christophe Pradal, Andreas Enders, Cyrille Ahmed Midingoyi, Ioannis Athanasiadis, Davide Fumagalli, Dean P. Holzworth, Gerrit Hoogenboom, Cheryl Porter, H´elene Raynal, Andrea Emilio Rizzoli, and P Thorburn. The agricultural model exchange initiative. In IICA, editor, 7th AgMIP Global Workshop, San Jos´e, Costa Rica, 2018.
[4] William Solow, Sandhya Saisubramanian, and Alan Fern. Wofostgym: A crop simulator for learning annual and perennial crop management strategies. arXiv preprint arXiv: , 2025
Objectives
The goal of the postdoc project is to develop a robust and flexible interface between crop models and reinforcement learning (RL) to enable decision-making algorithms to interact with crop simulations. This requires bridging a significant conceptual and technical gap between monolithic crop models, in which user-defined management actions are treated as predefined input variables, and RL, which relies on state-action-reward loops to enable adaptive decision-making under uncertainty.
The core challenge, therefore, is to design a modular and generalizable methodology that embeds STICS within a standard RL framework. This will pave the way for adaptive, data-driven decision-making in agronomy and open new opportunities to optimize crop management strategies in complex, uncertain environments.
Methodology
The methodological pathway follows a generic-to-specific progression: defining a model-agnostic formalism to couple black-box PBM with reinforcement learning (RL), then implementing this abstraction in a reusable software interface, and finally instantiate and evaluate it on STICS or other PBM.
- O1- Define a model-agnostic formalism for RL–PBM coupling. The first step is to develop an abstract
representation of how an RL agent can interact with a PBM. It will focus on the formalization of how to
access state variables or other indicators from crop model run, how to change recommendations or manage-
ment decisions (actions) through controllable input levers (e.g. sowing date, fertilization, irrigation, cultivar
choice), and how to define reward signals that encode agronomic and environmental objectives (yield, risk,
resource use, emissions, etc.).This formalism will be expressed as a generic state–action–reward–transition
schema, compatible with standard RL frameworks, but enriched with agronomic structure (time step, crop
management, etc.). This model-agnostic abstraction will serve as a conceptual template for any mechanistic
crop model.
- O2 - Implement a generic RL–crop interface library. Building on the abstract formalism defined in O1, the
second objective is to implement a generic software interface that operationalizes this coupling, in the form
of an OpenAI Gymnasium–compatible environment layer (a "Gym-Agro" or "Gym-PBM abstraction). This
layer will expose a standardized API (reset, step, observe, reward, done) to RL agents, manage simulation
calls and time-stepping, allow the mapping between model-specific I/O and the abstract state–action–reward
schema to be specified declaratively,etc. The result will be a model-independent environment layer into
which different crop models can be plugged without changing the core RL code, simply by providing a
suitable adapter specification.
- O3 - Instantiate and evaluate the formalism on STICS The third objective is to specialize and validate the
generic methodology on STICS. Using the formalism and the "Gym-Agro" interface from O1–O2, we will
develop a STICS-specific adapter that maps STICS input variables and management options to actions,
extracts relevant biophysical and management indicators as states, defines appropriate reward functions in
line with agronomic objectives. We will demonstrate the flexibility and scalability of the approach for
diverse cropping systems and objectives.
English (mandatory), French (bonus)
Excellent writing and presentation skills
Excellent organisation and communication skills due to interdisciplinary context.
Avantages- Subsidized meals
- Partial reimbursement of public transport costs
- Leave: 7 weeks of annual leave + 10 extra days off due to RTT (statutory reduction in working hours) + possibility of exceptional leave (sick children, moving home, etc.)
- Possibility of teleworking and flexible organization of working hours
- Professional equipment available (videoconferencing, loan of computer equipment, etc.)
- Social, cultural and sports events and activities
- Access to vocational training
- Social security coverage
Monthly gross salary: 2788 €
Informations générales- Thème/Domaine : Optimisation, apprentissage et méthodes statistiques
Statistiques (Big data) (BAP E)
- Ville : Villeneuve d'Ascq
- Centre Inria : Centre Inria de l'Université de Lille
- Date de prise de fonction souhaitée :
- Durée de contrat : 1 an, 6 mois
- Date limite pour postuler :
Attention: Les candidatures doivent être déposées en ligne sur le site Inria. Le traitement des candidatures adressées par d'autres canaux n'est pas garanti.
Consignes pour postulerPlease send your CV and cover letter
Sécurité défense :
Ce poste est susceptible d'être affecté dans une zone à régime restrictif (ZRR), telle que définie dans le décret n° relatif à la protection du potentiel scientifique et technique de la nation (PPST). L'autorisation d'accès à une zone est délivrée par le chef d'établissement, après avis ministériel favorable, tel que défini dans l'arrêté du 03 juillet 2012, relatif à la PPST. Un avis ministériel défavorable pour un poste affecté dans une ZRR aurait pour conséquence l'annulation du recrutement.
Politique de recrutement :
Dans le cadre de sa politique diversité, tous les postes Inria sont accessibles aux personnes en situation de handicap.
Contacts- Équipe Inria : SCOOL
- Recruteur :
Maillard Odalric-ambrym /
L'essentiel pour réussirDesired Postdoctoral Profile and Scientific Challenges
The postdoctoral position requires both a strong mathematical foundation and a deep interest in applied interdisciplinary research. The coupling of RL with mechanistic crop models entails challenges beyond standard decision-making frameworks: calibration, inference, and optimization must operate over high-dimensional, continuous, and structured variable spaces.
In this context, expertise in probabilistic graphical models and non-parametric inference methods will be crucial to learn structural dependencies and perform efficient inference under data scarcity. The project also opens perspectives for applying and extending continuous Bayesian networks and bandit-based experimental design approaches to agricultural systems.
Because of the computational demands of crop simulations, software optimization and scalability are also central. The candidate should ideally have experience implementing large-scale inference or optimization methods, and a strong interest in reinforcement learning and adaptive decision algorithms. This position thus offers a rare opportunity to combine advanced mathematical modeling with real-world impact in agroecology.
A propos d'InriaInria est l'institut national de recherche dédié aux sciences et technologies du numérique. Il emploie 2600 personnes. Ses 215 équipes-projets agiles, en général communes avec des partenaires académiques, impliquent plus de 3900 scientifiques pour relever les défis du numérique, souvent à l'interface d'autres disciplines. L'institut fait appel à de nombreux talents dans plus d'une quarantaine de métiers différents. 900 personnels d'appui à la recherche et à l'innovation contribuent à faire émerger et grandir des projets scientifiques ou entrepreneuriaux qui impactent le monde. Inria travaille avec de nombreuses entreprises et a accompagné la création de plus de 200 start-up. L'institut s'efforce ainsi de répondre aux enjeux de la transformation numérique de la science, de la société et de l'économie.
-
Villeneuve-d'Ascq, Hauts-de-France Inria Temps pleinLe descriptif de l'offre ci-dessous est en AnglaisType de contrat : CDDNiveau de diplôme exigé : Bac + 5 ou équivalentFonction : DoctorantA propos du centre ou de la direction fonctionnelleThe Inria University of Lille centre, created in 2008, employs 360 people including 305 scientists in 16 research teams. Recognised for its strong involvement in the...
-
Villeneuve-d'Ascq, Hauts-de-France Université Gustave Eiffel Temps pleinÀ propos de nousL'Université Gustave Eiffel, modèle innovant d'université rassemblant le triptyque université, écoles et organisme de recherche, dispose de plusieurs campus de formation et de recherche implantés sur le territoire national.L'établissement compte plus de 15000 étudiants et plus de 3000 personnels enseignant (e)s-chercheur(e)s,...
-
Villeneuve-d'Ascq, Hauts-de-France Université Gustave Eiffel Temps pleinÀ propos de nousL'Université Gustave Eiffel, modèle innovant d'université rassemblant le triptyque université, écoles et organisme de recherche, dispose de plusieurs campus de formation et de recherche implantés sur le territoire national.L'établissement compte plus de 15000 étudiants et plus de 3000 personnels enseignant (e)s-chercheur(e)s,...
-
Post-doc en Sciences Cognitives
il y a 19 heures
Villeneuve-d'Ascq, Hauts-de-France Université de Lille Temps pleinÀ propos de nousAmbitieuse et innovante, l'Université de Lille réunit facultés, instituts et écoles de renom.Notre objectif : Construire une université internationale, rayonnante et attractive, ancrée dans son territoire et moteur du développement économique et socio-culturel.Faire le choix de l'Université de Lille, c'est rejoindre une communauté...
-
Secrétaire Médical en alternance F/H
il y a 7 jours
Villeneuve-d'Ascq, Hauts-de-France Walter Learning Temps pleinL'école Walter Learning conçoit, produit et dispense des formations en ligne à destination des professionnels et des personnes qui cherchent à construire leur avenir professionnel.Alliant création de contenus pédagogiques et expertise technologique, nous produisons nos formations en interne avec des spécialistes reconnus, et les diffusons sur notre...
-
Secrétaire Médical en alternance F/H
il y a 1 semaine
Villeneuve-d'Ascq, Hauts-de-France Walter Learning Temps pleinL'école Walter Learning conçoit, produit et dispense des formations en ligne à destination des professionnels et des personnes qui cherchent à construire leur avenir professionnel.Alliant création de contenus pédagogiques et expertise technologique, nous produisons nos formations en interne avec des spécialistes reconnus, et les diffusons sur notre...
-
Post-doctorat en optique quantique
il y a 7 jours
Villeneuve-d'Ascq, Hauts-de-France Choisir le Service Public Temps pleinInformations générales Organisme de rattachement CNRS Référence UMR8523-GIUPAT-002 Date de début de diffusion /12/2025 Date de parution /12/2025 Date de fin de diffusion /01/2026 VersantFonction Publique de l'Etat CatégorieCatégorie A (cadre) Nature de l'emploiEmploi ouvert uniquement aux contractuels Domaine / MétierRecherche -...
-
Villeneuve-d'Ascq, Hauts-de-France Choisir le Service Public Temps pleinInformations générales Organisme de rattachement CNRS Référence UMR8523-LAUHEL-009 Date de début de diffusion /11/2025 Date de parution /12/2025 Date de fin de diffusion /02/2026 VersantFonction Publique de l'Etat CatégorieCatégorie A (cadre) Nature de l'emploiEmploi ouvert uniquement aux contractuels Domaine / MétierRecherche -...
-
Conseiller commercial en alternance F/H
il y a 5 jours
Villeneuve-Saint-Georges, Île-de-France Walter Learning Temps pleinL'école Walter Learning conçoit, produit et dispense des formations en ligne à destination des professionnels et des personnes qui cherchent à construire leur avenir professionnel.Alliant création de contenus pédagogiques et expertise technologique, nous produisons nos formations en interne avec des spécialistes reconnus, et les diffusons sur notre...
-
Sr Manager R&D
il y a 7 jours
Villeneuve-d'Ascq, Hauts-de-France McCain Alimentaire SAS Temps pleinPosition Title: Sr Manager R&D - Regulatory Affairs CE & GB Position Type: Regular - Full-Time Requisition ID: 37501 The Senior R&D Regulatory Affairs Manager is responsible for product composition, claims and label regulatory compliance for finished goods manufactured and sold in Europe, the Middle East and North Africa and United Kingdom.The...