Phd Position F/m a Multi-modal Language Model for
il y a 4 semaines
Le descriptif de l’offre ci-dessous est en Anglais_
**Type de contrat **:CDD
**Niveau de diplôme exigé **:Bac + 5 ou équivalent
**Fonction **:Doctorant
**A propos du centre ou de la direction fonctionnelle**:
Inria is a national research institute dedicated to digital sciences that promotes scientific excellence and transfer. Inria employs 2,400 collaborators organised in research project teams, usually in collaboration with its academic partners.
This agility allows its scientists, from the best universities in the world, to meet the challenges of computer science and mathematics, either through multidisciplinarity or with industrial partners.
A precursor to the creation of Deep Tech companies, Inria has also supported the creation of more than 150 start-ups from its research teams. Inria effectively faces the challenges of the digital transformation of science, society and the economy.
**Contexte et atouts du poste**:
This PhD offer is funded by the GEO-ReSeT ANR project, representing a collaboration between Inria (team EVERGREEN, Montpellier) and Université de Paris Cité (team LIPADE, Paris).
Leveraging the large amounts of available geo-spatial data from different sources, the GEO-ReSeT (Generalized Earth Observation with Remote Sensing and Text) project has the objective to learn a rich representation of any geo-spatial location and convey a semantic representation of the information, by improving on existing models and providing a better experience to the end users. By using location on the Earth's surface as the common link between different modalities, a geo-spatial foundation model would be able to incorporate a variety of data sources, including remote sensing imagery, textual descriptions of places, and other generic features.
By leveraging several data modalities, this foundation model could provide a comprehensive and accurate understanding of the Earth's surface, enabling informed decisions and actions. This will be particularly valuable for new potential users in sectors such as journalism, social sciences or environmental monitoring, who may not have the resources or expertise to collect their own training datasets and develop their own methods, thus moving beyond open Earth observation data and democratizing the access to Earth observation information.
**Mission confiée**:
The work to be conducted during the proposed PhD thesis will contribute to the ambition of the GEO-ReSeT ANR project by linking textual descriptions of places (e.g., collected from heterogeneous online sources, such as news articles or search engine results), to their approximate geo-location, a task known as geoparsing.
This text-location link will then be used in combination with other geospatial data modalities, with a focus on remote sensing data from sensors such as Sentinel-1 and -2, in order to train multi-modal models that are aware about the way in which people describe locations.
This will be done by first combining information stemming from different databases containing geographic named entities, such as Open Street Map, Wikipedia and gazetteers, such that geographic points or polygons can be linked to each named entity.
In a second step, a Natural Language Processing (NLP) pipeline will be developed to obtain the most likely geographic named entities that are referred to in any piece of text that describes a place.
With respect to existing Named Entity Recognition (NER) methodologies, in order to avoid restricting us to cases where entities' names appear exactly as in the databases or gazetteers, we will leverage pre-trained Large Language Models (LLM) to resolve ambiguities and gather evidence towards the most likely entities that are being described in the text. Such an approach will be trained and validated by using the cases that do match the names in the gazetteer.
We will then move on, in collaboration with the rest of the GEO-ReSeT consortium, to train a multi-modal large language model (MMLLM) that will serve as a foundation model for Earth observation tasks.
This model will finally be evaluated on several agro-environmental tasks.
**Principales activités**:
- Description of the state-of-the-art in unstructured text geoparsing, with a focus on approaches leveraging LLMs.
- Collection of a database of geographic named entities linked to their geographic footprint (e.g. point or polygon). Collection of a database of unstructured online text that is likely to contain a reference to a geographic location.
- Development of an NLP pipeline to link each piece of geographic text to its likely geographic footprint.
- Participate in the design and training of a multi-modal large language model (MMLLM) using remote sensing and geoparsed text.
- Evaluation of the final model on two of the following case studies at a national or continental scale: ecosystem type mapping, crop type mapping or land-use mapping.
**Compétences**:
- Python programming.
- Deep Learning with Python (preferably
-
Phd Position F/m a Multi-modal Language Model for
il y a 5 jours
Montpellier, Occitanie, France Inria Temps pleinLe descriptif de l'offre ci-dessous est en Anglais_Type de contrat :CDDNiveau de diplôme exigé :Bac + 5 ou équivalentFonction :DoctorantA propos du centre ou de la direction fonctionnelle:Inria is a national research institute dedicated to digital sciences that promotes scientific excellence and transfer. Inria employs 2,400 collaborators organised in...
-
PhD Position F/M A multi-modal language model for Earth observation
il y a 4 semaines
Montpellier, France INRIA Temps pleinContexte et atouts du poste This PhD offer is funded by the GEO-ReSeT ANR project, representing a collaboration between Inria (team EVERGREEN, Montpellier) and Université de Paris Cité (team LIPADE, Paris). Leveraging the large amounts of available geo-spatial data from different sources, the GEO-ReSeT (Generalized Earth Observation with Remote...
-
PhD Position F/M A multi-modal language model for Earth observation
il y a 3 semaines
Montpellier, France INRIA Temps pleinContexte et atouts du poste This PhD offer is funded by the GEO-ReSeT ANR project, representing a collaboration between Inria (team EVERGREEN, Montpellier) and Université de Paris Cité (team LIPADE, Paris). Leveraging the large amounts of available geo-spatial data from different sources, the GEO-ReSeT (Generalized Earth Observation with Remote...
-
Doctorant / Doctorante (Phd) - Développement
il y a 1 semaine
Montpellier, France CNRS Temps pleinCette offre est disponible dans les langues suivantes: - Français - Anglais Date Limite Candidature : lundi 15 avril 2024 **Informations générales**: **Intitulé de l'offre **:Doctorant / Doctorante (PhD) - Développement d'outils d'analyses pour batteries tout solide (H/F)** Référence : UMR5253-LORSTI0-002 Nombre de Postes : 1 Lieu de travail :...
-
24-291 Actionable Knowledge for Fair Remote Sensing
il y a 6 jours
Montpellier, France CNES - Centre National d'Etudes Spatiales Temps pleinDoctorat, 36 mois - Temps plein - Aucune expérience exigée - Maitrise, IEP, IUP, Bac+4 - Tele-epidemiology Risks **Mission**: Actionable Knowledge for FAIR Remote Sensing Processes, Application to One Health Risk Indicators **Scientific objective** **Use cases** **Related works** On the one hand, a variety of tools exist such as [1, 2, 3] which offer...
-
Quantifications Généralisées: Preuves, Modèles
il y a 5 jours
Montpellier, Occitanie, France Université de Montpellier Temps plein**Quantifications généralisées: preuves, modèles et expression en langage naturel // Generalized quantifications: proofs, models and expression in natural language**:- Réf **ABG-122063****ADUM-55856**- Sujet de Thèse- 30/03/2024- Contrat doctoral- Université de Montpellier- Lieu de travail- Montpellier cedex 5 - France- Intitulé du sujet-...
-
Quantifications Généralisées: Preuves, Modèles
il y a 4 jours
Montpellier, France Université de Montpellier Temps plein**Quantifications généralisées: preuves, modèles et expression en langage naturel // Generalized quantifications: proofs, models and expression in natural language**: - Réf **ABG-122063** **ADUM-55856** - Sujet de Thèse- 30/03/2024- Contrat doctoral- Université de Montpellier- Lieu de travail- Montpellier cedex 5 - France- Intitulé du sujet-...
-
Postdoctoral Reasearcher in Llm
il y a 2 semaines
Montpellier, France CEA Temps pleinDescription du poste **Domaine**: - Mathématiques, information scientifique, logiciel **Contrat**: - Post-doctorat **Intitulé de l'offre**: - Postdoctoral reasearcher in LLM - H/F **Sujet de stage**: - Towards Tokamak operations Conversational AI Interface Using Multimodal Large Language Models **Durée du contrat (en mois)**: - 12 **Description...
-
Phd Subject
Il y a 2 mois
Montpellier, France Centre de Recherche en Biologie cellulaire de Montpellier (CRBM) Temps plein**PHD SUBJECT : Contribution of topographical constraints to kidney pathophysiological organization**: - Réf **ABG-123537** - Sujet de Thèse- 30/04/2024- Autre financement public- Centre de Recherche en Biologie cellulaire de Montpellier (CRBM)- Lieu de travail- Montpellier - Occitanie - France- Intitulé du sujet- PHD SUBJECT : Contribution of...
-
Phd Subject
il y a 15 heures
Montpellier, France Centre de Recherche en Biologie cellulaire de Montpellier (CRBM) Temps plein**PHD SUBJECT : Contribution of topographical constraints to kidney pathophysiological organization**: - Réf **ABG-123537** - Sujet de Thèse- 30/04/2024- Autre financement public- Centre de Recherche en Biologie cellulaire de Montpellier (CRBM)- Lieu de travail- Montpellier - Occitanie - France- Intitulé du sujet- PHD SUBJECT : Contribution of...
-
Responsable Communication
il y a 2 semaines
Montpellier, France OPEN MODAL Temps pleinLe Groupe Open Modal (300 personnes - 90 millions de CA) est spécialiste du Transport Combiné Rail Route depuis plus de 15 ans. Notre structure familiale est à taille humaine avec quatre filiales : TAB RAIL ROAD, T3M, COMBI RAIL et BTM. Notre réussite repose sur notre qualité de service et notre volonté d’innover tout en maintenant un très bon...
-
Assistant Sirh
il y a 6 jours
Montpellier, France Groupe Open Modal Temps pleinRejoignez le **Groupe Open Modal, **spécialiste du Transport Combiné Rail Route, transport qui allie les avantages du rail sur la longue distance à ceux de la route sur les premiers et derniers kilomètres. Fondé en 2013, le Groupe Open Modal propose un transport longue distance décarboné grâce à une stratégie unique d’intégration des maillons...
-
Assistant Qse
il y a 7 jours
Montpellier, France Groupe Open Modal Temps pleinLe **Groupe Open Modal **est spécialiste du Transport Combiné Rail Route depuis plus de 15 ans. Notre structure est à taille humaine avec quatre filiales : TAB Rail Road (Transporteur Routier), T3M (Opérateur Ferroviaire), BTM (Opérateur de Terminal) et Combi rail (Entreprise Ferroviaire), intégrant ainsi la totalité de la chaîne du Transport...
-
Assistant Communication en Alternance
il y a 6 jours
Montpellier, France Groupe Open Modal Temps pleinRejoignez le **Groupe Open Modal, **spécialiste du Transport Combiné Rail Route, transport qui allie les avantages du rail sur la longue distance à ceux de la route sur les premiers et derniers kilomètres. Fondé en 2013, le Groupe Open Modal propose un transport longue distance décarboné grâce à une stratégie unique d’intégration des maillons...
-
Responsable Communication
il y a 5 jours
Montpellier, Occitanie, France OPEN MODAL Temps pleinLe Groupe Open Modal (300 personnes - 90 millions de CA) est spécialiste du Transport Combiné Rail Route depuis plus de 15 ans.Notre structure familiale est à taille humaine avec quatre filiales : TAB RAIL ROAD, T3M, COMBI RAIL et BTM.Notre réussite repose sur notre qualité de service et notre volonté d'innover tout en maintenant un très bon...
-
Assistant Qse
il y a 5 jours
Montpellier, Occitanie, France Groupe Open Modal Temps pleinLe **Groupe Open Modal **est spécialiste du Transport Combiné Rail Route depuis plus de 15 ans. Notre structure est à taille humaine avec quatre filiales : TAB Rail Road (Transporteur Routier), T3M (Opérateur Ferroviaire), BTM (Opérateur de Terminal) et Combi rail (Entreprise Ferroviaire), intégrant ainsi la totalité de la chaîne du Transport...
-
Chargé de Mission RH
Il y a 2 mois
Montpellier, France Groupe Open Modal Temps plein**Le Transport Durable recrute !** Le Groupe Open Modal (315 personnes) est spécialisé dans le Transport Combiné Rail Route depuis plusieurs années. **Descriptif du poste**: Sous la responsabilité du Responsable RH et en vue d’intégrer une équipe de 4 personnes, votre activité principale sera d’effectuer les tâches courantes de gestion RH et...
-
Montpellier, France CIP Temps pleinThe International Potato Center (CIP)is looking for a highly motivated and experienced Regional Director for LatinAmerica and Country Representative for Peru. The position: This positionplays a dual role as Country Representative for Peru and Regional Director forLatin America, with the goal to safeguard and further develop CIP’s essentialrelationship...
-
Regional Director for Latin America and Country Representative for Peru
il y a 3 semaines
Montpellier, France CIP Temps pleinThe International Potato Center (CIP)is looking for a highly motivated and experienced Regional Director for LatinAmerica and Country Representative for Peru. The position: This positionplays a dual role as Country Representative for Peru and Regional Director forLatin America, with the goal to safeguard and further develop CIP’s essentialrelationship...
-
23-087 Modelling and Test Methods for Single-event
il y a 1 semaine
Montpellier, France CNES - Centre National d'Etudes Spatiales Temps pleinDoctorat, 36 mois - Temps plein - Moins de 2 ans d’expérience - Master, DESS, DEA, Bac+5 - Space environment and effects **Mission**: For power systems, the occurrence of a destructive SEE is a major threat that requires extensive testing for parts selection and qualification, typically using rare and expensive particles accelerators beams, as well as...