Emplois actuels liés à Post-Doctorant F/H Editing and Conditional Generation with Text-to-Video Generation Models - MontbonnotSaintMartin, Auvergne-Rhône-Alpes - Inria

Post-Doctorant F/H Postdoc Position: Editing and Conditional Generation with Text-to-Video Generation Models

il y a 3 jours

Montbonnot-Saint-Martin, Auvergne-Rhône-Alpes, France Inria Temps plein

Type de contrat : CDDNiveau de diplôme exigé : Thèse ou équivalentFonction : Post-DoctorantA propos du centre ou de la direction fonctionnelleLe Centre Inria de l'Université Grenoble Alpes, regroupe un peu moins de 600 personnes réparties au sein de 22 équipes de recherche et 7 services support à la recherche.Son effectif est distribué sur 3 campus...
Chercheur contractuel: Text-to-Video Generation and Editing for Realistic Human-Centric Scene Synthesis

il y a 3 jours

Montbonnot-Saint-Martin, Auvergne-Rhône-Alpes, France Inria Temps plein

Type de contrat : CDDNiveau de diplôme exigé : Bac + 5 ou équivalentFonction : Chercheur contractuelA propos du centre ou de la direction fonctionnelleLe Centre Inria de l'Université Grenoble Alpes, regroupe un peu moins de 600 personnes réparties au sein de 22 équipes de recherche et 7 services support à la recherche.Son effectif est distribué sur 3...
Post-Doctoral Research Visit F/M Post-Doc on Neuro-Symbolic Systems

il y a 2 jours

Montbonnot-Saint-Martin, Auvergne-Rhône-Alpes, France Inria Temps plein

Le descriptif de l'offre ci-dessous est en AnglaisType de contrat : CDDContrat renouvelable : OuiNiveau de diplôme exigé : Thèse ou équivalentFonction : Post-DoctorantA propos du centre ou de la direction fonctionnelleThe Centre Inria de l'Université de Grenoble groups together almost 600 people in 22 research teams and 8 research support...
PhD Position F/M Modelling of curly hair

il y a 3 jours

Montbonnot-Saint-Martin, Auvergne-Rhône-Alpes, France Inria Temps plein

Le descriptif de l'offre ci-dessous est en AnglaisType de contrat : CDDNiveau de diplôme exigé : Bac + 5 ou équivalentFonction : DoctorantNiveau d'expérience souhaité : De 3 à 5 ansA propos du centre ou de la direction fonctionnelleThe Centre Inria de l'Université de Grenoble groups together almost 600 people in 26 research teams and 9 research...
Research Engineer F/M Differential Privacy and Fairness-Aware AI

il y a 3 jours

Montbonnot-Saint-Martin, Auvergne-Rhône-Alpes, France Inria Temps plein

Le descriptif de l'offre ci-dessous est en AnglaisType de contrat : CDDNiveau de diplôme exigé : Bac + 5 ou équivalentFonction : Ingénieur scientifique contractuelNiveau d'expérience souhaité : De 3 à 5 ansA propos du centre ou de la direction fonctionnelleThe Inria Grenoble research center groups together almost 600 people in 27 research teams and 8...
INTERNSHIP - Optics - Closed-loop caracterization and control of ultrafast tip/tilt platforms

il y a 1 semaine

Montbonnot-Saint-Martin, Auvergne-Rhône-Alpes, France Bertin Alpao Temps plein

BERTIN ALPAO, a subsidiary of the BERTIN TECHNOLOGIES group, is a high-tech company renowned for its innovation and expertise in adaptive optics (AO).A world leader in this field, we design and produce a wide range of deformable mirrors (DM), wavefront sensors (WFS) and customized systems, specially designed for demanding applications such as space,...
_Automaticienne / Automaticien

il y a 3 jours

Montbonnot-Saint-Martin, Auvergne-Rhône-Alpes, France Capgemini Engineering Temps plein

Capgemini Engineering, leader mondial des services d'ingénierie , rassemble des équipes d'ingénieurs , de scientifiques et d'architectes pour aider les entreprises les plus innovantes dans le monde à libérer leur potentiel . Des voitures autonomes aux robots qui sauvent des vies, nos experts en technologies digitales et logicielles sortent des sentiers...
Application Engineer f/h

il y a 3 jours

Montbonnot-Saint-Martin, Auvergne-Rhône-Alpes, France Merck Electronics Temps plein

Exprimez votre talent avec nous Vous voulez explorer, franchir des obstacles, faire des découvertes ? Nous savons que vos projets sont ambitieux. Les nôtres aussi Dans le monde entier, nos collègues ont la passion de l'innovation scientifique et technologique qui enrichit les vies humaines grâce à nos solutions dans les domaines Healthcare, Life...
Firmware Engineer Intern

il y a 3 jours

Montbonnot-Saint-Martin, Auvergne-Rhône-Alpes, France Arturia Temps plein

Arturia conçoit des logiciels et instruments de musique pour les musiciens et producteurs, professionnels comme amateurs. Sa mission est de rendre la création musicale accessible à tous grâce à la technologie, et d'offrir l'expérience la plus intuitive et agréable possible.Arturia commence son épopée en 1999 avec la création de synthétiseurs...
Engineering _ Ingénieure_Ingénieur méthode industrialisation

il y a 3 jours

Montbonnot-Saint-Martin, Auvergne-Rhône-Alpes, France Capgemini Temps plein

Capgemini Engineering, leader mondial des services d'ingénierie , rassemble des équipes d'ingénieurs , de scientifiques et d'architectes pour aider les entreprises les plus innovantes dans le monde à libérer leur potentiel . Des voitures autonomes aux robots qui sauvent des vies, nos experts en technologies digitales et logicielles sortent des sentiers...

Post-Doctorant F/H Editing and Conditional Generation with Text-to-Video Generation Models

Il y a 42 minutes

MontbonnotSaintMartin, Auvergne-Rhône-Alpes, France Inria Temps plein

Type de contrat : CDD

Niveau de diplôme exigé : Thèse ou équivalent

Fonction : Post-Doctorant

A propos du centre ou de la direction fonctionnelle

Le centre de recherche Inria de l'Université Grenoble Alpes regroupe un peu moins de 600 personnes réparties au sein de 27 équipes de recherche et 8 services support à la recherche.

Son effectif est distribué sur 3 campus à Grenoble, en lien étroit avec les laboratoires et les établissements de recherche et d'enseignement supérieur (Université Grenoble Alpes, CNRS, CEA, INRAE, …), mais aussi avec les acteurs économiques du territoire.

Présent dans les domaines du calcul et grands systèmes distribués, logiciels sûrs et systèmes embarqués, la modélisation de l'environnement à différentes échelles et la science des données et intelligence artificielle, Inria Grenoble - Rhône-Alpes participe au meilleur niveau à la vie scientifique internationale par les résultats obtenus et les collaborations tant en Europe que dans le reste du monde.

Contexte et atouts du poste

Titre : Editing and Conditional Generation with Text-to-Video Generation Models

Supervision : Dr Stéphane Lathuilière (INRIA-UGA)

Funding : BPI contract

Contexte :Background and Motivation

Recent advancements in generative AI, and in particular diffusion models [1,2], have significantly enhanced the capabilities of text-to-video (T2V) models[3,4], allowing users to produce richly varied and imaginative scenes from natural language descriptions. These systems demonstrate strong scene diversity and flexibility, making them attractive for applications in entertainment, simulation, and human–computer interaction. However, a persistent limitation lies in their inability to enforce fine-grained conditioning. For example, while a T2V model can generate a "person walking in a park," it cannot ensure that the person is wearing a specific garment or that the garment adapts convincingly to body shape, pose, and interaction with the environment. In contrast, virtual try-on (VTON) systems are highly specialized in clothing transfer tasks~[5], excelling at fine-grained conditioning of garments on target individuals. They can adapt clothing to morphology, pose, and texture details with remarkable realism. Yet, they lack the scene diversity and broader contextual awareness that T2V models offer. Current VTON approaches generally operate in isolation, focusing on clothing alignment rather than situating the dressed person within dynamic, complex environments. Bridging these two paradigms offers a powerful opportunity: to synthesize realistic humans dressed in controllable garments, embedded within richly described environments, and interacting with objects and other people. This integration could transform applications in e-commerce (immersive virtual try-on experiences), creative industries (fashion films, digital avatars), and simulation (training data for human–AI interaction).

Mission confiée

Research Objectives :

The primary mission of the Postdoctoral Research Fellow will be to advance the state-of-the-art in controllable and editable Text-to-Video (T2V) generation. The successful candidate will design, implement, and evaluate novel deep generative models and methodologies that address the current limitations of existing T2V systems. A core focus will be on achieving fine-grained conditional generation, allowing users to specify complex temporal, spatial, and stylistic constraints, as well as enabling intuitive and high-fidelity post-generation editing of the video content. The research will aim to produce models that are not only photorealistic but also exhibit high semantic fidelity, temporal coherence, and practical usability in creative and industrial applications.

Principales activités

Main Tasks (Plain Text)

The Postdoctoral Research Fellow will be responsible for the following main tasks. They will engage in Model Design and Development by designing and implementing novel architectures (e.g., Diffusion Models, Transformers, VAEs) specifically tailored for high-resolution, temporally consistent, and controllable video generation. A key focus is to develop conditional generation techniques to guide the Text-to-Video process using various complex inputs beyond a simple text prompt, such as image references, motion skeletons, semantic masks, or detailed scene descriptions. They will extensively research Video Editing and Manipulation, developing methods for high-fidelity post-generation video editing, allowing for non-destructive modification of generated videos (e.g., object replacement, style transfer, background alteration) while maintaining strong temporal consistency. Furthermore, they will investigate in-context editing mechanisms that enable precise changes to specific segments or objects within a generated video based on new text or image prompts. A core part of the role is Addressing Key T2V Challenges. This includes tackling the fundamental challenge of temporal coherence and consistency, ensuring that generated videos do not suffer from "flickering" or object identity changes across frames, and developing strategies to improve semantic fidelity, resolving issues where models misinterpret complex text prompts. They will also explore methods for efficient training and inference to manage the significant computational cost associated with high-resolution, long-duration video generation, and address the difficulties of data scarcity and bias through techniques like data augmentation or cross-modal transfer learning. Finally, they will perform Evaluation and Benchmarking, establishing rigorous quantitative and qualitative metrics to assess the quality, editability, and controllability of the developed models. The fellow is expected to prioritize Dissemination and Collaboration, which involves documenting research findings and publishing high-quality papers in top-tier machine learning and computer vision venues, actively participating in departmental seminars, and contributing to collaborative projects.

Compétences

Compétences techniques et niveau requis :We are seeking a motivated PhD candidate with a strong background in one or more the following areas :

speech processing, computer vision, machine learning,
solid programmming skills
interest in connecting AI with human cognition Prior experience with LLM, SpeechLMs, RL algorithms, or robotic platforms is a plus, but not mandatory

Langues : Anglais

Avantages

Restauration subventionnée
Transports publics remboursés partiellement
Congés: 7 semaines de congés annuels + 10 jours de RTT (base temps plein) + possibilité d'autorisations d'absence exceptionnelle (ex : enfants malades, déménagement)
Possibilité de télétravail 90 jours/an fixes ou flottants et aménagement du temps de travail
Équipements professionnels à disposition (visioconférence, prêts de matériels informatiques, etc.)
Prestations sociales, culturelles et sportives (Association de gestion des œuvres sociales d'Inria)
Accès à la formation professionnelle
Participation Protection Sociale Complémentaire sous conditions

Rémunération

2788€ gross salary / month

Informations générales

Thème/Domaine : Vision, perception et interprétation multimedia

Statistiques (Big data) (BAP E)
- Ville : Montbonnot
- Centre Inria : Centre Inria de l'Université Grenoble Alpes
- Date de prise de fonction souhaitée :
- Durée de contrat : 2 ans
- Date limite pour postuler :

Attention: Les candidatures doivent être déposées en ligne sur le site Inria. Le traitement des candidatures adressées par d'autres canaux n'est pas garanti.

Consignes pour postuler

Les candidatures doivent être déposées en ligne sur le site Inria.

Le traitement des candidatures adressées par d'autres canaux n'est pas garanti.

Sécurité défense :

Ce poste est susceptible d'être affecté dans une zone à régime restrictif (ZRR), telle que définie dans le décret n° relatif à la protection du potentiel scientifique et technique de la nation (PPST). L'autorisation d'accès à une zone est délivrée par le chef d'établissement, après avis ministériel favorable, tel que défini dans l'arrêté du 03 juillet 2012, relatif à la PPST. Un avis ministériel défavorable pour un poste affecté dans une ZRR aurait pour conséquence l'annulation du recrutement.

Politique de recrutement :

Dans le cadre de sa politique diversité, tous les postes Inria sont accessibles aux personnes en situation de handicap.

Contacts

Équipe Inria : ROBOTLEARN
Recruteur :

Lathuiliere Stephane /

A propos d'Inria

Inria est l'institut national de recherche dédié aux sciences et technologies du numérique. Il emploie 2600 personnes. Ses 215 équipes-projets agiles, en général communes avec des partenaires académiques, impliquent plus de 3900 scientifiques pour relever les défis du numérique, souvent à l'interface d'autres disciplines. L'institut fait appel à de nombreux talents dans plus d'une quarantaine de métiers différents. 900 personnels d'appui à la recherche et à l'innovation contribuent à faire émerger et grandir des projets scientifiques ou entrepreneuriaux qui impactent le monde. Inria travaille avec de nombreuses entreprises et a accompagné la création de plus de 200 start-up. L'institut s'efforce ainsi de répondre aux enjeux de la transformation numérique de la science, de la société et de l'économie.

Amériques

Europe

Asie / Océanie

Afrique

Emplois actuels liés à Post-Doctorant F/H Editing and Conditional Generation with Text-to-Video Generation Models - MontbonnotSaintMartin, Auvergne-Rhône-Alpes - Inria

Post-Doctorant F/H Editing and Conditional Generation with Text-to-Video Generation Models