Emplois actuels liés à PhD Position F/M Explainable and frugal audio scene description - Paris - INRIA

Chief Audio Transactions Lawyer

il y a 4 semaines

Paris, Île-de-France Audio Temps plein

Job DescriptionWe are seeking a highly skilled and experienced lawyer to join our team as a Chief Audio Transactions Lawyer. This role will play a crucial part in shaping the future of sound innovation and excellence in product design.The ideal candidate will have a strong academic background in civil and business law, with substantial experience in handling...
Technical Product Specialist for Audio Electronics

il y a 4 semaines

Paris, Île-de-France Audio Temps plein

We are seeking a skilled Technical Product Specialist to join our team in Paris-Saclay. As a key member of our R&D department, you will work closely with us to develop and implement technical solutions for amplification, DSP, and embedded systems.About the RoleThis is a full-time position that offers a competitive salary range of €60,000 - €80,000 per...
Senior Legal Advisor for Audio Solutions

il y a 4 semaines

Paris, Île-de-France Audio Temps plein

Job DescriptionWe are seeking a highly skilled Senior Legal Advisor to join our team at Audio, where you will play a crucial role in ensuring the successful implementation of our audio solutions. Your expertise in contract management and employment law will be invaluable in navigating complex legal matters.About UsAuditory innovation meets artistic...
Phd Position F/m Multilingual and Cross-cultural

Il y a 2 mois

Paris, France Inria Temps plein

Le descriptif de l’offre ci-dessous est en Anglais_ **Type de contrat **:CDD **Niveau de diplôme exigé **:Bac + 5 ou équivalent **Fonction **:Doctorant **Contexte et atouts du poste**: **Mission confiée**: **PhD topic** In particular, this Ph.D. thesis aims to detect formal and informal logical fallacies in a multilingual corpus of political...
Senior Design Director

il y a 4 semaines

Paris, Île-de-France Audio Temps plein

Job Title: Senior Design Director - Audio TechnologyWe are seeking a highly skilled and experienced Senior Design Director to lead our design team in creating innovative and aesthetically pleasing product designs for the audio industry.About the Role:The successful candidate will be responsible for overseeing the entire design process, from concept to final...
Audio Lead

Il y a 3 mois

Paris, France Lionbridge Temps plein

Paris, Île-de-France, France**Job Title**: Audio Lead **Location**: Paris, France **Salary**: 25,000 - 35,000 EUR per month **Audio Lead** As an Audio Lead at Lionbridge Games, you will be responsible for managing and coordinating all aspects of audio production for video games, focusing on dubbing and voiceover work. You will oversee casting,...
Audio Project Manager

il y a 4 jours

Paris, France Lionbridge Technologies Temps plein

**Audio Project Manager** Lionbridge Game Services is currently seeking to find an Audio Production **Responsibilities**: - Creating casting selections from our database or organize live castings in collaboration with the audio team and artistic directors - Creating and distributing recording schedules related to ongoing productions - Booking artists and...
Audio Tech Lead

Il y a 6 mois

Paris, France Enchanted Tools Temps plein

**Change the face of robotics with us.** At Enchanted Tools, we are bringing a new generation of robots to life. Combining world-class engineering expertise and the power of imagination, we plan to make everyone’s life better with robotic characters, by tackling concrete issues and needs. Why you should join us: - You will have a unique opportunity to...
Phd Position F/m Phd Position F/m

Il y a 2 mois

Paris, France Inria Temps plein

Le descriptif de l’offre ci-dessous est en Anglais_ **Type de contrat **:CDD **Niveau de diplôme exigé **:Bac + 5 ou équivalent **Fonction **:Doctorant **Contexte et atouts du poste**: This PhD project will be realized in the Inria NERV team, a research lab supported by the French institutions Inria, Inserm, CNRS, and Sorbonne University. The team...
Postdoctoral Researcher

Il y a 8 mois

Paris, France Meta Temps plein

**Postdoctoral Researcher (PhD) Responsibilities**: - Perform research to advance the science and technology of intelligent machines - Perform research that enables learning the semantics of data (images, video, text, audio, and other modalities) - Devise better data-driven models of human behavior - Contribute research that can be applied to Facebook...
Junior Key Account Manager Philips Audio and

Il y a 2 mois

Paris, France TP Vision Temps plein

**Join our team and help us deliver the future of sound!** - Are you passionate about audio and motivated by success? Do you excel in fast-paced environments and enjoy forging strong, strategic relationships? If so, we want to hear from you!_ TP Vision, a leading multinational consumer electronics company, is looking for a dynamic and results driven - Key...
Phd Position

Il y a 2 mois

Paris 13e, France Jaylo.io Temps plein

**Thesis Topic**: Development of an imperceptible and durable watermark for the security and authentication of physical objects through an invisible physical signature. In the context of increasing counterfeiting of high-value products (luxury goods, pharmaceuticals, cosmetics, etc.), security technologies must evolve to ensure product authenticity and...
Audio Lead

Il y a 2 mois

Paris, France Lionbridge Temps plein

Creating casting selections from our database or organize live castings in collaboration with the audio team and artistic directors - Creating and distributing recording schedules related to ongoing productions - Booking artists and negotiating rates - Creating and filing recording contracts - Handle talent and freelancer administrative tasks - Populating...
Stage en Caractérisation de Scènes Et

il y a 1 mois

Paris, France Inria Temps plein

**Type de contrat **:Stage **Niveau de diplôme exigé **:Bac + 4 ou équivalent **Autre diplôme apprécié **:de M2 en IA, mathématiques, mathématiques appliquée ou informatique ou équivalent, avec une forte motivation pour la recherche appliquée. **Fonction **:Stagiaire des fonctions support **Contexte et atouts du poste**: L’objectif du stage...
Place Audio Recherche Un Assistant Administratif

il y a 5 jours

Paris, France Place Audio Paris Temps plein

Place Audio recherche un Assistant administratif H/F ! **VOS MISSIONS** - Accueil des clients, des prestataires et des collaborateurs - Gestion du standard téléphonique (transfert, prise de rendez-vous, message, filtrage des appels) - Traitement du courrier et des colis, et affranchissement - Saisie de données administratives - Rédactions de documents...
[CDI] Le Groupe Paradiso Media – Binge Audio recrute un·e directeur·ice de clientèle

il y a 2 jours

Paris, France BINGE AUDIO Temps plein

Quand ? Poste à pourvoir à partir d’avril 2024Où ? Paris, 6 Villa Marcel Lods (19e)Comment ? CDI ー Convention collective de la production audiovisuelleCombien ? Rémunération en part fixe et part variable ー Avantages : forfait mobilités durables, carte titres restaurants Swile, mutuelle AlanDescription de l’entrepriseLe groupe Paradiso Media –...
Fundamental Ai Research Scientist, Fair

il y a 1 mois

Paris, France Meta Temps plein

**Fundamental AI Research Scientist, FAIR (PhD) Responsibilities**: - Perform research to advance the science and technology of intelligent machines. - Perform research that enables learning the semantics of data (images, video, text, audio, and other modalities). - Work towards long-term ambitious research goals, while identifying immediate milestones. -...
Audio Games Producer

Il y a 2 mois

Paris, France The Marshmallow Project Temps plein

Paris (75) Position Duties We are looking for a highly driven individual, passionate about crafting delightful experiences and excited to make a lasting impact on kids education. As our Audio Game Project Manager, you will play a crucial role in developing and bringing to life a brand new type of content for children. You will craft the core experience,...
Technical Product Manager, Electronics

il y a 1 mois

Paris, France Audio Temps plein

Technical Product Manager, Electronics - DSP Platforms page is loadedTechnical Product Manager, Electronics - DSP PlatformsApply remote type Hybrid locations Paris-Saclay time type Full time posted on Posted 4 Days Ago job requisition id R1564Join our passionate and dedicated teams who are shaping the future of sound!As a Technical Product Manager (TPM) for...
Technical Product Manager, Electronics

il y a 2 jours

Paris, France Audio Temps plein

Technical Product Manager, Electronics - DSP Platforms page is loadedTechnical Product Manager, Electronics - DSP PlatformsApply remote type Hybrid locations Paris-Saclay time type Full time posted on Posted 4 Days Ago job requisition id R1564Join our passionate and dedicated teams who are shaping the future of sound!As a Technical Product Manager (TPM) for...

PhD Position F/M Explainable and frugal audio scene description

Il y a 7 mois

Paris, France INRIA Temps plein

Contexte et atouts du poste

Inria Défense&Sécurité (Inria D&S) was created in 2020 to federate Inria’s actions for the benefit of military forces. The PhD will be carried out within the audio processing research team of Inria D&S, under the supervision of Jean-François Bonastre and co-supervised by Raphaël Duroselle.

The automatic audio scene description task is to present operators with a summary of the information present in the scene, in the form of augmented text. This text provides a visual summary of the most important information, while efficiently structuring access to specific information. Here is an illustrative example of a summary: « This five-minute recording features three different speakers. Speaker A corresponds to a known identity in the database and speaks French with a strong Monawa accent, speakers B and C are unknown in the database and speak English in their interactions with A and use an unidentified language when talking to each other. The voices of B and C show strong similarities with speakers from the Eastern Quabar region. The main theme of the recording concerns a transfer of goods between the cities of Orienta and Flagrance. The date July 8, 2023 is mentioned three times.». Clicking on A gives the operator information about A and details of the voice identification performed. There will be direct access to the time segments during which A spoke and to their transcription. The transcription will highlight names of people, places or dates (named entities).

Mission confiée

Goal

The aim of this thesis is to propose a general framework for processing audio recordings for intelligence purposes. It consists in defining a high-level application adapted to the needs of end users, favouring the presentation of a recording in the form of a summary report to highlight its salient points.

Approach

This approach is inspired both by textual description of video scenes [1] and by dialogue systems based on audio-visual scenes [2]. The system will be based on the extraction of speech signal representations at different scales (frame, speech segment or sound event, complete recording), possibly dedicated to different tasks. The representations, useful for the various technological bricks of the system, will be embeddings extracted from deep neural networks, either generic [3] or dedicated to each task. The fusion between the different levels of information can be achieved with an architecture inspired by the multi-stream "Encoder-Decoder" scheme [4], with several encoders producing sequences of representations and one or more decoders performing the tasks or sub-tasks required by the system. One of these decoders will produce a textual summary of the scene.

Potential research directions, aiming to go beyond an audio scene description system by assembling existing bricks, can be discussed and refined with the candidate.

Principales activités

Bibliography, development and evaluation of deep learning systems ; Definition of a new task, definition of a corpus and evaluation protocol ; Work on the alignment between self-supervised representations of the speech signal and large language models ; Weakly supervised system training ; System evaluation.

Compétences

Master level in computer science, mathematics or phonetics.

Strong interest in applied research.

Written and spoken English

Signal processing

Machine learning and deep learning

Experience with deep learning toolkits such as pytorch or keras

Speech processing experience, knowledge of open source toolkits such as kaldi or speechbrain.

References

[1] Aafaq, N., Mian, A., Liu, W., Gilani, S. Z., & Shah, M. . Video description: A survey of methods, datasets, and evaluation metrics. ACM Computing Surveys (CSUR), 52, 1-37.

[2] Hori, Chiori, Huda Alamri, Jue Wang, Gordon Wichern, Takaaki Hori, Anoop Cherian, Tim K. Marks, et al. « End-to-End Audio Visual Scene-Aware Dialog Using Multimodal Attention-Based Video Features ». In ICASSP 2019 - 2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2352‑56. Brighton, United Kingdom: IEEE, 2019. [3] Zhang, C., & Tian, Y. (2016, December). Automatic video description generation via lstm with joint two-stream encoding. In 2016 23rd International Conference on Pattern Recognition (ICPR) (pp. 2924-2929). IEEE.

[4] Pratap, Vineel, Andros Tjandra, Bowen Shi, Paden Tomasello, Arun Babu, Sayani Kundu, Ali Elkahky, et al. 2023. « Scaling Speech Technology to 1,000+ Languages ». arXiv.

Avantages

Subsidized meals, Partial reimbursement of public transport costs, Leave: 7 weeks of annual leave + 10 extra days off due to RTT (statutory reduction in working hours) + possibility of exceptional leave (sick children, moving home, etc.), Possibility of teleworking and flexible organization of working hours, Professional equipment available (videoconferencing, loan of computer equipment, etc.), Social, cultural and sports events and activities,

Rémunération

1st and 2nd year : 2082 € bruts - gross /month 3rd year : 2190 € bruts - gross /month

Amériques

Europe

Asie / Océanie

Afrique

Emplois actuels liés à PhD Position F/M Explainable and frugal audio scene description - Paris - INRIA

PhD Position F/M Explainable and frugal audio scene description