Engineer Position: Automatic Speech Recognition for Non-natives Speakers in a Noisy Environment

il y a 2 jours


VillerslèsNancy, France Inria Temps plein

Le descriptif de l’offre ci-dessous est en Anglais_

**Type de contrat**: CDD

**Niveau de diplôme exigé**: Bac + 5 ou équivalent

**Fonction**: Ingénieur scientifique contractuel

**Niveau d'expérience souhaité**: Jeune diplômé

**Contexte et atouts du poste**:
The work will be performed at MultiSpeech Team of INRIA-LORIA, Nancy.

MULTISPEECH is a joint research team between the Université of Lorraine, Inria, and CNRS. It is part of department D4 “Natural language and knowledge processing” of LORIA.

Its research focuses on speech processing, with particular emphasis to multisource (source separation, robust speech recognition), multilingual (computer assisted language learning), and multimodal aspects.

**Mission confiée**:
**Context**
- When a person has their hands busy performing a task like driving a car or piloting an airplane, voice is a fast and efficient way to achieve interaction. In aeronautical communications, the English language is most often compulsory. Unfortunately, a large part of the pilots are not native English and speak with an accent dependent on their native language and are therefore influenced by the pronunciation mechanisms of this language. Inside an aircraft cockpit, non-native voice of the pilots and the surrounding noises are the most difficult challenges to overcome in order to have efficient automatic speech recognition (ASR). The problems of non-native speech are numerous: incorrect or approximate pronunciations, errors of agreement in gender and number, use of non-existent words, missing articles, grammatically incorrect sentences, etc. The acoustic environment adds a disturbing component to the speech signal. Much of the success of speech recognition relies on the ability to take into account different accents and ambient noises into the models used by ARP.
- Automatic speech recognition has made great progress thanks to the spectacular development of deep learning. In recent years, end-to-end automatic speech recognition, which directly optimizes the probability of the output character sequence based on the input acoustic characteristics, has made great progress [Chan et al., 2016; Baevski et al., 2020; Gulati, et al., 2020].

**Objectives**
- The recruited person will have to develop methodologies and tools to obtain high-performance non-native automatic speech recognition in the aeronautical context and more specifically in a (noisy) aircraft cockpit.
- This project will be based on an end-to-end automatic speech recognition system.

**References**
- [Baevski et al., 2020] A. Baevski, H. Zhou, A. Mohamed, and M. Auli. Wav2vec 2.0: A framework for self-supervised learning of speech representations, 34th Conference on Neural Information Processing Systems (NeurIPS 2020), 2020.
- [Chan et al., 2016] W. Chan, N. Jaitly, Q. Le and O. Vinyals. Listen, attend and spell: A neural network for large vocabulary conversational speech recognition. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2016, pp. 4960-4964, 2016.
- [Chorowski et al., 2017] J. Chorowski, N. Jaitly. Towards better decoding and language model integration in sequence to sequence models. Interspeech, 2017.
- [Houlsby et al., 2019] N. Houlsby, A. Giurgiu, S. Jastrzebski, B. Morrone, Q. De Laroussilhe, A. Gesmundo, M. Attariyan, S. Gelly. Parameter-efficient transfer learning for NLP. International Conference on Machine Learning, PMLR, pp. 2790-2799, 2019.
- [Gulati et al., 2020] A. Gulati, J. Qin, C.C. Chiu, N. Parmar, Y. Zhang, J. Yu, W. Han, S. Wang, Z. Zhang, Y. Wu, and R. Pang. Conformer: Convolution-augmented transformer for speech recognition. Interspeech, 2020.
- [Shi et al., 2021] X. Shi, F. Yu, Y. Lu, Y. Liang, Q. Feng, D. Wang, Y. Qian, and L. Xie. The accented english speech recognition challenge 2020: open datasets, tracks, baselines, results and methods. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 6918-6922, 2021.

**Principales activités**:
The main activities are those typical of a engineer. They include: literature reading, scientific development, programming and simulation, data processing, reporting and presentation, paper writing, collaboration with the team, the supervisors and other scientific partners.

**Duration**: 8-10 months

**Compétences**:

- M.Sc. or engineer degree in speech/audio processing, computer vision, machine learning, or in a related field,
- ability to work independently as well as in a team,
- solid programming skills (Python, PyTorch), and deep learning knowledge,
- good level of written and spoken English.

**Avantages**:

- Subsidized meals
- Partial reimbursement of public transport costs
- Leave: 7 weeks of annual leave + 10 extra days off due to RTT (statutory reduction in working hours) + possibility of exceptional leave (sick children, moving home, etc.)
- Possibility of teleworking (after 6 months of employment) and flexible organization of working hours
- Professional equipment availa



  • Villers-lès-Nancy, Grand Est, France Inria Temps plein

    Le descriptif de l'offre ci-dessous est en AnglaisType de contrat : Convention de stageNiveau de diplôme exigé : Bac + 4 ou équivalentFonction : Stagiaire de la rechercheContexte et atouts du posteThis master internship is part of the REAVISE project: "Robust and Efficient Deep Learning based Audiovisual Speech Enhancement" funded by the French National...


  • Villers-lès-Nancy, France INRIA Temps plein

    Contexte et atouts du poste Context and funding: This position is funded by the PEPR AS3 project. Within this framework, the HUCEBOT team is developing multimodal strategies for online control and adaptation of dynamic legged robot platforms. This PhD project explores quality diversity optimization as an alternative to traditional reinforcement learning for...


  • Nancy, France Aspen Technology, Inc. Temps plein

    A leading software company in Nancy is seeking a Senior Full Stack Developer to develop and maintain cloud-native applications for natural resources management. Candidates should have at least 5 years of experience in TypeScript, NodeJS, and React, along with a degree in Computer Science or a related field. Strong English communication skills are required....


  • Nancy, France Inria Temps plein

    Doctorant F/H PhD Position: Beyond reinforcement learning for social adaptation The offer description be low is in French Level of qualifications required : Graduate degree or equivalent Fonction : PhD Position Context Context and funding:This position is funded by the PEPR AS3project.Within this framework, the HUCEBOT team is developing multimodal...


  • Villers-lès-Nancy, France Inria Temps plein

    Le descriptif de l’offre ci-dessous est en Anglais_ **Niveau de diplôme exigé**: Bac + 4 ou équivalent **Fonction**: Stagiaire de la recherche **Contexte et atouts du poste**: The position is funded by the PEPR O2R, a national French program to advance research in robotics which reunites several French laboratories in robotics, AI, and Social and...


  • Nancy, France Aspen Technology, Inc. Temps plein

    A technology company is looking for an experienced Principal Software Developer in Nancy. The role emphasizes expertise in TypeScript, with strong knowledge of C++ and PostgreSQL. You will develop backend solutions and ensure data integrity while collaborating with remote teams. Ideal candidates have over 8 years of experience and possess strong...


  • Villers-lès-Nancy, France Inria Temps plein

    Le descriptif de l’offre ci-dessous est en Anglais_ **Type de contrat**: CDD **Niveau de diplôme exigé**: Bac + 5 ou équivalent **Fonction**: Doctorant **Contexte et atouts du poste**: The HUCEBOT team is a new team of the Center Inria at the University of Lorraine. The main robots of the team are the Tiago++ bimanual mobile manipulator, the Unitree G1...


  • Nancy, France Aspen Technology Temps plein

    A leading technology firm in France is seeking a Senior DevOps Engineer to support their cloud native product for the exploration and production of natural resources. The ideal candidate will have over 5 years of experience in a DevOps role, proficiency in automation with YAML pipelines, and experience deploying on Azure. Fluent English skills are essential,...


  • Nancy, France Aspen Technology, Inc. Temps plein

    A leader in software for natural resources is seeking a Senior Full Stack Developer based in Nancy. This role involves developing and maintaining cloud-native applications using React and TypeScript. The ideal candidate will have over 5 years of experience in software development and a Bachelor's degree in Computer Science. The position emphasizes agile...


  • Nancy, France Aspen Technology, Inc. Temps plein

    A leading technology firm located in Nancy is seeking an experienced Principal Software Developer to drive backend solutions and data services. The ideal candidate will possess strong expertise in TypeScript and PostgreSQL, with a solid background in cloud-native development including Docker and Kubernetes. Responsibilities include developing OSDU RDDMS...