This position is no longer available.

R&D Internship - Voice activity detection in a multitask approach

Internship
Villeneuve-d'Ascq
Salary: Not specified
Starting date: June 30, 2019
Occasional remote
Experience: < 6 months
Education: Master's Degree

SteelSeries France
SteelSeries France

Interested in this job?

Questions and answers about the job

The position

Job description

Voice activity detection, more commonly known as VAD, is a speech processing technique used to detect the presence or absence of human speech in an audio signal.
The main applications of VADs are in speech coding and recognition, but it can also be used to disable some processes during the non-voice part of an audio session. This feature would thus reduce the CPU load of our algorithms.
In addition, voice activity can be jointly estimated together with other tasks such as noise reduction or other classification. Examples of multi-tasking approaches can be found in.

Subject
The first part of the internship will focus on benchmarking several state-of-the-art techniques (classical signal processing, deep learning and the adaptation of one of these techniques to the needs of A-Volute, the latency and computation cost properties being more important than accuracy for our application. Thus, knowledges in hardware or embedded software would be a plus.
In a second part and if the student has a particular aspiration for machine learning, it will be possible to work on a multi-task approach based on internal work that for the moment focuses on multi-task treatment of music.


Preferred experience

Who are we looking for ?
Preparing an engineering degree or master’s degree, or even a PhD (3 month visit), you preferably have knowledge in the development and implementation of advanced algorithms fordigital audio signal processing. In addition, advanced notions in the following various fields would be highly appreciated :

  • Audio,acoustics and psychoacoustics
  • Audio effects in general : compression, equalization, etc.
  • Machine learning and artificial neural networks.
  • Statistics, probabilist approaches, optimization.
  • Programming language : Matlab, Python.

Recruitment process

And experiences in the following areas would be a plus :

  • Sound spatialization effects : binaural synthesis,ambisonics, artificial reverberation.
  • Voice recognition, voice command.
  • Voice processing effects : noise reduction, echo cancellation, antenna processing.
  • Virtual, augmented and mixed reality.
  • Computer programming and development : Max/MSP, C/C+++/C#.
  • Video game engines : Unity, Unreal Engine, Wwise, FMod, etc.
  • Audio editing software : Audacity, Adobe Audition, etc.
  • Scientific publications and patent applications.
  • Fluent in English and French.
  • Demonstrate intellectual curiosity.

Want to know more?