Main Page

From ManyEars_Wiki

Jump to: navigation, search

Contents

The ManyEars Project: Microphone Array-Based Audition for Mobile Robots

It took many years and many man-years to use many ears on a mobile robot

About

Intelligent systems would benefit from being able to localize and track sound sources in real life settings. Such a capability can help localizing a person or an interesting event in the environment, and also provides enhanced processing for other capabilities such as speech recognition. To give this capability to a computing system, the challenge is not only to localize simultaneous sound sources, but to track them over time. The ManyEars project propose a robust sound source localization and tracking method using an array of eight microphones. The method is based on a frequency-domain implementation of a steered beamformer along with a particle filter-based tracking algorithm. Tests on a mobile robot show that the algorithm can localize and track in real-time multiple moving sources of different types over a range of 7 meters. These new capabilities allowed the robot to interact using more natural means with people in real life settings.

The ManyEars project was setup to provide source code for AUDIBLE project. It provides FlowDesigner building-blocks for microphone array processing. This includes sound source localisation, tracking and separation.

Authors

Click here for the list of authors.

News

Project Information

Getting Started

Please refer to the Installation Instructions.

Hardware Requirements

Hardware requirements are available here.

License

ManyEars is distributed under the GNU General Public License (GPL) and a patent license is provided for use in GPL software. See the full license for details.

Download

Click here to access the download section and release informations.

Publications

Most of the publications about the method are available on Jean-Marc Valin's website. The most relevant ones are given below:


  1. J.-M. Valin, S. Yamamoto, J. Rouat, F. Michaud, K. Nakadai, H. G. Okuno, Robust Recognition of Simultaneous Speech By a Mobile Robot, IEEE Transactions on Robotics, Vol. 23, No. 4, pp. 742-752, 2007.</li>
  2. J.-M. Valin, F. Michaud, J. Rouat, Robust Localization and Tracking of Simultaneous Moving Sound Sources Using Beamforming and Particle Filtering. Robotics and Autonomous Systems Journal (Elsevier), Vol. 55, No. 3, pp. 216-228, 2007.
  3. F. Michaud, C. Cote, D. Letourneau, Y. Brosseau, J.-M. Valin, E. Beaudry, C. Raievsky, A. Ponchon, P. Moisan, P. Lepage, Y. Morin, F. Gagnon, P. Giguere, M.-A. Roux, S. Caron, P. Frenette, F. Kabanza, Spartacus attending the 2005 AAAI conference. Autonomous Robots (Springer), Vol. 22, No. 4, pp. 369-383, 2007.
  4. S. Yamamoto, K. Nakadai, M. Nakano, H. Tsujino, J.-M. Valin, K. Komatani, T. Ogata, H. G. Okuno, Simultaneous Speech Recognition based on Automatic Missing-Feature Mask Generation integrated with Sound Source Separation (in Japanese). Journal of Robotic Society of Japan, Vol. 25, No. 1, 2007.
  5. S. Yamamoto, K. Nakadai, M. Nakano, H. Tsujino, J.-M. Valin, R. Takeda, K. Komatani, T. Ogata, H. G. Okuno, Improving Location-Based Speech Recognition of Simultaneous Speech Signals by Parameter Optimization with Genetic Algorithm (in Japanese). Human Interface, Vol.8, No.2, pp. 203-212, 2006.
Personal tools