Back to Search
Start Over
Swarm Intelligence Based Particle Filter for Alternating Talker Localization and Tracking Using Microphone Arrays
- Source :
- IEEE/ACM Transactions on Audio, Speech, and Language Processing. 25:1384-1397
- Publication Year :
- 2017
- Publisher :
- Institute of Electrical and Electronics Engineers (IEEE), 2017.
-
Abstract
- We address the problem of localizing and tracking alternating (moving or stationary) talkers using microphone arrays in a room environment. One of the main challenges is the frequent (and possibly abrupt) change of talker positions which requires the algorithm to capture the active talker rapidly. In addition, the presence of interference, background noise and room reverberation degrades the tracking performance. We propose a new algorithm that jointly exploits the advantages of the particle filter (PF) and particle swarm intelligence. The PF is used as a general tracking framework which incorporates a proposed alternating source-dynamic model for recursive estimation of talker position. Unlike the conventional PF where particles operate independently in the particle sampling stage, the use of swarm intelligence allows particles to interact with each other, thereby improving convergence toward the active talker location. In addition, the memory mechanism in swarm intelligence allows particles to remain at their previous best-fit state estimate when signals are corrupted by interference, noise and/or reverberation. Simulations and experiments were conducted to demonstrate the effectiveness of the proposed algorithm. Accepted version
- Subjects :
- Reverberation
Acoustics and Ultrasonics
Microphone
Computer science
Speech recognition
Talker localization and tracking
02 engineering and technology
Tracking (particle physics)
Interference (wave propagation)
Swarm intelligence
Background noise
030507 speech-language pathology & audiology
03 medical and health sciences
0202 electrical engineering, electronic engineering, information engineering
Computer Science (miscellaneous)
Computer vision
Electrical and Electronic Engineering
business.industry
020206 networking & telecommunications
Microphone arrays
Computational Mathematics
Noise
Artificial intelligence
0305 other medical science
business
Particle filter
Subjects
Details
- ISSN :
- 23299304 and 23299290
- Volume :
- 25
- Database :
- OpenAIRE
- Journal :
- IEEE/ACM Transactions on Audio, Speech, and Language Processing
- Accession number :
- edsair.doi.dedup.....e245333f8271328f4888156c798844e8
- Full Text :
- https://doi.org/10.1109/taslp.2017.2693566