Catalogo Articoli (Spogli Riviste)

OPAC HELP

Titolo:
Audio-visual speaker recognition for video broadcast news
Autore:
Maison, B; Neti, C; Senior, A;
Indirizzi:
IBM Corp, Thomas J Watson Res Ctr, Yorktown Heights, NY 10598 USA IBM Corp Yorktown Heights NY USA 10598 tr, Yorktown Heights, NY 10598 USA
Titolo Testata:
JOURNAL OF VLSI SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY
fascicolo: 1-2, volume: 29, anno: 2001,
pagine: 71 - 79
SICI:
1387-5485(200108)29:1-2<71:ASRFVB>2.0.ZU;2-X
Fonte:
ISI
Lingua:
ENG
Keywords:
speaker identification; face recognition; multimodal; fusion; broadcast news;
Tipo documento:
Article
Natura:
Periodico
Settore Disciplinare:
Engineering, Computing & Technology
Citazioni:
15
Recensione:
Indirizzi per estratti:
Indirizzo: Maison, B IBM Corp, Thomas J Watson Res Ctr, POB 218, Yorktown Heights, NY10598 USA IBM Corp POB 218 Yorktown Heights NY USA 10598 hts, NY 10598 USA
Citazione:
B. Maison et al., "Audio-visual speaker recognition for video broadcast news", J VLSI S P, 29(1-2), 2001, pp. 71-79

Abstract

Audio-based speaker identification degrades severely when there is a mismatch between training and test conditions due either to channel or to noise. In this paper, we explore various techniques to combine video based speaker identification with audio-based speaker identification to improve the performance under mismatched conditions. Specifically, we explore techniques to optimally determine the relative weights of the independent decisions based on audio and video to achieve the best combination. Experiments on videobroadcast news data show that significant improvements can be achieved by the fusion in acoustically degraded conditions.

ASDD Area Sistemi Dipartimentali e Documentali, Università di Bologna, Catalogo delle riviste ed altri periodici
Documento generato il 20/01/20 alle ore 10:27:28