Catalogo Articoli (Spogli Riviste)

OPAC HELP

Titolo:
Fundamental frequency estimation based on the joint time-frequency analysis of harmonic spectral structure
Autore:
Liu, DJ; Lin, CT;
Indirizzi:
Natl Chiao Tung Univ, Dept Elect & Control Engn, Hsinchu, Taiwan Natl Chiao Tung Univ Hsinchu Taiwan ect & Control Engn, Hsinchu, Taiwan
Titolo Testata:
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING
fascicolo: 6, volume: 9, anno: 2001,
pagine: 609 - 621
SICI:
1063-6676(200109)9:6<609:FFEBOT>2.0.ZU;2-Q
Fonte:
ISI
Lingua:
ENG
Soggetto:
TONE RECOGNITION; SPEECH; CLASSIFICATION;
Keywords:
adaptive representation; harmonic structure; partial FFT; pitch contour; pitch measure; spectrum analysis;
Tipo documento:
Article
Natura:
Periodico
Settore Disciplinare:
Engineering, Computing & Technology
Citazioni:
24
Recensione:
Indirizzi per estratti:
Indirizzo: Liu, DJ Natl Chiao Tung Univ, Dept Elect & Control Engn, Hsinchu, Taiwan Natl Chiao Tung Univ Hsinchu Taiwan ntrol Engn, Hsinchu, Taiwan
Citazione:
D.J. Liu e C.T. Lin, "Fundamental frequency estimation based on the joint time-frequency analysis of harmonic spectral structure", IEEE SPEECH, 9(6), 2001, pp. 609-621

Abstract

In this paper, we propose a new scheme to analyze the spectral structure of speech signals for fundamental frequency estimation. First, we propose a pitch measure to detect the harmonic characteristics of voiced sounds on the spectrum of a speech signal. This measure utilizes the properties that there are distinct impulses located at the positions of fundamental frequencyand its harmonics, and the energy of voiced sound is dominated by the energy of these distinct harmonic impulses. The spectrum can be obtained by thefast Fourier transform (FFT); however, it may be destroyed when the speechis interfered with by additive noise. To enhance the robustness of the proposed scheme in noisy environments, we apply the joint time-frequency analysis (JTFA) technique to obtain the adaptive representation of the spectrum of speech signals. The adaptive representation can accurately extract important harmonic structure of noisy speech signals at the expense of high computation cost. To solve this problem, we further propose a fast adaptive representation (FAR) algorithm, which reduces the computation complexity of the original algorithm by 50%. The performance of the proposed fundamental-frequency estimation scheme is evaluated on a large database with or without additive noise. The performance is compared to that of other approaches on the same database. The experimental results show that the proposed scheme performs well on clean speech and is robust in noisy environments.

ASDD Area Sistemi Dipartimentali e Documentali, Università di Bologna, Catalogo delle riviste ed altri periodici
Documento generato il 05/04/20 alle ore 06:09:13