Catalogo Articoli (Spogli Riviste)

OPAC HELP

Titolo:
A Japanese TTS system based on multiform units and a speech modification algorithm with harmonics reconstruction
Autore:
Takano, S; Tanaka, K; Mizuno, H; Abe, M; Nakajima, S;
Indirizzi:
Nippon Telegraph & Tel Publ Corp, Cyber Space Labs, Kanagawa 2390847, Japan Nippon Telegraph & Tel Publ Corp Kanagawa Japan 2390847 a 2390847, Japan
Titolo Testata:
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING
fascicolo: 1, volume: 9, anno: 2001,
pagine: 3 - 10
SICI:
1063-6676(200101)9:1<3:AJTSBO>2.0.ZU;2-6
Fonte:
ISI
Lingua:
ENG
Keywords:
evaluation of synthesis technique and systems; segmental units and adjustment rules; spectral analysis; synthesis structure and systems;
Tipo documento:
Article
Natura:
Periodico
Settore Disciplinare:
Engineering, Computing & Technology
Citazioni:
23
Recensione:
Indirizzi per estratti:
Indirizzo: Takano, S Nippon Telegraph & Tel Publ Corp, Cyber Space Labs, Kanagawa 2390847, Japan Nippon Telegraph & Tel Publ Corp Kanagawa Japan 2390847 Japan
Citazione:
S. Takano et al., "A Japanese TTS system based on multiform units and a speech modification algorithm with harmonics reconstruction", IEEE SPEECH, 9(1), 2001, pp. 3-10

Abstract

This paper proposes a new text to-speech (TTS) system that utilizes large numbers of speech segments to produce very natural and intelligible synthetic speech. There are two innovations; new multiform synthesis units and a new speech modification algorithm based on a vocoder that offers harmonics reconstruction. The multiform units make it possible to reduce acoustic discontinuities at concatenation points and unnatural sound by preparing synthesis units with various lengths and various F-0 contours. The new speech modification algorithm, on the other hand, improves the quality of prosody modified speech. This algorithm is extremely effective in synthesizing speech whose prosodic parameters are quite different from those of synthesis units. Listening tests confirm that the new synthesis units yield speech with high intelligibility and naturalness, and that the new speech modification algorithm is superior to all other conventional vocoders and waveform domain algorithms including TD-PSOLA, especially when modifying the F-0 frequency upward.

ASDD Area Sistemi Dipartimentali e Documentali, Università di Bologna, Catalogo delle riviste ed altri periodici
Documento generato il 06/04/20 alle ore 08:10:40