Catalogo Articoli (Spogli Riviste)

OPAC HELP

Titolo:
Chinese document indexing based on a new partitioned signature file: Modeland evaluation
Autore:
Lam, W; Wong, KF; Wong, CY;
Indirizzi:
Chinese Univ Hong Kong, Dept Syst Engn & Engn Management, Shatin, Hong Kong, Peoples R China Chinese Univ Hong Kong Shatin Hong Kong Peoples R China Peoples R China
Titolo Testata:
JOURNAL OF THE AMERICAN SOCIETY FOR INFORMATION SCIENCE AND TECHNOLOGY
fascicolo: 7, volume: 52, anno: 2001,
pagine: 584 - 597
SICI:
1532-2882(200105)52:7<584:CDIBOA>2.0.ZU;2-Q
Fonte:
ISI
Lingua:
ENG
Soggetto:
OPTIMAL WEIGHT ASSIGNMENT; PARTIAL-MATCH RETRIEVAL; EXTRACTION;
Tipo documento:
Article
Natura:
Periodico
Settore Disciplinare:
Social & Behavioral Sciences
Citazioni:
24
Recensione:
Indirizzi per estratti:
Indirizzo: Lam, W Chinese Univ Hong Kong, Dept Syst Engn & Engn Management, Shatin, Hong Kong, Peoples R China Chinese Univ Hong Kong Shatin Hong Kong Peoples R China s R China
Citazione:
W. Lam et al., "Chinese document indexing based on a new partitioned signature file: Modeland evaluation", J AM SOC IN, 52(7), 2001, pp. 584-597

Abstract

In this article we investigate the use of signature files in Chinese information retrieval system and propose a new partitioning method for Chinese signature file based on the characteristic of Chinese words. Our partitioning method, called Partitioned Signature File for Chinese (PSFC), offers faster search efficiency than the traditional single signature file approach. We devise a general scheme for controlling the trade-off between the false drop and storage overhead while maintaining the search space reduction in PSFC. An analytical study is presented to support the claims of our method. We also propose two new hashing methods for Chinese signature files so that the signature file will be more suitable for dynamic environment while the retrieval performance is maintained. Furthermore, we have implemented PSFC and the new hashing methods, and we evaluated them using a large-scale real-world Chinese document corpus, namely, the TREC-5 (Text REtrieval Conference) Chinese collection. The experimental results confirm the features of PSFC and demonstrate its superiority over the traditional single signature file method.

ASDD Area Sistemi Dipartimentali e Documentali, Università di Bologna, Catalogo delle riviste ed altri periodici
Documento generato il 07/04/20 alle ore 23:10:18