Catalogo Articoli (Spogli Riviste)

OPAC HELP

Titolo:
On analyzing errors in a selectivity estimation method based on dynamic maintenance of data distribution
Autore:
Kim, SW; Whang, WK;
Indirizzi:
Kangwon Natl Univ, Div Comp Informat & Commun Engn, Chunchon 200701, Kangwon, South Korea Kangwon Natl Univ Chunchon Kangwon South Korea 200701 angwon, South Korea
Titolo Testata:
INFORMATION AND SOFTWARE TECHNOLOGY
fascicolo: 4, volume: 43, anno: 2001,
pagine: 265 - 274
SICI:
0950-5849(20010315)43:4<265:OAEIAS>2.0.ZU;2-W
Fonte:
ISI
Lingua:
ENG
Keywords:
selectivity estimation; data distribution; error analysis; multilevel grid files; performance evaluation; multidimensional file structure;
Tipo documento:
Article
Natura:
Periodico
Settore Disciplinare:
Engineering, Computing & Technology
Citazioni:
14
Recensione:
Indirizzi per estratti:
Indirizzo: Kim, SW Kangwon Natl Univ, Div Comp Informat & Commun Engn, 192-1,Hyoja 2 Dong, Chunchon 200701, Kangwon, South Korea Kangwon Natl Univ 192-1,Hyoja 2Dong Chunchon Kangwon South Korea 200701
Citazione:
S.W. Kim e W.K. Whang, "On analyzing errors in a selectivity estimation method based on dynamic maintenance of data distribution", INF SOFTW T, 43(4), 2001, pp. 265-274

Abstract

In this paper, we discuss the errors in selectivity estimation using the multilevel grid file (MLGF), a multidimensional file structure. We first demonstrate that estimation errors stem from the uniformity assumption that records are uniformly distributed in their belonging region represented by anentry in a level of an MLGF directory. Based on this demonstration, we then investigate five factors affecting the accuracy of estimation: (1) the data distribution in a region, (2) the number of records stored in an MLGF, (3) the page size, (4) the query region size, and (5) the level of an MLGF directory. Next, we present the tendency of estimation errors according to the change of values for each factor through extensive experiments. The results show that the errors decrease when: (1) the distribution of records in a region becomes closer to the uniform one; (2) the number of records in anMLGF increases; (3) the page size decreases; (4) the query region size increases; and (5) the level of an MLGF directory employed as data distribution information becomes lower. After defining the Granule Ratio, the core formula representing the basic relationship between the estimation errors and the above factors, we finally examine the change of estimation errors according to the change of the values for the Granule Ratio through experiments. The results indicate that errors tend to be similar depending on the values for the Granule Ratio regardless of the various changes of the values forthe factors. (C) 2001 Elsevier Science B.V. All rights reserved.

ASDD Area Sistemi Dipartimentali e Documentali, Università di Bologna, Catalogo delle riviste ed altri periodici
Documento generato il 23/09/20 alle ore 06:28:21