Data node splitting policies for improved range query efficiency in k-dimensional point data indexes

Απόθεσις

 

Εμφάνιση απλής εγγραφής

dc.contributor.author Outsios, E.
dc.contributor.author Evangelidis, G.
dc.date.accessioned 2015-07-06T12:39:15Z
dc.date.available 2015-07-06T12:39:15Z
dc.date.issued 2011-09
dc.identifier.other http://ieeexplore.ieee.org/xpl/articleDetails.jsp?arnumber=6065062&abstractAccess=no&userType=inst el
dc.identifier.uri http://apothesis.teicm.gr/xmlui/handle/123456789/1610
dc.description.abstract High dimensional vectors (points) are very common in image and video classification, time series data mining, and many modern data mining applications. One of the most popular classification methods on such data is k-Nearest Neighbor (kNN) searching. Unfortunately, all proposed and state-of-the-art multi-attribute indexes fall short in terms of usability as dimensionality increases. This is attributed to the ``dimensionality curse" problem, according to which, range searching above 10 dimensions is as efficient as a sequential scan of the entire database. Thus, kNN searching, as a special case of range searching, has to benefit a lot if we find ways to increase the performance of indexes in high dimensions. In this paper, we deal with space partitioning indexes and we propose six data node splitting techniques. We examine their performance in terms of data node storage utilization and quality of space partitioning. These two conflicting goals are both essential for good range query performance. Our experiments with uniform and skewed data demonstrate that certain splitting techniques can perform satisfactorily. en
dc.format.extent 5 el
dc.language.iso en el
dc.rights Attribution-NonCommercial-NoDerivatives 4.0 Διεθνές *
dc.rights.uri http://creativecommons.org/licenses/by-nc-nd/4.0/ *
dc.title Data node splitting policies for improved range query efficiency in k-dimensional point data indexes en
dc.type Άρθρο σε επιστημονικό συνέδριο el
dc.conference.information Kastoria, September 30-October 2, 2011 el
dc.conference.name 15th Panhellenic Conference on Informatics (PCI) el
dc.conference.publisher The Greek Computer Society (ΕΠΥ), the Department of Informatics and Telecommunications Engineering, University of Western Macedonia, and the Department of Informatics and Computer Technology, Technological Educational Institute of Western Macedonia el
dc.identifier.doi 10.1109/PCI.2011.46
dc.publication.category Απαγόρευση δημοσίευσης - Βιβλιογραφική αναφορά el
dc.subject.keyword Average storage utilization el
dc.subject.keyword Multi-attribute point data indexes el
dc.subject.keyword Range query performance el
dc.subject.keyword Space partitioning quality el


Αρχεία σε αυτό το τεκμήριο

Αρχεία Μέγεθος Τύπος Προβολή

Δεν υπάρχουν αρχεία που να σχετίζονται με αυτό το τεκμήριο.

Οι παρακάτω άδειες σχετίζονται με αυτό το τεκμήριο:

Αυτό το τεκμήριο εμφανίζεται στις ακόλουθες συλλογές

Εμφάνιση απλής εγγραφής

Attribution-NonCommercial-NoDerivatives 4.0 Διεθνές Except where otherwise noted, this item's license is described as Attribution-NonCommercial-NoDerivatives 4.0 Διεθνές