Abstract
Large amounts of spatio-temporal data are continuously collected through the use of location devices or sensor technologies. One of the techniques usually used to obtain a first insight on data is clustering. The Shared Nearest Neighbour (SNN) is a clustering algorithm that finds clusters with different densities, shapes and sizes, and also identifies noise in data, making it a good candidate to deal with spatial data. However, its time complexity is, in the worst case, O(n(2)), compromising its scalability. This paper presents the use of a metric data structure, the kd-Tree, to index spatial data and support the SNN in querying for the k-nearest neighbours, improving the time complexity in the average case of the algorithm, when dealing with low dimensional data, to at most O(n x log n). The proposed algorithm, the kd-SNN, was evaluated in terms of performance, showing huge improvements over existing approaches, allowing the identification of the main traffic routes by completely clustering a maritime data set.
Original language | English |
---|---|
Title of host publication | COMPUTATIONAL SCIENCE AND ITS APPLICATIONS - ICCSA 2014, PT 1 |
Editors | B Murgante, S Misra, AMAC Rocha, C Torre, JG Rocha, MI Falcao, D Taniar, BO Apduhan, O Gervasi |
Publisher | SPRINGER-VERLAG BERLIN |
Pages | 312-327 |
Number of pages | 16 |
ISBN (Print) | 978-3-319-09143-3 |
Publication status | Published - 2014 |
Event | 14th International Conference on Computational Science and Its Applications (ICCSA) - University of Minho, Guimaraes, Portugal Duration: 30 Jun 2014 → 3 Jul 2014 http://2014.iccsa.org/ |
Publication series
Name | Lecture Notes in Computer Science |
---|---|
Publisher | SPRINGER-VERLAG BERLIN |
Volume | 8579 |
ISSN (Print) | 0302-9743 |
Conference
Conference | 14th International Conference on Computational Science and Its Applications (ICCSA) |
---|---|
Abbreviated title | ICCSA 2014 |
Country/Territory | Portugal |
City | Guimaraes |
Period | 30/06/14 → 3/07/14 |
Internet address |
Keywords
- kd-tree
- SNN
- spatial data
- movement data
- route identification