Browse our site
About
People
Research Areas
Projects
Publications
Books
Book chapters
Journal articles
In proceedings
M. Sc. Dissertations
Ph. D. Dissertations
Technical reports
Seminars
News
You are here:
Home
Publications
View
Publication details
Go back
Publication details
Main information
Title:
Implementation for Spatial Data of the Shared Nearest Neighbour with Metric Data Structures
Publication date:
November 2012
Citation:
bfaustino
Abstract:
The Shared Nearest Neighbour (SNN) is a data clustering algorithm that identifies noise in data and finds clusters with different densities, shapes and sizes. making the SNN a good candidate to deal with spatial data. The SNN time complexity can be a bottleneck in spatial data clustering, since it has a time complexity in the worst case evaluated in O(n2). In this thesis, it is proposed to use metric data structures to index spatial data and support the SNN in querying for the k-nearest neighbours. When dealing with spatial data, the time complexity in the average case of the SNN, using a metric data structure in primary storage (kd-Tree), is improved to at most O(n × log n). Furthermore, using a strategy to reuse the k-nearest neighbours between consecutive runs, it is possible to obtain a time complexity in the worst case of O(n). The experimental results were done using the kd-Tree and the DF-Tree, which work in primary and secondary storage, respectively.
M. Sc. dissertation
Authors:
Bruno Faustino
Supervisors:
João Moura Pires
School:
UNL
Note:
-
Url address:
-
Publication files
File #1:
- click here to download -
pdf 5242 KB
Export formats
Plain text:
Bruno Faustino, Implementation for Spatial Data of the Shared Nearest Neighbour with Metric Data Structures, João Moura Pires (superv.), UNL, November 2012.
HTML:
<b>Bruno Faustino</b>, <u>Implementation for Spatial Data of the Shared Nearest Neighbour with Metric Data Structures</u>, <a href="/people/members/view.php?code=542b14e1830dcf7566974fd36b6fccc7" class="supervisor">João Moura Pires</a> (superv.), UNL, November 2012.
BibTeX:
@mastersthesis {bfaustino, author = {Bruno Faustino}, title = {Implementation for Spatial Data of the Shared Nearest Neighbour with Metric Data Structures}, school = {UNL}, note = {Jo{\~a}o Moura Pires (superv.); }, abstract = {The Shared Nearest Neighbour (SNN) is a data clustering algorithm that identifies noise in data and finds clusters with different densities, shapes and sizes. making the SNN a good candidate to deal with spatial data. The SNN time complexity can be a bottleneck in spatial data clustering, since it has a time complexity in the worst case evaluated in O(n2). In this thesis, it is proposed to use metric data structures to index spatial data and support the SNN in querying for the k-nearest neighbours. When dealing with spatial data, the time complexity in the average case of the SNN, using a metric data structure in primary storage (kd-Tree), is improved to at most O(n × log n). Furthermore, using a strategy to reuse the k-nearest neighbours between consecutive runs, it is possible to obtain a time complexity in the worst case of O(n). The experimental results were done using the kd-Tree and the DF-Tree, which work in primary and secondary storage, respectively.}, keywords = {df-tree,kd-tree,sharednearestneighbour,snn,spatialdata}, month = {November}, year = {2012}, }
Publication's urls
Full url:
/publications/view.php?code=4ad9d35f8453cb17d6455049d54ac481
Friendly url:
/publications/view.php?code=bfaustino
Go back
Departamento de Informática, FCT/UNL
Quinta da Torre 2829-516 CAPARICA - Portugal
Tel. (+351) 21 294 8536 FAX (+351) 21 294 8541