Clustering Algorithm (Dbscan) : Vishal Bharti Computer Science Dept. GC, Cuny
Clustering Algorithm (Dbscan) : Vishal Bharti Computer Science Dept. GC, Cuny
(DBSCAN)
VISHAL BHARTI
Computer Science Dept.
GC, CUNY
Clustering Algorithm
▪ The benefit of using the indexed structure is that the neighborhood queries
have O(1) complexity.
▪ Cell Neighborhood : The cell neighborhood NCell (c) of a given cell c denotes
all cells d from the space of all available grid cells C that have a Chebychev
distance distChebychev of zero or one to c, i.e., NCell (c) = { d | d ∈ C ∧
distChebychev (c, d) ≤ 1 }.
▪ To get all neighborhood points within an assigned subspace, the processor
need an additional one cell-thick layer of redundant data items. This is known
as halos or ghost cells. These are transferred during the redistribution phase.
▪ After the redistribution phase, a local DBSCAN algorithm is run locally at
each of the processors.
▪ To ensure a balanced data space division, they use a cost heuristic.
Cost Heuristic
Local DBSCAN
Local DBSCAN