Prepare for your exams
Get points
Guidelines and tips
Sell on Docsity
Docsity AI

Prepare for your exams

Study with the several resources on Docsity

Earn points to download

Earn points by helping other students or get them with a premium plan

Guidelines and tips

Sell on Docsity

Docsity AI

Prepare for your exams

Study with the several resources on Docsity

Find documents

Prepare for your exams with the study notes shared by other students like you on Docsity

Search for your university

Find the specific documents for your university's exams

Docsity AINEW

Summarize your documents, ask them questions, convert them into quizzes and concept maps

Explore questions

Clear up your doubts by reading the answers to questions asked by your fellow students

Earn points to download

Earn points by helping other students or get them with a premium plan

Share documents

20 Points

For each uploaded document

Answer questions

5 Points

For each given answer (max 1 per day)

All the ways to get free points

Get points immediately

Choose a premium plan with all the points you need

Study Opportunities

Choose your next study program

Get in touch with the best universities in the world. Search through thousands of universities and official partners

Community

Ask the community

Ask the community for help and clear up your study doubts

Free resources

Our save-the-student-ebooks!

Download our free guides on studying techniques, anxiety management strategies, and thesis advice from Docsity tutors

Clustering Using Grid-based Method, Lecture notes of Data Mining

Tribhuvan University Kathmandu Data Mining

STING: Statistical Information Grid

Typology: Lecture notes

2018/2019

Uploaded on 04/17/2019

2018006347.tara 🇳🇵

1 document

1 / 10

This page cannot be seen from the preview

Don't miss anything!

A presentation on

STING: Statistical Information

Grid (Grid–Based Methods )

By:

Binko Toure

Favour Iwoni

Tara Chandra

Shrestha

Discover Lecture notes of Data Mining Tribhuvan University Kathmandu

Partial preview of the text

Download Clustering Using Grid-based Method and more Lecture notes Data Mining in PDF only on Docsity!

A presentation on

STING: Statistical Information

Grid (Grid–Based Methods )

By:

Binko Toure

Favour Iwoni

Tara Chandra

Shrestha

Grid-Based Clustering Methods

(^) The clustering methods discussed so far are data driven: they partition the set of objects and adopt to the distribution of the objects in the embedding space.
(^) Algorithms are query dependent. They are built for one query and generally no use for other query. We need a separate scan for each query, hence computation complexity at least O(n).
(^) This method takes a space-driven approach by partitioning the embedding space into cells independent of the distribution of the input objects.
(^) Uses multi-resolution grid data structure
(^) Quantizes the object space into a finite number of cells that form a grid structure on which all of the operation for clustering are performed.
(^) Develop hierarchical structure out of a given data and answer various queries efficiently. Every level of Hierarchy consists of cells.

Features & Challenges of a typical grid-based algorithm

(^) Efficiency & Scalability : # of cells << # of data points
(^) Uniformity: Uniform, hard to handle highly irregular data distributions
(^) Locality: Limited by predefined cell sizes, borders, and the density threshold
(^) Curse of dimensionality: Hard to cluster high-dimensional data

Advantages of Grid-based Clustering Algorithms

(^) Fast:  (^) No distance computations  (^) Clustering is performed on summaries and not individual objects; complexity is usually O(#- populated-grid-cells) and not O(# data objects)  (^) Easy to determine which clusters are neighboring
(^) Shapes are limited to union of grid-cells 5

STING: Algorithm (2) 7 The summarized pseudocodes for the STING algorithm are as follows:

STING: Query Processing(3) Used a top-down approach to answer spatial data queries

Start from a pre-selected layer—typically with a small number of cells
From the pre-selected layer until you reach the bottom layer do the following:

(^) For each cell in the current level compute the

confidence interval indicating a cell’s relevance to a

given query;

(^) If it is relevant, include the cell in a cluster
(^) If it irrelevant, remove cell from further consideration
(^) otherwise, look for relevant cells at the next lower layer

Combine relevant cells into relevant regions (based on grid-neighborhood) and return the so obtained clusters as your answers.

Clustering Using Grid-based Method, Lecture notes of Data Mining

Related documents

Partial preview of the text

Download Clustering Using Grid-based Method and more Lecture notes Data Mining in PDF only on Docsity!

A presentation on

STING: Statistical Information

Grid (Grid–Based Methods )

By:

Binko Toure

Favour Iwoni

Tara Chandra

Shrestha

Grid-Based Clustering Methods

confidence interval indicating a cell’s relevance to a

given query;