data mining approaches | Essays (high school) Computer science

IJCSI International Journal of Computer Science Issues, Vol. 7, Issue 5, September 2010

ISSN (Online): 1694-0814

www.IJCSI.org

181

A New Approach for Evaluation of Data Mining Techniques

Moawia Elfaki Yahia1, Murtada El-mukashfi El-taher2

1College of Computer Science and IT

King Faisal University

Saudi Arabia, Alhasa 31982

2Faculty of Mathematical Sciences

University of Khartoum

Sudan, Khartoum 11115

Abstract

This paper tries to put a new direction for the evaluation of some

techniques for solving data mining tasks such as: Statistics,

Visualization, Clustering, Decision Trees, Association Rules and

Neural Networks. The new approach has succeed in defining

some new criteria for the evaluation process, and it has obtained

valuable results based on what the technique is, the environment

of using each techniques, the advantages and disadvantages of

each technique, the consequences of choosing any of these

techniques to extract hidden predictive information from large

databases, and the methods of implementation of each technique.

Finally, the paper has presented some valuable recommendations

in this field.

Keywords:Data Mining Evaluation, Statistics,

Visualization, Clustering, Decision Trees, Association

Rules, Neural Networks.

1. Introduction

Extracting useful information from data is very far easier

from collecting them. Therefore many sophisticated

techniques, such as those developed in the multi-

disciplinary field data mining are applied to the analysis of

the datasets. One of the most difficult tasks in data mining

is determining which of the multitude of available data

mining technique is best suited to a given problem.

Clearly, a more generalized approach to information

extraction would improve the accuracy and cost

effectiveness of using data mining techniques. Therefore,

this paper proposes a new direction based on evaluation

techniques for solving data mining tasks, by using six

techniques: Statistics, Visualization, Clustering, Decision

Tree, Association Rule and Neural Networks. The aim of

this new approach is to study those techniques and their

processes and to evaluate data mining techniques on the

basis of: the suitability to a given problem, the advantages

and disadvantages, the consequences of choosing any

technique, and the methods of implementation [5].

2. Data Mining Overview

Data mining, the extraction of hidden predictive

information from large databases, is a powerful new

technology with great potential to help companies focus

on the most important information in their data

warehouses [6]. Data mining tools predict future trends

and behaviors allowing businesses to make proactive

knowledge driven decisions. Data mining tools can answer

business question that traditionally were too time

consuming to resolve. They scour database for hidden

patterns, finding predictive information that experts may

miss because it lies outside their expectations.

3. Review of Selected Techniques

A large number of modeling techniques are labeled "data

mining" techniques [7]. This section provides a short

review of a selected number of these techniques. Our

choice was guided the focus on the most currently used

models. The review in this section only highlights some of

the features of different techniques and how they

influence, and benefit from. We do not present a complete

exposition of the mathematical details of the algorithms, or

their implementations. Although various different

techniques are used for different purposes those that are of

interest in the present context [4]. Data mining techniques

which are selected are Statistics, Visualization, Clustering,

Decision Tree, Association Rules and Neural Networks.

3.1 Statistical Techniques

By strict definition "statistics" or statistical techniques are

not data mining. They were being used long before the

term data mining was coined. However, statistical

techniques are driven by the data and are used to discover

patterns and build predictive models. Today people have

data mining approaches, Essays (high school) of Computer science

Related documents

Partial preview of the text

Download data mining approaches and more Essays (high school) Computer science in PDF only on Docsity!

A New Approach for Evaluation of Data Mining Techniques

1. Introduction

2. Data Mining Overview

3. Review of Selected Techniques

3.1 Statistical Techniques

3.2 Visualization Techniques

3.3 Clustering Techniques

3.4 Induction Decision Tree Techniques

3.5 Association Rule Techniques

4.2.2 The Environment of using Visualization

Technique

4.2.3 The Advantages of Visualization Technique

4.2.4 The Disadvantages of Visualization Technique

4.2.5 Consequences of choosing of Visualization

Technique

4.2.6 Implementation of Statistical Visualization

process

4.3 Clustering Technique

4.3.1 Identification of Clustering

4.3.2 The Environment of using Clustering

Technique

4.3.3 The Advantages of Clustering Technique

4.3.4 The Disadvantages of Clustering Technique

4.3.5 Consequences of choosing of Clustering

Technique

4.3.6 Implementation of Clustering Technique

process

4.4 Decision Trees Technique

4.4.1 Identification of Decision Trees

4.4.2 The Environment of using Decision Trees

Technique

4.4.4 The Disadvantages of Decision Trees

Technique

4.4.5 Consequences of choosing of Decision Trees

Technique

4.4.6 Implementation of Decision Trees Technique

process

4.5 Association Rule Technique

4.5.1 Identification of Association Rule

4.5.2 The Environment of using Association Rule

Technique

4.5.3 The Advantages of Association Rule Technique

4.5.4 The Disadvantages of Association Rule

Technique

4.5.5 Consequences of choosing of Association Rule

Technique

4.4.6 Implementation of Association Rule Technique

process

4.3 Neural Networks Technique

4.3.1 Identification of Neural Network

4.3.2 The Environment of using Neural Networks

Technique