Study with the several resources on Docsity
Earn points by helping other students or get them with a premium plan
Prepare for your exams
Study with the several resources on Docsity
Earn points to download
Earn points by helping other students or get them with a premium plan
Community
Ask the community for help and clear up your study doubts
Discover the best universities in your country according to Docsity users
Free resources
Download our free guides on studying techniques, anxiety management strategies, and thesis advice from Docsity tutors
Explore intelligent data analysis (ida), an interdisciplinary field focused on effective data analysis for extracting valuable information and knowledge from large datasets. Learn about ida concepts, tools like see5, cubist, illm, and magnum opus, and techniques for discovering rules hidden in data.
Typology: Slides
1 / 31
Decision making is asking forinformation and knowledge Data processing can give them Multidimensionality of problems islooking for methods for adequate anddeep data processing and analysis
To understand the concept of the IDA To meet web-sites and literature on IDA To meet some tools for IDA To learn how to use IDA tools and tovalidate the IDA results
Recognize problems asking for IDA Preparing data and making analysis Validating and interpreting results of IDA
… an interdisciplinary studyconcerned with the effectiveanalysis of data;… used for extracting usefulinformation from large quantitiesof online data; extracting desirableknowledge or interesting patternsfrom existing databases;
Data mining Knowledge acquisition from data Genetic algorithm-based rule discovery Knowledge discovery Learning classifier system Machine learning etc.
^ the distillation of information that has beencollected, classified, organized, integrated,abstracted and value-added; ^ at a level of abstraction higher than the data,and information on which it is based and canbe used to deduce new information and newknowledge; ^ usually in the context of human expertiseused in solving problems.
See
Cubist
Illustration of IDA by using See
application.
names
classes
to
which cases may belong and the attributes
used to describe each case.
Attributes are of two types:
discrete
attributes have a value drawn from a setof possibilities, and
continuous
attributes have numeric values.
application.
data
on the
training
cases from which See
will extract patterns. The entry for each case consists of oneor more lines that give the values for allattributes.
application.
test
on the
test
cases (used for evaluation of
results). The entry for each case consists of oneor more lines that give the values for allattributes.
1 – they were healthy2 – they were ill (drug treatment, positive clinicaland laboratory findings)
names
Goal.gender:M,Factivity:1,2,3age: continuoussmoking: No,Yes… Goal:1,2…
data
M,1,59,Yes,0,0,0,0,119,73,103,86,247,87,15979,?,?,?,1,73,2.5M,1,66,Yes,0,0,0,0,132,81,183,239,?,783,14403,27221,19153,23187,1,73,2.6M,1,61,No,0,0,0,0,130,79,148,86,209,115,21719,12324,10593,11458,1,74,2.5…^
…
Rule 1: (cover 26)
gender = MSBP > 111oil_fat > 2.9 ->^
class 1
[0.929]