Prepare for your exams
Get points
Guidelines and tips
Sell on Docsity
Docsity AI

Prepare for your exams

Study with the several resources on Docsity

Earn points to download

Earn points by helping other students or get them with a premium plan

Guidelines and tips

Sell on Docsity

Docsity AI

Log in Sign up

Prepare for your exams

Study with the several resources on Docsity

Find documents

Prepare for your exams with the study notes shared by other students like you on Docsity

Search for your university

Find the specific documents for your university's exams

Docsity AINEW

Summarize your documents, ask them questions, convert them into quizzes and concept maps

Explore questions

Clear up your doubts by reading the answers to questions asked by your fellow students

Earn points to download

Earn points by helping other students or get them with a premium plan

Share documents

20 Points

For each uploaded document

Answer questions

5 Points

For each given answer (max 1 per day)

All the ways to get free points

Get points immediately

Choose a premium plan with all the points you need

Study Opportunities

Choose your next study program

Get in touch with the best universities in the world. Search through thousands of universities and official partners

Community

Ask the community

Ask the community for help and clear up your study doubts

Free resources

Our save-the-student-ebooks!

Download our free guides on studying techniques, anxiety management strategies, and thesis advice from Docsity tutors

Data_mining_techniques_and_applications, Essays (high school) of Computer science

Computer science

hii this is usefull for student

Typology: Essays (high school)

2020/2021

Uploaded on 10/26/2022

saajith 🇱🇰

1 document

1 / 5

This page cannot be seen from the preview

Don't miss anything!

Bharati M. Ramageri / Indian Journal of Computer Science and Engineering

Vol. 1 No. 4 301-305

DATA MINING TECHNIQUES AND APPLICATIONS

Mrs. Bharati M. Ramageri, Lecturer

Modern Institute of Information Technology and Research,

Department of Computer Application, Yamunanagar, Nigdi

Pune, Maharashtra, India-411044.

Abstract

Data mining is a process which finds useful patterns from large amount of data. The paper discusses few of the data mining

techniques, algorithms and some of the organizations which have adapted data mining technology to improve their businesses and

found excellent results.

Keywords: Data mining Techniques; Data mining algorithms; Data mining applications.

1. Overview of Data Mining

The development of Information Technology has generated large amount of databases and huge data in

various areas. The research in databases and information technology has given rise to an approach to store

and manipulate this precious data for further decision making. Data mining is a process of extraction of

useful information and patterns from huge data. It is also called as knowledge discovery process,

knowledge mining from data, knowledge extraction or data /pattern analysis.

Figure 1. Knowledge discovery Process

Data mining is a logical process that is used to search through large amount of data in order to find

useful data. The goal of this technique is to find patterns that were previously unknown. Once these

patterns are found they can further be used to make certain decisions for development of their businesses.

Three steps involved are

 Exploration

 Pattern identification

 Deployment

Exploration: In the first step of data exploration data is cleaned and transformed into another form, and

important variables and then nature of data based on the problem are determined.

ISSN : 0976-5166

301

Partial preview of the text

Download Data_mining_techniques_and_applications and more Essays (high school) Computer science in PDF only on Docsity!

Vol. 1 No. 4 301-

DATA MINING TECHNIQUES AND APPLICATIONS Mrs. Bharati M. Ramageri, Lecturer Modern Institute of Information Technology and Research, Department of Computer Application, Yamunanagar, Nigdi Pune, Maharashtra, India-411044.

Abstract

Data mining is a process which finds useful patterns from large amount of data. The paper discusses few of the data mining techniques, algorithms and some of the organizations which have adapted data mining technology to improve their businesses and found excellent results.

Keywords: Data mining Techniques; Data mining algorithms; Data mining applications.

1. Overview of Data Mining

The development of Information Technology has generated large amount of databases and huge data in various areas. The research in databases and information technology has given rise to an approach to store and manipulate this precious data for further decision making. Data mining is a process of extraction of useful information and patterns from huge data. It is also called as knowledge discovery process, knowledge mining from data, knowledge extraction or data /pattern analysis.

Figure 1. Knowledge discovery Process

Data mining is a logical process that is used to search through large amount of data in order to find useful data. The goal of this technique is to find patterns that were previously unknown. Once these patterns are found they can further be used to make certain decisions for development of their businesses.

Three steps involved are

 Exploration

 Pattern identification

 Deployment

Exploration: In the first step of data exploration data is cleaned and transformed into another form, and important variables and then nature of data based on the problem are determined.

Vol. 1 No. 4 301-

Pattern Identification: Once data is explored, refined and defined for the specific variables the second step is to form pattern identification. Identify and choose the patterns which make the best prediction.

Deployment: Patterns are deployed for desired outcome.

2. Data Mining Algorithms and Techniques

Various algorithms and techniques like Classification, Clustering, Regression, Artificial Intelligence, Neural Networks, Association Rules, Decision Trees, Genetic Algorithm, Nearest Neighbor method etc., are used for knowledge discovery from databases.

2.1. Classification

Classification is the most commonly applied data mining technique, which employs a set of pre-classified examples to develop a model that can classify the population of records at large. Fraud detection and credit- risk applications are particularly well suited to this type of analysis. This approach frequently employs decision tree or neural network-based classification algorithms. The data classification process involves learning and classification. In Learning the training data are analyzed by classification algorithm. In classification test data are used to estimate the accuracy of the classification rules. If the accuracy is acceptable the rules can be applied to the new data tuples. For a fraud detection application, this would include complete records of both fraudulent and valid activities determined on a record-by-record basis. The classifier-training algorithm uses these pre-classified examples to determine the set of parameters required for proper discrimination. The algorithm then encodes these parameters into a model called a classifier.

Types of classification models:

 Classification by decision tree induction

 Bayesian Classification

 Neural Networks

 Support Vector Machines (SVM)

 Classification Based on Associations

2.2. Clustering

Clustering can be said as identification of similar classes of objects. By using clustering techniques we can further identify dense and sparse regions in object space and can discover overall distribution pattern and correlations among data attributes. Classification approach can also be used for effective means of distinguishing groups or classes of object but it becomes costly so clustering can be used as preprocessing approach for attribute subset selection and classification. For example, to form group of customers based on purchasing patterns, to categories genes with similar functionality.

Types of clustering methods

 Partitioning Methods

 Hierarchical Agglomerative (divisive) methods

 Density based methods

 Grid-based methods

 Model-based methods

Vol. 1 No. 4 301-

3. Data Mining Applications

Data mining is a relatively new technology that has not fully matured. Despite this, there are a number of industries that are already using it on a regular basis. Some of these organizations include retail stores, hospitals, banks, and insurance companies. Many of these organizations are combining data mining with such things as statistics, pattern recognition, and other important tools. Data mining can be used to find patterns and connections that would otherwise be difficult to find. This technology is popular with many businesses because it allows them to learn more about their customers and make smart marketing decisions. Here is overview of business problems and solutions found using data mining technology.

3.1. FBTO Dutch Insurance Company

Challenges

 To reduce direct mail costs.

 Increase efficiency of marketing campaigns.

 Increase cross-selling to existing customers, using inbound channels such as the company’s sell center and the internet a one year test of the solution’s effectiveness.

Results

 Provided the marketing team with the ability to predict the effectiveness of its campaigns.

 Increased the efficiency of marketing campaign creation, optimization, and execution.

 Decreased mailing costs by 35 percent.

 Increased conversion rates by 40 percent.

3.2. ECtel Ltd., Israel

Challenges

 Fraudulent activity in telecommunication services.

Results

 Significantly reduced telecommunications fraud for more than 150 telecommunication companies worldwide.

 Saved money by enabling real-time fraud detection.

3.3. Provident Financial’s Home credit Division, United Kingdom

Challenges

 No system to detect and prevent fraud.

Results

 Reduced frequency and magnitude of agent and customer fraud.

 Saved money through early fraud detection.

 Saved investigator’s time and increased prosecution rate.

3.4. Standard Life Mutual Financial Services Companies

Challenges

 Identify the key attributes of clients attracted to their mortgage offer.

 Cross sell Standard Life Bank products to the clients of other Standard Life companies.

 Develop a remortgage model which could be deployed on the group Web site to examine the profitability of the mortgage business being accepted by Standard Life Bank.

Vol. 1 No. 4 301-

Results

 Built a propensity model for the Standard Life Bank mortgage offer identifying key customer types that can be applied across the whole group prospect pool.

 Discovered the key drivers for purchasing a remortgage product.

 Achieved, with the model, a nine times greater response than that achieved by the control group.

 Secured £33million (approx. $47 million) worth of mortgage application revenue.

3.5. Shenandoah Life insurance company United States.

Challenges

 Policy approval process was paper based and cumbersome.

 Routing of these paper copies to various departments, there was delays in approval.

Results

 Empowered management with current information on pending policies.

 Reduced the time required to issue certain policies by 20 percent.

 Improved underwriting and employee performance review processes.

3.6. Soft map Company Ltd., Tokyo

Challenges

 Customers had difficulty making hardware and software purchasing decisions, which was hindering online sales.

Results

 Page views increased 67 percent per month after the recommendation engine went live.

 Profits tripled in 2001, as sales increased 18 percent versus the same period in the previous year.

4. Conclusion

Data mining has importance regarding finding the patterns, forecasting, discovery of knowledge etc., in different business domains. Data mining techniques and algorithms such as classification, clustering etc., helps in finding the patterns to decide upon the future trends in businesses to grow. Data mining has wide application domain almost in every industry where the data is generated that’s why data mining is considered one of the most important frontiers in database and information systems and one of the most promising interdisciplinary developments in Information Technology.

5. References

Jiawei Han and Micheline Kamber (2006), Data Mining Concepts and Techniques, published by Morgan Kauffman, 2nd ed.
Dr. Gary Parker, vol 7, 2004, Data Mining: Modules in emerging fields, CD-ROM.
Crisp-DM 1.0 Step by step Data Mining guide from http://www.crisp-dm.org/CRISPWP-0800.pdf.
Customer Successes in your industry from http://www.spss.com/success/?source=homepage&hpzone=nav_bar.
https://www.allbusiness.com/Technology /computer-software-data-management/ 633425-1.html, last retrieved on 15th Aug 2010.
http://www.kdnuggets.com/.

Data_mining_techniques_and_applications, Essays (high school) of Computer science

Related documents

Partial preview of the text

Download Data_mining_techniques_and_applications and more Essays (high school) Computer science in PDF only on Docsity!