Prepare for your exams
Get points
Guidelines and tips
Sell on Docsity
Docsity AI

Prepare for your exams

Study with the several resources on Docsity

Earn points to download

Earn points by helping other students or get them with a premium plan

Guidelines and tips

Sell on Docsity

Prepare for your exams

Study with the several resources on Docsity

Prepare for your exams with the study notes shared by other students like you on Docsity

Search for your university

Find the specific documents for your university's exams

Summarize your documents, ask them questions, convert them into quizzes and concept maps

Explore questions

Clear up your doubts by reading the answers to questions asked by your fellow students

Earn points to download

Earn points by helping other students or get them with a premium plan

Share documents

For each uploaded document

Answer questions

For each given answer (max 1 per day)

All the ways to get free points

Get points immediately

Choose a premium plan with all the points you need

Study Opportunities

Choose your next study program

Get in touch with the best universities in the world. Search through thousands of universities and official partners

Community

Ask the community

Ask the community for help and clear up your study doubts

Free resources

Our save-the-student-ebooks!

Download our free guides on studying techniques, anxiety management strategies, and thesis advice from Docsity tutors

Data Mining & Machine Learning: Quiz on Search Procedures and Optimization, Lecture notes of Artificial Intelligence

Purdue University Artificial Intelligence

Various topics related to data mining and machine learning, including Naive Bayes Classifier challenges, optimization techniques for smooth and non-smooth functions, search algorithms for discrete spaces, heuristic search techniques, and convex optimization problems. The document also includes a Greedy tree search algorithm. likely to be useful as study notes or lecture notes for university students studying data mining and machine learning.

Typology: Lecture notes

2021/2022

Uploaded on 05/11/2023

ameen 🇺🇸

4.6

(5)

236 documents

1 / 48

This page cannot be seen from the preview

Don't miss anything!

bg1

Data Mining & Machine Learning

CS37300!

Purdue University"

September 13, 2022

pf3

pf4

pf5

pf8

pf9

pfa

pfd

pfe

pff

pf12

pf13

pf14

pf15

pf16

pf17

pf18

pf19

pf1a

pf1b

pf1c

pf1d

pf1e

pf1f

pf20

pf21

pf22

pf23

pf24

pf25

pf26

pf27

pf28

pf29

pf2a

pf2b

pf2c

pf2d

pf2e

pf2f

pf30

Discover Lecture notes of Artificial Intelligence Purdue University

Related documents

Heuristics - Introduction to Artificial Intelligence - Solved Exams

Heuristics - Introduction to Artificial Intelligence - Exams

Data Mining & Machine Learning at Purdue University

Data Mining: Predictive Modeling and Search Algorithms - Prof. Jennifer L. Neville

Data Mining: An Example from the Financial Industry - Prof. Jennifer L. Neville

Understanding the Basics of Internet, Convergence, and Web Design - Prof. William B. Colli

APES MINING,GEOLOGY & DIVERSITY REVIEW WITH COMPLETE SOLUTIONS NEW UPDATE!!

Lecture Notes on What is Mining - Data Mining | CS 57300

Review Problems for Final Exam - Operations Research - Optimization | IE 33500

Midterm Exam with Solutions - Data Mining - Spring 2009 | CS 57300

(2)

Midterm Review for Data Mining | CS 57300

Robust validation of network designs under uncertain demands and failures

Partial preview of the text

Download Data Mining & Machine Learning: Quiz on Search Procedures and Optimization and more Lecture notes Artificial Intelligence in PDF only on Docsity!

Data Mining & Machine Learning

CS

Purdue University September 13, 2022

Quiz

Naive Bayes Classifier challenges:
- (a) Over-counting evidence, leads to overly confident probability estimate
- (b) Zero counts are a problem because it affects the conditional probability and has no known solution

Combinatorial optimization

Optimization

Non-smooth functions:
- If the function is discrete , then traditional optimization methods that rely on smoothness are not applicable (e.g., gradient descent needs the derivative). Instead we need to use combinatorial optimization - Example: Choosing what features (structure) to add to a decision tree

Heuristic search

Typically, there is an exponential number of models in the (discrete) search space, making it intractable to exhaustively search the space - Thus, it is generally impossible to return a model that is guaranteed to have the best score
Instead, we have to resort to heuristic search techniques
- Methods are evaluated experimentally and shown to have good performance on average
- Greedy search: Given a current model M, look for other models near M and move to the best of these (if any have a score better than M)

Greedy search

Choose an initial state M^0 corresponding to a particular model structure (e.g., an empty tree)
Let Mi^ be the model structure location at the i-th iteration
For each iteration i
- Construct all possible models {Mj1, ..., Mjk} adjacent to Mi^ (as defined by search operators)
- Evaluate scores for all models {Mj1, ..., Mjk}
- Choose to move to the adjacent model with best score: Mi+1^ = Mj.best
- Repeat until there is no possible further improvement in the score

Greedy tree search algorithm

Input:
- Initial state? State space?
- Set of actions?
- How to choose next state?
- Goal test?
Output: -?

Root feature

Root feature, Root feature,

X
X
X
- X1,X
- X2,X
- X3,X
- X1,X left child
- X2,X
- X3,X
  - X1,X2,X
  - X2,X3,X
  - X3,X1,X
  - X1,X3,X
  - X2,X1,X
  - X3,X2,X
  - X1,X2,X left child, right child
  - X1,X3,X
  - X2,X1,X
  - X2,X3,X
  - X3,X1,X
  - X3,X2,X

Questions to ask about search procedures

Is the search exhaustive?
- I.e., does it either explicitly or implicitly consider all models in the space?
Is the search optimal?
- I.e., is it guaranteed to return the model with the best score?
- Global vs. local optimum?

Smooth optimization

Convex optimization problems

Where f is a convex score function C is a convex set ( constraints on model parameters or structure ) are the parameters variable and x is the data
For convex optimization problems, all locally optimal points are globally optimal
Example algorithms: Quadratic programming (SVMs), least squares estimation, maximum likelihood estimation

minimize f ( x ; θ )

subject to θ ∈ C

Convex optimization

Convex functions

In graph of convex function, the line connecting two points must lie above the function f (↵x + (1 ↵)y)  ↵f (x) + (1 ↵)f (y) for all 0  ↵  1 A function f is convex if:

Concave vs convex

Maximizing a concave function is equivalent to minimizing a convex function