














Study with the several resources on Docsity
Earn points by helping other students or get them with a premium plan
Prepare for your exams
Study with the several resources on Docsity
Earn points to download
Earn points by helping other students or get them with a premium plan
ISYE 6501 - Midterm 2 Study Guide
Typology: Study Guides, Projects, Research
1 / 22
This page cannot be seen from the preview
Don't miss anything!















when might overfitting occurs
What is elastic net?
Do you have to decide the sample size ahead of time for A/B tests
When should you not use imputation? - When more than 5% of the data is moving per factor
first and more later on What do q-q plots help visual - if two data sets follow the same distribution.
why are q-q plots sometimes better than statistical tests - sometimes the statistical test will lead us in the wrong direction because most points might match but may be bad matches at the ends what is the memoryless property - it doesn't matter what's happened in the past, all that matters is where we are now If the data fits exponential distribution is it memoryless? - Yes If a data is memoryless is it exponential - yes Which distributions are memoryless - poisson and exponential Can a distribution not be memoryless and still be exponential - no what are deterministic simulations - same inputs give the same outputs what are stochastic simulations? - when there is randomness what are continuous-time simulations? - When changes happen continuously. Example: chemical processes, propagations What are discrete-event simulatIons - changes happen at discrete time points. Example: call center simulations someone calls worker finishes talking to someone. what are the elements of simulation model? - entities, modules, actions, resources, decisions point, and statistical tracking what are entities - things that move through the simulation (bags, people, etc) what are modules - parts of process (queues, storage, etc) what are replications - number of runs of a simulation Why is it important to validate a simulation by comparing to real data as much as possible? - If the simulation isn't a good reflection of reality, then any insights we gain from studying the simulation might not be applicable in reality what do prescriptive simulations answer - what-if questions what's an example of heuristic optimization - what's the best buffer size to have at each step in the process
quality of a set of values for the variables, which we're trying to maximize or minimize what is the solution - values for each variable what is a feasible solution - variable values that satisfy all constraints
what is an optimal solution - feasible solution with the best objective value What do binary variables do - They allow for more-complex models what is the objective function in linear regression - trying to minimize the squared error what are the statistical variables and constants in linear regression - the data; the coefficients what are the variables and constants in optimization model for linear regression - the data is the constant and the coefficients are the variables What is the objective function in logistic regression - to minimize the prediction error What are the variables in logistic regression - the coefficients in soft and hard classification SVMs what are the variables - the coefficients what is the constraint in hard classification for SVMS - each observation has to be on the right side of the line What is the objective function for hard classification - maximize the margin for soft classification what is the objective function - minimize classification error and maximize the margin what is the objective function for a time series model - minimize prediction error what are the variables in k-means clustering - coordinate of cluster centers and if a point is part of certain cluster What are the constraints in k-means clustering - each data point is assigned to a cluster What is the objective function in k-means - minimize total distance from data points to their cluster centers What are the order of fastest to slowest optimization problems - linear programs, convex quadratic programs, convex programs, integer programs, general non-convex programs are convex optimization problems guaranteed to find optimal solution - yes
what is a linear program? - f(x) is a linear function; constraint set X is defined by linear equations and inequalities what is convex quadratic program - f(x) is a convex quadratic function. Minimize f(x) or Maximize -f(x). constraint set X is defined by linear equations and inequalities what is constraint set X defined by in linear programs - linear equations and inequalities what is a convex optimization problem - objective f(x) is concave (if maximizing) or convex (if minimizing). Constraint set X is a convex set what is constraint set X in a convex optimization program - a convex set what is a integer program - linear program plus some (or all) variables restricted to take only integer values; variables could be binary (either 0 or
what are the basic steps to solve an optimization problem - 1) Initialization: pick values for all the variables (they may be simple, bad and not satisfy all of the constraints) 2.) find an improving direction t and make a change in that direction of some amount called the step size (theta) 3.) repeat using the the old solution plus the improving direction times the step size What should you do if your problem is too hard - Use a heuristic: rule of thumb process. It is ually gives good solutions what are some common network models? - shortest path model - find quickest/shortest route from one place to another; assignment model - which worker gets which job to maximize workforce efficiency; maximum flow model - how much oil can flow through complex network of pipes what type of models are network models - linear program what are the constraints in network models - flow into node = flow out of node; flow on arc between min and max allowable what do we get without constraints in a network model if data is all integers - an optimal solution where all the variables have integer values True or false: Requiring some variables in a linear program to take integer values can make it take a lot longer to solve. - True Adding integer variables moves the model from a linear program, which usually solves very quickly, to an integer program, which sometimes takes a long time to solve.
Do optimization implicitly assume that we know all of the values of the input data exactly? - Optimization models treat all of the data as known exactly
what is a non-zero sum game - it's possible for the total benefit for everyone to be higher or lower depending on the decisions made. what is a zero sum game - whatever one side gets, the other side looses what is stable equilibrium - Neither station has incentive to change what is pure strategy - just one choice (gas station) What is mixed strategy - randomize decisions according to probabilities (rock, paper, scissors); If your opponent knows you're going to pick rock she'll pick paper and beat you every time. Instead, a randomized or mixed strategy, where you randomly pick each one with probability 1/3 is your best bet. what is perfect information - know all about everyone else's situation what is imperfect information - Neither one really knows the others profit margins exactly. And in still other situation, some people have more information than others, so it's not symmetric what do descriptive, predictive and prescriptive models assume - that the systems do not react what is the idea of deep learning - to train a system to react to whatever patterns our human brain is reacting to without knowing what it's reacting to. what are some areas where deep learning is useful - recognizing speech and writing, natural language processing, and image recognition. what are the down sides of neural networks? - often do't give the best results. They require lots of data to rain and it's often hard to chose and tune the learning algorithm so the weights don't change too slowly but also don't change so quickly that they jump all over the place. What are the three levels of neurons - input level, hidden level, and output level What is the process of neural networks? - * Each input neuron accepts a single piece of information