Linear Programming Boosting: Efficiently Solving LP Approaches to Boosting using LPBoost | Papers Computer Graphics

Machine Learning, 46, 225–254, 2002

2002 Kluwer Academic Publishers. Manufactured in The Netherlands.

Linear Programming Boosting

via Column Generation

AYHAN DEMIRIZ [email protected]

Department of Decision Sciences and Eng. Systems, Rensselaer Polytechnic Institute, Troy, NY 12180, USA

KRISTIN P. BENNETT [email protected]

Department of Mathematical Sciences, Rensselaer Polytechnic Institute, Troy, NY 12180 USA while visiting

Microsoft Research, Redmond, WA, USA

JOHN SHAWE-TAYLOR [email protected]

Department of Computer Science, Royal Holloway, University of London, Egham, Surrey TW20 0EX, UK

Editor: Nello Cristianini

Abstract. We examine linear program (LP) approaches to boosting and demonstrate their efficient solution

using LPBoost, a column generation based simplex method. We formulate the problem as if all possible weak

hypotheses had already been generated. The labels produced by the weak hypotheses become the new feature

space of the problem. The boosting task becomes to construct a learning function in the label space that minimizes

misclassification error and maximizes the soft margin. We prove that for classification, minimizing the 1-norm

soft margin error function directly optimizes a generalization error bound. The equivalent linear program can

be efficiently solved using column generation techniques developed for large-scale optimization problems. The

resulting LPBoost algorithm can be used to solve any LP boosting formulation by iteratively optimizing the dual

misclassification costs in a restricted LP and dynamically generating weak hypotheses to make new LP columns.

We provide algorithms for soft marginclassification, confidence-rated, and regression boosting problems. Unlike

gradient boosting algorithms, which may converge in the limit only, LPBoost converges in a finite number of

iterations to a global solution satisfying mathematically well-defined optimality conditions. The optimal solutions

of LPBoost are very sparse in contrast with gradient based methods. Computationally, LPBoost is competitive in

quality and computational cost to AdaBoost.

Keywords: ensemble learning, boosting, linear programming, sparseness, soft margin

1. Introduction

Recent papers (Schapire et al., 1998) have shown that boosting, arcing, and related ensemble

methods (hereafter summarized as boosting) can be viewed as margin maximization in

function space. By changing the cost function, different boosting methods such as AdaBoost

can be viewed as gradient descent to minimize this cost function. Some authors have noted

the possibility of choosing cost functions that can be formulated as linear programs (LP)

but then dismiss the approach as intractable using standard LP algorithms (R¨atsch et al.,

2000a; Breiman, 1999).

Linear Programming Boosting: Efficiently Solving LP Approaches to Boosting using LPBoost, Papers of Computer Graphics

Related documents

Partial preview of the text

Download Linear Programming Boosting: Efficiently Solving LP Approaches to Boosting using LPBoost and more Papers Computer Graphics in PDF only on Docsity!