Optimization in logical analysis of data

Bonates, Tiberius

doi:doi:10.7282/T32N52PZ

RUcore: Rutgers University Community Repository

Search
- All
- Text
- Images
- Audio
- Video
Advanced Search | Help

Search all content in all RUcore collections.
Services
Collections

Help Contact Us My Account

Home

Resource

Optimization in logical analysis of data

PDF

PDF format is widely accepted and good for printing.

Plug-in required

PDF-1(733.84 kb)

Citation & Export

View Usage Statistics

Staff View

Citation & Export
Hide

Simple citation

Bonates, Tiberius. Optimization in logical analysis of data. Retrieved from https://doi.org/doi:10.7282/T32N52PZ

Export

Click here for information about Citation Management Tools at Rutgers.

Statistics
Hide

Description

Uniform TitleOptimization in logical analysis of data

NameBonates, Tiberius (author); Boros, Endre (chair); Gurvich, Vladimir (internal member); Kogan, Alexander (internal member); Prekopa, Andras (internal member); Alexe, Gabriela (outside member); Maculan, Nelson (outside member); Rutgers University; Graduate School - New Brunswick

Date Created2007

Other Date2007 (degree)

SubjectOperations Research, Data mining, Machine learning

Extentx, 105 pages

DescriptionLogical Analysis of Data (LAD) is a machine learning/data mining methodology that combines ideas from areas such as Boolean functions, optimization and logic. In this thesis, we focus on the description and the application of novel optimization models to the construction of improved and/or simplified LAD models of data. We address the construction of LAD classification models, proposing two alternative ways of generating patterns, or rules. First, we show how to construct LAD models based on patterns of maximum coverage. We show, through a series of computational experiments, that such models are as good as, if not better than those obtained with the standard LAD implementation and other machine learning methods, while requiring a much simpler calibration for optimal performance. We formulate the problem of finding the most suitable LAD model as a large linear program, and show how to solve it using column generation. For the subproblem phase, we describe a branch-and-bound algorithm, whose performance is significantly superior to that of a commercial integer programming solver. The
LAD models produced by this algorithm are virtually parameter-free and practically as accurate as the calibrated models obtained with other machine learning methods. Finally, we propose a novel regression algorithm that extends the LAD methodology for the case of a numerical outcome and show that it constitutes an attractive alternative to other regression methods in terms of performance and flexibility of use.

NotePh.D.

NoteIncludes bibliographical references (p. 95-103).

Genretheses, ETD doctoral

Persistent URLhttps://doi.org/doi:10.7282/T32N52PZ

LanguageEnglish

CollectionGraduate School - New Brunswick Electronic Theses and Dissertations

Organization NameRutgers, The State University of New Jersey

RightsThe author owns the copyright to this work.

Version 8.5.5

Citation & ExportHide

Simple citation

Export

StatisticsHide

Description

Citation & Export
Hide

Statistics
Hide