Abstract
Support vector machines (SVMs) are a powerful tool for classification problems. SVMs have only been developed in the last 20 years with the availability of cheap and abundant computing power. SVMs are a non-statistical approach and make no assumptions about the distribution of the data. Here support vector machines are applied to a classic data set from the machine learning literature and the out-of-sample misclassification rates are compared to other classification methods. Finally, an algorithm for using support vector machines to address the difficulty in imputing missing categorical data is proposed and its performance is demonstrated under three different scenarios using data from the 1997 National Labor Survey.
Degree
MS
College and Department
Physical and Mathematical Sciences; Statistics
Rights
http://lib.byu.edu/about/copyright/
BYU ScholarsArchive Citation
Rogers, Spencer David, "Support Vector Machines for Classification and Imputation" (2012). Theses and Dissertations. 3215.
https://scholarsarchive.byu.edu/etd/3215
Date Submitted
2012-05-16
Document Type
Selected Project
Handle
http://hdl.lib.byu.edu/1877/etd5236
Keywords
support vector machines, SVM, imputation, binary classification, handwritten digit recognition, EM algorithm, NLSY97
Language
English