Abstract

Millions of people in the United States alone suffer from undiagnosed or late-diagnosed chronic diseases such as Chronic Kidney Disease and Type II Diabetes. Catching these diseases earlier facilitates preventive healthcare interventions, which in turn can lead to tremendous cost savings and improved health outcomes. We develop algorithms for predicting disease occurrence by drawing from ideas and techniques in the field of machine learning. We explore standard classification methods such as logistic regression and random forest, as well as more sophisticated sequence models, including recurrent neural networks. We focus especially on the use of medical code data for disease prediction, and explore different ways for representing such data in our prediction algorithms.

Degree

MS

College and Department

Physical and Mathematical Sciences; Mathematics

Rights

http://lib.byu.edu/about/copyright/

Date Submitted

2016-06-01

Document Type

Thesis

Handle

http://hdl.lib.byu.edu/1877/etd8867

Keywords

preventive healthcare, disease prediction, chronic diseases, machine learning, sequence classification, recurrent neural networks, ICD-9 codes, survival analysis

Language

english

Included in

Mathematics Commons

Share

COinS