Paper Title
Harnessing Mass Spectra Data using KNN Principle: Diagnosing Alzheimer’s Disease

A high level of expertise, rigorous algorithms and methods are needed to adequately mine and harness Mass Spectrometer generated data due to its unique nature and structure. Hitherto, peptide ions are matched with theoretical results and/or public databases in order to identify expressed proteins in analyzed protein source samples, but this is done on a spectrum by spectrum basis. In this study, we present a mechanism that extends the principle of K-nearest neighbor algorithm for mining pools of mass spectrometer saliva data towards discovering and characterizing patterns for diagnosing Alzheimer’s disease. The methodology discusses feature selection by correlation matrix, matrix to vector decomposition, an extension of Euclidean distance formula, and successfully classifies donor samples into the three stages of Alzheimer’s disease with over 85% accuracy without collaborating clinical records. Keywords- Alzheimer’s Disease Diagnosis; Feature matrix; Jackknifing; Mass Spectra Data