Winterburn, Julie L and Voineskos, Aristotle N and Devenyi, Gabriel A and Plitman, Eric and de la Fuente-Sandoval, Camilo and Bhagwat, Nikhil and Graff-Guerrero, Ariel and Knight, Jo and Chakravarty, M Mallar (2019) Can we accurately classify schizophrenia patients from healthy controls using magnetic resonance imaging and machine learning? : A multi-method and multi-dataset study. Schizophrenia Research, 214. pp. 3-10. ISSN 0920-9964
SCHRES_D_17_00102R1_281_29.pdf - Accepted Version
Available under License Creative Commons Attribution-NonCommercial-NoDerivs.
Download (1MB)
Abstract
Machine learning is a powerful tool that has previously been used to classify schizophrenia (SZ) patients from healthy controls (HC) using magnetic resonance images. Each study, however, uses different datasets, classification algorithms, and validation techniques. Here, we perform a critical appraisal of the accuracy of machine learning methodologies used in SZ/HC classifications studies by comparing three machine learning algorithms (logistic regression [LR], support vector machines [SVMs], and linear discriminant analysis [LDA]) on three independent datasets (435 subjects total) using two tissue density estimates and cortical thickness (CT). Performance is assessed using 10-fold cross-validation, as well as a held-out validation set. Classification using CT outperformed tissue densities, but there was no clear effect of dataset. LR, SVMs, and LDA each yielded the highest accuracies for a different feature set and validation paradigm, but most accuracies were between 55 and 70%, well below previously reported values. The highest accuracy achieved was 73.5% using CT data and an SVM. Taken together, these results illustrate some of the obstacles to constructing effective disease classifiers, and suggest that tissue densities and CT may not be sufficiently sensitive for SZ/HC classification given current available methodologies and sample sizes.