PERFORMANCE COMPARISON OF MACHINE LEARNING MODELS TO REDUCE MISDIAGNOSIS RATES IN PSYCHIATRIC DISORDER USING EEG DATASET
DOI:
https://doi.org/10.59407/jdaics.v1i3.967Keywords:
Machine Learning, Psychiatric Disorder, LightGBM, CatBoost, Logistic Regression, Diagnostic Accuracy.Abstract
This paper aims at comparing the suitability of three machine learning models: LightGBM, CatBoost, and Logistic Regression, to lower misdiagnosis rates for psychiatric disorders. Misdiagnosis in mental health may mean improper treatment and, hence, poor outcomes for patients. Our research aims to determine the most accurate predictive model for mental health condition diagnosis that will lead to improved clinical outcomes. We trained and tested these models on an EEG dataset with patient records that have psychiatric diagnoses labeled. For all the models, evaluation and comparison are made using key performance metrics such as Accuracy, Precision, Recall, and F1-Score. Through the use of these methods, it was shown that LightGBM performed better than CatBoost and Logistic Regression, having achieved higher accuracy and F1 scores, indicating more power to make a difference among different psychiatric disorders. These results suggest that machine learning techniques, especially LightGBM, can greatly increase diagnostic accuracy and reduce misdiagnosis in psychiatric contextual systems.
Keywords Machine Learning, Psychiatric Disorder, LightGBM, CatBoost, Logistic Regression.
References
Alturki, F. A., AlSharabi, K., Abdurraqeeb, A. M., & Aljalal, M. (2020). EEG Signal Analysis for Diagnosing Neurological Disorders Using Discrete Wavelet Transform and Intelligent Techniques. Sensors, 20(9), 2505. https://doi.org/10.3390/s20092505
Awad, F. H., Hamad, M. M., & Alzubaidi, L. (2023). Robust Classification and Detection of Big Medical Data Using Advanced Parallel K-Means Clustering, YOLOv4, and Logistic Regression. Life, 13(3), 691. https://doi.org/10.3390/life13030691
Cai, H., Yuan, Z., Gao, Y., Sun, S., Li, N., Tian, F., Xiao, H., Li, J., Yang, Z., Li, X., Zhao, Q., Liu, Z., Yao, Z., Yang, M., Peng, H., Zhu, J., Zhang, X., Gao, G., Zheng, F., … Hu, B. (2022). A multi-modal open dataset for mental-disorder analysis. Scientific Data, 9(1), 178. https://doi.org/10.1038/s41597-022-01211-x
Chehreh Chelgani, S., Nasiri, H., Tohry, A., & Heidari, H. R. (2023). Modeling industrial hydrocyclone operational variables by SHAP-CatBoost - A “conscious lab” approach. Powder Technology, 420, 118416. https://doi.org/10.1016/j.powtec.2023.118416
Chung, J., & Teo, J. (2022). Mental Health Prediction Using Machine Learning: Taxonomy, Applications, and Challenges. Applied Computational Intelligence and Soft Computing, 2022, 1–19. https://doi.org/10.1155/2022/9970363
Demir, S., & Sahin, E. K. (2023). Predicting occurrence of liquefaction-induced lateral spreading using gradient boosting algorithms integrated with particle swarm optimization: PSO-XGBoost, PSO-LightGBM, and PSO-CatBoost. Acta Geotechnica, 18(6), 3403–3419. https://doi.org/10.1007/s11440-022-01777-1
Guo, X., Gui, X., Xiong, H., Hu, X., Li, Y., Cui, H., Qiu, Y., & Ma, C. (2023). Critical role of climate factors for groundwater potential mapping in arid regions: Insights from random forest, XGBoost, and LightGBM algorithms. Journal of Hydrology, 621, 129599. https://doi.org/10.1016/j.jhydrol.2023.129599
Jang, K.-I., Kim, S., Kim, S. Y., Lee, C., & Chae, J.-H. (2021). Machine Learning-Based Electroencephalographic Phenotypes of Schizophrenia and Major Depressive Disorder. Frontiers in Psychiatry, 12. https://doi.org/10.3389/fpsyt.2021.745458
Jia, H., Yin, S., Liu, L., Yi, C., Xue, K., Li, F., Yao, D., Xu, P., & Zhang, T. (2023). A Model Combining Multi Branch Spectral-Temporal CNN, Efficient Channel Attention, and LightGBM for MI-BCI Classification. IEEE.
Kida, T., Kaneda, T., & Nishihira, Y. (2012). Dual-task repetition alters event-related brain potentials and task performance. Clinical Neurophysiology, 123(6), 1123–1130. https://doi.org/10.1016/j.clinph.2011.10.001
Maitin, A. M., Romero Muñoz, J. P., & García-Tejedor, Á. J. (2022). Survey of Machine Learning Techniques in the Analysis of EEG Signals for Parkinson’s Disease: A Systematic Review. Applied Sciences, 12(14), 6967. https://doi.org/10.3390/app12146967
Marquand, A. F., Kia, S. M., Zabihi, M., Wolfers, T., Buitelaar, J. K., & Beckmann, C. F. (2019). Conceptualizing mental disorders as deviations from normative functioning. Molecular Psychiatry, 24(10), 1415–1424. https://doi.org/10.1038/s41380-019-0441-1
Noviandy, T. R., Maulana, A., Idroes, G. M., Maulydia, N. B., Patwekar, M., Suhendra, R., & Idroes, R. (2023). Integrating Genetic Algorithm and LightGBM for QSAR Modeling of Acetylcholinesterase Inhibitors in Alzheimer’s Disease Drug Discovery. Malacca Pharmaceutics, 1(2), 48–54. https://doi.org/10.60084/mp.v1i2.60
Qian, L., Chen, Z., Huang, Y., & Stanford, R. J. (2023). Employing categorical boosting (CatBoost) and meta-heuristic algorithms for predicting the urban gas consumption. Urban Climate, 51, 101647. https://doi.org/10.1016/j.uclim.2023.101647
Qiu, R., Lin, H., Jiang, H., Shen, J., He, J., & Fu, J. (2024). Association of major depression, schizophrenia and bipolar disorder with thyroid cancer: a bidirectional two-sample mendelian randomized study. BMC Psychiatry, 24(1), 261. https://doi.org/10.1186/s12888-024-05682-7
Rivera, M. J., Teruel, M. A., Maté, A., & Trujillo, J. (2022). Diagnosis and prognosis of mental disorders by means of EEG and deep learning: a systematic mapping study. Artificial Intelligence Review, 55(2), 1209–1251. https://doi.org/10.1007/s10462-021-09986-y
Shanarova, N., Pronina, M., Lipkovich, M., Ponomarev, V., Müller, A., & Kropotov, J. (2023). Application of Machine Learning to Diagnostics of Schizophrenia Patients Based on Event-Related Potentials. Diagnostics, 13(3), 509. https://doi.org/10.3390/diagnostics13030509
Sun, D., Wu, X., Wen, H., & Gu, Q. (2023). A LightGBM-based landslide susceptibility model considering the uncertainty of non-landslide samples. Geomatics, Natural Hazards and Risk, 14(1). https://doi.org/10.1080/19475705.2023.2213807
Tao, W., Li, C., Song, R., Cheng, J., Liu, Y., Wan, F., & Chen, X. (2023). EEG-Based Emotion Recognition via Channel-Wise Attention and Self Attention. IEEE Transactions on Affective Computing, 14(1), 382–393. https://doi.org/10.1109/TAFFC.2020.3025777
Tarailis, P., Koenig, T., Michel, C. M., & Griškova-Bulanova, I. (2024). The Functional Aspects of Resting EEG Microstates: A Systematic Review. Brain Topography, 37(2), 181–217. https://doi.org/10.1007/s10548-023-00958-9
Tushar, Patel, R. K., Aggarwal, E., Solanki, K., Dahiya, O., & Yadav, S. A. (2023). A Logistic Regression and Decision Tree Based Hybrid Approach to Predict Alzheimer’s Disease. 2023 International Conference on Computational Intelligence and Sustainable Engineering Solutions (CISES), 722–726. https://doi.org/10.1109/CISES58720.2023.10183466
van den Goorbergh, R., van Smeden, M., Timmerman, D., & van Calster, B. (2022). The harm of class imbalance corrections for risk prediction models: illustration and simulation using logistic regression. Journal of the American Medical Informatics Association, 29(9), 1525–1534. https://doi.org/10.1093/jamia/ocac093
Varshney, R. D., Gadhia, R. K., Ritvik, & Bhat, A. (2023). BCI-based Emotion recognition by Combining Kernel Classifiers and EEG Feature Selection. 2023 7th International Conference on Intelligent Computing and Control Systems (ICICCS), 1809–1815. https://doi.org/10.1109/ICICCS56967.2023.10142682
Wei, X., Rao, C., Xiao, X., Chen, L., & Goh, M. (2023). Risk assessment of cardiovascular disease based on SOLSSA-CatBoost model. Expert Systems with Applications, 219, 119648. https://doi.org/10.1016/j.eswa.2023.119648
Wong, S., Simmons, A., Rivera‐Villicana, J., Barnett, S., Sivathamboo, S., Perucca, P., Ge, Z., Kwan, P., Kuhlmann, L., Vasa, R., Mouzakis, K., & O’Brien, T. J. (2023). EEG datasets for seizure detection and prediction— A review. Epilepsia Open, 8(2), 252–267. https://doi.org/10.1002/epi4.12704
Wu, Y., Cao, H., Baranova, A., Huang, H., Li, S., Cai, L., Rao, S., Dai, M., Xie, M., Dou, Y., Hao, Q., Zhu, L., Zhang, X., Yao, Y., Zhang, F., Xu, M., & Wang, Q. (2020). Multi-trait analysis for genome-wide association study of five psychiatric disorders. Translational Psychiatry, 10(1), 209. https://doi.org/10.1038/s41398-020-00902-6
Yao, D., Qin, Y., & Zhang, Y. (2022). From psychosomatic medicine, brain–computer interface to brain–apparatus communication. Brain-Apparatus Communication: A Journal of Bacomics, 1(1), 66–88. https://doi.org/10.1080/27706710.2022.2120775
Zhang, J., Xiang, J., Luo, L., & Shui, R. (2022). Editorial: EEG/MEG based diagnosis for psychiatric disorders. Frontiers in Human Neuroscience, 16. https://doi.org/10.3389/fnhum.2022.1061176
















