PERFORMANCE COMPARISON OF MACHINE LEARNING MODELS TO REDUCE MISDIAGNOSIS RATES IN PSYCHIATRIC DISORDER USING EEG DATASET

Authors

  • Wina Munada Albukhary International University
  • Desy Khalida Maharani School of Computing and Informatics, Albukhary International University
  • Anis Sofiana School of Computing and Informatics, Albukhary International University

DOI:

https://doi.org/10.59407/jdaics.v1i3.967

Keywords:

Machine Learning, Psychiatric Disorder, LightGBM, CatBoost, Logistic Regression, Diagnostic Accuracy.

Abstract

This paper aims at comparing the suitability of three machine learning models: LightGBM, CatBoost, and Logistic Regression, to lower misdiagnosis rates for psychiatric disorders. Misdiagnosis in mental health may mean improper treatment and, hence, poor outcomes for patients. Our research aims to determine the most accurate predictive model for mental health condition diagnosis that will lead to improved clinical outcomes. We trained and tested these models on an EEG dataset with patient records that have psychiatric diagnoses labeled. For all the models, evaluation and comparison are made using key performance metrics such as Accuracy, Precision, Recall, and F1-Score. Through the use of these methods, it was shown that LightGBM performed better than CatBoost and Logistic Regression, having achieved higher accuracy and F1 scores, indicating more power to make a difference among different psychiatric disorders. These results suggest that machine learning techniques, especially LightGBM, can greatly increase diagnostic accuracy and reduce misdiagnosis in psychiatric contextual systems.

Keywords Machine Learning, Psychiatric Disorder, LightGBM, CatBoost, Logistic Regression.

References

Alturki, F. A., AlSharabi, K., Abdurraqeeb, A. M., & Aljalal, M. (2020). EEG Signal Analysis for Diagnosing Neurological Disorders Using Discrete Wavelet Transform and Intelligent Techniques. Sensors, 20(9), 2505. https://doi.org/10.3390/s20092505

Awad, F. H., Hamad, M. M., & Alzubaidi, L. (2023). Robust Classification and Detection of Big Medical Data Using Advanced Parallel K-Means Clustering, YOLOv4, and Logistic Regression. Life, 13(3), 691. https://doi.org/10.3390/life13030691

Cai, H., Yuan, Z., Gao, Y., Sun, S., Li, N., Tian, F., Xiao, H., Li, J., Yang, Z., Li, X., Zhao, Q., Liu, Z., Yao, Z., Yang, M., Peng, H., Zhu, J., Zhang, X., Gao, G., Zheng, F., … Hu, B. (2022). A multi-modal open dataset for mental-disorder analysis. Scientific Data, 9(1), 178. https://doi.org/10.1038/s41597-022-01211-x

Chehreh Chelgani, S., Nasiri, H., Tohry, A., & Heidari, H. R. (2023). Modeling industrial hydrocyclone operational variables by SHAP-CatBoost - A “conscious lab” approach. Powder Technology, 420, 118416. https://doi.org/10.1016/j.powtec.2023.118416

Chung, J., & Teo, J. (2022). Mental Health Prediction Using Machine Learning: Taxonomy, Applications, and Challenges. Applied Computational Intelligence and Soft Computing, 2022, 1–19. https://doi.org/10.1155/2022/9970363

Demir, S., & Sahin, E. K. (2023). Predicting occurrence of liquefaction-induced lateral spreading using gradient boosting algorithms integrated with particle swarm optimization: PSO-XGBoost, PSO-LightGBM, and PSO-CatBoost. Acta Geotechnica, 18(6), 3403–3419. https://doi.org/10.1007/s11440-022-01777-1

Guo, X., Gui, X., Xiong, H., Hu, X., Li, Y., Cui, H., Qiu, Y., & Ma, C. (2023). Critical role of climate factors for groundwater potential mapping in arid regions: Insights from random forest, XGBoost, and LightGBM algorithms. Journal of Hydrology, 621, 129599. https://doi.org/10.1016/j.jhydrol.2023.129599

Jang, K.-I., Kim, S., Kim, S. Y., Lee, C., & Chae, J.-H. (2021). Machine Learning-Based Electroencephalographic Phenotypes of Schizophrenia and Major Depressive Disorder. Frontiers in Psychiatry, 12. https://doi.org/10.3389/fpsyt.2021.745458

Jia, H., Yin, S., Liu, L., Yi, C., Xue, K., Li, F., Yao, D., Xu, P., & Zhang, T. (2023). A Model Combining Multi Branch Spectral-Temporal CNN, Efficient Channel Attention, and LightGBM for MI-BCI Classification. IEEE.

Kida, T., Kaneda, T., & Nishihira, Y. (2012). Dual-task repetition alters event-related brain potentials and task performance. Clinical Neurophysiology, 123(6), 1123–1130. https://doi.org/10.1016/j.clinph.2011.10.001

Maitin, A. M., Romero Muñoz, J. P., & García-Tejedor, Á. J. (2022). Survey of Machine Learning Techniques in the Analysis of EEG Signals for Parkinson’s Disease: A Systematic Review. Applied Sciences, 12(14), 6967. https://doi.org/10.3390/app12146967

Marquand, A. F., Kia, S. M., Zabihi, M., Wolfers, T., Buitelaar, J. K., & Beckmann, C. F. (2019). Conceptualizing mental disorders as deviations from normative functioning. Molecular Psychiatry, 24(10), 1415–1424. https://doi.org/10.1038/s41380-019-0441-1

Noviandy, T. R., Maulana, A., Idroes, G. M., Maulydia, N. B., Patwekar, M., Suhendra, R., & Idroes, R. (2023). Integrating Genetic Algorithm and LightGBM for QSAR Modeling of Acetylcholinesterase Inhibitors in Alzheimer’s Disease Drug Discovery. Malacca Pharmaceutics, 1(2), 48–54. https://doi.org/10.60084/mp.v1i2.60

Qian, L., Chen, Z., Huang, Y., & Stanford, R. J. (2023). Employing categorical boosting (CatBoost) and meta-heuristic algorithms for predicting the urban gas consumption. Urban Climate, 51, 101647. https://doi.org/10.1016/j.uclim.2023.101647

Qiu, R., Lin, H., Jiang, H., Shen, J., He, J., & Fu, J. (2024). Association of major depression, schizophrenia and bipolar disorder with thyroid cancer: a bidirectional two-sample mendelian randomized study. BMC Psychiatry, 24(1), 261. https://doi.org/10.1186/s12888-024-05682-7

Rivera, M. J., Teruel, M. A., Maté, A., & Trujillo, J. (2022). Diagnosis and prognosis of mental disorders by means of EEG and deep learning: a systematic mapping study. Artificial Intelligence Review, 55(2), 1209–1251. https://doi.org/10.1007/s10462-021-09986-y

Shanarova, N., Pronina, M., Lipkovich, M., Ponomarev, V., Müller, A., & Kropotov, J. (2023). Application of Machine Learning to Diagnostics of Schizophrenia Patients Based on Event-Related Potentials. Diagnostics, 13(3), 509. https://doi.org/10.3390/diagnostics13030509

Sun, D., Wu, X., Wen, H., & Gu, Q. (2023). A LightGBM-based landslide susceptibility model considering the uncertainty of non-landslide samples. Geomatics, Natural Hazards and Risk, 14(1). https://doi.org/10.1080/19475705.2023.2213807

Tao, W., Li, C., Song, R., Cheng, J., Liu, Y., Wan, F., & Chen, X. (2023). EEG-Based Emotion Recognition via Channel-Wise Attention and Self Attention. IEEE Transactions on Affective Computing, 14(1), 382–393. https://doi.org/10.1109/TAFFC.2020.3025777

Tarailis, P., Koenig, T., Michel, C. M., & Griškova-Bulanova, I. (2024). The Functional Aspects of Resting EEG Microstates: A Systematic Review. Brain Topography, 37(2), 181–217. https://doi.org/10.1007/s10548-023-00958-9

Tushar, Patel, R. K., Aggarwal, E., Solanki, K., Dahiya, O., & Yadav, S. A. (2023). A Logistic Regression and Decision Tree Based Hybrid Approach to Predict Alzheimer’s Disease. 2023 International Conference on Computational Intelligence and Sustainable Engineering Solutions (CISES), 722–726. https://doi.org/10.1109/CISES58720.2023.10183466

van den Goorbergh, R., van Smeden, M., Timmerman, D., & van Calster, B. (2022). The harm of class imbalance corrections for risk prediction models: illustration and simulation using logistic regression. Journal of the American Medical Informatics Association, 29(9), 1525–1534. https://doi.org/10.1093/jamia/ocac093

Varshney, R. D., Gadhia, R. K., Ritvik, & Bhat, A. (2023). BCI-based Emotion recognition by Combining Kernel Classifiers and EEG Feature Selection. 2023 7th International Conference on Intelligent Computing and Control Systems (ICICCS), 1809–1815. https://doi.org/10.1109/ICICCS56967.2023.10142682

Wei, X., Rao, C., Xiao, X., Chen, L., & Goh, M. (2023). Risk assessment of cardiovascular disease based on SOLSSA-CatBoost model. Expert Systems with Applications, 219, 119648. https://doi.org/10.1016/j.eswa.2023.119648

Wong, S., Simmons, A., Rivera‐Villicana, J., Barnett, S., Sivathamboo, S., Perucca, P., Ge, Z., Kwan, P., Kuhlmann, L., Vasa, R., Mouzakis, K., & O’Brien, T. J. (2023). EEG datasets for seizure detection and prediction— A review. Epilepsia Open, 8(2), 252–267. https://doi.org/10.1002/epi4.12704

Wu, Y., Cao, H., Baranova, A., Huang, H., Li, S., Cai, L., Rao, S., Dai, M., Xie, M., Dou, Y., Hao, Q., Zhu, L., Zhang, X., Yao, Y., Zhang, F., Xu, M., & Wang, Q. (2020). Multi-trait analysis for genome-wide association study of five psychiatric disorders. Translational Psychiatry, 10(1), 209. https://doi.org/10.1038/s41398-020-00902-6

Yao, D., Qin, Y., & Zhang, Y. (2022). From psychosomatic medicine, brain–computer interface to brain–apparatus communication. Brain-Apparatus Communication: A Journal of Bacomics, 1(1), 66–88. https://doi.org/10.1080/27706710.2022.2120775

Zhang, J., Xiang, J., Luo, L., & Shui, R. (2022). Editorial: EEG/MEG based diagnosis for psychiatric disorders. Frontiers in Human Neuroscience, 16. https://doi.org/10.3389/fnhum.2022.1061176

Downloads

Published

2024-07-26

How to Cite

Munada, W., Maharani, D. K., & Sofiana, A. (2024). PERFORMANCE COMPARISON OF MACHINE LEARNING MODELS TO REDUCE MISDIAGNOSIS RATES IN PSYCHIATRIC DISORDER USING EEG DATASET. Journal of Data Analytics, Information, and Computer Science, 1(3), 114–127. https://doi.org/10.59407/jdaics.v1i3.967

Issue

Section

Articles