AI-Driven Early Detection of Amyotrophic Lateral Sclerosis (ALS): A Machine Learning Approach Using Acoustic Biomarkers
DOI:
https://doi.org/10.32628/IJSRST25126249Keywords:
motor neuron disease, speech, acoustic biomarkers, amyotrophic lateral sclerosis, artificial intelligence, dysarthria, computational neurobiologyAbstract
Introduction: Amyotrophic Lateral Sclerosis (ALS) is a progressive neurodegenerative disease that affects the motor nervous system, specifically the motor neurons responsible for voluntary muscle movement. One of the main affected areas in the brain is the bulbar region which controls speech. The motor neuron degeneration in the bulbar region causes speech impairment dysarthria (slurred speech) which is also linked to shorter survival. Speech impairment is often one of the earliest manifestations of ALS; however, the articulatory movements involved are so subtle, rapid, variable, and minute that identifying abnormalities demands substantial clinical expertise. Consequently, accurate diagnosis is frequently delayed until the symptoms become more pronounced. While early detection is critical, it remains challenging due to the inherent limitations of conventional diagnostic tools. In this context the advancements in machine learning and AI coupled with high quality audio systems are playing a pivotal role in finding subtle patterns in patient speech data to help neurologists in early detection of ALS symptoms. Aim: The aim is to demonstrate the application of artificial intelligence and machine learning in developing a prototype tool for the early and accurate detection of ALS. Method: The proposed tool analyzes patients’ acoustic data, identifies characteristic speech patterns associated with dysarthria, and predicts the likelihood of current or potential ALS manifestation with high accuracy. In this paper I present acoustic analysis on the VOC-ALS dataset containing voice recordings of ALS patients and healthy individuals. From the collected vocal features, a machine learning model was developed to classify ALS patients and correlate them to specific speech disorder voice characteristics. The study then led to the building of a model that was able to classify healthy subjects from ALS patients. Results: Considering a small number of the patients in the data set, the model achieved good results with a classification accuracy of 81% and demonstrated strong sensitivity (recall = 0.67) and precision (0.93) for ALS detection. Conclusion: This study demonstrates the potential of machine learning in supporting early ALS diagnosis through speech analysis by identifying subtle articulatory changes that often go unnoticed in clinical settings.
Downloads
References
Hardiman, O., Al-Chalabi, A., Chio, A., Corr, E. M., Logroscino, G., Robberecht, W., Shaw, P. J., Simmons, Z., & van den Berg, L. H. (2017). Amyotrophic lateral sclerosis. Nature reviews. Disease primers, 3, 17071. https://doi.org/10.1038/nrdp.2017.71
Kiernan, M. C., Vucic, S., Cheah, B. C., Turner, M. R., Eisen, A., Hardiman, O., Burrell, J. R., & Zoing, M. C. (2011). Amyotrophic lateral sclerosis. Lancet (London, England), 377(9769), 942–955. https://doi.org/10.1016/S0140-6736(10)61156-7
Feldman, E. L., Goutman, S. A., Petri, S., Mazzini, L., Savelieff, M. G., Shaw, P. J., & Sobue, G. (2022). Amyotrophic lateral sclerosis. The Lancet, 400(10360), 1363–1380. https://doi.org/10.1016/S0140-6736(22)01272-7
Cedarbaum, J. M., Stambler, N., Malta, E., Fuller, C., Hilt, D., Thurmond, B., & Nakanishi, A. (1999). The ALSFRS-R: A revised ALS functional rating scale that incorporates assessments of respiratory function. Journal of the Neurological Sciences, 169(1-2), 13–21. https://doi.org/10.1016/S0022-510X(99)00210-5
Duffy, J. R. (2012). Motor speech disorders: Substrates, differential diagnosis, and management (3rd ed.). Elsevier Health Sciences.
Dubbioso, R., Spisto, M., Verde, L., Iuzzolino, V. V., Senerchia, G., De Pietro, G., De Falco, I., & Sannino, G. (2024). Precision medicine in ALS: Identification of new acoustic markers for dysarthria severity assessment. Biomedical Signal Processing and Control, 89, 105706. https://doi.org/10.1016/j.bspc.2023.105706
Dubbioso, R., Spisto, M., Verde, L., Iuzzolino, V. V., Senerchia, G., Salvatore, E., De Pietro, G., De Falco, I., & Sannino, G. (2024). Voice signals database of ALS patients with different dysarthria severity and healthy controls. Scientific Data, 11(800). https://doi.org/10.1038/s41597-024-03597-2
Jadoul, Y., & Boersma, P. (2018). Introducing Parselmouth: A Python interface to Praat. Journal of Phonetics, 71, 1–15. https://doi.org/10.1016/j.wocn.2018.01.002
Vashkevich, M., Petrovsky, A., & Rushkevich, Y. (2020). Bulbar ALS detection based on analysis of voice perturbation and vibrato. Proceedings of the IEEE 2019 Signal Processing: Algorithms, Architectures, Arrangements, and Applications (SPA), 267–272. https://doi.org/10.23919/spa.2019.8936657
GeeksforGeeks. (2025, September 5). XGBoost. https://www.geeksforgeeks.org/machine-learning/xgboost/
NumPy Developers. (n.d.). NumPy. Retrieved October 15, 2025, from https://numpy.org/
The pandas development team. (n.d.). pandas – Python Data Analysis Library. Retrieved October 15, 2025, from https://pandas.pydata.org/
scikit-learn developers. (2025). sklearn.preprocessing.StandardScaler. Retrieved October 15, 2025, from https://scikit-learn.org/stable/modules/generated/sklearn.preprocessing.StandardScaler.html
scikit-learn developers. (2025). sklearn.model_selection.train_test_split. Retrieved October 15, 2025, from https://scikit-learn.org/stable/modules/generated/sklearn.model_selection.train_test_split.html
Lemaitre, G., Nogueira, F., & Aridas, C. K. (2025). imblearn.over_sampling.SMOTE. imbalanced-learn 0.14.0. Retrieved October 15, 2025, from https://imbalanced-learn.org/stable/references/generated/imblearn.over_sampling.SMOTE.html
Joblib developers. (2025). Joblib: Running Python functions as pipeline jobs. Retrieved October 15, 2025, from https://joblib.readthedocs.io/en/stable/
Downloads
Published
Issue
Section
License
Copyright (c) 2025 International Journal of Scientific Research in Science and Technology

This work is licensed under a Creative Commons Attribution 4.0 International License.
https://creativecommons.org/licenses/by/4.0