Tobacco Consumption and Mortality Rate Prediction using Machine Learning

Authors

  • Anila R Nambiar, Shaheena K V, Saran P MCA, Acharya Institute of Technology, Bengaluru, Karnataka, India Author

DOI:

https://doi.org/10.70849/IJSCI

Keywords:

Tobacco, Mortality Prediction, Machine Learning, Random Forest, Healthcare Analytics

Abstract

Cigarette smoking is one of the major reasons behind preventable deaths in the world and in India it was reported that approximately 28.6 percent of adults now smoked. Such a high level of prevalence poses a serious threat to the occurrence of chronic diseases and early death. The current research aims at forecasting the probability of a person dying as a result of using tobacco products by using machine learning methods incorporating lifestyle, medical, and demographic factors. Two models, Random Forest Regressor and Linear Regression were used to analyze a dataset of 2,000 patients. It was found that Random Forest was more predictive accurate (R 2 = 0.85, MAE = X ) than was Linear Regression (R 2 = 0.50, MAE = Y ). The analysis of feature importance also indicated that the risk was best determined by age, frequency of smoking, and access to healthcare services. These findings prove the existence of the potential of predictive analytics to help medical professionals to identify high-risk individuals in a more timely manner. Finally, such a strategy may be used to advocate the development of specific interventions and may help decrease the health toll of tobacco use in the long term.

Downloads

Published

25-11-2025

How to Cite

[1]
Anila R Nambiar, Shaheena K V, Saran P, “Tobacco Consumption and Mortality Rate Prediction using Machine Learning”, Int. J. Sci. Inno. Eng., vol. 2, no. 11, pp. 1351–1360, Nov. 2025, doi: 10.70849/IJSCI.