Statistical count models for predicting the number of treatment dropouts in Pediatric Tuberculosis patients
Main Article Content
Abstract
Tuberculosis (TB) is a bacterial infectious disease that remains a challenging and unexplored public health concern in pediatric patients. This study aims to fit a suitable count-based regression model and identify the key factors associated with treatment dropout in paediatric TB patients. The retrospective data of 2086 pediatric TB patients was obtained from the Nikshay database at the Department of TB and Respiratory Diseases of Sir Sunderlal Hospital, Banaras Hindu University, Varanasi, Uttar Pradesh, India. The analysis utilized various count data models, including Poisson, Negative Binomial, Zero-Inflated, Hurdle and Zero- truncated models. Model performance was assessed using Akaike Information Criterion, Bayesian Information Criterion and log likelihood values which confirmed the models’ effectiveness.The findings suggest that some key factors, such as “Missed followup” and “Contact tracing not done”, as well as patients’ failing to access the nutritional and financial support provided by the National Tuberculosis Elimination Programme, are significant factors contributing to patients’ non-adherence to TB treatment. These insights can inform public health strategies aimed at increasing support services, enhancing TB control programs, and contributing to broader TB elimination efforts.
Article Details

This work is licensed under a Creative Commons Attribution 4.0 International License.
Authors who publish with this journal agree to the following terms:
- Authors retain copyright and grant the journal right of first publication with the work simultaneously licensed under a Creative Commons Attribution License that allows others to share the work with an acknowledgement of the work's authorship and initial publication in this journal.
- Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgement of its initial publication in this journal.
- Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) prior to and during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work (See The Effect of Open Access).
References
1. Akaike, H., Petrov, B. N. & Csaki, F. Second international symposium on information theory 1973.
2. Azevedo, A. M., Silva, Í. J., Nery, M. C., Rocha, H. P. & Santana, R. A. Counting models for
overdispersed data: A review with application to tuberculosis data. Brazilian Journal of Biometrics
41, 274–286 (2023).
3. Balakrishnan, M. & Varadharajan, R. Spatial patterns and multilevel analysis of factors associated
with paediatric tuberculosis in India. Indian Journal of Tuberculosis 72, S12–S17 (2025).
4. Barreto-Duarte, B. et al. Tuberculosis burden and determinants of treatment outcomes according to age in Brazil: a nationwide study of 896,314 cases reported between 2010 and
2019. Frontiers in Medicine 8, 706689 (2021).
5. Basa, S. & Venkatesh, S. Study on default and its factors associated among Tuberculosis patients
treated under DOTS in Mayurbhanj District, Odisha. Journal of Health Research and Reviews
(In Developing Countries) 2, 25–28 (2015).
6. Bekalo, D. B. & Kebede, D. T. Zero-inflated models for count data: an application to number
of antenatal care service visits. Annals of data science 8, 683–708 (2021).
7. Cameron, A. C. & Trivedi, P. K. Regression analysis of count data 53 (Cambridge university
press, 2013).
8. Central TB Division, M. o. H. & FamilyWelfare, G. o. I. National Strategic Plan for Tuberculosis
Elimination 2017–2025 https : / / tbcindia . mohfw. gov . in / wp - content / uploads / 2023 / 05 /
National-Strategic-Plan-2017-25.pdf . Accessed: 2025-05-01. 2017.
9. Chakrabarti, A. & Ghosh, J. K. AIC, BIC and recent advances in model selection. Philosophy
of statistics, 583–605 (2011).
10. Cruyff, M. J. & Van Der Heijden, P. G. Point and interval estimation of the population size
using a zero-truncated negative binomial regression model. Biometrical Journal: Journal of Mathematical Methods in Biosciences 50, 1035–1050 (2008).
11. CTD, C. T. D. India TB Report 2024 https://tbcindia.mohfw.gov.in/2024/10/11/india-tbreport-
2024/. Accessed: 2025-05-01. 2024.
12. Dwivedi, A. K., Dwivedi, S. N., Deo, S., Shukla, R. & Kopras, E. Statistical models for predicting
number of involved nodes in breast cancer patients. Health 2, 641 (2010).
13. Fajri, A., Rahmi, N., Maharani, P. A. & Amal, M. I. Modeling tuberculosis in children under
five using poisson and negative binomial regression. Desimal: Jurnal Matematika 7, 311–320
(2024).
14. Greene, W. H. Accounting for excess zeros and sample selection in Poisson and negative binomial regression models (1994).
15. Haight, F. A. Handbook of the Poisson distribution. (No Title) (1967).
16. Hall, D. B. & Zhang, Z. Marginal models for zero inflated clustered data. Statistical modelling
4, 161–180 (2004).
17. Hilbe, J. M. Modeling count data (Cambridge University Press, 2014).
18. Hutchinson, M. K. & Holtman, M. C. Analysis of count data using poisson regression. Research
in nursing & health 28, 408–418 (2005).
19. Karazsia, B. T. & Van Dulmen, M. H. Regression models for count data: illustrations using
longitudinal predictors of childhood injury. Journal of pediatric psychology 33, 1076–1084 (2008).
20. Kibria, B. G. Applications of some discrete regression models for count data. Pakistan Journal
of Statistics and Operation Research, 1–16 (2006).
21. Lambert, D. Zero-inflated Poisson regression, with an application to defects in manufacturing.
Technometrics 34, 1–14 (1992).
22. MacCallum, R. C., Zhang, S., Preacher, K. J. & Rucker, D. D.On the practice of dichotomization
of quantitative variables. Psychological methods 7, 19 (2002).
23. Mullahy, J. Specification and testing of some modified count data models. Journal of econometrics 33, 341–365 (1986).
24. Muniswamy, B & Lavanya, M. ZERO TRUNCATED POISSON REGRESSION MODEL
FOR REPRODUCTIVE PATTERNS ON COUNT DATA. Reliability: Theory & Applications
20, 1070–1088 (2025).
25. Mwalili, S. M., Lesaffre, E. & Declerck, D. The zero-inflated negative binomial regression
model with correction for misclassification: an example in caries research. Statistical methods in
medical research 17, 123–139 (2008).
26. Myers, R. H., Montgomery, D. C., Vining, G. G. & Robinson, T. J. Generalized linear models:
with applications in engineering and the sciences (JohnWiley & Sons, 2012).
27. Rose, C. E., Martin, S. W.,Wannemuehler, K. A. & Plikaytis, B. D.On the use of zero-inflated
and hurdle models for modeling vaccine adverse event count data. Journal of biopharmaceutical
statistics 16, 463–481 (2006).
28. Schwarz, G. Estimating the dimension of a model. The annals of statistics, 461–464 (1978).
29. Seddon, J. A. et al. Counting children with tuberculosis: why numbers matter. The International
Journal of Tuberculosis and Lung Disease 19, S9–S16 (2015).
30. Shrestha, N. Detecting multicollinearity in regression analysis. American journal of applied
mathematics and statistics 8, 39–42 (2020).
31. Verma, P., Swain, P. K., Singh, K. K. & Khetan, M. Count data regression modeling: an
application to spontaneous abortion. Reproductive Health 17, 1–9 (2020).
32. WHO, W. H. O. Global tuberculosis report 2024 https : / / www . who . int / teams / global -
programme-on-tuberculosis-and-lung-health/tb-reports/global-tuberculosis-report-2024/.
Accessed: 2025-05-01. 2024.
33. Winkelmann, R. & Zimmermann, K. F. Count data models for demographic data. Mathematical
Population Studies 4, 205–221 (1994).
34. Yotenka, R. & Banapon, A. Modelling the Number of Tuberculosis (TB) Cases in Indonesia using
Poisson Regression and Negative Binomial Regression in The 2nd International Seminar on Science
and Technology (ISSTEC 2019) (2020), 36–42.
35. Zeileis, A., Kleiber, C. & Jackman, S. Regression models for count data in R. Journal of statistical
software 27, 1–25 (2008).