Publication
Development of CART model for prediction of tuberculosis treatment loss to follow up in the state of São Paulo, Brazil: A case–control study
datacite.subject.fos | Ciências Médicas::Medicina Clínica | |
datacite.subject.fos | Ciências Naturais::Ciências da Computação e da Informação | |
datacite.subject.sdg | 07:Energias Renováveis e Acessíveis | |
datacite.subject.sdg | 09:Indústria, Inovação e Infraestruturas | |
datacite.subject.sdg | 11:Cidades e Comunidades Sustentáveis | |
dc.contributor.author | Yamaguti, Verena Hokino | |
dc.contributor.author | Alves, Domingos | |
dc.contributor.author | Rijo, Rui, Rui Pedro Charters Lopes | |
dc.contributor.author | Miyoshi, Newton Shydeo Brandão | |
dc.contributor.author | Ruffino-Netto, Antônio | |
dc.date.accessioned | 2025-07-30T12:00:52Z | |
dc.date.available | 2025-07-30T12:00:52Z | |
dc.date.issued | 2020-09 | |
dc.description | Article number - 104198 | |
dc.description.abstract | Background: Tuberculosis is the leading cause of infectious disease-related death, surpassing even the immunodeficiency virus. Treatment loss to follow up and irregular medication use contribute to persistent morbidity and mortality. This increases bacillus drug resistance and has a negative impact on disease control. Objective: This study aims to develop a computational model that predicts the loss to follow up treatment in tuberculosis patients, thereby increasing treatment adherence and cure, reducing efforts regarding treatment relapses and decreasing disease spread. Methods: This is a case-controlled study. Included in the data set were 103,846 tuberculosis cases from the state of São Paulo. They were collected using the TBWEB, an information system used as a tuberculosis treatment monitor, containing samples from 2006 to 2016. This set was later resampled into 6 segments with a 1-1 ratio. This ratio was used to avoid any bias during the model construction. Results: The Classification and Regression Trees were used as the prediction model. Training and test sets accounted for 70% in the former and 30% in the latter of the tuberculosis cases. The model displayed an accuracy of 0.76, F-measure of 0.77, sensitivity of 0.80 and specificity of 0.71. The model emphasizes the relationship between several variables that had been identified in previous studies as related to patient cure or loss to follow up treatment in tuberculosis patients. Conclusion: It was possible to construct a predictive model for loss to follow up treatment in tuberculosis patients using Classification and Regression Trees. Although the fact that the ideal predictive ability was not achieved, it seems reasonable to propose the use of Classification and Regression Trees models to predict likelihood of treatment follow up to support healthcare professionals in minimising the loss to follow up. | eng |
dc.description.sponsorship | We would like to thank the Coordenação de Aperfeiçoamento de Pessoal de Nível Superior – Brasil (CAPES) – Finance Code 001 and São Paulo Research Foundation (FAPESP) (Grant Nos. 2018/23963-2 and 2018/00307-2). | |
dc.identifier.citation | Verena Hokino Yamaguti, Domingos Alves, Rui Pedro Charters Lopes Rijo, Newton Shydeo Brandão Miyoshi, Antônio Ruffino-Netto, Development of CART model for prediction of tuberculosis treatment loss to follow up in the state of São Paulo, Brazil: A case–control study, International Journal of Medical Informatics, Volume 141, 2020, 104198, ISSN 1386-5056, https://doi.org/10.1016/j.ijmedinf.2020.104198. | |
dc.identifier.doi | 10.1016/j.ijmedinf.2020.104198 | |
dc.identifier.eissn | 1872-8243 | |
dc.identifier.issn | 1386-5056 | |
dc.identifier.uri | http://hdl.handle.net/10400.8/13792 | |
dc.language.iso | eng | |
dc.peerreviewed | yes | |
dc.publisher | Elsevier | |
dc.relation.hasversion | https://www.sciencedirect.com/science/article/pii/S1386505619314133?via%3Dihub | |
dc.relation.ispartof | International Journal of Medical Informatics | |
dc.rights.uri | http://creativecommons.org/licenses/by/4.0/ | |
dc.subject | Tuberculosis | |
dc.subject | Treatment loss to follow up | |
dc.subject | Prediction model | |
dc.subject | Feature selection | |
dc.title | Development of CART model for prediction of tuberculosis treatment loss to follow up in the state of São Paulo, Brazil: A case–control study | eng |
dc.type | journal article | |
dspace.entity.type | Publication | |
oaire.citation.endPage | 5 | |
oaire.citation.startPage | 1 | |
oaire.citation.title | International Journal of Medical Informatics | |
oaire.citation.volume | 141 | |
oaire.version | http://purl.org/coar/version/c_970fb48d4fbd8a85 | |
person.familyName | Rijo | |
person.givenName | Rui | |
person.identifier.orcid | 0000-0002-9348-0474 | |
person.identifier.scopus-author-id | 36861366200 | |
relation.isAuthorOfPublication | e69d7599-392c-4f8f-a96a-bf0a0d15c8b1 | |
relation.isAuthorOfPublication.latestForDiscovery | e69d7599-392c-4f8f-a96a-bf0a0d15c8b1 |
Files
Original bundle
1 - 1 of 1
No Thumbnail Available
- Name:
- Development of CART model for prediction of tuberculosis treatment loss to follow up in the state of São Paulo, Brazil A case–control study.pdf
- Size:
- 973.45 KB
- Format:
- Adobe Portable Document Format
- Description:
- Background: Tuberculosis is the leading cause of infectious disease-related death, surpassing even the immunodeficiency virus. Treatment loss to follow up and irregular medication use contribute to persistent morbidity and mortality. This increases bacillus drug resistance and has a negative impact on disease control. Objective: This study aims to develop a computational model that predicts the loss to follow up treatment in tuberculosis patients, thereby increasing treatment adherence and cure, reducing efforts regarding treatment relapses and decreasing disease spread. Methods: This is a case-controlled study. Included in the data set were 103,846 tuberculosis cases from the state of São Paulo. They were collected using the TBWEB, an information system used as a tuberculosis treatment monitor, containing samples from 2006 to 2016. This set was later resampled into 6 segments with a 1-1 ratio. This ratio was used to avoid any bias during the model construction. Results: The Classification and Regression Trees were used as the prediction model. Training and test sets accounted for 70% in the former and 30% in the latter of the tuberculosis cases. The model displayed an accuracy of 0.76, F-measure of 0.77, sensitivity of 0.80 and specificity of 0.71. The model emphasizes the relationship between several variables that had been identified in previous studies as related to patient cure or loss to follow up treatment in tuberculosis patients. Conclusion: It was possible to construct a predictive model for loss to follow up treatment in tuberculosis patients using Classification and Regression Trees. Although the fact that the ideal predictive ability was not achieved, it seems reasonable to propose the use of Classification and Regression Trees models to predict likelihood of treatment follow up to support healthcare professionals in minimising the loss to follow up.
License bundle
1 - 1 of 1
No Thumbnail Available
- Name:
- license.txt
- Size:
- 1.32 KB
- Format:
- Item-specific license agreed upon to submission
- Description: