Increasing Quality in Deep Learning Models through Stratification

Patiño-Pérez, Darwin; Collantes-Farah, Alex; Ordoñez-Valencia, Maylee; Herrera-Rivas, Luisa

Increasing Quality in Deep Learning Models through Stratification (#1787)

Read Article

Date of Conference

July 17-19, 2024

Published In

"Sustainable Engineering for a Diverse, Equitable, and Inclusive Future at the Service of Education, Research, and Industry for a Society 5.0."

Location of Conference

Costa Rica

Authors

Patiño-Pérez, Darwin

Collantes-Farah, Alex

Ordoñez-Valencia, Maylee

Herrera-Rivas, Luisa

Abstract

In this study, increasing quality in deep learning models through data stratification was investigated, focusing on a dataset of patients with diabetes mellitus. An artificial neural network (ANN) was created with the objective of predicting the probability of diabetes complications based on multiple clinical and biomedical characteristics of the patients. To improve model quality, a data stratification approach was implemented, dividing the data set into homogeneous groups to ensure balanced representation of the classes of interest in the training and testing data sets. Data stratification proved to be an effective strategy to improve the quality of the deep learning model. By maintaining a balanced distribution of diabetes complication classes in the training and testing data sets, the model was able to learn more representative patterns and better generalize to unseen data. This resulted in a significant improvement in the accuracy and generalizability of the model, resulting in more accurate and useful predictions for early identification of complications in patients with diabetes mellitus. Together, these findings highlight the importance of data stratification in increasing quality in deep learning models, especially in clinical applications such as predicting medical complications. This approach has the potential to improve healthcare by enabling more accurate and earlier identification of risks for patients, which can lead to more effective interventions and better overall health outcomes.

Read Article