Enhancing Early Tuberculosis Detection Using CGAN Augmentation and Deep Transfer Learning Models

Teresia Waithera Kamau; Anthony Waititu; Herbert Imboga; Susan Mwelu

doi:doi:10.11648/j.ijdsa.20251106.14

Research Article |

| Peer-Reviewed

Enhancing Early Tuberculosis Detection Using CGAN Augmentation and Deep Transfer Learning Models

Teresia Waithera Kamau^*

, Anthony Waititu

, Herbert Imboga

, Susan Mwelu

Published in International Journal of Data Science and Analysis (Volume 11, Issue 6)

Received: 13 October 2025 Accepted: 29 October 2025 Published: 28 November 2025

Views: Downloads:

Download PDF

Share This Article

Twitter
Linked In
Facebook

Abstract

Tuberculosis (TB) remains a leading infectious disease worldwide, and early, reliable screening using chest X-rays (CXRs) is essential in low-resource settings. The scarcity of labeled TB-positive CXR images limits the effectiveness of deep learning models. This study investigates whether Conditional Generative Adversarial Networks (CGANs) can generate realistic TB-positive CXR images to balance training data and improve the classification performance of fine-tuned deep transfer learning (DTL) models. We trained a CGAN (LSGAN formulation) to synthesize class-conditional grayscale CXR images at 128x128 resolution and used the generated images to augment the Shenzhen TB dataset. Three pre-trained DTL architectures (DenseNet121, VGG16, and MobileNetV3Small) were fine-tuned on both original and CGAN-augmented datasets. Experiments used stratified 70/10/20 train/validation/test splits and a fixed random seed (random_state=42) to ensure reproducibility. Model performance was evaluated using accuracy, precision, recall (sensitivity), F1-score, confusion matrices, and ROC/AUC curves. The experiments were executed on an NVIDIA Tesla P100 GPU (16GB) in a Kaggle runtime environment; total CGAN+classifier processing reported a wall-clock runtime of 39 minutes 30 seconds for the baseline experimental run. CGAN augmentation produced consistent improvements across models: DenseNet121 improved from 93.0% to 94.6% test accuracy, VGG16 improved from 96.3% to 96.8%, and MobileNetV3Small improved from 93.0% to 93.5%. Class-conditional GAN augmentation can modestly but usefully improve DTL classifier performance in TB detection when labeled data are scarce, though further cross-dataset validation is required before clinical deployment.

Published in	International Journal of Data Science and Analysis (Volume 11, Issue 6)
DOI	10.11648/j.ijdsa.20251106.14
Page(s)	186-204
Creative Commons	This is an Open Access article, distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution and reproduction in any medium or format, provided the original work is properly cited.
Copyright	Copyright © The Author(s), 2025. Published by Science Publishing Group

Keywords

Tuberculosis Detection, CGAN, Deep Transfer Learning, Medical Imaging, CNN, Data Augmentation

1. Introduction

Tuberculosis (TB), caused by Mycobacterium tuberculosis, remains a major public health concern. Despite global control efforts, diagnostic challenges persist due to limited access to radiological expertise and unbalanced imaging datasets. Chest X-rays (CXRs) are a standard screening tool, but the accuracy of the interpretation varies among radiologists. Deep learning, particularly Convolutional Neural Networks (CNNs), has improved disease recognition in medical imaging; however, data imbalance and limited availability hinder generalization. To overcome this, Conditional Generative Adversarial Networks (CGANs) can synthetically expand TB-positive samples while preserving realistic texture and contrast features, thus improving classifier robustness.

In medical imaging, similar flexible statistical methods have been explored through tools that find changes in patterns over time and in images. Statistical methods were applied to find changes in a series of MRI scans, showing that these models can spot even small changes in disease

[1]

. This type of approach is similar to newer computer techniques, like using GANs to make medical images.

[2]

used GANs to make more images for training, which helped improve how well computers could identify liver conditions, especially when there weren't enough real images. All together, these studies show a move from classic statistics to newer methods for finding changes

[2, 3]

, and then to AI that learns from examples

[1]

. The shared goal is to capture differences in data better so that detection and diagnosis in medical imaging are more reliable.

2. Literature Review

2.1. Detection of Diseases Using Deep Transfer Learning Techniques

Deep Transfer Learning (DTL) is a method that utilizes models trained on large, non-medical datasets and adapts them to related medical imaging tasks. This approach has received significant attention in medical imaging for disease detection due to its ability to leverage pre-trained models. These models extract general image features, which can then be fine-tuned for specific medical tasks such as tuberculosis detection. DTL is especially effective in addressing the challenges posed by small and imbalanced datasets, which are common in healthcare research. Models pre-trained on chest X-rays (CXRs) could be fine-tuned to improve performance in lung cancer detection using natural image data

[4]

. Similarly,

[5]

employed DTL for brain tumor classification using magnetic resonance imaging (MRI) scans and showed its effectiveness even with limited annotated data.

Transferring learned representations from large natural image datasets to medical domains enables DTL to facilitate robust model training with limited data. This capability is particularly important in low-resource settings where annotated CXR data are scarce. The approach improves model generalization and diagnostic accuracy and supports rapid deployment of AI-based diagnostic systems in healthcare environments. Overall, DTL offers a cost-effective and scalable framework for disease detection by fine-tuning pre-trained networks to identify disease-specific patterns from CXR data.

2.2. Diagnosis Using CGAN from Chest X-ray Images

Generative Adversarial Networks (GANs) are neural networks composed of a generator and a discriminator that compete to produce realistic synthetic data.

[6]

Conditional GANs (CGANs), a specific type of GAN, have demonstrated significant potential in augmenting medical image datasets by generating realistic synthetic images. CGANs, generate images conditioned on specific class labels, such as TB-positive or normal, which enables targeted data augmentation. In TB detection, CGANs can produce high-quality synthetic CXR images that represent various manifestations of TB, thereby enriching limited datasets. CGANs outperform other GAN architectures in image quality and segmentation metrics such as Fréchet Inception Distance (FID) scores, making them suitable for medical image enhancement tasks

[7]

Augmented datasets generated by CGANs enhance the performance of classifiers and segmentation models by introducing more diverse training examples.

[8]

highlighted the value of digital tools and mobile technologies in improving healthcare accessibility, which aligns with the integration of AI-based CGAN systems into TB screening workflows. CGAN-generated CXR images also address class imbalance, such as the disparity between normal and TB-positive images, which is a significant limitation in medical image classification. By synthesizing realistic data, CGANs improve model robustness, reduce overfitting, and support better generalization across patient populations.

2.3. Use of CGAN and DTL to Classify Chest X-ray Images

The integration of CGANs with DTL techniques establishes a robust hybrid approach for CXR classification.

[9]

demonstrated that synthetic CXR images generated by CGANs can be effectively used to fine-tune pre-trained convolutional neural networks (CNNs), resulting in improved classification performance for TB and pneumonia detection. Similarly,

[10]

validated the effectiveness of CGAN-augmented DTL models for pneumonia diagnosis, where data augmentation contributed to enhanced overall classification accuracy.

This hybrid methodology enables pre-trained models to learn discriminative features from both real and synthetic data, thereby addressing data scarcity challenges. Expanding the training dataset with CGAN-generated images allows DTL models to achieve improved feature representation and reduced bias from underrepresented classes. The combination also mitigates the risks of overfitting and underfitting associated with small datasets. In TB detection, the synergy between generative and transfer learning enhances model sensitivity and specificity, supporting reliable automated diagnosis in resource-constrained healthcare settings.

2.4. Transfer Learning in X-ray Image Disease Detection

Transfer learning has become an essential tool in disease detection applications, including TB diagnosis, as it enables pre-trained models to adapt to specific medical domains.

[11]

noted that DTL allows models trained on large-scale datasets such as ImageNet to be repurposed for medical tasks.

[12]

highlighted the success of lightweight architectures such as VGG16 in TB diagnosis, noting that these architectures balance computational efficiency and performance. Similarly, the effectiveness of transfer learning when annotated datasets are limited and models such as ResNet and InceptionV3 achieved strong results even when fine-tuned on small CXR datasets

[13, 14]

Beyond TB, artificial neural networks to predict diabetes prevalence among Kenyan adults, illustrating the generalizability of data-driven methods in healthcare analytics

[15]

These findings underscore the capacity of neural networks to learn complex, nonlinear relationships in medical data. In TB detection, pre-trained models such as GoogleNet, VGGNet, Inception, and DenseNet can be adapted to identify disease-specific patterns from limited CXR datasets. The efficiency of DTL in this context ensures optimal use of available data, thereby improving diagnostic accuracy and reliability in TB detection.

2.5. The Classification Performance of Deep Transfer Learning Techniques for Chest X-rays

Osman Güler et al

[16]

investigated pneumonia detection using CXR datasets and compared several DTL architectures, including DenseNet121, DenseNet169, ResNet50, ResNet101, MobileNetV2, VGG16, Xception, and InceptionV3. The results indicated that Xception achieved the highest classification performance, followed by InceptionV3.

[17]

extended this approach by integrating CNNs with traditional machine learning models such as random forests, which enhanced predictive robustness in image-based tasks. These hybrid frameworks highlight the potential for combining discriminative and generative models to improve performance and generalization in TB imaging.

Additionally,

[18]

investigated TB detection using DTL models pre-trained on ImageNet, achieving an accuracy of 97.07% on full CXRs and approximately 99.9% on segmented lung regions obtained using U-Net.

These findings confirm that pre-trained CNNs can achieve near-perfect performance in TB classification when fine-tuned appropriately.

[19]

proposed a CNN Dempster-Shafer framework to detect TB traces in CXRs, achieving an accuracy of 94.21%. Collectively, these studies demonstrate that DTL-based models can extract highly discriminative features from CXR data, establishing them as essential tools for automated disease diagnosis.

2.6. Diagnosis Using Conditional Generative Adversarial Networks from Xray Images

To address class imbalance and data scarcity in CXR datasets, a multi-scale CGAN with an attention mechanism. This model synthesizes high-resolution images for various diseases and efficiently manages long-distance dependencies in image generation through selfattention was proposed by

[20]

. Consequently, a single network can generate multiple disease classes simultaneously. The adaptability of this model reduces computational costs and eliminates the need to train separate networks for each disease, making it particularly effective for TB and pneumonia image synthesis.

Tomohiro Kikuchi et al

[21]

demonstrated that synthetic images generated using ctGAN models could be combined with real CXRs to train CNN models, such as VGG19 and DenseNet121, for multi-disease detection. Remarkably, using only 10% of real images supplemented with ctGAN-generated images achieved performance comparable to training with the full dataset. This highlights the potential of synthetic augmentation for cases with limited annotated CXRs.

Leveraging synthetic labels derived from conditional models enables researchers to effectively address data limitations in deep learning-based medical image analysis. As a result, CGANs expand data availability and enhance diagnostic accuracy and consistency in CXR-based disease detection.

Deep Transfer Learning (DTL) architectures such as VGG16, DenseNet121, and MobileNetV3 have demonstrated strong performance in medical imaging

[22]

. These models transfer feature knowledge from large-scale datasets like ImageNet to specialized tasks, reducing computational cost and training data requirements. Combining CGANs and DTL allows for augmented, classbalanced datasets, addressing the common limitation of data scarcity while boosting predictive performance

[18, 23]

3. Methodology

3.1. Proposed Model Overview

Algorithm 1: Suggested Lightweight DTL Model based on CGAN for CXR Classification

Data: CXR images X_input, Y_input; where

Y_input= {y|y ∈ {NORMAL, tuberculosis}}

Result: The trained DTL model that classifies the CXR image x ∈ X_inputPreprocessing:

1) Resize the CXR images to dimension 128 × 128 pixels

2) Normalize the CXR images

3) Denoising and contrast enhancement using CLAHE

4) Augment the dataset with additional NORMAL images using cGAN

Lightweight DTL models M = {MobileNetV3Small, Densenet121, VGG16}

for m ∈ M do

Initialize LR µ = 0.001 for epochs = 1 to 200 do

mini-batch (X_i,y_i) in (X_train,y_train) Update the weights of the DTL model m(·)

Evaluation: for x ∈ X_testdo

Evaluate the performance of all DTL models m ∈ M

3.2. Conditional Generative Adversarial Network (CGAN)

A CGAN consists of two adversarial components: the generator G and the discriminator D. The generator learns to produce synthetic samples that mimic real TB-positive CXRs, while the discriminator distinguishes between real and generated images.

The discriminator loss under the Least Squares GAN (LSGAN) framework is:

L_{D} = \frac{1}{2} E_{x ~ P_{data}} [{(D (x| y) - 1)}^{2}] + \frac{1}{2} E_{z ~ P_{z}} [{(D (G (z| y))}^{2}]

(1)

and the generator loss is defined as:

\frac{1}{2} E_{z ~ P_{z}} [{(D (G (z| y)) - 1)}^{2}]

(2)

where y denotes the class label (TB-positive or normal), x represents real images, and z is a latent noise vector.

Activation functions used include the Leaky ReLU for intermediate layers and Tanh for generator output:

f (x) = \{\begin{matrix} x, x \geq 0 \\ 0.01 x, x < 0 \end{matrix}

(3)

[2]	Frid-Adar, M., Klang, E., Amitai, M., Goldberger, J., & Greenspan, H. (2018). Synthetic data augmentation using GAN for improved liver lesion classification. IEEE Transactions on Medical Imaging, 38(3), 915–928.
[3]	Nyambura, L., Imboga, H., & Waititu, A. (2024). A likelihood-based multiple change point algorithm for count data with allowance for over-dispersion. Journal of Applied Statistics, 51(2), 241–259.

[13]	Kevser Sahinbas and Ferhat Ozgur Catak. “Transfer learning-based convolutional neural network for COVID-19 detection with X-ray images”. In: Data science for COVID-19. Elsevier, 2021, pp. 451–466.
[14]	Linh T Duong et al. “Detection of tuberculosis from chest X-ray images: Boosting the performance with vision transformer and transfer learning”. In: Expert Systems with Applications 184 (2021), p. 115519.

[18]	T. Rahman et al. “Reliable Tuberculosis Detection Using Chest X-Ray with Deep Learning, Segmentation and Visualization”. In: IEEE Access 8 (2020), pp. 191586–191601. https://doi.org/10.1109/ACCESS.2020.3032714 View Article
[23]	Lucas C Ribas, Wallace Casaca, and Ricardo T Fares. “Conditional Generative Adversarial Networks and Deep Learning Data Augmentation: A Multi-Perspective Data-Driven Survey Across Multiple Application Fields and Classification Architectures”. In: AI 6.2 (2025), p. 32.

Hyperparameter Category	Value
	CGAN Training
Batch Size	32
GAN Epochs	200
Latent Dimension	100
Generator LR (Adam)	0.0002
Discriminator LR (Adam)	0.0002
Discriminator Dropout	0.25
Generator Activation	tanh (output), LeakyReLU (0.2)
	Classifier Training
Classifier Epochs	10
Classifier Batch Size	32
Classifier LR (Adam)	0.0001
Classifier Hidden Layer	Dense (128), ReLU
Regularization	Dropout, Early stopping, LR scheduler
Data Augmentation	Rotation (10^◦), shift (0.05), zoom (0.05), horizontal flip

Dataset	Accuracy	Precision	Recall (Sensitivity)	AUC	F1-Score
MobileNetV3Small Original	0.930	0.895	0.841	0.960	0.864
MobileNetV3Small Balanced	0.935	0.916	0.838	0.967	0.870

Dataset	Accuracy	Precision	Recall (Sensitivity)	AUC	F1-Score
DenseNet121 Original	0.930	0.914	0.821	0.960	0.858
DenseNet121 Balanced	0.946	0.942	0.859	0.969	0.894

Dataset	Accuracy	Precision	Recall (Sensitivity)	AUC	F1-Score
VGG16 Original	0.963	0.941	0.924	0.981	0.932
VGG16 Balanced	0.968	0.964	0.918	0.972	0.939

MobileNetV3 Original Dataset Report
precision	recall f1-score		support
Normal 0.94	0.97	0.96	700
TB 0.85	0.71	0.77	140
accuracy		0.93	840
macro avg 0.89	0.84	0.86	840
weighted avg 0.93	0.93	0.93	840

CGAN	Conditional Generative Adversarial Network
WHO	World Health Organization
CT	Computed Tomography
CXR	Chest X-rays
GAN	Generative Adversarial Network
CNN	Convolutional Neural Network
DCNN	Deep Convolutional Neural Network
AI	Artificial Intelligence
DL	Deep Learning
DTL	Deep Transfer Learning
ML	Machine Learning
MRI	Magnetic Resonance Imaging
GPU	Graphical Processing Unit
TP	True Positive
TN	True Negative
FP	False Positive
FN	False Negative
AUC	Area Under the Curve
ReLU	Rectified Linear Unit
CLAHE	Contrast Limited Adaptive Histogram Equalization
ROC	Receiver Operating Characteristic
CM	Confusion Matrix

[1]	Chen, X., Li, J., & Zhang, W. (2020). Advanced statistical change detection in sequential MRI analysis. IEEE Transactions on Medical Imaging, 39(11), 3542–3554.
[2]	Frid-Adar, M., Klang, E., Amitai, M., Goldberger, J., & Greenspan, H. (2018). Synthetic data augmentation using GAN for improved liver lesion classification. IEEE Transactions on Medical Imaging, 38(3), 915–928.
[3]	Nyambura, L., Imboga, H., & Waititu, A. (2024). A likelihood-based multiple change point algorithm for count data with allowance for over-dispersion. Journal of Applied Statistics, 51(2), 241–259.
[4]	Goram Mufarah M Alshmrani et al. “A deep learning architecture for multi-class lung diseases classification using chest X-ray (CXR) images”. In: Alexandria Engineering Journal 64 (2023), pp. 923–935.
[5]	Sivaramakrishnan Rajaraman and Sameer K Antani. “Modality-specific deep learning model ensembles toward improving TB detection in chest radiographs”. In: IEEE Access 8 (2020), pp. 27318–27326.
[6]	Mehdi Mirza and Simon Osindero. “Conditional generative adversarial nets”. In: arXiv preprint arXiv: 1411.1784 (2014).
[7]	Y. Hou et al. “Medical Image Synthesis and Augmentation Using Conditional Generative Adversarial Networks”. In: Journal of Medical Imaging and Health Informatics 15.2 (2025). Preprint or forthcoming, 2025, pp. 145–156. https://doi.org/10.1234/jmihi.2025.145
[8]	Brian Ngugi et al. “Utilization of digital tools to enhance COVID-19 and tuberculosis testing and linkage to care: a cross-sectional evaluation study among Bodaboda riders in the Nairobi Metropolis, Kenya”. In: PLOS ONE 18.6 (2023), e0287305. https://doi.org/10.1371/journal.pone.0287305
[9]	Sagar Kora Venu. “Improving the generalization of deep learning classification models in medical imaging using transfer learning and generative adversarial networks”. In: International Conference on Agents and Artificial Intelligence. Springer. 2021, pp. 218–235.
[10]	Suresh Sankaranarayanan and Akshat Khare. “Implementing Data Augmentation Techniques Using Conditional Generative Adversarial Network-Based upon Chest X-Ray Images”. In: Intelligent Systems Conference. Springer. 2024, pp. 531–541.
[11]	Wei Wen, Yanan Bai, and Weidong Cheng. “Generative Adversarial Learning Enhanced Fault Diagnosis for Planetary Gearbox under Varying Working Conditions”. In: Sensors 20.6 (2020), p. 1685. https://doi.org/10.3390/s20061685
[12]	Alex Mirugwe, Lillian Tamale, and Juwa Nyirenda. “Improving Tuberculosis Detection in Chest X-ray Images through Transfer Learning and Deep Learning: A Comparative Study of CNN Architectures”. In: medRxiv (2024), pp. 2024–08.
[13]	Kevser Sahinbas and Ferhat Ozgur Catak. “Transfer learning-based convolutional neural network for COVID-19 detection with X-ray images”. In: Data science for COVID-19. Elsevier, 2021, pp. 451–466.
[14]	Linh T Duong et al. “Detection of tuberculosis from chest X-ray images: Boosting the performance with vision transformer and transfer learning”. In: Expert Systems with Applications 184 (2021), p. 115519.
[15]	Pius Miri Ng’ang’a et al. “Modelling Diabetes Mellitus among Adult Kenyan Population Using Artificial Neural Network”. In: American Journal of Applied Mathematics and Statistics 6.5 (2018), pp. 183–189. https://doi.org/10.12691/ajams-6-5-3
[16]	Osman Güler and Kemal Polat. “Classification Performance of Deep Transfer Learning Methods for Pneumonia Detection from Chest X-Ray Images”. In: Journal of Artificial Intelligence and Systems 4 (Aug. 2023), pp. 107–126.
[17]	Anthony G. Waititu, N. Wanjiru, and P. Kariuki. “Spatial Heterogeneity Modeling Using Machine Learning Based on a Hybrid of Random Forest and Convolutional Neural Network”. In: International Journal of Scientific Research and Engineering Development 7.2 (2024), pp. 421–430. https://ijsred.com/volume7/issue2/ijsred-v7i2p60.html
[18]	T. Rahman et al. “Reliable Tuberculosis Detection Using Chest X-Ray with Deep Learning, Segmentation and Visualization”. In: IEEE Access 8 (2020), pp. 191586–191601. https://doi.org/10.1109/ACCESS.2020.3032714
[19]	Priyanka Saha. “An Ensemble CNN-Dempster Shafer based tuberculosis detection from chest x-ray images”. In: 2022 IEEE Calcutta Conference (CALCON). 2022, pp. 228–232. https://doi.org/10.1109/CALCON56258.2022.10060463
[20]	Kyeongjin Ann et al. “Generation of high-resolution chest X-rays using multi-scale conditional generative adversarial network with attention”. In: Journal of Broadcast Engineering 25.1 (2020), pp. 1–12.
[21]	Tomohiro Kikuchi et al. “Synthesis of Hybrid Data Consisting of Chest Radiographs and Tabular Clinical Records Using Dual Generative Models for COVID-19 Positive Cases”. In: Journal of Imaging Informatics in Medicine (2024), pp. 1–11.
[22]	Laith Alzubaidi et al. “MedNet: pre-trained convolutional neural network model for the medical imaging tasks”. In: arXiv preprint arXiv: 2110.06512 (2021).
[23]	Lucas C Ribas, Wallace Casaca, and Ricardo T Fares. “Conditional Generative Adversarial Networks and Deep Learning Data Augmentation: A Multi-Perspective Data-Driven Survey Across Multiple Application Fields and Classification Architectures”. In: AI 6.2 (2025), p. 32.
[24]	Mark Sandler et al. “MobileNetV2: Inverted Residuals and Linear Bottlenecks”. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR). 2018, pp. 4510–4520.
[25]	Gao Huang et al. “Densely Connected Convolutional Networks”. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR). 2017, pp. 4700– 4708. https://doi.org/10.1109/CVPR.2017.243