计算机科学
人工智能
二元分类
二进制数
机器学习
支持向量机
数学
算术
作者
Srijita Bandopadhyay,Srimonti Dutta,Imran Haider,Bhavaraju Anuraag,Jerry Zhu,Saad Ahmed Bazaz
标识
DOI:10.1109/iatmsi60426.2024.10503258
摘要
This paper delves into the challenges of binary classification using imbalanced datasets, particularly when instances of interest are infrequent. It explores a comprehensive approach that integrates Synthetic Minority Over-sampling Technique (SMOTE), Generative Adversarial Networks (GANs), and Variational Autoencoders (VAEs) to enhance classification outcomes. Traditional classification models tend to favor the majority class, while the impact of imbalanced misclassification costs is often overlooked. The integration of SMOTE, GANs, and VAEs in binary classification, or SMOTE-GAN-VAE, addresses these challenges by generating synthetic instances, refining data representations, and capturing latent features. To evaluate the effectiveness of various data generation methods, a credit card fraud dataset is used. The performance metrics considered include F0.5-score, F1-score, and F2-score, which account for both precision and recall. The results indicate that SMOTE-GAN-VAE outperforms individual methods, such as SMOTE, GANs, and VAEs, demonstrating its potential to enhance data representation and classification accuracy, and outperformed the β- VAE filtered approach employed in previous literature.
科研通智能强力驱动
Strongly Powered by AbleSci AI