简体   繁体   English

使用 Keras 深度学习的不平衡数据集

[英]imbalanced dataset with Keras deep learning

I have a datasets that looks like this: Training (Class 0: 471, Class 1: 986) Testing (Class 0: 177, Class 1: 246. I split my data as 80% for training and 20% for validation. I know that is an imbalanced dataset, and I have tried Class_weight but the problem remains. I have retrained my Baseline CNN and I always have a result like as attached in the picture. Could someone help me?我有一个看起来像这样的数据集:训练(0 类:471,1 类:986)测试(0 类:177,1 类:246。我将数据分成 80% 用于训练,20% 用于验证。我知道这是一个不平衡的数据集,我试过 Class_weight 但问题仍然存在。我重新训练了我的基线 CNN,我总是得到如图所示的结果。有人可以帮助我吗? 我训练后的结果

I faced a similar problem while classifying events in 5 imbalanced categories.我在将事件分为 5 个不平衡类别时遇到了类似的问题。 I found this loss function that implement a weighted categorical cross-entropy: https://gist.github.com/noparade/aaa8584e6e90ad64936e333e4e08ca5f Combined with the Nadam optimizer, it allowed me to get over 95% true positive for all my categories.我发现这个损失函数实现了加权分类交叉熵: https : //gist.github.com/noparade/aaa8584e6e90ad64936e333e4e08ca5f结合 Nadam 优化器,它让我在所有类别中获得超过 95% 的真阳性。

声明:本站的技术帖子网页,遵循CC BY-SA 4.0协议,如果您需要转载,请注明本站网址或者原文地址。任何问题请咨询:yoyou2525@163.com.

 
粤ICP备18138465号  © 2020-2024 STACKOOM.COM