使用NGN算法改進(jìn)不平衡數(shù)值數(shù)據(jù)的研究

打開文本圖片集
Research on improving imbalanced numerical data using NGN algorithm
Xing Changzheng,Zheng Xin,LiangJunfeng (CollegeofElectronic& Information Engineering,Liaoning Technical University,HuludaoLioning1251o5,China)
Abstract:When minorityclassamplesare scarce,traditional oversampling methods struggleto increasethesamplecount. This paper introduced a NGN algorithm that synthesized new data byadding generator-generated dataas noise to theoriginal minorityclassamplesuntilbalance wasachieved.Thegenerator employedafour-layerfullyconnectednetworkandintegrated low-structureandhigh-structurefeaturegenerationtechniquestoenhancethequalityanddiversityofthegenerateddata.For verylimited minorityclassamples,NGNgeneratednewsamples,mergedthemwiththeoriginal minorityclassdata,and performedclustering to achieve balance withinclusters while minimizing the impactof noise.The study evaluated NGNon6unbalanceddatasets,applied4oversamplingalgorithms tobalancethedatasets,andclasifiedthebalanceddatasetsusing4classificationmethods.TheexperimentalresultsdemonstratethatNGNefectivelyincreasesthenumberof minorityclasssamples, enhances the model’sability to learn minority classfeatures,and significantly improves classification performance.
Key words:numerical generator network(NGN);generator;noise;extremely scarce minority class;balance
0 引言
數(shù)據(jù)不平衡的問(wèn)題源自于樣本分布的不均衡。(剩余15909字)