1. A Generative Adversarial Network Structure for Learning with Small Numerical Data Sets
- Author
-
Kuan-Cheng Huang, Szu-Chou Chen, Der-Chiang Li, Yao-San Lin, and Centre for Chinese Language and Culture
- Subjects
Technology ,Computer science ,QH301-705.5 ,QC1-999 ,computer.software_genre ,Standard deviation ,Virtual Sample ,Language [Humanities] ,Small Datasets ,General Materials Science ,Limit (mathematics) ,Biology (General) ,Instrumentation ,QD1-999 ,Fluid Flow and Transfer Processes ,Structure (mathematical logic) ,Process Chemistry and Technology ,Small number ,Physics ,generative adversarial network ,General Engineering ,Volume (computing) ,Engineering (General). Civil engineering (General) ,Computer Science Applications ,Identification (information) ,Chemistry ,Data mining ,virtual sample ,TA1-2040 ,Generative adversarial network ,computer ,Generative grammar ,small datasets - Abstract
In recent years, generative adversarial networks (GANs) have been proposed to generate simulated images, and some works of literature have applied GAN to the analysis of numerical data in many fields, such as the prediction of building energy consumption and the prediction and identification of liver cancer stages. However, these studies are based on sufficient data volume. In the current era of globalization, the demand for rapid decision‐making is increasing, but the data available in a short period of time is scarce. As a result, machine learning may not provide precise results. Obtaining more information from a small number of samples has become an important issue. Therefore, this study aimed to modify the generative adversarial network structure for learning with small numerical datasets, starting with the Wasserstein GAN (WGAN) as the GAN architecture, and using mega‐trend‐diffusion (MTD) to limit the bound of virtual samples that the GAN generates. The model verification of our proposed structure was conducted with two datasets in the UC Irvine Machine Learning Repository, and the performance was evaluated using three criteria: accuracy, standard deviation, and p‐value. The experiment result shows that, using this improved GAN architecture (WGAN_MTD), small sample data can also be used to generate virtual samples that are similar to real samples through GAN. Published version
- Published
- 2021