特異データを意識した学習データの人工合成
项目来源
项目主持人
项目受资助机构
项目编号
立项年度
立项时间
研究期限
项目级别
受资助金额
学科
学科代码
基金类别
关键词
参与者
参与机构
1.Privacy-Aware Table Data Generation by Adversarial Gradient Boosting Decision Tree
- 关键词:
- adversarial learning; decision trees; tree ensembles; privacy evaluation;K-ANONYMITY; MODEL
- Jiang, Shuai;Iwata, Naoto;Kamei, Sayaka;Alam, Kazi Md. Rokibul;Morimoto, Yasuhiko
- 《MATHEMATICS》
- 2025年
- 13卷
- 15期
- 期刊
Privacy preservation poses significant challenges in third-party data sharing, particularly when handling table data containing personal information such as demographic and behavioral records. Synthetic table data generation has emerged as a promising solution to enable data analysis while mitigating privacy risks. While Generative Adversarial Networks (GANs) are widely used for this purpose, they exhibit limitations in modeling table data due to challenges in handling mixed data types (numerical/categorical), non-Gaussian distributions, and imbalanced variables. To address these limitations, this study proposes a novel adversarial learning framework integrating gradient boosting trees for synthesizing table data, called Adversarial Gradient Boosting Decision Tree (AGBDT). Experimental evaluations on several datasets demonstrate that our method outperforms representative baseline models regarding statistical similarity and machine learning utility. Furthermore, we introduce a privacy-aware adaptation of the framework by incorporating k-anonymization constraints, effectively reducing overfitting to source data while maintaining practical usability. The results validate the balance between data utility and privacy preservation achieved by our approach.
...
