基于属性学习的零样本图像分类研究

项目来源

国家自然科学基金(NSFC)

项目主持人

程玉虎

项目受资助机构

中国矿业大学

立项年度

2017

立项时间

未公开

项目编号

61772532

项目级别

国家级

研究期限

未知 / 未知

受资助金额

62.00万元

学科

信息科学-人工智能-模式识别与数据挖掘

学科代码

F-F06-F0605

基金类别

面上项目

关键词

属性学习 ; 深度神经网络 ; 领域适应 ; 零样本图像分类 ; 多任务学习 ; Zero-shot image classification ; Attribute learning ; Deep neural network ; Multi-task learning ; Domain adaptation

参与者

潘杰;张倩;刘健;孔毅;吕恩辉;李冬青;顾扬;宝阿春

参与机构

中国矿业大学;徐州医科大学;中国科学院国家空间科学中心

项目标书摘要:训练数据和测试数据的分布不同使得零样本图像分类成为一个非常困难的学习任务。本项目拟通过对所有对象类共同的属性描述,将以前学到的各类属性知识迁移到新的对象类上,从而有效解决零样本学习场景下的图像分类问题。具体包括:利用深度神经网络构建一个深度属性学习模型,同时实现图像的深层次特征表达及属性分类器训练,以回避人工选取特征的主观性和解决浅层学习方法导致的属性分类器泛化能力弱的问题;借鉴多任务学习思想,利用共享的图像底层特征来协同学习属性分类器(属性排序函数)和图像分类器,以同时提升属性预测(相对属性的排序)精度和图像分类精度;利用领域适应学习技术,从不同层次(单源、多源)、不同角度(分类器适应、特征表示适应)来解决属性学习的领域偏移问题,以使得在可见类图像上训练得到的属性分类器能够准确预测未见类图像的属性。研究成果不仅可以丰富和发展现有的机器学习理论,而且可以推广应用到模式识别的诸多相关领域。

Application Abstract: Because the data distributions between training and testing samples are different,zero-shot image classification becomes a very difficult learning task.In this project,we aim to effectively solve the image classification problem under the zero-shot learning scenario by describing all object classes with common attributes and transferring the attribute knowledge from known classes to new classes.The main contents in our research include the following aspects.A deep attribute learning model based on deep neural networks is constructed to simultaneously realize the deep-level feature representation of images and the training of attribute classifier.The designed deep attribute learning model can not only avoid the subjectivity resulted from manually select features,but also solve the poor generalization problem of attribute classifier due to shallow learning methods.By borrowing the idea of multi-task learning,the attribute classifier(or the attribute ranking functions)and the image classifier are collaboratively learned by using the shared low-level image features.Thus,the attribute prediction(or the relative attribute ranking)accuracy and the image classification accuracy can be simultaneously improved.By using the domain adaptation learning technique,the domain shift problem of attribute learning is solved from different levels(single-source and multi-source)and different perspectives(classifier adaptation and feature representation adaptation).Thus,the obtained attribute classifier that is trained with seen images can accurately predict the attributes of unseen images.The research fruits not only can enrich and develop the existing machine learning theory,but also can be extended to many pattern recognition-related fields.

项目受资助省

江苏省

项目结题报告(全文)

由于标记样本的缺乏,已标记类别不可能涵盖所有的对象类,这种零样本学习问题场景广泛存在于计算机视觉、图像分类、人脸和语音识别等领。本项目利用深/宽度学习、多任务学习和迁移学习等技术,通过对所有对象类共同的属性描述,将以前学到的各类属性知识迁移到新的对象类上,从而有效解决零样本学习场景下的图像分类问题,主要从下述2个方面开展研究工作:1深度网络能够从无标签的原始图像中自动提取出具有良好描述能力的图像特征。相较于深度网络,宽度学习系统具有结构简单、易于与其他模型结合等优点。为此,项目组针对轻量型深度网络、新型宽度网络构造等相关问题展开了研究,提出了:基于自注意力机制的生成对抗网络、自适应多尺度图卷积网络、多路径集成卷积网络、权重共享多级多尺度集成卷积网络、基于反卷积特征提取的深度卷积网络、基于监督超图和样本扩充的卷积网络、领域适配CycleGan网络、领域适应宽度网络、基于块对角约束的多阶段卷积宽度网络;2利用构造的深度和宽度网络,在零样本图像分类方面,提出了:基于深度加权属性预测的零样本学习、基于自适应多核校准的多源域属性适应学习、基于图正则化特征选择的零样本学习、基于多任务扩展属性组的零样本学习、基于多任务混合属性关系与属性固有特征的零样本学习、基于特征原型的零样本学习、基于耦合自编码与高斯混合模型的零样本学习、基于关系有向图正则化的属性三因子分解模型、基于属性核矩阵的生成特征领域自适应模型、基于混合属性的零样本学习、基于加权重构混合属性组的零样本图像分类模型、基于增强属性—特征的宽度属性预测模型。通过研究,项目组取得的成果为:在科学出版社出版专著1部;在国际国内学术期刊上发表/录用论文43篇;授权发明专利7件;培养博士研究生、硕士研究生8名;获江苏省优秀硕士学位论文奖1项。

  • 排序方式:
  • 1
  • /
  • 1.Prioritised experience replay based on sample optimisation

    • 关键词:
    • ;Agent learning;Average values;Mean values;Optimisations;Sampling method;Training process;Training speed
    • Wang, Xuesong;Xiang, Haopeng;Cheng, Yuhu;Yu, Qiang
    • 《3rd Asian Conference on Artificial Intelligence Technology, ACAIT 2019》
    • 2020年
    • July 5, 2019 - July 7, 2019
    • Chongqing, China
    • 会议

    The sample-based prioritised experience replay proposed in this study is aimed at how to select samples to the experience replay, which improves the training speed and increases the reward return. In the traditional deep Q-networks (DQNs), it is subjected to random pickup of samples into the experience replay. However, the effect of each sample is different for the training process of agent. A better sampling method will make the agent training more effective. Therefore, when selecting a sample to the experience replay, the authors first allow the agent to learn randomly through the sample optimisation network, and take the average value returned after each study, so that the mean value is used as a threshold for selecting samples to the experience replay. Second, on the basis of sample optimisation, the authors increase the priority update and use the idea of reward-shaping to give additional reward values to the returns of certain samples, which speeds up the agent training. Compared with traditional DQN and the prioritised experience replay DQN, this study uses OpenAI Gym as platform to improve agent learning efficiency.
    © 2020 Institution of Engineering and Technology. All rights reserved.

    ...
  • 2.Low Rank Subspace Clustering via Discrete Constraint and Hypergraph Regularization for Tumor Molecular Pattern Discovery

    • 关键词:
    • Low rank subspace clustering; gene expression data; discrete constraint;hypergraph regularization; Schatten p-norm; tumor clustering;NONNEGATIVE MATRIX FACTORIZATION; GENE-EXPRESSION; DIMENSIONALITYREDUCTION
    • Liu, Jian;Cheng, Yuhu;Wang, Xuesong;Cui, Xiaoluo;Kong, Yi;Du, Junping
    • 《12th International Conference on Intelligent Computing 》
    • 2018年
    • AUG 02-05, 2016
    • Lanzhou, PEOPLES R CHINA
    • 会议

    Tumor clustering is a powerful approach for cancer class discovery which is crucial to the effective treatment of cancer. Many traditional clustering methods such as NMF-based models, have been widely used to identify tumors. However, they cannot achieve satisfactory results. Recently, subspace clustering approaches have been proposed to improve the performance by dividing the original space into multiple low-dimensional subspaces. Among them, low rank representation is becoming a popular approach to attain subspace clustering. In this paper, we propose a novel Low Rank Subspace Clustering model via Discrete Constraint and Hypergraph Regularization (DHLRS). The proposed method learns the cluster indicators directly by using discrete constraint, which makes the clustering task simple. For each subspace, we adopt Schatten p-norm to better approximate the low rank constraint. Moreover, Hypergraph Regularization is adopted to infer the complex relationship between genes and intrinsic geometrical structure of gene expression data in each subspace. Finally, the molecular pattern of tumor gene expression data sets is discovered according to the optimized cluster indicators. Experiments on both synthetic data and real tumor gene expression data sets prove the effectiveness of proposed DHLRS.

    ...
  • 3.Spectral-Spatial Feature Extraction for HSI Classification Based on Supervised Hypergraph and Sample Expanded CNN

    • 关键词:
    • Convolutional neural network (CNN); feature extraction (FE);hyperspectral image (HSI); hypergraph; sample expansion;DIMENSIONALITY REDUCTION; GRAPH; TEMPERATURE; SPARSE
    • Kong, Yi;Wang, Xuesong;Cheng, Yuhu
    • 《International-Association-for-Pattern-Recognition TC 7 Workshopon Pattern Recognition in Remote Sensing 》
    • 2018年
    • AUG, 2018
    • Beijing, PEOPLES R CHINA
    • 会议

    Hyperspectral image (HSI) classification remains a challenging problem due to unique characteristics of HSI data (such as numerous bands and strong correlations in the spectral and spatial domains) and small sample size. To address such concerns, we propose a novel spectral-spatial feature extraction method for HSI classification by employing graph embedding and deep learning (DL) models. Since the conventional graph cannot capture the complex manifold relationship of HSI data, and there exist the observations of within-class variation as well as the similarity between different classes in the spectral domain, we construct the supervised within-class/between-class hypergraph (SWBH) to extract the spectral features ofHSI. Since it is difficult for DL models to learn representative features for HSI data when the labeled training samples are limited, we propose the random zero settings to newly generate a large amount of labeled HSI samples for the training of convolutional neural network (CNN). The designed sample expanded CNN (SECNN) is used to extract the HSI spatial features. Thus, the spectral-spatial features of HSI can be learned by integrating the features extracted from SWBH and SECNN, respectively. Experiments on three real HSI datasets demonstrate higher classification accuracy of the proposed SWBH-SECNN method.

    ...
  • 排序方式:
  • 1
  • /