基于脑启发的PolSAR图像深层协同表示学习与分类

项目来源

国家自然科学基金(NSFC)

项目主持人

侯彪

项目受资助机构

西安电子科技大学

项目编号

61671350

立项年度

2016

立项时间

未公开

研究期限

未知 / 未知

项目级别

国家级

受资助金额

58.00万元

学科

信息科学-电子学与信息系统-信息获取与处理

学科代码

F-F01-F0113

基金类别

面上项目

关键词

深度学习 ; 图像分割 ; 脑启发 ; 表示学习 ; 图像分类 ; image segmentation ; deep learning ; Brain-inspired ; representation learning ; image classification

参与者

唐旭；任博；文载道；吴倩；王建龙；任仲乐；牟树根；张广

参与机构

西北工业大学；中国人民解放军空军工程大学

项目标书摘要：多模态特征学习与协同表示和鲁棒的分类器设计是极化SAR分类中重要的两个核心问题。然而大规模极化SAR的数据量中通常仅有少量有效的类别标签，给这两个问题的有效解决带来了困难。本项目受到大脑感知、学习表示与决策融合过程的启发，构造一个多模块的深层神经网络来逐层解决以上问题。首先，本项目构造两个并行的低级感知皮层网络实现无监督的学习PolSAR图像特征与极化特征；其次，在上述两个低级网络之上，模拟构造高级前额皮层网络将底层多源特征进行协同表示与融合；最后，在高层网络中加入语义先验知识以及少量的监督信息，在贝叶斯推理的框架下实现决策分类，并用RADARSAT2和PALSAR的全极化SAR数据验证其有效性。期望在提高极化SAR图像分类效果的同时，进一步完善和促进深度学习等理论的研究和应用。成果在本领域重要期刊和会议上发表论文15-20篇，申报国家发明专利6-8项，联合培养博士、硕士5-8名。

Application Abstract: Multimodal feature learning and collaborative representation,together with a robust classifier construction are two crucial problems in PolSAR classification.However,large scale of PolSAR data with only a few labelled information increases their difficulties.Inspired by human brain of sensing,learning representation and decision fusion process.This project aims at constructing a deep neural network consisting of several basic networks to address these issues in a layer-wise manner.Firstly,we construct two networks imitating the low level sensing the cerebral cortex in order to respectively learn the PolSAR image and polarimetric features in parallel without supervised information.These features will be subsequently fused in a higher level network which performs the same role as the prefrontal cortex.By incorporating several semantic prior knowledge as well as a limited supervised information in the high-level network,the classification can be implemented via Bayesian inference.We validate the effectiveness of the proposed method by the full-polarimetric data of RADARSAT 2 and PALSAR.We hope our methods not only to improve performance of Polarimetric SAR image classification,but also further boosting and carving out compressed sensing theory and its applications.We will publish 10-15 journals and conferences,apply 6-8 patents and bring up 5-8 Ph.Ds and masters.

项目受资助省

陕西省

项目结题报告(全文)

本项目针对目前有限的物理散射机理驱动的分解模型不能涵盖所有地物类型的散射方式给分类带来的问题，设计自适应PolSAR目标极化分解的新框架，采用堆栈自编码网络自适应的学习极化分解的方式，获得更抽象的极化特征。针对PolSAR地物类型复杂、多样使得目前的特征表示难以自适应性，且缺少完备性与判别性，利用深度卷积置信网络来学习较深层的PolSAR图像特征。针对PolSAR“同类异谱图”和“同谱图异类”问题带来的分类精度下降的问题，考虑将极化特征空间、图像特征空间和语义特征空间进行多模态特征融合，挖掘三个特征空间的关联关系来实现对PolSAR图像的协同表示。将PolSAR数据的分类决策问题建模为一个类标重构的逆问题，通过挖掘类标空间的先验特性，在弱监督学习的算法框架下，建立少量标签指导的特征表示模型，然后设计合适的重构算法，恢复类别标签。建立了基于半监督学习的自适应奇异标签检测方法，对于含噪声类标的数据，采用半监督学习的策略，通过挖掘特征的本征结构先验，然后自适应的根据这种结构得到每个数据对应类标的置信度，以此来区分奇异类标。获教育部自然科学奖二等奖1项，获陕西省科学技术奖一等奖1项，出版专著3部，发表论文47篇，获批国家发明专利19项，依托本项目的研究，我们培养博士生10人，已毕业5人；硕士生30人，已毕业20人。

排序方式：时间相关性
显示方式：列表摘要

1.Multiscale CNN with Autoencoder Regularization Joint Contextual Attention Network for SAR Image Classification

关键词：
Convolution;Radar imaging;Image classification;Deep learning;Neural networks;Asymmetric structures;Classification accuracy;Classification algorithm;Fundamental research;Intelligent technology;Postprocessing methods;SAR image classifications;Synthetic aperture radar (SAR) images

Wu, Zitong;Hou, Biao;Jiao, Licheng
《IEEE Transactions on Geoscience and Remote Sensing》
2021年
59卷
2期
期刊

Synthetic aperture radar (SAR) image classification is a fundamental research direction in image interpretation. With the development of various intelligent technologies, deep learning techniques are gradually being applied to SAR image classification. In this study, a new SAR classification algorithm known as the multiscale convolutional neural network with an autoencoder regularization joint contextual attention network (MCAR-CAN) is proposed. The MCAR-CAN has two branches: The autoencoder regularization branch and the context attention branch. First, autoencoder regularization is used for the reconstruction of the input to regularize the classification in the autoencoder regularization branch. Multiscale input and an asymmetric structure of the autoencoder branch cause the network more to be focused on classification than on reconstruction. Second, the attention mechanism is used to produce an attention map in which each attention weight corresponds to a context correlation in attention branch. The robust features are obtained by the attention mechanism. Finally, the features obtained by the two branches are spliced for classification. In addition, a new training strategy and a postprocessing method are designed to further improve the classification accuracy. Experiments performed on the data from three SAR images demonstrated the effectiveness and robustness of the proposed algorithm.
© 1980-2012 IEEE.

...

2.Modified Tensor Distance-Based Multiview Spectral Embedding for PolSAR Land Cover Classification

关键词：
Synthetic aperture radar;Polarimeters;Classification (of information);Pixels;Embeddings;Complementary property;Embedding algorithms;Land cover classification;Machine learning methods;Pixel based classifications;Polarimetric synthetic aperture radars;Polarimetric target decomposition;State-of-the-art methods

Ren, Bo;Hou, Biao;Chanussot, Jocelyn;Jiao, Licheng
《IEEE Geoscience and Remote Sensing Letters》
2020年
17卷
12期
期刊

This letter proposes a novel method for combining multiview features in polarimetric synthetic aperture radar (PolSAR) for land cover classification. It is well-known that feature extraction and classifier design are two significant steps in machine learning methods for PolSAR data interpretation. Each PolSAR pixel can be represented in different feature spaces, such as polarimetric data scattering, or the polarimetric target decomposition spaces. In this letter, a tensor-based multiview embedding algorithm is proposed to fuse those features from different spaces in order to obtain a distinctive set of features for the subsequent classification. Based on the pixel-based classification tasks, a modified tensor distance (MTD) is designed to accurately calculate the distance between tensors. It emphasizes the importance of the central pixel, and decreases the influence of the neighbors in the feature patch when calculating tensor distance. Furthermore, the complementary properties of different views are exploited by an MTD measured tensor multiview spectral embedding method, so as to obtain relevant low-dimensional features. Compared with state-of-the-art methods, the validation and effectiveness of the proposed method is demonstrated on two real PolSAR data sets.
© 2004-2012 IEEE.

...

3.Semi-Supervised PolSAR Image Classification Based on Improved Tri-Training With a Minimum Spanning Tree

Shuang Wang；Yanhe Guo；Wenqiang Hua；Xinan Liu；Guoxin Song；Biao Hou；Licheng Jiao；
《》
0年
卷
期
期刊

4.Dynamic Immunization Node Model for Complex Networks Based on Community Structure and Threshold

关键词：
Immune system; Integrated circuit modeling; Viruses (medical);Adaptation models; Complex networks; Probability; Dynamic propagationmodel; immune threshold; node immunization; propagation probability;INFORMATION DIFFUSION; MATHEMATICAL-THEORY; MODULARITY

Shang, Ronghua;Zhang, Weitong;Jiao, Licheng;Zhang, Xiangrong;Stolkin, Rustam
《IEEE TRANSACTIONS ON CYBERNETICS》
2022年
52卷
3期
期刊

In the information age of big data, and increasingly large and complex networks, there is a growing challenge of understanding how best to restrain the spread of harmful information, for example, a computer virus. Establishing models of propagation and node immunity are important parts of this problem. In this article, a dynamic node immune model, based on the community structure and threshold (NICT), is proposed. First, a network model is established, which regards nodes carrying harmful information as new nodes in the network. The method of establishing the edge between the new node and the original node can be changed according to the needs of different networks. The propagation probability between nodes is determined by using community structure information and a similarity function between nodes. Second, an improved immune gain, based on the propagation probability of the community structure and node similarity, is proposed. The improved immune gain value is calculated for neighbors of the infected node at each time step, and the node is immunized according to the hand-coded parameter: immune threshold. This can effectively prevent invalid or insufficient immunization at each time step. Finally, an evaluation index, considering both the number of immune nodes and the number of infected nodes at each time step, is proposed. The immune effect of nodes can be evaluated more effectively. The results of network immunization experiments, on eight real networks, suggest that the proposed method can deliver better network immunization than several other well-known methods from the literature.

...

5.基于深度学习的遥感图像地物分类

关键词：
光学遥感图像;地物分类;标签迭代更新;特征融合;坐标损失

容拓拓
指导老师：西安电子科技大学侯彪
0年
学位论文

遥感图像地物分类是遥感图像的重要应用领域之一。随着卫星成像技术和航天技术的快速发展,越来越多的高分辨率遥感卫星投入使用,人类可获取的遥感图像数量在不断上升、分辨率也在不断提高。在高分辨率遥感图像中,不同地物要素的类别特征显著,纹理细节丰富,这些特点是实现遥感图像地物分类的重要前提。然而,相比于自然图像,高分辨率遥感图像的信息更加丰富、场景更加复杂,因此对遥感图像直接应用通用计算机视觉处理算法往往效果欠佳。本文针对高分辨率遥感图像的成像特征,对现有用于自然图像像素级分类任务的算法进行改进。本文中,分别从数据集,深度卷积网络的网络结构以及损失函数三个方向对现有的算法进行优化,提升了算法最终的性能。本文的主要研究内容如下:1.提出了低层特征提取模块和标签迭代更新策略。低层特征提取模块针对遥感图像单位像素表征信息更多这一特点,从浅层特征中提取更多特征用于最终的分类决策;标签迭代更新策略针对遥感图像中人工标注存在误差这一问题,使用网络的输出对原始的人工标注进行修正,利用修正的标签继续训练算法模型,不断重复“训练-修正”步骤直到网络再次收敛,通过网络自身学习的信息对数据偏差导致的网络模型性能下降进行修正,提升算法的最终性能。2.提出了基于U型网络的深度特征融合方法。U型网络先编码后解码过程中不仅损失了分辨率而且浅层深层特征融合不充分。为了提高算法的特征融合能力,引入了注意力模块,该模块用于浅层特征与深层特征融合中提高精细边缘的提取能力;引入了空间金字塔池化模块,该模块用于对编码解码阶段特征损失的分辨率信息进行补充。为了解决U型网络不同深度信息缺乏交互的问题,引入了网络跨层特征融合模块,对不同层特征进行迭代融合,从而更好地补充细节信息。3.提出了基于标签残差的坐标损失。当前像素级分类中最常使用的交叉熵损失只关注单个像素点分类是否正确,但是无法关注到图像全局上下文信息的损失。为了促使算法模型在训练过程中关注到图像级别的损失,本文提出了基于标签残差的坐标损失,通过在两个空间方向对图像标签的残差信息进行建模,利用两个一维特征表征图像级别差异以辅助交叉熵损失函数对网络进行优化。通过交叉熵损失与坐标损失的共同约束,从而提高网络最终的性能。

...

6.基于深度卷积神经网络的光学遥感图像地物分类

关键词：
遥感图像地物分类;注意力机制;卷积神经网络;密集上采样卷积;生成对抗网络

柳阳飞
指导老师：西安电子科技大学侯彪
0年
学位论文

遥感图像地物分类是遥感图像的重要应用之一。随着国家在遥感领域的大力探索发展,遥感图像成像质量有了很大提高,图像分辨率越来越小,获取的信息越来越丰富,为地物分类任务带来优势的同时,也提出了更高的要求和挑战。传统的地物分类算法由于图像分辨率有限,能够区分的类别较少,随着分辨率的提高,越来越多的类别清晰可见,类别数也随之增加。为了更好的区分这些类别,首先必须提取出这些类别的特征,随着近些年卷积神经网络在图像领域的广泛应用,一种新的基于卷积神经网络的遥感图像地物分类方法呼之欲出。本论文基于深度卷积神经网络在自然图像的语义分割算法,分析遥感图像与自然图像的区别与联系,提出了一系列针对遥感图像地物分类任务的改进方法,获得了令人满意的分割效果。本文具体内容如下所述:1.本文提出了一种基于注意力机制和多尺度特征的地物分类方法。该方法基于目前自然图像语义分割最先进的Deep Lab-V3+算法结构,针对遥感图像场景复杂的问题,使用注意力机制可以使特征提取结构更加关注复杂背景下的典型地物目标要素;针对遥感图像不同类别尺寸差异大的问题,重新设计了新的多尺度特征提取模块来适应不同尺寸大小的目标;为了在数据分布不均匀的情况下也能得到较好的模型参数,使用了一种全新的类别权重计算方法,有效解决了类别不均衡问题。2.本文提出了一种基于高分辨率特征和密集上采样卷积的地物分类方法。该方法借鉴了HRNet网络新颖的结构,在特征提取过程中始终保持较高的特征分辨率,这对于小目标类别的特征提取和特征恢复有很大的帮助。为了提高算法的分割结果,在HRNet网络后又接入了一种完全依靠参数学习的解码结构—密集上采样卷积,同时为了更好利用提取的高分辨率特征,使用了边缘分割损失和Focal Loss损失的加权融合,进一步提高分割边缘的准确性。3.本文提出了一种基于生成对抗网络的遥感图像生成方法。该方法利用生成对抗网络,实现了从分割标签图像生成真实遥感图像的迁移过程,能够有效解决地物分类任务中数据量不足的问题,进而提高了地物分类的准确率和鲁棒性。

...

7.基于空间信息和长短时记忆网络的极化SAR图像分类

关键词：
极化SAR图像分类;栈式自编码网络;特征选择;长短时记忆网络;超像素分割

程杰
指导老师：西安电子科技大学侯彪
0年
学位论文

极化合成孔径雷达作为一种高精度成像雷达,其发射的电磁波具有很强的穿透力,在军事、工农业中发挥着重要的作用,因此有关极化SAR图像解译的研究逐渐发展起来,其中极化SAR分类问题是遥感图像解译领域的一大分支。传统的极化SAR分类方法依赖于极化SAR数据的相干特性,而没有利用图像的空间信息,不能达到很好的分类效果。本文的主要工作是基于空间信息和长短时记忆网络的极化SAR图像分类的研究,以深度置信网络和长短时记忆网络为基础,将其应用于极化SAR分类中,本文的主要工作如下:1.研究了一种基于图像空间信息的深度置信网络的极化SAR分类方法。该方法以深度置信网络为基础分类网络,首先将待分类的极化数据进行预处理并得到其相干矩阵,然后利用相干矩阵对深度置信网络进行训练,提取极化SAR数据的特征,并经过softmax层进行初步分类,最后引入数据的局部空间信息对初分类结果加以优化,取得了较好的分类结果。2.研究了一种基于SAE-LSTM网络的极化SAR分类方法。由于极化SAR数据中包含丰富的极化散射信息,仅对极化相干矩阵进行处理不足以表征完整的地物信息,因此本文首先利用多种目标分解方法对极化SAR数据进行处理,得到其多维特征向量,然后利用SAE网络对获得的多维特征向量进行特征选择,获得对分类效果贡献度大的特征。最后,考虑极化SAR数据的空间时序信息,利用LSTM网络实现对极化SAR图像的分类。3.研究了一种基于超像素分割的SAE-LSTM极化SAR分类方法。该方法首先利用SAE网络对多维特征向量进行特征选择,选择出有利于分类的特征向量。其次,采用基于特征分解的超像素分割算法,将选择出来的特征用于生成超像素块。最后,将LSTM网络得到的初分类结果与超像素块进行比对,根据超像素块的标签去校正与之对应位置的预测标签,得到最终的分类结果。

...

8.基于生成对抗网络的SAR图像生成与数据扩充研究

关键词：
生成对抗网络;SAR图像生成;分布特性;结构特性;图像补全

周驿枰
指导老师：西安电子科技大学侯彪
0年
学位论文

合成孔径雷达（Synthetic Aperture Radar,SAR）在军事和民生领域中发挥着重要的作用,它一直是雷达遥感领域的研究热点。随着深度学习技术的不断发展,SAR图像理解与解译的研究方法也得到新的启发。但是,基于深度学习的研究方法一般需要充足的带标签数据,而SAR图像却存在着资源昂贵,难以获得的问题,这限制了深度学习技术在SAR图像上的应用。因此,针对SAR图像的生成和扩充研究具有重要的研究意义。本文通过对技术现状研究和分析,尝试使用生成对抗网络进行SAR图像的生成与数据扩充,并探究生成数据对于SAR图像分类的意义。本文在生成对抗网络的基础上,分析并引入了SAR图像的特性,具体的工作包含以下方面:（1）基于分布和LBP-GAN的SAR图像生成方法。通过分析SAR图像不同于光学图像的成像机理与特性,对传统的生成对抗网络引入SAR图像信息。提出一种基于分布的生成方法,改进了生成对抗网络的噪声潜在空间,使其更符合SAR图像的统计分布信息。为了约束网络对于SAR图像纹理特征的学习,提出了一种LBP损失函数。两种方法分别从低级像素特征和高级结构特征捕获SAR图像特性,使得生成的图像更接近真实图像,且具有较好的生成效果和生成效率,有助于提高分类精度。（2）基于结构相似生成对抗网络的SAR图像生成方法。针对高分辨SAR图像生成结果模糊、出现噪声这一问题,通过改进网络结构,利用马尔可夫鉴别器网络学习高分辨SAR图像的区域细节特征和纹理特征,提出结构相似损失函数,保证生成图像与真实图像在结构上的相似度,通过结合金字塔的层级网络结构,进行单张SAR图像的内部混合模式学习,在整体上提高生成图像的效果。该方法进一步提高了生成图像与真实图像的相似度,同时基于生成方法扩充的数据提高了分类的准确度。（3）基于随机掩膜和图像补全的SAR图像生成方法。针对SAR图像生成多样性不足的问题,提出利用图像补全的思想进行图像生成。通过多元补全方法,利用样本间的混合模式学习,捕获SAR图像的整体结构特性,提高生成的稳定性。设计了随机掩膜的方法,从而适应SAR图像不同地物的结构,提高随机性,使结果多样化,扩充的数据具有抗噪性,有助于提高分类准确度。

...

9.Incremental Land Cover Classification via Label Strategy and Adaptive Weights

关键词：
Remote sensing; Task analysis; Semantic segmentation; Data models;Training; Feature extraction; Predictive models; Incremental learning;land cover classification; semantic segmentation

Ren, Bo;Wang, Zhao;Hou, Biao;Liu, Bo;Wu, Zitong;Chanussot, Jocelyn;Jiao, Licheng
《IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING》
2023年
61卷
期
期刊

During incremental learning tasks, catastrophic forgetting occurs when old models are updated with new information. To address this issue, we propose a novel method called label strategy and adaptive weights (LSAW) that improves the incremental learning process. The label strategy introduces the old classes and solves the problem of how to reasonably use the wrong samples predicted by the old model. In the cross-entropy (CE) loss, we apply a threshold to filter the pseudolabels predicted by the old model. Subsequently, we merge the pixel samples with high probability with the current label. The probability here refers to the probability that the pixel belongs to the true class. This process enables the introduction of information from old classes that are not directly accessible in the current stage. Moreover, this information is relatively reliable, and the model exhibits confidence in its accuracy. For the remaining pixels, we retain all classes' information through label smoothing. In the distillation function, the old class and background pixel samples are selected for distillation according to the prediction map of the old classes. The weights of the classes are adaptively updated and adjusted using specific label information from each batch and the different stages of incremental learning. As demonstrated by the results of our experiment, on three remote sensing image datasets: China Computer Federation (CCF), Potsdam, and Vaihingen, our method achieves the best results.

...

10.基于Atlas 200DK的遥感图像飞机目标检测实现

关键词：
昇腾AI处理器;Atlas 200DK;飞机目标检测;遥感图像

吕睿娟
指导老师：西安电子科技大学侯彪
0年
学位论文

遥感图像随着成像技术的不断发展,可获取的图片数量以及质量也与日俱增。海量的遥感图像,对研究人员来说,其中包含的信息不管是在城市管理还是军事等方面都有极大的参考价值与研究价值。随着遥感图像分辨率的不断提高,可获得信息量也随着增长,而传统的计算处理设备在处理大尺寸的遥感图像时,需要强大的硬件支持来提高计算的性能。但是强大的硬件支持往往意味着更高的功耗,因此低功耗并且具有强大算力的加速设备受到研究人员的关注。昇腾AI系列处理器是由华为自主研发的AI处理器,以其优良的软硬件架构为基础,向开发人员提供了两种不同语言的开发接口,同时兼具了强大算力和低功耗的特性,在计算机视觉领域展现了明显的优势。昇腾AI系列处理器主要分为昇腾910和昇腾310系列,在本文中主要使用了搭载昇腾310的边缘计算设备Atlas 200DK,进行光学遥感图像飞机目标检测算法的实现以及优化加速的方法,并且对比了算法在优化前后的检测性能比较。围绕此任务做了以下工作:（1）设计遥感图像飞机目标检测算法的实现流程,并将此目标检测算法在华为Atlas 200DK上进行实现。本文中主要需要实现的目标检测算法是YOLO v3,该算法在实现过程中,需遵循Atlas 200DK的开发流程进行应用开发以及算法实现。本次开发过程中使用了python语言进行整体的程序设计,在程序实现时,首先对开发环境和运行环境进行搭建,并依照ACL的应用开发流程,完成目标检测算法的模型转换,以及具体的应用开发任务。同时,针对该目标检测应用搭建了基于Py Qt5的飞机目标检测系统,将目标检测结果实时结果显示。（2）针对目标检测算法进行进一步的优化,具体的优化方式是针对本次开发所使用的数据集进行k-means聚类生成该数据集的anchors,然后基于华为Atlas 200DK的开发特性,结合昇腾AI处理器的性能优化策略进行目标检测算法的进一步优化,充分利用昇腾AI处理器在设计时已完成优化的网络结构以及算子集合,实现算法优化以及检测性能的提升。最后,本文对比了算法优化前后在Atlas 200DK上的推理性能,检测率以及虚警率等目标检测性能,可以看出,目标检测算法在优化后的推理检测速度得到了提升。

...

排序方式：时间相关性
显示方式：列表摘要