基于脑启发的PolSAR图像深层协同表示学习与分类

项目来源

国家自然科学基金(NSFC)

项目主持人

侯彪

项目受资助机构

西安电子科技大学

项目编号

61671350

立项年度

2016

立项时间

未公开

项目级别

国家级

研究期限

未知 / 未知

受资助金额

58.00万元

学科

信息科学-电子学与信息系统-信息获取与处理

学科代码

F-F01-F0113

基金类别

面上项目

关键词

深度学习 ; 图像分割 ; 脑启发 ; 表示学习 ; 图像分类 ; image segmentation ; deep learning ; Brain-inspired ; representation learning ; image classification

参与者

唐旭;任博;文载道;吴倩;王建龙;任仲乐;牟树根;张广

参与机构

西北工业大学;中国人民解放军空军工程大学

项目标书摘要:多模态特征学习与协同表示和鲁棒的分类器设计是极化SAR分类中重要的两个核心问题。然而大规模极化SAR的数据量中通常仅有少量有效的类别标签,给这两个问题的有效解决带来了困难。本项目受到大脑感知、学习表示与决策融合过程的启发,构造一个多模块的深层神经网络来逐层解决以上问题。首先,本项目构造两个并行的低级感知皮层网络实现无监督的学习PolSAR图像特征与极化特征;其次,在上述两个低级网络之上,模拟构造高级前额皮层网络将底层多源特征进行协同表示与融合;最后,在高层网络中加入语义先验知识以及少量的监督信息,在贝叶斯推理的框架下实现决策分类,并用RADARSAT2和PALSAR的全极化SAR数据验证其有效性。期望在提高极化SAR图像分类效果的同时,进一步完善和促进深度学习等理论的研究和应用。成果在本领域重要期刊和会议上发表论文15-20篇,申报国家发明专利6-8项,联合培养博士、硕士5-8名。

Application Abstract: Multimodal feature learning and collaborative representation,together with a robust classifier construction are two crucial problems in PolSAR classification.However,large scale of PolSAR data with only a few labelled information increases their difficulties.Inspired by human brain of sensing,learning representation and decision fusion process.This project aims at constructing a deep neural network consisting of several basic networks to address these issues in a layer-wise manner.Firstly,we construct two networks imitating the low level sensing the cerebral cortex in order to respectively learn the PolSAR image and polarimetric features in parallel without supervised information.These features will be subsequently fused in a higher level network which performs the same role as the prefrontal cortex.By incorporating several semantic prior knowledge as well as a limited supervised information in the high-level network,the classification can be implemented via Bayesian inference.We validate the effectiveness of the proposed method by the full-polarimetric data of RADARSAT 2 and PALSAR.We hope our methods not only to improve performance of Polarimetric SAR image classification,but also further boosting and carving out compressed sensing theory and its applications.We will publish 10-15 journals and conferences,apply 6-8 patents and bring up 5-8 Ph.Ds and masters.

项目受资助省

陕西省

项目结题报告(全文)

本项目针对目前有限的物理散射机理驱动的分解模型不能涵盖所有地物类型的散射方式给分类带来的问题,设计自适应PolSAR目标极化分解的新框架,采用堆栈自编码网络自适应的学习极化分解的方式,获得更抽象的极化特征。针对PolSAR地物类型复杂、多样使得目前的特征表示难以自适应性,且缺少完备性与判别性,利用深度卷积置信网络来学习较深层的PolSAR图像特征。针对PolSAR“同类异谱图”和“同谱图异类”问题带来的分类精度下降的问题,考虑将极化特征空间、图像特征空间和语义特征空间进行多模态特征融合,挖掘三个特征空间的关联关系来实现对PolSAR图像的协同表示。将PolSAR数据的分类决策问题建模为一个类标重构的逆问题,通过挖掘类标空间的先验特性,在弱监督学习的算法框架下,建立少量标签指导的特征表示模型,然后设计合适的重构算法,恢复类别标签。建立了基于半监督学习的自适应奇异标签检测方法,对于含噪声类标的数据,采用半监督学习的策略,通过挖掘特征的本征结构先验,然后自适应的根据这种结构得到每个数据对应类标的置信度,以此来区分奇异类标。获教育部自然科学奖二等奖1项,获陕西省科学技术奖一等奖1项,出版专著3部,发表论文47篇,获批国家发明专利19项,依托本项目的研究,我们培养博士生10人,已毕业5人;硕士生30人,已毕业20人。

  • 排序方式:
  • 9
  • /
  • 1.Multiscale CNN with Autoencoder Regularization Joint Contextual Attention Network for SAR Image Classification

    • 关键词:
    • Convolution;Radar imaging;Image classification;Deep learning;Neural networks;Asymmetric structures;Classification accuracy;Classification algorithm;Fundamental research;Intelligent technology;Postprocessing methods;SAR image classifications;Synthetic aperture radar (SAR) images
    • Wu, Zitong;Hou, Biao;Jiao, Licheng
    • 《IEEE Transactions on Geoscience and Remote Sensing》
    • 2021年
    • 59卷
    • 2期
    • 期刊

    Synthetic aperture radar (SAR) image classification is a fundamental research direction in image interpretation. With the development of various intelligent technologies, deep learning techniques are gradually being applied to SAR image classification. In this study, a new SAR classification algorithm known as the multiscale convolutional neural network with an autoencoder regularization joint contextual attention network (MCAR-CAN) is proposed. The MCAR-CAN has two branches: The autoencoder regularization branch and the context attention branch. First, autoencoder regularization is used for the reconstruction of the input to regularize the classification in the autoencoder regularization branch. Multiscale input and an asymmetric structure of the autoencoder branch cause the network more to be focused on classification than on reconstruction. Second, the attention mechanism is used to produce an attention map in which each attention weight corresponds to a context correlation in attention branch. The robust features are obtained by the attention mechanism. Finally, the features obtained by the two branches are spliced for classification. In addition, a new training strategy and a postprocessing method are designed to further improve the classification accuracy. Experiments performed on the data from three SAR images demonstrated the effectiveness and robustness of the proposed algorithm.
    © 1980-2012 IEEE.

    ...
  • 2.Modified Tensor Distance-Based Multiview Spectral Embedding for PolSAR Land Cover Classification

    • 关键词:
    • Synthetic aperture radar;Polarimeters;Classification (of information);Pixels;Embeddings;Complementary property;Embedding algorithms;Land cover classification;Machine learning methods;Pixel based classifications;Polarimetric synthetic aperture radars;Polarimetric target decomposition;State-of-the-art methods
    • Ren, Bo;Hou, Biao;Chanussot, Jocelyn;Jiao, Licheng
    • 《IEEE Geoscience and Remote Sensing Letters》
    • 2020年
    • 17卷
    • 12期
    • 期刊

    This letter proposes a novel method for combining multiview features in polarimetric synthetic aperture radar (PolSAR) for land cover classification. It is well-known that feature extraction and classifier design are two significant steps in machine learning methods for PolSAR data interpretation. Each PolSAR pixel can be represented in different feature spaces, such as polarimetric data scattering, or the polarimetric target decomposition spaces. In this letter, a tensor-based multiview embedding algorithm is proposed to fuse those features from different spaces in order to obtain a distinctive set of features for the subsequent classification. Based on the pixel-based classification tasks, a modified tensor distance (MTD) is designed to accurately calculate the distance between tensors. It emphasizes the importance of the central pixel, and decreases the influence of the neighbors in the feature patch when calculating tensor distance. Furthermore, the complementary properties of different views are exploited by an MTD measured tensor multiview spectral embedding method, so as to obtain relevant low-dimensional features. Compared with state-of-the-art methods, the validation and effectiveness of the proposed method is demonstrated on two real PolSAR data sets.
    © 2004-2012 IEEE.

    ...
  • 4.Dynamic Immunization Node Model for Complex Networks Based on Community Structure and Threshold

    • 关键词:
    • Immune system; Integrated circuit modeling; Viruses (medical);Adaptation models; Complex networks; Probability; Dynamic propagationmodel; immune threshold; node immunization; propagation probability;INFORMATION DIFFUSION; MATHEMATICAL-THEORY; MODULARITY
    • Shang, Ronghua;Zhang, Weitong;Jiao, Licheng;Zhang, Xiangrong;Stolkin, Rustam
    • 《IEEE TRANSACTIONS ON CYBERNETICS》
    • 2022年
    • 52卷
    • 3期
    • 期刊

    In the information age of big data, and increasingly large and complex networks, there is a growing challenge of understanding how best to restrain the spread of harmful information, for example, a computer virus. Establishing models of propagation and node immunity are important parts of this problem. In this article, a dynamic node immune model, based on the community structure and threshold (NICT), is proposed. First, a network model is established, which regards nodes carrying harmful information as new nodes in the network. The method of establishing the edge between the new node and the original node can be changed according to the needs of different networks. The propagation probability between nodes is determined by using community structure information and a similarity function between nodes. Second, an improved immune gain, based on the propagation probability of the community structure and node similarity, is proposed. The improved immune gain value is calculated for neighbors of the infected node at each time step, and the node is immunized according to the hand-coded parameter: immune threshold. This can effectively prevent invalid or insufficient immunization at each time step. Finally, an evaluation index, considering both the number of immune nodes and the number of infected nodes at each time step, is proposed. The immune effect of nodes can be evaluated more effectively. The results of network immunization experiments, on eight real networks, suggest that the proposed method can deliver better network immunization than several other well-known methods from the literature.

    ...
  • 5.Incremental Land Cover Classification via Label Strategy and Adaptive Weights

    • 关键词:
    • Remote sensing; Task analysis; Semantic segmentation; Data models;Training; Feature extraction; Predictive models; Incremental learning;land cover classification; semantic segmentation
    • Ren, Bo;Wang, Zhao;Hou, Biao;Liu, Bo;Wu, Zitong;Chanussot, Jocelyn;Jiao, Licheng
    • 《IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING》
    • 2023年
    • 61卷
    • 期刊

    During incremental learning tasks, catastrophic forgetting occurs when old models are updated with new information. To address this issue, we propose a novel method called label strategy and adaptive weights (LSAW) that improves the incremental learning process. The label strategy introduces the old classes and solves the problem of how to reasonably use the wrong samples predicted by the old model. In the cross-entropy (CE) loss, we apply a threshold to filter the pseudolabels predicted by the old model. Subsequently, we merge the pixel samples with high probability with the current label. The probability here refers to the probability that the pixel belongs to the true class. This process enables the introduction of information from old classes that are not directly accessible in the current stage. Moreover, this information is relatively reliable, and the model exhibits confidence in its accuracy. For the remaining pixels, we retain all classes' information through label smoothing. In the distillation function, the old class and background pixel samples are selected for distillation according to the prediction map of the old classes. The weights of the classes are adaptively updated and adjusted using specific label information from each batch and the different stages of incremental learning. As demonstrated by the results of our experiment, on three remote sensing image datasets: China Computer Federation (CCF), Potsdam, and Vaihingen, our method achieves the best results.

    ...
  • 6.PolSAR Scene Classification via Low-Rank Constrained Multimodal Tensor Representation

    • 关键词:
    • PolSAR; scene classification; multimodal features; low-rank; tensorialrepresentations;CANONICAL CORRELATION-ANALYSIS; MANIFOLD ALIGNMENT; SCATTERING MODEL;COLOR FEATURES; LAND-COVER; TEXTURE; DECOMPOSITION; FRAMEWORK; FUSION
    • Ren, Bo;Chen, Mengqian;Hou, Biao;Hong, Danfeng;Ma, Shibin;Chanussot, Jocelyn;Jiao, Licheng
    • 《REMOTE SENSING》
    • 2022年
    • 14卷
    • 13期
    • 期刊

    Polarimetric synthetic aperture radar (PolSAR) data can be acquired at all times and are not impacted by weather conditions. They can efficiently capture geometrical and geographical structures on the ground. However, due to the complexity of the data and the difficulty of data availability, PolSAR image scene classification remains a challenging task. To this end, in this paper, a low-rank constrained multimodal tensor representation method (LR-MTR) is proposed to integrate PolSAR data in multimodal representations. To preserve the multimodal polarimetric information simultaneously, the target decompositions in a scene from multiple spaces (e.g., Freeman, H/A/alpha, Pauli, etc.) are exploited to provide multiple pseudo-color images. Furthermore, a representation tensor is constructed via the representation matrices and constrained by the low-rank norm to keep the cross-information from multiple spaces. A projection matrix is also calculated by minimizing the differences between the whole cascaded data set and the features in the corresponding space. It also reduces the redundancy of those multiple spaces and solves the out-of-sample problem in the large-scale data set. To support the experiments, two new PolSAR image data sets are built via ALOS-2 full polarization data, covering the areas of Shanghai, China, and Tokyo, Japan. Compared with state-of-the-art (SOTA) dimension reduction algorithms, the proposed method achieves the best quantitative performance and demonstrates superiority in fusing multimodal PolSAR features for image scene classification.

    ...
  • 7.A dual-stream high resolution network: Deep fusion of GF-2 and GF-3 data for land cover classification

    • 关键词:
    • Heterogeneous data fusion; SAR-optical; Land cover classification;Multi-modalities;SAR; IMAGERY
    • Ren, Bo;Ma, Shibin;Hou, Biao;Hong, Danfeng;Chanussot, Jocelyn;Wang, Jianlong;Jiao, Licheng
    • 《INTERNATIONAL JOURNAL OF APPLIED EARTH OBSERVATION AND GEOINFORMATION》
    • 2022年
    • 112卷
    • 期刊

    Land cover classification (LCC) is an important application in remote sensing data interpretation and invariably faces big intra-class variance and sample imbalance in remote sensing images. The optical image is obtained by satellites capturing the spectral information of the Earth's surface, and the synthetic aperture radar (SAR) image is produced by the satellite actively transmitting and receiving the electromagnetic wave signals reflected from land covers. Because of the limitations of the optical image, a single modality (optical image) might be disturbed by external conditions, especially complex weather. Using heterogeneous SAR and optical images for LCC can reduce the negative impact caused by single-modal data damage, and multi-modal data can also be used as supplementary information to enhance classification accuracy. However, general LCC methods mainly focus on remote sensing data of a single modality without fully considering the multi-modalities of land covers. Therefore, we propose a dual-stream deep high-resolution network (DDHRNet) to deeply integrate SAR and optical data at the feature level in every branch. The network can effectively exploit the complementary information in heterogeneous images. It improves classification performance and achieves significant improvements in the classification of clouded images. A multi-modal squeeze-and-excitation (MSE) module is also utilized to fuse the features. Compared with the ordinary methods, MSE modules can lead to an improvement of about 1% to 5% in overall accuracy (OA), Kappa coefficients, and mean intersection over union (mIoU). Besides, in order to evaluate our method, we describe in detail the preprocessing process of Gaofen-2 (GF2) and Gaofen-3 (GF3) data before they are used in the LCC task. The experiments show that the proposed method performs well compared with other excellent segmentation methods and obtains the best performance on heterogeneous images from GF2 and GF3. The code and datasets are available at: https://github.com/XD-MG/DDHRNet.

    ...
  • 8.PolSAR Scene Classification via Low-Rank Constrained Multimodal Tensor Representation(Open Access)

    • Ren, Bo ; Chen, Mengqian ; Hou, Biao ; Hong, Danfeng ; Ma, Shibin ; Chanussot, Jocelyn ; Jiao, Licheng
    • 《Remote Sensing》
    • 2022年
    • 14卷
    • 13期
    • 期刊

    Polarimetric synthetic aperture radar (PolSAR) data can be acquired at all times and are not impacted by weather conditions. They can efficiently capture geometrical and geographical structures on the ground. However, due to the complexity of the data and the difficulty of data availability, PolSAR image scene classification remains a challenging task. To this end, in this paper, a low-rank constrained multimodal tensor representation method (LR-MTR) is proposed to integrate PolSAR data in multimodal representations. To preserve the multimodal polarimetric information simultaneously, the target decompositions in a scene from multiple spaces (e.g., Freeman, H/A/α, Pauli, etc.) are exploited to provide multiple pseudo-color images. Furthermore, a representation tensor is constructed via the representation matrices and constrained by the low-rank norm to keep the cross-information from multiple spaces. A projection matrix is also calculated by minimizing the differences between the whole cascaded data set and the features in the corresponding space. It also reduces the redundancy of those multiple spaces and solves the out-of-sample problem in the large-scale data set. To support the experiments, two new PolSAR image data sets are built via ALOS-2 full polarization data, covering the areas of Shanghai, China, and Tokyo, Japan. Compared with state-of-the-art (SOTA) dimension reduction algorithms, the proposed method achieves the best quantitative performance and demonstrates superiority in fusing multimodal PolSAR features for image scene classification. © 2022 by the authors. Licensee MDPI, Basel, Switzerland.

    ...
  • 9.Parameter selection of Touzi decomposition and a distribution improved autoencoder for PolSAR image classification

    • 关键词:
    • Polarization decomposition; Data distribution; Touzi decomposition;Parameter selection; Feature extraction; Autoencoder network;CONVOLUTIONAL NEURAL-NETWORK; LAND-COVER; MUTUAL INFORMATION; MODEL
    • Wang, Jianlong;Hou, Biao;Ren, Bo;Zhang, Yake;Yang, Meijuan;Wang, Shuang;Jiao, Licheng
    • 《ISPRS JOURNAL OF PHOTOGRAMMETRY AND REMOTE SENSING》
    • 2022年
    • 186卷
    • 期刊

    Touzi decomposition provides more scattering information corresponding to parameters describing different targets. It is possible to explore the selection method of parameters, which plays an important role in polarimetric synthetic aperture radar (PolSAR) image classification. Therefore, this paper presents an innovative parameter ordering scheme based on the histogram of parameters obtained by Touzi decomposition. Then, a parameter selection scheme is proposed by analyzing the influence of the increase of parameters on the overall accuracy obtained by the Softmax classifier. Based on the Gamma distribution, the parameter selection method of eigenvalue decomposition is also given. As the data distribution of the selected parameters does not always satisfy the Gaussian distribution, it may not be sufficient to directly apply the autoencoder network. Thus, the loss function of the autoencoder network is modified by the construction of different data error terms according to a different data distribution form of the selected parameters, and then an improved autoencoder network is proposed. The process of feature extraction and classification can be regarded as a whole network to better accomplish the task of classification. An extensive set of experiments are done on four real PolSAR images. Compared with the classification results of all parameters, the gap in the overall accuracy of the parameters obtained by the proposed selection scheme is only about 1%. In terms of the classification overall accuracy, considering the distribution of parameters in the autoencoder network and the construction of the whole classification network can improve it by about 6-7% for four data sets. Code is available at https://github.com/ Justin20220123/ISPRS2022.

    ...
  • 10.Reconstruction Error-Based Decomposition Feature Selection for PolSAR Image

    • 关键词:
    • Feature selection; reconstruction error; sparse variational auto encoder(VAE) (SVAE) feature selection (SVAE-FS); target decomposition;POLARIMETRIC SAR DATA; UNSUPERVISED CLASSIFICATION; HYPERSPECTRALIMAGES; SCATTERING MODEL; SEGMENTATION; INFORMATION; SIGNATURES;NETWORK; CUTS
    • Yang, Chen;Hou, Biao;Guo, Xianpeng;Ren, Bo;Chanussot, Jocelyn;Wang, Shuang;Jiao, Licheng
    • 《IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING》
    • 2022年
    • 60卷
    • 期刊

    Target decomposition features are the cornerstone of subsequent analyses for PolSAR images. Generally, adopting single or several decomposition algorithms limits the representation ability for original terrain characteristics. Using all the existing decomposition features, however, will definitely increase computational complexity. Besides, some features even have a negative effect on the following tasks. To address these problems, a sparse variational autoencoder feature selection framework (SVAE-FS) is proposed in this article. In detail, the encoder transforms the original feature set into latent space and then decoder reconstructs the corresponding pseudo set on this latent space. Similarly, a pseudo subset is subsequently obtained by the SVAE. The discrepancy, namely reconstruction error, between the pseudo set and the pseudo subset is taken as an evaluation criterion which reflects the feature representation ability of pseudo subset. Sparse constraint in the encoder makes the representative features stand out. Meanwhile, the linear feature transformation layer of the encoder enables the SVAE to evaluate different scale subsets without repeated training. Finally, a greedy selection approach with search scale K is proposed to find the suboptimal subset. This procedure not only reduces time consumption, but also ensures the performance of the subset. The selected features are analyzed on four real PolSAR datasets according to the terrain scattering characteristics. Furthermore, these features have achieved competitive performance on three PolSAR image tasks.

    ...
  • 排序方式:
  • 9
  • /