无监督层次表征学习模型及其在遥感影像解译中的应用

项目来源

国家自然科学基金(NSFC)

项目主持人

刘芳

项目受资助机构

西安电子科技大学

立项年度

2018

立项时间

未公开

项目编号

61871310

研究期限

未知 / 未知

项目级别

国家级

受资助金额

16.00万元

学科

信息科学-电子学与信息系统-信息获取与处理

学科代码

F-F01-F0113

基金类别

面上项目

无监督学习 ; SAR影像无监督分割 ; 极不匀质区域地物表示 ; 方向结构滤波器 ; 层次表征学习 ; unsupervised learning ; hierarchical representation learning ; unsupervised Segmentation of SAR Image ; representation of ground objects in extremely uneven regions ; directional structure filter

参与者

李婉；张雅科；崔元浩；黄欣研；骞晓雪；杨苗苗；李鹏芳；庞婷尹；李硕

参与机构

陕西科技大学

项目标书摘要：大场景高分辨SAR影像在语义空间中的区域图的作用下，被划分为混合集聚结构地物像素子空间、结构地物像素子空间和匀质地物像素子空间。针对大场景高分辨率SAR影像存在获取类别标签样本数据难的问题，通过对三个不同结构地物的像素子空间的无监督分割来解决SAR影像的无监督分割。由于极不匀质区域地物表示的复杂性与该区域目标大小和形状的多样性、相邻目标空间拓扑结构的多样性、目标散射特性和相干斑噪声等有密切的关系，与匀质区域和不匀质区域地物表示的复杂性相比，极不匀质区域地物表示的难度更大，导致混合集聚结构地物像素子空间的无监督分割更具有挑战性；因此，本项目拟重点研究极不匀质区域的无监督层次表征学习模型和方法来实现混合集聚结构地物像素子空间的无监督分割，为无监督表征学习探索出一条非深层且可解释的新的研究途径。

Application Abstract: Under the application of the regional map in the semantic space,the large scene and high-resolution SAR images are divided into hybrid aggregated,structural and homogeneous pixel-level subspaces.As it is difficult to obtain the labeled samples for the large scene and high-resolution SAR images,the unsupervised segmentation problem of the large scene and high-resolution SAR images is solved by the unsupervised segmentation of three structure pixel-level subspaces.The complexity of the object representation of the extremely unhomogeneous pixel-level subspace has a close relationship with the diversity of the size and shape of the target in the region,the diversity of spatial topological structures with adjacent targets,the target scattering characteristics and the speckle noise,etc.So compared with the homogeneous and unhomogeneous region,the extremely unhomogeneous region is more difficult to represent,which leads to the unsupervised segmentation of the hybird aggreated structure pixel-level subspace more challenging.Therefore,this project will focus on the unsupervised hierarchical represention learning model and method to realize the unsupervised segemention of the hybird aggregated stucture pixel-level subspace,and explore a new non-deep and interpretable apporach to realize unsupervised representation learning.

项目受资助省

陕西省

项目结题报告(全文)

由于SAR图像中极不匀质区域地物表示的复杂性与该区域目标大小和形状的多样性、相邻目标空间拓扑结构的多样性、目标散射特性和相干斑噪声等有密切的关系。针对SAR图像极不匀质区域地物的复杂形状结构表征难的问题，本项目首先构建了能捕捉方向信息的Ridgelet核函数和Curvelet核函数，并基于这些核函数设计了方向结构滤波器；接着，为了以无监督学习的方式获取初始结构滤波器，该项目利用了素描线段的方向和个数等信息，建立了基于素描方向信息的能量保真目标函数，并提出了相应优化方法。理论分析和实验表明本项目提出的基于素描方向信息的能量保真模型和优化方法在无需样本标签的条件下，能更好地求解出那些与原始影像块中蕴含的复杂地物结构尽量匹配的初始结构滤波器。在此基础上，针对目标形状边界的建模问题，本项目提出了以素描线段为单位构建能包含边界信息在内的几何结构块，设计了几何结构块中描述边界邻域内像素值之间约束关系的函数，建立了基于几何结构块约束关系的结构能量保真模型，并提出了相应优化方法。理论分析和实验表明与强调重构块与原始影像块的整体Frobenius范数最小化不同，本项目提出的模型在关注它们之间整体Frobenius范数较小化的同时，还充分考虑了目标形状边界的建模问题。最后，在语义空间和像素空间信息交互框架下，针对混合地物结构像素子空间的无监督分割问题，本项目提出了一种基于素描方向统计信息和特征学习的两阶段聚类的混合像素子空间分割方法，该方法将人工设计的特征和学习得到的特征通过两阶段聚类有机地结合起来，使得分割性能得到较大提升。

排序方式：时间相关性
显示方式：列表摘要

1.Large Motion Video Super-Resolution with Dual Subnet and Multi-Stage Communicated Upsampling

Liu, Hongying ; Zhao, Peng ; Ruan, Zhubo ; Shang, Fanhua ; Liu, Yuanyuan
《35th AAAI Conference on Artificial Intelligence, AAAI 2021》
2021年
会议

必应学术

2.Supervised Edge Attention Network for Accurate Image Instance Segmentation

关键词：
Computer vision;Convolution;Image segmentation;Attention mechanisms;Background noise;Baseline models;Bounding box

Chen, Xier;Lian, Yanchao;Jiao, Licheng;Wang, Haoran;Gao, YanJie;Lingling, Shi
《16th European Conference on Computer Vision, ECCV 2020》
2020年
August 23, 2020 - August 28, 2020
Glasgow, United kingdom
会议

Effectively keeping boundary of the mask complete is important in instance segmentation. In this task, many works segment instance based on a bounding box from the box head, which means the quality of the detection also affects the completeness of the mask. To circumvent this issue, we propose a fully convolutional box head and a supervised edge attention module in mask head. The box head contains one new IoU prediction branch. It learns association between object features and detected bounding boxes to provide more accurate bounding boxes for segmentation. The edge attention module utilizes attention mechanism to highlight object and suppress background noise, and a supervised branch is devised to guide the network to focus on the edge of instances precisely. To evaluate the effectiveness, we conduct experiments on COCO dataset. Without bells and whistles, our approach achieves impressive and robust improvement compared to baseline models. Code is at https://github.com//IPIU-detection/SEANet. © 2020, Springer Nature Switzerland AG.

...

3.AttAN: Attention adversarial networks for 3D point cloud semantic segmentation

关键词：
Semantic Web;Semantics;High dimensional spaces;High order correlation;Multi-scale features;Segmentation accuracy;Segmentation results;Semantic segmentation;State-of-the-art methods;Three dimensional space

Zhang, Gege;Ma, Qinghua;Jiao, Licheng;Liu, Fang;Sun, Qigong
《29th International Joint Conference on Artificial Intelligence, IJCAI 2020》
2020年
January 1, 2021
Yokohama, Japan
会议

3D point cloud semantic segmentation has attracted wide attention with its extensive applications in autonomous driving, AR/VR, and robot sensing fields. However, in existing methods, each point in the segmentation results is predicted independently from each other. This property causes the non-contiguity of label sets in three-dimensional space and produces many noisy label points, which hinders the improvement of segmentation accuracy. To address this problem, we first extend adversarial learning to this task and propose a novel framework Attention Adversarial Networks (AttAN). With high-order correlations in label sets learned from the adversarial learning, segmentation network can predict labels closer to the real ones and correct noisy results. Moreover, we design an additive attention block for the segmentation network, which is used to automatically focus on regions critical to the segmentation task by learning the correlation between multi-scale features. Adversarial learning, which explores the underlying relationship between labels in high-dimensional space, opens up a new way in 3D point cloud semantic segmentation. Experimental results on ScanNet and S3DIS datasets show that this framework effectively improves the segmentation quality and outperforms other state-of-the-art methods.

...

排序方式：时间相关性
显示方式：列表摘要