面向服务机器人的无监督领域自适应目标检测方法研究

项目来源

国家自然科学基金(NSFC)

项目主持人

叶茂

项目受资助机构

电子科技大学

项目编号

61773093

立项年度

2017

立项时间

未公开

研究期限

未知 / 未知

项目级别

国家级

受资助金额

66.00万元

学科

信息科学-人工智能-机器感知与机器视觉

学科代码

F-F06-F0604

基金类别

面上项目

关键词

深度神经网络 ; 回复式神经网络 ; 目标检测 ; 生成对抗网络 ; Object Detection ; Deep Neural Network ; Recurrent Neural Network ; Generative Adversarial Nets

参与者

李凡;邢冠宇;徐培;任东晓;李旭东;唐宋;张锋;刘丹;淦艳

参与机构

电子科技大学;四川大学;浙江科技学院;上海理工大学;南京邮电大学;重庆大学

项目标书摘要:当服务机器人工作于新场景时,因源场景与新场景数据分布的不一致,目标检测效果通常会下降。迁移学习是一个很好的解决手段,但目前绝大多数领域自适应目标检测方法要求在迁移的时候保留源样本,或者少量新场境样本有标签。这些要求服务机器人通常无法满足。针对这种情况,在以往深度学习和机器视觉研究基础上,本课题提出通过网络调控和一致特征学习等方法实现目标检测器对新场景的无监督迁移。研究内容有:基于网络调控的目标检测神经网络迁移方法研究;基于无监督一致特征学习的目标检测神经网络迁移方法研究;结合上下文信息与融入多模态信息的无监督领域自适应目标检测方法研究。创新之处有:不保留源训练集、没有新场景目标标签,基于无监督学习的目标检测网络迁移模型;基于调控网络的迁移学习框架;基于网络调控和无监督学习的上下文和多模态信息的融合方法。取得的成果不仅能丰富目标检测方法和机器学习理论,也有很重要的社会与经济价值。

Application Abstract: When a service robot is working in a new scene,because of the inconsistency of the data distributions between the source and the new scenes,the detection performance of object detector always will drop rapidly.Transfer learning is a good solution;however,almost all of present domain adaption methods require that the training set is kept or some of target samples are labeled.These requirements for service robots are usually not met.From the previous research experiences in the fields of deep learning and machine vision,we propose to study the unsupervised transfer methods which are based on network control and consistent feature learning.The research contents include:1network control framework for transferring neural network based object detector,2unsupervised consistent feature learning model for transferring neural network based object detector,3unsupervised context feature learning methods and the corresponding object detector transferring methods,4unsupervised multi-modal information learning methods and the corresponding object detector transferring methods.The main innovation points are:1network control framework for transferring learning,2unsupervised consistent feature learning model,3the combing methods which absorb context and multi-modal information based on network control and unsupervised learning.The results obtained through these studies are expected not only to enrich the methods of object detection of robot and machine learning theory,but also to make important contributions to social and economic development.

项目受资助省

四川省

项目结题报告

面向服务机器人的无监督领域自适应目标检测方法研究结题报告(全文)

  • 排序方式:
  • 1
  • /
  • 2.Memory-Augmented Episodic Value Network

    • 关键词:
    • Deep learning;Corresponding state;Deep reinforcement learning;Environmental state;Episodic memory;Learn+;Memory modules;Reinforcement learnings;State-value functions;Value network;Visual Navigation
    • Zeng, Fanyu;Xing, Guangyu;Han, Guang
    • 《2022 IEEE Conference on Games, CoG 2022》
    • 2022年
    • August 21, 2022 - August 24, 2022
    • Beijing, China
    • 会议

    In this paper, we propose memory-augmented episodic value network (M-EVN) to learn a differentiable planning-based policy with episodic memory in maze games. The episodic memory module associates the environmental state to its corresponding state value function, and outputs a weighted sum of state value functions with similar states to improve the agents' navigation performance in partially observable mazes. In addition, we introduce a Net-in-Net architecture to make M-EVN differentiable by error backpropagation and learn an explicit planning computation. We train M-EVN in 2D maze games, and the experimental results show that the M-EVN agent outperforms the original value iteration network (VIN) in the partially observable maze games. © 2022 IEEE.

    ...
  • 5.Distribution-Aware Coordinate Representation for Human Pose Estimation

    • 关键词:
    • Computer vision;Coordinate representations;De facto standard;Decoding methods;Design limitations;Encoding process;Human pose estimations;Joint coordinates;Original images
    • Zhang, Feng;Zhu, Xiatian;Dai, Hanbin;Ye, Mao;Zhu, Ce
    • 《2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2020》
    • 2020年
    • June 14, 2020 - June 19, 2020
    • Virtual, Online, United states
    • 会议

    While being the de facto standard coordinate representation for human pose estimation, heatmap has not been investigated in-depth. This work fills this gap. For the first time, we find that the process of decoding the predicted heatmaps into the final joint coordinates in the original image space is surprisingly significant for the performance. We further probe the design limitations of the standard coordinate decoding method, and propose a more principled distributionaware decoding method. Also, we improve the standard coordinate encoding process (i.e.Transforming ground-Truth coordinates to heatmaps) by generating unbiased/accurate heatmaps. Taking the two together, we formulate a novel Distribution-Aware coordinate Representation of Keypoints (DARK) method. Serving as a model-Agnostic plug-in, DARK brings about significant performance boost to existing human pose estimation models. Extensive experiments show that DARK yields the best results on two common benchmarks, MPII and COCO. Besides, DARK achieves the 2nd place entry in the ICCV 2019 COCO Keypoints Challenge. The code is available online. © 2020 IEEE.

    ...
  • 6.Hierarchical Reinforcement Learning Based on Continuous Subgoal Space

    • 关键词:
    • Neural networks;Robots;Multi agent systems;Complex networks;Complex environments;Hierarchical policy;Hierarchical reinforcement learning;High level policies;Neural network training;Reward function;Robot navigation;Simultaneous training
    • Wang, Chen;Zeng, Fanyu;Ge, Shuzhi Sam;Jiang, Xin
    • 《2020 IEEE International Conference on Real-Time Computing and Robotics, RCAR 2020》
    • 2020年
    • September 28, 2020 - September 29, 2020
    • Virtual, Asahikawa, Hokkaido, Japan
    • 会议

    Multi-Agents are designed at different temporal levels to decomposed a complex task into several simple ones in hierarchical reinforcement learning. At the beginning of neural network training, the great changes of the low-level policy would cause the unstable transitions of the high-level one. In this paper, we propose a hierarchical policy combined with PPO and DDPG to deal with the simultaneous training of multi-Agents. To create an end-To-end policy, neural networks are employed to extract scene features in both low-level and high-level policies. In the meanwhile, a novel internal reward function is designed to enhance the goal achieving ability of low-level policy. A lightweight and fast gridworld Gym environment, MiniGrid, is used to test its validity. We found that the hierarchical policy is able to explore and plan without dense rewards. This attribute has a considerable influence on the study of robot navigation, especially in large and complex environment. © 2020 IEEE.

    ...
  • 7.Squeeze Criterion GANs: Double Adversarial Learning Method

    • 关键词:
    • Learning systems;Image enhancement;Adversarial learning;Adversarial networks;Mathematical derivation;Network structures;Real distribution;Training process;Training techniques;Variant models
    • Gan, Yan;Xiang, Tao;Ye, Mao
    • 《3rd International Conference on Machine Learning for Cyber Security, ML4CS 2020》
    • 2020年
    • October 8, 2020 - October 10, 2020
    • Guangzhou, China
    • 会议

    Generative adversarial networks (GANs) have attracted much attention since it is able to effective learn from an unknown real distribution. However, the instability of the training process greatly affects the quality of the generated images. To address this problem, the network structure-based, loss-based variant model and some training techniques are proposed. Unfortunately, there are some problems with the above methods, such as the limited effect of stabilizing the training process, the complex mathematical derivation, and the lack of universality of training techniques for different tasks. To this end, we propose a novel squeeze criterion GANs. In this method, we design a pseudo real module to synthesize adversarial sample and the double identity discriminator is designed. Then, the generated image and adversarial sample, as well as the generated image and real image form double adversarial learning. Through double adversarial learning, it forms a squeeze criterion to stabilize the training process of generator and discriminator. Finally, experimental results show that the proposed method has well portability and stabilizes the training process of existing GANs, and improves the quality of generated images. © 2020, Springer Nature Switzerland AG.

    ...
  • 排序方式:
  • 1
  • /