面向服务机器人的无监督领域自适应目标检测方法研究
项目来源
项目主持人
项目受资助机构
立项年度
立项时间
项目编号
项目级别
研究期限
受资助金额
学科
学科代码
基金类别
关键词
参与者
参与机构
项目受资助省
项目结题报告
面向服务机器人的无监督领域自适应目标检测方法研究结题报告(全文)
1.Attention-to-Embedding Framework forMulti-instance Learning
- Yang, Mei ; Zhang, Yu-Xuan ; Ye, Mao ; Min, Fan
- 《Lecture Notes in Computer Science 》
- 2022年
- 会议
2.Memory-Augmented Episodic Value Network
- 关键词:
- Deep learning;Corresponding state;Deep reinforcement learning;Environmental state;Episodic memory;Learn+;Memory modules;Reinforcement learnings;State-value functions;Value network;Visual Navigation
- Zeng, Fanyu;Xing, Guangyu;Han, Guang
- 《2022 IEEE Conference on Games, CoG 2022》
- 2022年
- August 21, 2022 - August 24, 2022
- Beijing, China
- 会议
In this paper, we propose memory-augmented episodic value network (M-EVN) to learn a differentiable planning-based policy with episodic memory in maze games. The episodic memory module associates the environmental state to its corresponding state value function, and outputs a weighted sum of state value functions with similar states to improve the agents' navigation performance in partially observable mazes. In addition, we introduce a Net-in-Net architecture to make M-EVN differentiable by error backpropagation and learn an explicit planning computation. We train M-EVN in 2D maze games, and the experimental results show that the M-EVN agent outperforms the original value iteration network (VIN) in the partially observable maze games. © 2022 IEEE.
...3.Source-Style Transferred Mean Teacher for Source-data Free Object Detection(Open Access)
- Zhang, Dan ; Ye, Mao ; Xiong, Lin ; Li, Shuaifeng ; Li, Xue
- 《ACM International Conference Proceeding Series》
- 2021年
- 会议
4.Divided Caption Model with Global Attention
- Yamin, Cheng ; Hancong, Duan ; Zitian, Zhao ; Zhi, Wang
- 《ACM International Conference Proceeding Series》
- 2021年
- 会议
5.Distribution-Aware Coordinate Representation for Human Pose Estimation
- 关键词:
- Computer vision;Coordinate representations;De facto standard;Decoding methods;Design limitations;Encoding process;Human pose estimations;Joint coordinates;Original images
- Zhang, Feng;Zhu, Xiatian;Dai, Hanbin;Ye, Mao;Zhu, Ce
- 《2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2020》
- 2020年
- June 14, 2020 - June 19, 2020
- Virtual, Online, United states
- 会议
While being the de facto standard coordinate representation for human pose estimation, heatmap has not been investigated in-depth. This work fills this gap. For the first time, we find that the process of decoding the predicted heatmaps into the final joint coordinates in the original image space is surprisingly significant for the performance. We further probe the design limitations of the standard coordinate decoding method, and propose a more principled distributionaware decoding method. Also, we improve the standard coordinate encoding process (i.e.Transforming ground-Truth coordinates to heatmaps) by generating unbiased/accurate heatmaps. Taking the two together, we formulate a novel Distribution-Aware coordinate Representation of Keypoints (DARK) method. Serving as a model-Agnostic plug-in, DARK brings about significant performance boost to existing human pose estimation models. Extensive experiments show that DARK yields the best results on two common benchmarks, MPII and COCO. Besides, DARK achieves the 2nd place entry in the ICCV 2019 COCO Keypoints Challenge. The code is available online. © 2020 IEEE.
...6.Hierarchical Reinforcement Learning Based on Continuous Subgoal Space
- 关键词:
- Neural networks;Robots;Multi agent systems;Complex networks;Complex environments;Hierarchical policy;Hierarchical reinforcement learning;High level policies;Neural network training;Reward function;Robot navigation;Simultaneous training
- Wang, Chen;Zeng, Fanyu;Ge, Shuzhi Sam;Jiang, Xin
- 《2020 IEEE International Conference on Real-Time Computing and Robotics, RCAR 2020》
- 2020年
- September 28, 2020 - September 29, 2020
- Virtual, Asahikawa, Hokkaido, Japan
- 会议
Multi-Agents are designed at different temporal levels to decomposed a complex task into several simple ones in hierarchical reinforcement learning. At the beginning of neural network training, the great changes of the low-level policy would cause the unstable transitions of the high-level one. In this paper, we propose a hierarchical policy combined with PPO and DDPG to deal with the simultaneous training of multi-Agents. To create an end-To-end policy, neural networks are employed to extract scene features in both low-level and high-level policies. In the meanwhile, a novel internal reward function is designed to enhance the goal achieving ability of low-level policy. A lightweight and fast gridworld Gym environment, MiniGrid, is used to test its validity. We found that the hierarchical policy is able to explore and plan without dense rewards. This attribute has a considerable influence on the study of robot navigation, especially in large and complex environment. © 2020 IEEE.
...7.Squeeze Criterion GANs: Double Adversarial Learning Method
- 关键词:
- Learning systems;Image enhancement;Adversarial learning;Adversarial networks;Mathematical derivation;Network structures;Real distribution;Training process;Training techniques;Variant models
- Gan, Yan;Xiang, Tao;Ye, Mao
- 《3rd International Conference on Machine Learning for Cyber Security, ML4CS 2020》
- 2020年
- October 8, 2020 - October 10, 2020
- Guangzhou, China
- 会议
Generative adversarial networks (GANs) have attracted much attention since it is able to effective learn from an unknown real distribution. However, the instability of the training process greatly affects the quality of the generated images. To address this problem, the network structure-based, loss-based variant model and some training techniques are proposed. Unfortunately, there are some problems with the above methods, such as the limited effect of stabilizing the training process, the complex mathematical derivation, and the lack of universality of training techniques for different tasks. To this end, we propose a novel squeeze criterion GANs. In this method, we design a pseudo real module to synthesize adversarial sample and the double identity discriminator is designed. Then, the generated image and adversarial sample, as well as the generated image and real image form double adversarial learning. Through double adversarial learning, it forms a squeeze criterion to stabilize the training process of generator and discriminator. Finally, experimental results show that the proposed method has well portability and stabilizes the training process of existing GANs, and improves the quality of generated images. © 2020, Springer Nature Switzerland AG.
...
