面向人机合作协同的机器人运动技能获取和执行研究

项目来源

国家自然科学基金(NSFC)

项目主持人

傅剑

项目受资助机构

武汉理工大学

项目编号

61773299

立项年度

2017

立项时间

未公开

研究期限

未知 / 未知

项目级别

国家级

受资助金额

63.00万元

学科

信息科学-自动化-自动化检测技术与装置

学科代码

F-F03-F0306

基金类别

面上项目

关键词

示范学习 ; 人机协作 ; 强化学习 ; 轨迹规划 ; 运动技能获取 ; motor skill acquisition ; human-robot collaboration ; trajectory planning ; Learning from demonstration ; reinforcement learning

参与者

向馗;庞牧野;罗璠;陈向成;魏达;曹策;刘冰;杜宇澄

参与机构

武汉理工大学;武汉大学;安徽大学

项目标书摘要:源于巨大的市场需求和前景,人机协作近年来成为机器人领域的前沿和热点。机器人如何学习到人类完成任务的个人偏好并自主适应,是当前亟待解决的问题,也是本项目研究的主要内容。本项目创造性地从机器人运动技能获取和执行视角出发开展研究。.1设计关节/操作空间上冗余的并发行为基元投射人机协作数据,实现在对偶空间上数据面向任务类别的分类聚集和分布,构建融合意图识别、个人偏好学习和适应的一体模型。2基于强化学习框架,结合表征学习的特征抽取、协作者反馈的引导学习来实现行为基元面向指标集的自主、快速适应和优化。3研究设计融合语义信息的操作算子对行为基元进行时空维度的复合,探究结合行为—运动模板库和长短期记忆网络对人体序列运动进行分割和识别,进而实现结合指标集的离散符号序列与连续运动规划的互相转化。.本研究将会揭示人体协作运动技能获得和提高的潜在模式和规律,为人机合作协同的研究提供新的研究方法和思路。

Application Abstract: Due to the huge market demand and charming prospect,the human-robot collaboration has become the cutting edge and hot spot in the field of robotics in recent years.It is an urgent problem for collaborative robot to learn and adapt to people’s preference about the task,which is addressed in this proposal.The principle investigator(PI)adopt a novel perspective,robot’s motor skill acquisition and execution,to conduct the research in this proposal...1)PI will design concurrent behavior primitives with redundancy in the joint and operation space,by which the collaborative data is projected onto the kernel of the behavior primitives.So the data(weight coefficient)in the dual space cluster with respect to the different task and cover a relative domain.In this way,an integrated interaction model for people’s intention identification,preference learning and adaption is constructed...2)Autonomous adaption and optimization of behavior primitive to meet the target is realized via feature extraction by representation learning and reward shaping by people’s feedback in the framework of reinforcement learning...3)Composition of the behavior primitives within the respective spatial and temporal dimensions is studied by introducing the operators with semantic information.Also,segmentation and identification of people’s sequence motions by behavior-motor library and LSTM are investigated.Moreover,interconnection between discrete symbol sequence with indicators and continuous motion planning is realized based on the previous achievement...In this study,we will reveal the potential patterns and rules of the acquisition and improvement of human collaborative motor skills,and provide new research methods and ideas for the research of human-robot collaboration.

项目受资助省

湖北省

项目结题报告(全文)

源于巨大的市场需求和前景,人机协作近年来成为机器人领域的前沿和热点。机器人如何学习到人类个体完成任务的个人偏好并自主适应,是亟待解决的问题也是本项目研究的主要内容。本项目创造性地从机器人运动技能获取和执行视角出发开展研究。具体而言:1)针对如何基于示范任务学习让机器人自主获得完成新任务能力的问题,我们提出双空间交替学习的思路和途径。它将当前机器人运动技能获取的 LfDRL 三阶段统一考虑,提出iLWR-PI2-AL算法实现了策略表达、模仿学习、策略提升的滚动优化。2)针对如何构建将时空耦合信息转化为可调制的运动模型并满足预设的条件约束的运动基元。同时在人机交互和协作中,以适配人类的行为意图变化而做出在线自适应调整的问题,我们提出面向多任务人机交互的MTiProMP模型,并结合解构和迭代策略实现了面向行为意图的多任务人机自适应交互、切换和协同。3)在面向不同任务运动技能的获取中,如何能能自主地掌握到完成任务的该技巧非常关键,它体现为各关节之间面向特定任务的隐含模式。我们提出双环结构启发式搜索的强化学习框架和 PI2-CMA-KCCA 算法用来加速面向新任务的运动技能获取。发现和预测关节间运动基元间和运动基元线内相关模式,实现了行为基元高效策略搜索。传统机器人操作和规划研究都是面对具体问题分别采用不同的模型和假设(彼此异构),这与人体本身基于同构模式来实现不同的运动技能有很大的不同。结合神经系统学、运动学和认识学的研究成果,本研究提出一种通过赋予机器人协作运动技能来实现人机合作协同的新思路和途径。通过构建协作行为基元,并结合模仿学习和强化学习实现运动技能传递(策略表达、模仿学习和策略提升)和人机交互协同(时间索引协作框架、状态索引协作框架),在机器人运动技能获取研究上做出有益的探索。该研究一定程度上揭示了人体协作运动技能获得和提高的潜在模式和规律,为人机合作协同的研究提供了新的方法和思路。

  • 排序方式:
  • 1
  • /
  • 1.Mixed Orientation ProMPs and Their Application in Attitude Trajectory Planning

    • 关键词:
    • Encoding (symbols);Robot programming;Trajectories;Academic research;GMM algorithm;Mixed orientations;Motion primitives;Movement primitives;Orientation probabilistic movement primitive;Probabilistics;Riemannian manifold;Robot motion;Trajectory Planning
    • Fu, Jian;Yang, Zhu;Li, Xiaolong
    • 《2nd International Conference on Cognitive Computation and Systems, ICCCS 2023》
    • 2024年
    • October 14, 2023 - October 15, 2023
    • Urumqi, China
    • 会议

    The application of motion primitives to encode robot motion has garnered considerable attention in the field of academic research. Existing models predominantly focus on reproducing task trajectory in relation to position, often neglecting the significance of orientation. Orientation Probabilistic Movement Primitives (ProMPs) indirectly encode motion primitives for attitude by utilizing their trajectory probabilities on Riemannian manifolds, specifically the 3-sphere S3. However, assuming a Gaussian distribution imposes constraints on its abilities. We propose Mixed Orientation ProMPs to enhance trajectory planning and minimize the occurrence of singular configurations. This model consists of multiple separate Gaussian distributions in the tangent space, enabling the approximation of any distribution. Furthermore, optimization objective functions of the Lagrangian type can incorporate constraints, such as singularity avoidance, and others. Finally, the effectiveness and reliability of the algorithm were validated through trajectory planning experiments conducted on the UR5 robotic arm. © The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd. 2024.

    ...
  • 2.MixStyle-Based Dual-Channel Feature Fusion for Person Re-Identification

    • 关键词:
    • Complex networks;Deep neural networks;Extraction;Image enhancement;Feature-based;Features extraction;Features fusions;Generalisation;Generalization performance;Global feature;Loss function improvement;Loss functions;Mixstyle;Person re identifications
    • Fu, Jian;Li, Xiaolong;Yang, Zhu
    • 《2nd International Conference on Cognitive Computation and Systems, ICCCS 2023》
    • 2024年
    • October 14, 2023 - October 15, 2023
    • Urumqi, China
    • 会议

    The problem of Person Re-Identification is still a big challenge, as the complex network structure and unsatisfactory generalization performance of widely used deep neural networks make them unsuitable for application to real-world problems. In this paper, we propose a global feature-based person re-identification network with strong generalization. The extracted features part contains two channels of feature fusion: the feature extraction module and the feature generalization module. The feature generalization module is a new MixStyle module added to the feature extraction module, which can effectively mix the style information of images under different domains or even the same domain to form multiple potential domain features, thus improving the generalization performance of the model. In addition, this paper also makes some improvements to the loss function by adding a new constraint on the positive sample pair distance, which makes it possible to maximizes the reduction of intra-class distance in addition to pushing the distance between different classes during the training process. Experimental results on two datasets, Market1501 and DukeMTMC, demonstrate that the method proposed in this paper exhibits strong generalization performance for the person re-identification problem and outperforms current global feature-based person re-identification methods. © The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd. 2024.

    ...
  • 3.Study of DNN Network Architecture Search for Robot Vision

    • 关键词:
    • Deep neural networks;Robot vision;'current;Network pruning;Neural network architecture;Neural network architecture search;Neural network model;Neural network pre-train;Neural network pruning;Neural-networks;Robot manipulation
    • Fu, Jian;Wang, Qifeng
    • 《8th IEEE International Conference on Advanced Robotics and Mechatronics, ICARM 2023》
    • 2023年
    • July 8, 2023 - July 10, 2023
    • Sanya, China
    • 会议

    Robot vision, which integrates measurement and perception, plays an important role in robot manipulation applications. Although the current deep learning-based visual neural network models have high perception capabilities, their network structures are usually too large to be implemented on embedded devices for robot vision. In this paper, we propose neural network architecture search (NAS) that combines pre-training and pruning operations to simplify deep neural network architectures. It not only solves this problem without losing network accuracy, but also significantly alleviates the difficulties of long network computation time and redundant search space in traditional NAS methods. Finally, the experimental results show that the neural network generated by the proposed algorithm outperforms the artificially designed neural network, which demonstrates the effectiveness of the method. At the end of the paper, the rationality of the method is proved by experiments and comparisons. The performance of the new algorithm and the generated neural network is better than that of the artificially designed neural network. © 2023 IEEE.

    ...
  • 4.Guided Policy Search Methods: A Review

    • 关键词:
    • Deep learning;Reinforcement learning;Manipulators;Complex control;Development process;Development trends;Future improvements;Policy search;Theoretical research;Training data;Trajectory optimization
    • Du, Jinyu;Fu, Jian;Li, Cong
    • 《2020 5th International Seminar on Computer Technology, Mechanical and Electrical Engineering, ISCME 2020》
    • 2021年
    • October 30, 2020 - November 1, 2020
    • Shenyang, Virtual, China
    • 会议

    Guided policy search methods (GPSs) have become important methods in the field of reinforcement learning in recent years. GPSs are a kind of policy search methods that utilize trajectory optimization methods to generate training data, guiding supervised learning. In theoretical research, GPSs combine convex optimization and deep learning, and have achieved fruitful results. In practical applications, they have achieved good results in complex control fields such as robots learning, especially manipulator operations. This paper mainly elaborates the development process and improvement route of GPSs. Firstly, the theoretical knowledge related to the GPSs is introduced. Secondly, the framework and basic methods of GPSs are analyzed; Thirdly, various improved GPSs based on the basic methods are generalized. Finally, the development and future improvement directions of GPSs are summarized, and the problems and future development trends are discussed.
    © Published under licence by IOP Publishing Ltd.

    ...
  • 5.Guided Policy Search Methods: A Review

    • 关键词:
    • Deep learning;Reinforcement learning;Manipulators;Complex control;Development process;Development trends;Future improvements;Policy search;Theoretical research;Training data;Trajectory optimization
    • Du, Jinyu;Fu, Jian;Li, Cong
    • 《2020 5th International Seminar on Computer Technology, Mechanical and Electrical Engineering, ISCME 2020》
    • 2021年
    • October 30, 2020 - November 1, 2020
    • Shenyang, Virtual, China
    • 会议

    Guided policy search methods (GPSs) have become important methods in the field of reinforcement learning in recent years. GPSs are a kind of policy search methods that utilize trajectory optimization methods to generate training data, guiding supervised learning. In theoretical research, GPSs combine convex optimization and deep learning, and have achieved fruitful results. In practical applications, they have achieved good results in complex control fields such as robots learning, especially manipulator operations. This paper mainly elaborates the development process and improvement route of GPSs. Firstly, the theoretical knowledge related to the GPSs is introduced. Secondly, the framework and basic methods of GPSs are analyzed; Thirdly, various improved GPSs based on the basic methods are generalized. Finally, the development and future improvement directions of GPSs are summarized, and the problems and future development trends are discussed.
    © Published under licence by IOP Publishing Ltd.

    ...
  • 排序方式:
  • 1
  • /