Collaborative Research:EP... - Zhong-Ping... - 美国国家科学基金(NSF...

Collaborative Research:EPCN:Distributed Optimization-based Control of Large-Scale Nonlinear Systems with Uncertainties and Application to Robotic Networks

项目来源

美国国家科学基金(NSF)

项目主持人

Zhong-Ping Jiang

项目受资助机构

NEW YORK UNIVERSITY

项目编号

2210320

立项年度

2022

立项时间

未公开

项目级别

国家级

研究期限

未知 / 未知

受资助金额

300000.00美元

学科

未公开

学科代码

未公开

基金类别

Standard Grant

关键词

EPCN-Energy-Power-Ctrl-Netwrks ; Control systems&applications ; CONTROL SYSTEMS ; ROBOTICS

参与者AI

郭雅婷;吴思;徐茜;刘腾飞;丁进良

参与机构AI

东北大学

项目标书摘要：Control and optimization need to be conducted simultaneously in numerous applications:smart grids,transportation networks,cooperative robotics,healthcare,and other autonomous systems interacting via wireless or physically-linked communications.These two tasks are typically treated distinctly,approached by independent designs.As a result,the two tasks interfere with one another and require performance compromises in at least of the two.For instance,optimality is obtained,but slowly,or convergence is rapid,but to suboptimal motions.A deep integration of control and optimization holds great promise.The integration is made difficult by the surge in complexity of contemporary control systems,reflected in the dynamic order,model uncertainty,and unreliable networking.The key challenge for concurrently running the mutually interfering optimization and control is the stability of the overall system or,if stability is ensured,the convergence rate.The control-optimization interference has been the hallmark of both classical adaptive control(controller-estimator interference)and extremum seeking(optimizer-controller interference),which are special cases of concurrent control and optimization.This project will advance the mathematical foundations of distributed optimization-based control and develop new tools and methods for real-time distributed optimization-based control design of large-scale and nonlinear uncertain systems.The methodology will be validated by means of cooperative robotic networks.The tools developed in this project,for real-time distributed optimization-based control algorithms for large-scale nonlinear systems with uncertainties,are of transformative nature.The algorithms designed will be applicable to heretofore intractable large-scale systems,including uncertain networked nonlinear systems and robotic networks described by Euler-Lagrange equations.To de-conflict the entanglement of optimization and control,the PIs pursue three research tasks:(1)the synthesis of distributed optimization algorithms that are robust to uncertainties,(2)the design of tracking controllers for each local system to follow in real time the desired output that aims to globally minimize certain global cost,and(3)the integration of optimization and control algorithms for global convergence of optimization algorithms and stability of the closed-loop network.The project builds on the PIs’foundational contributions in nonlinear small-gain theory,fortified uncertainty-attenuating controllers and estimators for modular adaptive control design,and on their complementary skillsets in learning-based control and in real-time optimization by extremum seeking.The deliverable is a controller-optimizer co-design with a greatly enlarged applicability,in terms of the generality of the nonlinear plants and the achieved robustness and adaptivity,as compared to current methods which rely on linearly-bounded interactions among the modules.This award reflects NSF's statutory mission and has been deemed worthy of support through evaluation using the Foundation's intellectual merit and broader impacts review criteria.

排序方式：时间相关性
显示方式：列表摘要

1.Continuous Safety-Critical Control of Mobile Robots With Set-Valued Feedback in Body-Fixed Coordinate Frame

关键词：
Safety; Vectors; Mobile robots; Standards; Qualifications; Coordinatemeasuring machines; Closed loop systems; Velocity measurement;Trajectory; Systematics; Cluttered environment; Lipschitz continuity;safety-critical control;CONTROL BARRIER FUNCTIONS

Wu, Si;Liu, Tengfei;Zhang, Weidong;Ding, Jinliang;Jiang, Zhong-Ping;Chai, Tianyou
《IEEE TRANSACTIONS ON AUTOMATIC CONTROL》
2026年
71卷
2期
期刊

This article investigates a safety-critical control problem for a mobile robot navigating cluttered environments. Obstacles are represented as a closed set, and a direction-distance function is used to model the obstacle measurement process. Based on real-time obstacle measurements, a safety-critical controller is expected to maintain a safe distance between the robot and obstacles while ensuring that the robot's velocity, considered as the control input, follows a given velocity command signal as closely as possible. This article contributes a regularization method for the obstacle-measurement model in the body-fixed frame and a systematic design of a Lipschitz continuous safety-critical controller based on the regularized obstacle-measurement model and quadratic programming (QP). Specifically, a novel Lipschitz-truncated regularization term is used to replace the conventional quadratic regularization in Moreau-Yosida regularization. It is guaranteed that, for each local minimum distance given by the original direction-distance function and its corresponding direction, there exists a neighborhood around that direction where the value of the regularized direction-distance function does not exceed the local minimum distance. This property is crucial for constructing a feasible set of control input that ensures safety in all directions using only a finite number of constraints. The QP framework is then employed to integrate these safety constraints with nominal velocity commands. A specific positive basis is employed to meet the linear independence constraint qualification, ensuring the Lipschitz continuity of the QP-based safety-critical controller. The effectiveness of the proposed design is demonstrated through numerical simulations and physical experiments.

...

2.Adaptive-Dynamic-Programming-Regulated Extremum Seeking for Distributed Feedback Optimization

关键词：
Multi-agent systems; Cost function; Silicon; Costs; Symmetric matrices;Closed loop systems; Aerodynamics; Regulation; Polynomials; Distributedfeedback devices; Adaptive dynamic programming (ADP); extremum seeking(ES); feedback optimization; multiagent systems;NASH EQUILIBRIUM SEEKING; TIME; STABILITY; SYSTEMS

Liu, Tong;Krstic, Miroslav;Jiang, Zhong-Ping
《IEEE TRANSACTIONS ON AUTOMATIC CONTROL》
2025年
70卷
11期
期刊

This note studies the distributed feedbak optimization for linear multiagent systems without precise knowledge of cost functions and agent dynamics. The goal is to regulate the outputs of the agents toward an unknown minimizer of a sum of local costs. To achieve this, distributed reference signals are combined with an extremum seeking mechanism to search for the minimizer. Meanwhile, each agent steers its output toward the designed reference signal using a learning-based adaptive optimal tracker. The entire process relies only on measurements of local costs and input-state data along the agents' trajectories. Moreover, the overall feedback loop has three time scales: tracking and consensus of the reference signals are the fastest, periodic sinusoidal perturbation is the medium, and optimization of the global cost is the slowest. Through this time-scale separation, the closed-loop system is guaranteed to be practically exponentially stable at an equilibrium of interest, along with the convergence of the output of each agent to a small neighborhood of the desired minimizer. A numerical example of robotic networks demonstrates the efficacy of the proposed method.

...

3.Continuous safety-critical control of Euler–Lagrange systems subject to multiple obstacles and velocity constraints

关键词：
Control theory;Safety engineering;Control laws;Control problems;Euler-Lagrange systems;Feasible set;Lipschitz continuity;Obstacles constraints;Position constraint;Quadratic programming algorithms;Velocity constraints;Velocity tracking

Liu, Zhi;Wu, Si;Liu, Tengfei;Jiang, Zhong-Ping
《Automatica》
2025年
180卷
期
期刊

This paper studies the safety-critical control problem for Euler–Lagrange (EL) systems subject to multiple ball obstacles and velocity constraints. A key strategy is to exploit the underlying cascade structure of EL systems to design a new safety-critical controller featuring an inner–outer-loop structure. In particular, the outer-loop control law is developed based on quadratic programming (QP) to avoid ball obstacles and generate velocity reference signals fulfilling the velocity limitation. Taking full advantage of the energy conservation property, a nonlinear velocity-tracking control law is designed to form the inner loop. One major difficulty is caused by the possible non-Lipschitz continuity of the standard QP algorithm when there are multiple constraints. To solve this problem, we propose a new feasible-set reshaping technique such that the refined QP algorithm with the reshaped feasible set admits a Lipschitz continuity property. Additionally, inspired by small-gain analysis, we construct a max-type Lyapunov-like function to integrate the safety constraints and the velocity-tracking error, and prove the achievement of the safety-critical control objective. The effectiveness of the proposed design is validated through numerical simulations and experiments on a 2-link planar manipulator. © 2025

...

4.Feedback Optimization of Nonlinear Strict-Feedback Systems

关键词：
Feedback optimization; internal model; output regulation; small-gaintheorem;SMALL-GAIN THEOREM; LYAPUNOV FORMULATION; OUTPUT REGULATION;STABILIZATION

Liu, Tong;Liu, Tengfei;Jiang, Zhong-Ping
《JOURNAL OF SYSTEMS SCIENCE & COMPLEXITY》
2025年
38卷
2期
期刊

Feedback optimization aims at regulating the output of a dynamical system to a value that minimizes a cost function. This problem is beyond the reach of the traditional output regulation theory, because the desired value is generally unknown and the reference signal evolves according to a gradient flow using the system's real-time output. This paper complements the output regulation theory with the nonlinear small-gain theory to address this challenge. Specifically, the authors assume that the cost function is strongly convex and the nonlinear dynamical system is in lower triangular form and is subject to parametric uncertainties and a class of external disturbances. An internal model is used to compensate for the effects of the disturbances while the cyclic small-gain theorem is invoked to address the coupling between the reference signal, the compensators, and the physical system. The proposed solution can guarantee the boundedness of the closed-loop signals and regulate the output of the system towards the desired minimizer in a global sense. Two numerical examples illustrate the effectiveness of the proposed method.

...

5.Invariant and Dual-Invariant Subspaces of k-Valued Networks

关键词：
Bearing spaces; finite lattices; finite rings; invariant subspace;k-valued (control) networks;BOOLEAN CONTROL NETWORKS; CONSENSUS NETWORKS; SET STABILITY;OBSERVABILITY; SYSTEMS; MULTISTATIONARITY; STABILIZATION

Cheng, Daizhan;Qi, Hongsheng;Zhang, Xiao;Ji, Zhengping
《JOURNAL OF SYSTEMS SCIENCE & COMPLEXITY》
2025年
38卷
2期
期刊

For k-valued (control) networks, two types of (control) invariant subspaces are proposed, namely, the state-invariant and dual-invariant subspaces, which are subspaces of the state space and dual space, respectively. Algorithms are presented to check whether a dual subspace is dual- (control) invariant, and to construct state feedback controls. The bearing space of k-valued (control) networks is introduced. Using the structure of the bearing space, the universal invariant subspace is presented, which is independent of the dynamics of particular networks. Finally, the relation between the state-invariant subspaces and the dual-invariant subspaces of a network is investigated. A duality property shows that if a dual subspace is invariant, then its perpendicular state subspace is also invariant, and vice versa.

...

6.基于改进混合A*的状态受限非完整移动机器人路径规划

关键词：
混合A*算法;路径规划;非完整约束

郭雅婷;吴思;徐茜;刘腾飞;丁进良
《控制工程》
2025年
卷
期
期刊

针对大型密闭空间内部作业且状态受限的移动机器人，基于混合A*算法、图搜索算法、曲线拟合等思想，提出了一种改进混合A*算法，来保证生成的安全路径处处满足此类状态受限移动机器人的约束。提出的算法可改进混合A*的传统六邻居节点选取策略，考虑移动机器人的固有运动和姿态约束并设计新的邻居节点选择策略，能够有效提升路径的灵活性和连续性，减少后续路径优化负担；采用抛物线曲线代替Reeds-Shepp曲线生成节点间的路径，可以在满足移动机器人非完整约束的同时有效避免移动机器人发生危险位姿，极大的减小了移动机器人的路径跟踪难度，易于设计和实现；最后基于MATLAB仿真环境，在多种工作场景下进行仿真实验验证该方法的有效性。

...

7.Data-Driven Combined Longitudinal and Lateral Control for the Car Following Problem

关键词：
Adaptation models; Vehicle dynamics; Mathematical models;Transportation; Roads; Reinforcement learning; Nonlinear dynamicalsystems; Electronic mail; Dynamic programming; Accuracy; Adaptivedynamic programming (ADP); combined longitudinal and lateral control;connected vehicles;ADAPTIVE CRUISE CONTROL; VEHICLE-FOLLOWING CONTROL; AUTOMATED VEHICLES;STABILITY; SYSTEM

Cui, Leilei;Chakraborty, Sayan;Ozbay, Kaan;Jiang, Zhong-Ping
《IEEE TRANSACTIONS ON CONTROL SYSTEMS TECHNOLOGY》
2025年
卷
期
期刊

This article studies the problem of data-driven combined longitudinal and lateral control of autonomous vehicles (AVs) such that the AV can stay within a safe but minimum distance from its leading vehicle and, at the same time, in the lane. Most of the existing methods for combined longitudinal and lateral control are either model-based or developed by purely data-driven methods such as reinforcement learning. Traditional model-based control approaches are insufficient to address the adaptive optimal control design issue for AVs in dynamically changing environments and are subject to model uncertainty. Moreover, the conventional reinforcement learning approaches require a large volume of data, and cannot guarantee the stability of the vehicle. These limitations are addressed by integrating the advanced control theory with reinforcement learning techniques. To be more specific, by utilizing adaptive dynamic programming (ADP) techniques and using the motion data collected from the vehicles, a policy iteration algorithm is proposed such that the control policy is iteratively optimized in the absence of the precise knowledge of the AV's dynamical model. Furthermore, the stability of the AV is guaranteed with the control policy generated at each iteration of the algorithm. The efficiency of the proposed approach is validated by the integrated simulation of SUMO and CommonRoad.

...

8.Learning-based adaptive optimal control of linear time-delay systems: A value iteration approach

关键词：
Adaptive control systems;Continuous time systems;Integer linear programming;Metal cutting;Optimal control systems;Reinforcement learning;Riccati equations;Adaptive dynamic programming;Adaptive optimal control;Controller design method;Learning-based control;Linear time-delay system;Optimal controller;Value iteration;Value iteration algorithm

Cui, Leilei;Pang, Bo;Krstić, Miroslav;Jiang, Zhong-Ping
《Automatica》
2025年
171卷
期
期刊

This paper proposes a novel learning-based adaptive optimal controller design method for a class of continuous-time linear time-delay systems. A key strategy is to exploit the state-of-the-art reinforcement learning (RL) techniques and adaptive dynamic programming (ADP), and propose a data-driven method to learn the near-optimal controller without the precise knowledge of system dynamics. Specifically, a value iteration (VI) algorithm is proposed to solve the infinite-dimensional Riccati equation for the linear quadratic optimal control problem of time-delay systems using finite samples of input-state trajectory data. It is rigorously proved that the proposed VI algorithm converges to the near-optimal solution. Compared with the previous literature, the nice features of the proposed VI algorithm are that it is directly developed for continuous-time systems without discretization and an initial admissible controller is not required for implementing the algorithm. The efficacy of the proposed methodology is demonstrated by two practical examples of metal cutting and autonomous driving. © 2024 Elsevier Ltd

...

9.Robust Stability and Near-Optimality for Policy Iteration: For Want of Recursive Feasibility, All is Not Lost

关键词：
Costs; Stability analysis; Robust stability; Cost function; Convergence;Closed loop systems; Robustness; Algorithm design and analysis;asymptotic stability; cost function; control system synthesis; dynamicprogramming; Lyapunov methods; nonlinear systems; optimal control;predictive control; robustness;MODEL-PREDICTIVE CONTROL; HORIZON OPTIMAL-CONTROL; TIME

Granzotto, Mathieu;De Silva, Olivier Lindamulage;Postoyan, Romain;Nesic, Dragan;Jiang, Zhong-Ping
《IEEE TRANSACTIONS ON AUTOMATIC CONTROL》
2024年
69卷
12期
期刊

We consider deterministic nonlinear discrete-time systems whose inputs are generated by policy iteration (PI) for undiscounted cost functions. We first assume that PI is recursively feasible, in the sense that the optimization problems solved at each iteration admit a solution. In this case, we provide novel conditions to establish recursive robust stability properties for a general attractor, meaning that the policies generated at each iteration ensure a robust KL -stability property with respect to a general state measure. We then derive novel explicit bounds on the mismatch between the (suboptimal) value function returned by PI at each iteration and the optimal one. However, we show by a counterexample that PI may fail to be recursively feasible, disallowing the mentioned stability and near-optimality guarantees. We therefore also present a modification of PI so that recursive feasibility is guaranteed a priori under mild conditions. This modified algorithm, called PI+, is shown to preserve the recursive robust stability when the attractor is compact. In addition, PI+ enjoys the same near-optimality properties as its PI counterpart under the same assumptions.

...

10.Robust Reinforcement Learning for Risk-Sensitive Linear Quadratic Gaussian Control

关键词：
Policy optimization (PO); risk-sensitive linear quadratic Gaussian(LQG); robust reinforcement learning; Policy optimization (PO);risk-sensitive linear quadratic Gaussian (LQG); robust reinforcementlearning;ZERO-SUM GAMES; STATE STABILITY; INPUT

Cui, Leilei;Basar, Tamer;Jiang, Zhong-Ping
《IEEE TRANSACTIONS ON AUTOMATIC CONTROL》
2024年
69卷
11期
期刊

This article proposes a novel robust reinforcement learning framework for discrete-time linear systems with model mismatch that may arise from the sim-to-real gap. A key strategy is to invoke advanced techniques from control theory. Using the formulation of the classical risk-sensitive linear quadratic Gaussian control, a dual-loop policy optimization algorithm is proposed to generate a robust optimal controller. The dual-loop policy optimization algorithm is shown to be globally and uniformly convergent, and robust against disturbances during the learning process. This robustness property is called small-disturbance input-to-state stability and guarantees that the proposed policy optimization algorithm converges to a small neighborhood of the optimal controller as long as the disturbance at each learning step is relatively small. In addition, when the system dynamics is unknown, a novel model-free off-policy policy optimization algorithm is proposed. Finally, numerical examples are provided to illustrate the proposed algorithm.

...

排序方式：时间相关性
显示方式：列表摘要