Collaborative Research:EP... - Zhong-Ping... - 美国国家科学基金(NSF...

Collaborative Research:EPCN:Distributed Optimization-based Control of Large-Scale Nonlinear Systems with Uncertainties and Application to Robotic Networks

项目来源

美国国家科学基金(NSF)

项目主持人

Zhong-Ping Jiang

项目受资助机构

NEW YORK UNIVERSITY

立项年度

2022

立项时间

未公开

项目编号

2210320

项目级别

国家级

研究期限

未知 / 未知

受资助金额

300000.00美元

学科

未公开

学科代码

未公开

基金类别

Standard Grant

关键词

EPCN-Energy-Power-Ctrl-Netwrks ; Control systems&applications ; CONTROL SYSTEMS ; ROBOTICS

参与者AI

郭雅婷;吴思;徐茜;刘腾飞;丁进良

参与机构AI

东北大学

项目标书摘要：Control and optimization need to be conducted simultaneously in numerous applications:smart grids,transportation networks,cooperative robotics,healthcare,and other autonomous systems interacting via wireless or physically-linked communications.These two tasks are typically treated distinctly,approached by independent designs.As a result,the two tasks interfere with one another and require performance compromises in at least of the two.For instance,optimality is obtained,but slowly,or convergence is rapid,but to suboptimal motions.A deep integration of control and optimization holds great promise.The integration is made difficult by the surge in complexity of contemporary control systems,reflected in the dynamic order,model uncertainty,and unreliable networking.The key challenge for concurrently running the mutually interfering optimization and control is the stability of the overall system or,if stability is ensured,the convergence rate.The control-optimization interference has been the hallmark of both classical adaptive control(controller-estimator interference)and extremum seeking(optimizer-controller interference),which are special cases of concurrent control and optimization.This project will advance the mathematical foundations of distributed optimization-based control and develop new tools and methods for real-time distributed optimization-based control design of large-scale and nonlinear uncertain systems.The methodology will be validated by means of cooperative robotic networks.The tools developed in this project,for real-time distributed optimization-based control algorithms for large-scale nonlinear systems with uncertainties,are of transformative nature.The algorithms designed will be applicable to heretofore intractable large-scale systems,including uncertain networked nonlinear systems and robotic networks described by Euler-Lagrange equations.To de-conflict the entanglement of optimization and control,the PIs pursue three research tasks:(1)the synthesis of distributed optimization algorithms that are robust to uncertainties,(2)the design of tracking controllers for each local system to follow in real time the desired output that aims to globally minimize certain global cost,and(3)the integration of optimization and control algorithms for global convergence of optimization algorithms and stability of the closed-loop network.The project builds on the PIs’foundational contributions in nonlinear small-gain theory,fortified uncertainty-attenuating controllers and estimators for modular adaptive control design,and on their complementary skillsets in learning-based control and in real-time optimization by extremum seeking.The deliverable is a controller-optimizer co-design with a greatly enlarged applicability,in terms of the generality of the nonlinear plants and the achieved robustness and adaptivity,as compared to current methods which rely on linearly-bounded interactions among the modules.This award reflects NSF's statutory mission and has been deemed worthy of support through evaluation using the Foundation's intellectual merit and broader impacts review criteria.

排序方式：时间相关性
显示方式：列表摘要

1.Resilient Learning-Based Control for Partially Observable Systems under DoS Attacks

关键词：
Artificial intelligence;Digital communication systems;Feedback control;Iterative methods;Learning algorithms;Learning systems;Optimal control systems;Robust control;Denialof- service attacks;Discrete;time systems;Input-output;Learning-based control;Optimal controller;Output data;Output-feedback;Partially observable systems;Resiliency;Resilient control

Chakraborty, Sayan;Jiang, Zhong-Ping
《15th IFAC Workshop on Adaptive and Learning Control Systems, ALCOS 2025》
2025年
July 2, 2025 - July 4, 2025
Mexico City, Mexico
会议

This paper addresses the challenge of designing resilient control systems under Denial of Service (DoS) attacks for discrete-Time systems. A learning-based framework is proposed to reconstruct lost measurements and compute optimal controllers using input-output data, eliminating the need for a complete system model. By leveraging state reconstruction techniques, the framework estimates missing information during DoS periods, ensuring robust control performance. Two algorithms, based on policy iteration (PI) and value iteration (VI), are developed to learn the optimal feedback control policy. The effectiveness of the proposed methodology is illustrated via a numerical example. © 2025 The Authors.

...

2.Safe Output-Feedback Control for a Class of Linear Systems with Multiple Output Constraints

关键词：
Constraint satisfaction problems;Feedback control;Linear systems;Quadratic programming;Robust control;Constraint Satisfaction;Control framework;Control problems;Disjointness;Feasible set;Lipschitz continuity;Luenberger observers;Multiple outputs;Output feedback controls;Virtual controller

Wu, Si;Liu, Tengfei;Ding, Jinliang;Li, Yuzhe;Chai, Tianyou;Jiang, Zhong-Ping
《64th IEEE Conference on Decision and Control, CDC 2025》
2025年
December 9, 2025 - December 12, 2025
Rio de Janeiro, Brazil
会议

This paper considers the constraint-satisfaction control problem for a class of linear systems subject to multiple output constraints. A novel output-feedback control framework is developed, with a refined quadratic programming (QP) based virtual controller and a Luenberger observer. The main contribution lies in the refined QP-based virtual controller, which employs a positive basis to construct its feasible set, ensuring (local) Lipschitz continuity while preserving robustness. Under the assumptions that the plant is controllable and observable and that the constraints satisfy a disjointness condition, the proposed controller can guarantee the satisfaction of all output constraints. The effectiveness of the proposed controller is demonstrated through numerical simulations on an identified quadrotor model. © 2025 IEEE.

...

3.Safety-Critical Control of a Class of Underactuated Systems: A Case Study on Wheeled Inverted Pendulum

关键词：
Cascade connections;Inverted pendulum;Safety engineering;Case-studies;Feedforward systems;Reference signals;Safety constraint;Safety-critical control;State-variables;Under-actuated systems;Virtual controller;Wheel track;Wheeled inverted pendulum

Wu, Si;Wang, Shuai;Liu, Tengfei;Jiang, Zhong-Ping
《2025 European Control Conference, ECC 2025》
2025年
June 24, 2025 - June 27, 2025
Thessaloniki, Greece
会议

This paper studies the safety-critical control of a class of underactuated systems, with a focus on a wheeled inverted pendulum as a case study. A safety-critical controller is desired to simultaneously keep the pendulum upright, restrict the position of the wheel within a desired range, and ensure that the rotational angular velocity of the wheel tracks a nominal reference signal as closely as possible. We first reformulate the safety constraints as saturated constraints on the state variables of a cascade connection of an integrator and a feedforward system, thereby mitigating conflicts among the safety constraints. Then, to ensure the satisfaction of the constraints, we propose a safety-critical controller that consists of a safety-critical virtual controller based on control barrier functions (CBFs) and a tracking controller designed using forwarding methods. Nonlinear small-gain techniques are employed to ensure the stable integration of the two controllers and the achievement of the safety-critical control objectives. © 2025 EUCA.

...

4.Finite-time Consensus for Third-Order Multi-Agent Systems with Directed Communication Networks

关键词：
Multi agent systems;Communications networks;Consensus problems;Edge nodes;Edge-based;Finite-time;Finite-time consensus;High order systems;Higher-order systems;Spanning tree;Third order

Zhang, Xirui;Zhao, Zhi-Liang;Liu, Tengfei;Jiang, Zhong-Ping
《3rd Conference on Fully Actuated System Theory and Applications, FASTA 2024》
2024年
May 10, 2024 - May 12, 2024
Shenzhen, China
会议

This paper focuses on third-order multi-agent systems with directed communication networks. It is for the first time that the finite-time consensus problem is solved for third-order multi-agent systems under digraphs containing spanning trees. Two explicit-coefficient finite-time protocols, namely, edge-based and node-based, are proposed. First, we prove that asymptotic consensus is achievable with the reduced linear protocol. Then, we show that finite-time consensus is achievable with our explicit-coefficient finite-time protocols through homogeneous theory. Finally, numerical simulations are carried out to demonstrate the effectiveness of the proposed protocols. © 2024 IEEE.

...

5.Distributed Feedback Optimization of Networked Nonlinear Systems Using Relative Output Measurements

关键词：
Feedback;Perturbation techniques;Controller designs;Controller state;Distributed controller;Distributed feedback;Distributed optimization;Feedback optimization;Optimization problems;Real- time;Singular perturbations methods;Time gradients

Qin, Zhengyan;Liu, Tao;Liu, Tengfei;Jiang, Zhong-Ping
《2024 European Control Conference, ECC 2024》
2024年
June 25, 2024 - June 28, 2024
Stockholm, Sweden
会议

This paper investigates the distributed feedback optimization problem of nonlinear multi-agent systems. In such systems, each agent can measure the relative outputs between itself and its neighbors but lacks access to their absolute states and internal controller states. By combining distributed optimization and singular perturbation methods, a novel distributed controller design is presented, that relies solely on each agent's real-time gradient values of its local objective function and its relative output measurements to neighboring agents. The boundedness of the closed-loop signals and the convergence of the agent outputs to the minimizer of the total cost are proved rigorously. A numerical example is conducted to validate the effectiveness of the proposed approach. © 2024 EUCA.

...

6.Resilient Learning-Based Control Under Denial-of-Service Attacks

关键词：
Closed loop systems;Deep reinforcement learning;Denial-of-service attack;Discrete time control systems;Linear systems;Optimal control systems;Reinforcement learning;Closed loop stability;Denial of Service;Denialof- service attacks;Discrete time linear systems;Input state;Learn+;Optimal controller;Policy iteration;Reinforcement learning method;Upper Bound

Chakraborty, Sayan;Gao, Weinan;Vamvoudakis, Kyriakos G.;Jiang, Zhong-Ping
《63rd IEEE Conference on Decision and Control, CDC 2024》
2024年
December 16, 2024 - December 19, 2024
Milan, Italy
会议

In this paper, we have proposed a resilient reinforcement learning method for discrete-time linear systems with unknown parameters, under denial-of-service (DoS) attacks. The proposed method is based on policy iteration that learns the optimal controller from input-state data amidst DoS attacks. We achieve an upper bound for the DoS duration to ensure closed-loop stability. The resilience of the closed-loop system, when subjected to DoS attacks with the learned controller and an internal model, has been thoroughly examined. The effectiveness of the proposed methodology is demonstrated on an inverted pendulum on a cart. © 2024 IEEE.

...

7.Reinforcement-Learning-Based Risk-Sensitive Optimal Feedback Mechanisms of Biological Motor Control

关键词：
;

Cui, Leilei;Pang, Bo;Jiang, Zhong-Ping
《62nd IEEE Conference on Decision and Control, CDC 2023》
2023年
December 13, 2023 - December 15, 2023
Singapore, Singapore
会议

Risk sensitivity is a fundamental aspect of biological motor control that accounts for both the expectation and variability of movement cost in the face of uncertainty. However, most computational models of biological motor control rely on model-based risk-sensitive optimal control, which requires an accurate internal representation in the central neural system to predict the outcomes of motor commands. In reality, the dynamics of human-environment interaction is too complex to be accurately modeled, and noise further complicates system identification. To address this issue, this paper proposes a novel risk-sensitive computational mechanism for biological motor control based on reinforcement learning (RL) and adaptive dynamic programming (ADP). The proposed ADP-based mechanism suggests that humans can directly learn an approximation of the risk-sensitive optimal feedback controller from noisy sensory data without the need for system identification. Numerical validation of the proposed mechanism is conducted on the arm-reaching task under divergent force field. The preliminary computational results align with the experimental observations from the past literature of computational neuroscience. © 2023 IEEE.

...

8.A Reinforcement Learning Look at Risk-Sensitive Linear Quadratic Gaussian Control

关键词：
Digital control systems;Discrete time control systems;Gaussian distribution;Gaussian noise (electronic);Iterative methods;Learning algorithms;Learning systems;Linear systems;Optimization;Real time systems;Dual loops;Input-to-state stability;Linear quadratic Gaussian control;Optimal solutions;Optimization algorithms;Policy optimization;Reinforcement learnings;Robust reinforcement learning

Cui, Leilei;Başar, Tamer;Jiang, Zhong-Ping
《5th Annual Conference on Learning for Dynamics and Control, L4DC 2023》
2023年
June 15, 2023 - June 16, 2023
Philadelphia, PA, United states
会议

In this paper, we propose a robust reinforcement learning method for a class of linear discrete-time systems to handle model mismatches that may be induced by sim-to-real gap. Under the formulation of risk-sensitive linear quadratic Gaussian control, a dual-loop policy optimization algorithm is proposed to iteratively approximate the robust and optimal controller. The convergence and robustness of the dual-loop policy optimization algorithm are rigorously analyzed. It is shown that the dual-loop policy optimization algorithm uniformly converges to the optimal solution. In addition, by invoking the concept of small-disturbance input-to-state stability, it is guaranteed that the dual-loop policy optimization algorithm still converges to a neighborhood of the optimal solution when the algorithm is subject to a sufficiently small disturbance at each step. When the system matrices are unknown, a learning-based off-policy policy optimization algorithm is proposed for the same class of linear systems with additive Gaussian noise. The numerical simulation is implemented to demonstrate the efficacy of the proposed algorithm. © 2023 L. Cui, T. Başar & Z.-P. Jiang.

...

9.Adaptive Optimal Output Regulation of Discrete-Time Linear Systems: A Reinforcement Learning Approach

关键词：
;

Chakraborty, Sayan;Gao, Weinan;Vamvoudakis, Kyriakos G.;Jiang, Zhong-Ping
《62nd IEEE Conference on Decision and Control, CDC 2023》
2023年
December 13, 2023 - December 15, 2023
Singapore, Singapore
会议

In this paper, we solve the optimal output regulation problem for discrete-time systems without precise knowledge of the system model. Drawing inspiration from reinforcement learning and adaptive dynamic programming, a data-driven solution is developed that enables asymptotic tracking and disturbance rejection. Notably, it is discovered that the proposed approach for discrete-time output regulation differs from the continuous-time approach in terms of the persistent excitation condition required for policy iteration to be unique and convergent. To address this issue, a new persistent excitation condition is introduced to ensure both uniqueness and convergence of the data-driven policy iteration. The efficacy of the proposed methodology is validated by an inverted pendulum on a cart example. © 2023 IEEE.

...

排序方式：时间相关性
显示方式：列表摘要