...
首页> 外文期刊>International Journal of Adaptive Control and Signal Processing >Observer-based adaptive optimal output containment control problem of linear heterogeneous Multiagent systems with relative output measurements
【24h】

Observer-based adaptive optimal output containment control problem of linear heterogeneous Multiagent systems with relative output measurements

机译:带有相对输出量的线性异构Multiagent系统基于观测器的自适应最优输出包含控制问题

获取原文
获取原文并翻译 | 示例
           

摘要

This paper develops a relative output-feedback-based solution to the containment control of linear heterogeneous multiagent systems. A distributed optimal control protocol is presented for the followers to not only assure that their outputs fall into the convex hull of the leaders' output but also optimizes their transient performance. The proposed optimal solution is composed of a feedback part, depending of the followers' state, and a feed-forward part, depending on the convex hull of the leaders' state. To comply with most real-world applications, the feedback and feed-forward states are assumed to be unavailable and are estimated using two distributed observers. That is, a distributed observer is designed to measure each agent's states using only its relative output measurements and the information that it receives by its neighbors. Another adaptive distributed observer is designed, which uses exchange of information between followers over a communication network to estimate the convex hull of the leaders' state. The proposed observer relaxes the restrictive requirement of having access to the complete knowledge of the leaders' dynamics by all the followers. An off-policy reinforcement learning algorithm on an actor-critic structure is next developed to solve the optimal containment control problem online, using relative output measurements and without requiring the leaders' dynamics. Finally, the theoretical results are verified by numerical simulations.
机译:本文针对线性异构多主体系统的安全壳控制,开发了一种基于输出反馈的相对解决方案。为追随者提供了一种分布式最优控制协议,不仅可以确保其输出落入领导者输出的凸包中,而且还可以优化其瞬态性能。所提出的最佳解决方案由取决于跟随者状态的反馈部分和取决于领导者状态的凸包的前馈部分组成。为了符合大多数实际应用,假定反馈和前馈状态不可用,并使用两个分布式观察器进行估算。也就是说,分布式观察者被设计为仅使用每个代理的相对输出测量值及其邻居接收到的信息来测量每个代理的状态。设计了另一个自适应分布式观察者,该观察者使用通信网络上的跟随者之间的信息交换来估计领导者状态的凸包。提议的观察者放宽了所有追随者可以完全了解领导者动态的限制性要求。接下来,针对行为者批评结构开发了一种脱离政策的强化学习算法,以使用相对的输出测量结果而无需领导者的动态来在线解决最优的收容控制问题。最后,通过数值模拟验证了理论结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号