...
首页> 外文期刊>Mobile networks & applications >Efficient QoS Provisioning for Adaptive Multimedia in Mobile Communication Networks by Reinforcement Learning
【24h】

Efficient QoS Provisioning for Adaptive Multimedia in Mobile Communication Networks by Reinforcement Learning

机译:通过强化学习为移动通信网络中的自适应多媒体提供有效的QoS

获取原文
获取原文并翻译 | 示例
           

摘要

The scarcity and large fluctuations of link bandwidth in wireless networks have motivated the development of adaptive multimedia services in mobile communication networks, where it is possible to increase or decrease the bandwidth of individual ongoing flows. This paper studies the issues of quality of service (QoS) provisioning in such systems. In particular, call admission control and bandwidth adaptation are formulated as a constrained Markov decision problem. The rapid growth in the number of states and the difficulty in estimating state transition probabilities in practical systems make it very difficult to employ classical methods to find the optimal policy. We present a novel approach that uses a form of discounted reward reinforcement learning known as Q-learning to solve QoS provisioning for wireless adaptive multimedia. Q-learning does not require the explicit state transition model to solve the Markov decision problem; therefore more general and realistic assumptions can be applied to the underlying system model for this approach than in previous schemes. Moreover, the proposed scheme can efficiently handle the large state space and action set of the wireless adaptive multimedia QoS provisioning problem. Handoff dropping probability and average allocated bandwidth are considered as QoS constraints in our model and can be guaranteed simultaneously. Simulation results demonstrate the effectiveness of the proposed scheme in adaptive multimedia mobile communication networks.
机译:无线网络中链路带宽的稀缺性和大的波动促使移动通信网络中自适应多媒体服务的发展,在那里可以增加或减少各个正在进行的流的带宽。本文研究了此类系统中服务质量(QoS)设置的问题。特别地,将呼叫准入控制和带宽自适应表述为受约束的马尔可夫决策问题。由于状态数量的快速增长以及在实际系统中估算状态转换概率的难度,使得采用经典方法来找到最优策略非常困难。我们提出一种新颖的方法,该方法使用一种称为Q学习的折扣奖励强化学习形式来解决无线自适应多媒体的QoS设置。 Q学习不需要显式的状态转移模型来解决马尔可夫决策问题。因此,与以前的方案相比,可以将更一般和现实的假设应用于该方法的基础系统模型。而且,所提出的方案可以有效地处理无线自适应多媒体QoS供应问题的大状态空间和动作集。在我们的模型中,切换丢弃概率和平均分配带宽被视为QoS约束,并且可以同时得到保证。仿真结果证明了该方案在自适应多媒体移动通信网络中的有效性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号