Adaptive Bandwidth Allocation via Uncertainty-Constrained Deep Reinforcement Learning
Computer Science and Information Systems, Tome 23 (2026) no. 1

Voir la notice de l'article provenant de la source Computer Science and Information Systems website

With the rapid growth of network services, traditional static bandwidth allocation schemes can no longer meet the demands of multi-user, dynamic, and QoS-sensitive applications. Ensuring both efficiency and stability in bandwidth allocation remains a significant challenge, especially under high variability and uncer-tainty conditions. To address this, we propose a novel algorithm named Uncertainty-Constrained Stability-aware Deep Reinforcement Learning (UCS-DRL) for dynamic bandwidth allocation. UCS-DRL adopts a dual-policy architecture: a task policy that learns optimal bandwidth allocation decisions, and a stability policy guided by uncertainty-aware value estimation to identify and mitigate potential risky or unstable behaviors during deployment. Furthermore, the framework incorporates a curiosity-driven exploration mechanism based on Random Network Distillation, which enhances exploration efficiency by encouraging the agent to visit informative and under-explored states. Experimental results show that UCS-DRL achieves high bandwidth utilization and service quality while reducing policy volatility and risky actions, balancing performance and robustness in dynamic bandwidth allocation.
Keywords: Dynamic Resource Allocation, Reinforcement Learning, Uncertainty Estimation, Stability-aware Control, Dual-policy Framework
Li Wei; Wu Yong; Yan Dong. Adaptive Bandwidth Allocation via Uncertainty-Constrained Deep Reinforcement Learning. Computer Science and Information Systems, Tome 23 (2026) no. 1. http://geodesic.mathdoc.fr/item/CSIS_2026_23_1_a14/
@article{CSIS_2026_23_1_a14,
     author = {Li Wei and Wu Yong and Yan Dong},
     title = {Adaptive {Bandwidth} {Allocation} via {Uncertainty-Constrained} {Deep} {Reinforcement} {Learning}},
     journal = {Computer Science and Information Systems},
     year = {2026},
     volume = {23},
     number = {1},
     url = {http://geodesic.mathdoc.fr/item/CSIS_2026_23_1_a14/}
}
TY  - JOUR
AU  - Li Wei
AU  - Wu Yong
AU  - Yan Dong
TI  - Adaptive Bandwidth Allocation via Uncertainty-Constrained Deep Reinforcement Learning
JO  - Computer Science and Information Systems
PY  - 2026
VL  - 23
IS  - 1
UR  - http://geodesic.mathdoc.fr/item/CSIS_2026_23_1_a14/
ID  - CSIS_2026_23_1_a14
ER  - 
%0 Journal Article
%A Li Wei
%A Wu Yong
%A Yan Dong
%T Adaptive Bandwidth Allocation via Uncertainty-Constrained Deep Reinforcement Learning
%J Computer Science and Information Systems
%D 2026
%V 23
%N 1
%U http://geodesic.mathdoc.fr/item/CSIS_2026_23_1_a14/
%F CSIS_2026_23_1_a14