A Dueling Deep Recurrent <span class="nowrap"><svg xmlns:xlink="http://www.w3.org/1999/xlink" xmlns="http://www.w3.org/2000/svg" style="vertical-align:-2.838901pt" id="M1" height="14.2383pt" version="1.1" viewBox="-0.0657574 -11.3994 12.5675 14.2383" width="12.5675pt"><g transform="matrix(.017,0,0,-0.017,0,0)"><path id="g113-82" d="M699 368C699 549 574 666 407 666C186 666 23 488 23 277C23 113 129 -3 288 -13L307 -26C431 -111 501 -139 533 -147C559 -154 613 -163 658 -164L666 -141C597 -111 507 -66 430 -11L416 -1C580 42 699 190 699 368ZM601 371C601 227 518 54 381 22L354 40L278 24C175 47 120 145 120 269C120 451 235 631 398 631C540 631 601 521 601 371Z"/></g></svg>-</span>Network Framework for Dynamic Multichannel Access in Heterogeneous Wireless Networks

Haitao Chen,Haitao Zhao,Li Zhou,Jiao Zhang,Yan Liu,Xiaoqian Pan,Xingguang Liu,Jibo Wei

A Dueling Deep Recurrent <span class="nowrap"><svg xmlns:xlink="http://www.w3.org/1999/xlink" xmlns="http://www.w3.org/2000/svg" style="vertical-align:-2.838901pt" id="M1" height="14.2383pt" version="1.1" viewBox="-0.0657574 -11.3994 12.5675 14.2383" width="12.5675pt"><g transform="matrix(.017,0,0,-0.017,0,0)"><path id="g113-82" d="M699 368C699 549 574 666 407 666C186 666 23 488 23 277C23 113 129 -3 288 -13L307 -26C431 -111 501 -139 533 -147C559 -154 613 -163 658 -164L666 -141C597 -111 507 -66 430 -11L416 -1C580 42 699 190 699 368ZM601 371C601 227 518 54 381 22L354 40L278 24C175 47 120 145 120 269C120 451 235 631 398 631C540 631 601 521 601 371Z"/></g></svg>-</span>Network Framework for Dynamic Multichannel Access in Heterogeneous Wireless Networks

2022

This paper investigates a deep reinforcement learning algorithm based on dueling deep recurrent -network (Dueling DRQN) for dynamic multichannel access in heterogeneous wireless networks. Specifically, we consider the scenario that multiple heterogeneous users with different MAC protocols share multiple independent channels. The goal of the intelligent node is to learn a channel access strategy that achieves high throughput by making full use of the underutilized channels. Two key challenges for the intelligent node are (i) there is no prior knowledge of spectrum environment or the other nodes’ behaviors; (ii) the spectrum environment is partially observable, and the spectrum states have complex temporal dynamics. In order to overcome the aforementioned challenges, we first embed the long short-term memory layer (LSTM) into the deep -network (DQN) to aggregate historical observations and capture the underlying temporal feature in the heterogeneous networks. And second, we employ the dueling architecture to overcome the observability problem of dynamic environment in neural networks. Simulation results show that our approach can learn the optimal access policy in various heterogeneous networks and outperforms the state-of-the-art policies.

Correction
Source
Cite
Save
Machine Reading By IdeaReader

References

Citations