Department of Electrical, Computer, and Biomedical Engineering Faculty Publications

Distributive dynamic spectrum access through deep reinforcement learning: A reservoir computing-based approach

Hao Hsuan Chang, Virginia Polytechnic Institute and State University
Hao Song, Virginia Polytechnic Institute and State University
Yang Yi, Virginia Polytechnic Institute and State University
Jianzhong Zhang, Samsung Research America
Haibo He, University of Rhode IslandFollow
Lingjia Liu, Virginia Polytechnic Institute and State University

Document Type

Article

Date of Original Version

4-1-2019

Abstract

Dynamic spectrum access (DSA) is regarded as an effective and efficient technology to share radio spectrum among different networks. As a secondary user (SU), a DSA device will face two critical problems: 1) avoiding causing harmful interference to primary users (PUs) and 2) conducting effective interference coordination with other SUs. These two problems become even more challenging for a distributed DSA network where there is no centralized controllers for SUs. In this paper, we investigate communication strategies of a distributive DSA network under the presence of spectrum sensing errors. To be specific, we apply the powerful machine learning tool, deep reinforcement learning (DRL), for SUs to learn 'appropriate' spectrum access strategies in a distributed fashion assuming NO knowledge of the underlying system statistics. Furthermore, a special type of recurrent neural network, called the reservoir computing (RC), is utilized to realize DRL by taking advantage of the underlying temporal correlation of the DSA network. Using the introduced machine learning-based strategy, SUs could make spectrum access decisions distributedly relying only on their own current and past spectrum sensing outcomes. Through extensive experiments, our results suggest that the RC-based spectrum access strategy can help the SU to significantly reduce the chances of collision with PUs and other SUs. We also show that our scheme outperforms the myopic method which assumes the knowledge of system statistics, and converges faster than the Q-learning method when the number of channels is large.

Publication Title, e.g., Journal

IEEE Internet of Things Journal

Volume

Issue

Citation/Publisher Attribution

Chang, Hao Hsuan, Hao Song, Yang Yi, Jianzhong Zhang, Haibo He, and Lingjia Liu. "Distributive dynamic spectrum access through deep reinforcement learning: A reservoir computing-based approach." IEEE Internet of Things Journal 6, 2 (2019): 1938-1948. doi: 10.1109/JIOT.2018.2872441.

Link to Full Text

COinS

DOI

https://doi.org/10.1109/JIOT.2018.2872441

Department of Electrical, Computer, and Biomedical Engineering Faculty Publications

Distributive dynamic spectrum access through deep reinforcement learning: A reservoir computing-based approach

Document Type

Date of Original Version

Abstract

Publication Title, e.g., Journal

Volume

Issue

Citation/Publisher Attribution

DOI

Search

Browse

Author Corner

Department of Electrical, Computer, and Biomedical Engineering Faculty Publications

Distributive dynamic spectrum access through deep reinforcement learning: A reservoir computing-based approach

Authors

Document Type

Date of Original Version

Abstract

Publication Title, e.g., Journal

Volume

Issue

Citation/Publisher Attribution

Share

DOI

Search

Browse

Author Corner