1. Learning spectrum opportunities in non-stationary radio environments
- Author
-
Visa Koivunen and Jan Oksanen
- Subjects
Index (economics) ,Computer science ,Frequency band ,Flexible spectrum use ,Distributed computing ,opportunistic spectrum access ,02 engineering and technology ,Interference (wave propagation) ,01 natural sciences ,Multi-armed bandit ,Radio spectrum ,0202 electrical engineering, electronic engineering, information engineering ,Fading ,cognitive radio ,dynamic propagation environment ,multi-armed bandit ,ta112 ,ta213 ,SIMPLE (military communications protocol) ,business.industry ,010401 analytical chemistry ,020206 networking & telecommunications ,0104 chemical sciences ,Cognitive radio ,Telecommunications ,business - Abstract
Learning-based sensing policies for multi-band flexible spectrum use, in particular cognitive radios operating in non-stationary radio environments are proposed. The proposed policies stem from the stochastic non-stationary restless multi-armed bandit formulation of opportunistic spectrum access. The non-stationary radio environment assumed in this paper is an appropriate model for a realistic cognitive radio systems, where the obtainable data rates depend on many unknown time-varying factors. These are e.g. mobility, fading and primary user activity. The developed policies are index policies, where the index of a frequency band depends on the discounted average reward of the band and a recency-based exploration bonus. The exploration bonus encourages sensing frequency bands that have not been explored for a long time. However, there is a maximum number of time instances when any band can remain unexplored. These index policies are computationally simple making them attractive for mobile cognitive radios. In our simulation examples, we demonstrate that the proposed policies can often provide higher cumulative data rate than other existing state-of-the-art policies.
- Published
- 2017
- Full Text
- View/download PDF