Back to Search
Start Over
Learning to Construct a Solution for the Agile Satellite Scheduling Problem With Time-Dependent Transition Times
- Source :
- IEEE Transactions on Systems, Man, and Cybernetics: Systems; October 2024, Vol. 54 Issue: 10 p5949-5963, 15p
- Publication Year :
- 2024
-
Abstract
- The agile earth observation satellite scheduling problem (AEOSSP) with time-dependent transition times is a complex combinational optimization problem that has emerged from the development of large-scale satellite management techniques. To address this problem, we propose a deep reinforcement learning-based construction model (DRL-CM) that consists of five parts: 1) a Markov decision process (MDP); 2) a feature engineering; 3) a constructive heuristic neural network (CHNN); 4) an RL training method; and 5) an evaluation system. Specifically, the CHNN comprises six modules containing three special components that we propose: a dynamic encoder, a dynamic global layer, and a two-stage attention layer. First, we build the MDP of the AEOSSP and the feature engineering with effective features required for decision-making. Second, we design the CHNN to function as the MDP policy and train it with an RL model. Finally, we propose a comprehensive evaluation system for the validation of our model. The experimental results indicate that the proposed DRL-CM outperforms the state-of-the-art algorithm in terms of both optimization speed and quality. In addition, the feature engineering and network architecture built in our model are verified to be effective in comprehensive experiments.
Details
- Language :
- English
- ISSN :
- 21682216 and 21682232
- Volume :
- 54
- Issue :
- 10
- Database :
- Supplemental Index
- Journal :
- IEEE Transactions on Systems, Man, and Cybernetics: Systems
- Publication Type :
- Periodical
- Accession number :
- ejs67440025
- Full Text :
- https://doi.org/10.1109/TSMC.2024.3411640