What is it about?

A mathematical model for satellite scheduling based on the Markov Decision Process (MDP) was developed, tailored to the characteristics of EOSSP. The problem was solved using meta-reinforcement learning (meta-RL) with the Proximal Policy Optimization (PPO) algorithm. Comparisons with other algorithms indicate that the meta-RL algorithm demonstrates advantages such as fast convergence, strong generalization ability, short execution time, and high overall rewards in large-scale scheduling problems.

Featured Image

Read the Original

This page is a summary of: Task Scheduling for Single Satellite Observation Based on Meta-Reinforcement Learning, Journal of Aerospace Information Systems, July 2025, American Institute of Aeronautics and Astronautics (AIAA),
DOI: 10.2514/1.i011619.
You can read the full text:

Read

Contributors

Be the first to contribute to this page