Task Scheduling for Single Satellite Observation Based on Meta-Reinforcement Learning

Zhi Li; Zhibin Li; Yongjie Bai

doi:10.2514/1.i011619

What is it about?

A mathematical model for satellite scheduling based on the Markov Decision Process (MDP) was developed, tailored to the characteristics of EOSSP. The problem was solved using meta-reinforcement learning (meta-RL) with the Proximal Policy Optimization (PPO) algorithm. Comparisons with other algorithms indicate that the meta-RL algorithm demonstrates advantages such as fast convergence, strong generalization ability, short execution time, and high overall rewards in large-scale scheduling problems.

This page is a summary of: Task Scheduling for Single Satellite Observation Based on Meta-Reinforcement Learning, Journal of Aerospace Information Systems, July 2025, American Institute of Aeronautics and Astronautics (AIAA),
DOI: 10.2514/1.i011619.
You can read the full text:

Read

Contributors

Be the first to contribute to this page

Task Scheduling for Single Satellite Observation Based on Meta-Reinforcement Learning

What is it about?

Contributors

Discover more

Medical Research

Life Sciences

Physical Sciences

Technology and Engineering

Environmental Research

Arts and Humanities

Social Sciences

Business and Management

Task Scheduling for Single Satellite Observation Based on Meta-Reinforcement Learning

What is it about?

Featured Image

Read the Original

Contributors

Share this page:

Discover more

Medical Research

Life Sciences

Physical Sciences

Technology and Engineering

Environmental Research

Arts and Humanities

Social Sciences

Business and Management