Federici et al., 2022 - Google Patents
Deep Reinforcement Learning for Robust Spacecraft Guidance and ControlFederici et al., 2022
View PDF- Document ID
- 11553781719698803250
- Author
- Federici L
- Zavoli A
- De Matteis G
- Publication year
External Links
Snippet
This Ph. D. thesis aims to investigate new guidance and control algorithms based on deep neural networks and reinforcement learning, with application in nextgeneration space missions, which are expected to require greater levels of autonomy and robustness. Unlike …
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/02—Computer systems based on biological models using neural network models
- G06N3/04—Architectures, e.g. interconnection topology
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/02—Computer systems based on biological models using neural network models
- G06N3/08—Learning methods
- G06N3/082—Learning methods modifying the architecture, e.g. adding or deleting nodes or connections, pruning
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/02—Computer systems based on biological models using neural network models
- G06N3/06—Physical realisation, i.e. hardware implementation of neural networks, neurons or parts of neurons
- G06N3/063—Physical realisation, i.e. hardware implementation of neural networks, neurons or parts of neurons using electronic means
- G06N3/0635—Physical realisation, i.e. hardware implementation of neural networks, neurons or parts of neurons using electronic means using analogue means
-
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B13/00—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion
- G05B13/02—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric
- G05B13/0265—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric the criterion being a learning criterion
- G05B13/027—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric the criterion being a learning criterion using neural networks only
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N99/00—Subject matter not provided for in other groups of this subclass
- G06N99/005—Learning machines, i.e. computer in which a programme is changed according to experience gained by the machine itself during a complete run
-
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B13/00—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion
- G05B13/02—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric
- G05B13/04—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric involving the use of models or simulators
- G05B13/042—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric involving the use of models or simulators in which a parameter or coefficient is automatically adjusted to optimise the performance
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N5/00—Computer systems utilising knowledge based models
- G06N5/04—Inference methods or devices
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/12—Computer systems based on biological models using genetic models
- G06N3/126—Genetic algorithms, i.e. information processing using digital simulations of the genetic system
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N5/00—Computer systems utilising knowledge based models
- G06N5/02—Knowledge representation
- G06N5/022—Knowledge engineering, knowledge acquisition
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N7/00—Computer systems based on specific mathematical models
- G06N7/005—Probabilistic networks
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/50—Computer-aided design
-
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B17/00—Systems involving the use of models or simulators of said systems
- G05B17/02—Systems involving the use of models or simulators of said systems electric
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| Moerland et al. | A0c: Alpha zero in continuous action space | |
| CN112906882B (en) | Reverse reinforcement learning with model predictive control | |
| US6882992B1 (en) | Neural networks for intelligent control | |
| CN101566829B (en) | Method for computer-aided open loop and/or closed loop control of a technical system | |
| Piccinin et al. | Deep Reinforcement Learning-based policy for autonomous imaging planning of small celestial bodies mapping | |
| Williams et al. | Trajectory planning with deep reinforcement learning in high-level action spaces | |
| Zhang et al. | Model‐Free Attitude Control of Spacecraft Based on PID‐Guide TD3 Algorithm | |
| Mustafa | Towards continuous control for mobile robot navigation: A reinforcement learning and slam based approach | |
| Martinsen | End-to-end training for path following and control of marine vehicles | |
| CN119861572B (en) | A method for interplanetary orbit transfer based on hidden state and reinforcement learning | |
| Federici et al. | Deep Reinforcement Learning for Robust Spacecraft Guidance and Control | |
| Federici et al. | Meta-reinforcement learning with transformer networks for space guidance applications | |
| Cheng et al. | Human motion prediction using adaptable neural networks | |
| Yen et al. | Coordination of exploration and exploitation in a dynamic environment | |
| Gavra et al. | Evolutionary reinforcement learning: Hybrid approach for safety-informed fault-tolerant flight control | |
| Verma et al. | ANN Based ANFIS controller Design Using Hybrid Meta-Heuristic Tuning Approach for Cart Inverted Pendulum System | |
| Federici et al. | Improving reinforcement learning performance in spacecraft guidance and control through meta-learning: a comparison on planetary landing | |
| Marchetti et al. | A hybrid neural network-genetic programming intelligent control approach | |
| Amin et al. | System identification via artificial neural networks-applications to on-line aircraft parameter estimation | |
| Zhang et al. | Reinforcement Learning from Demonstrations by Novel Interactive Expert and Application to Automatic Berthing Control Systems for Unmanned Surface Vessel | |
| Rath et al. | On‐line extreme learning algorithm based identification and non‐linear model predictive controller for way‐point tracking application of an autonomous underwater vehicle | |
| Guzman et al. | Adaptive model predictive control by learning classifiers | |
| Coulson | Data-enabled predictive control: Theory and practice | |
| Pozzi et al. | Imitation learning-driven approximation of stochastic control models | |
| Brandonisio | Ai-based guidance for spacecraft proximity operations around uncooperative targets |