Cook et al., 2024 - Google Patents
Autonomous intelligent reinforcement inferred symbolismCook et al., 2024
- Document ID
- 16775526925273107253
- Author
- Cook B
- Hammer P
- Publication year
- Publication venue
- International Conference on Artificial General Intelligence
External Links
Snippet
This paper introduces AIRIS (Autonomous Intelligent Reinforcement Inferred Symbolism) to enable causality-based artificial intelligent agents. The system builds sets of causal rules from observations of changes in its environment which are typically caused by the actions of …
- 230000002787 reinforcement 0 title abstract description 10
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N99/00—Subject matter not provided for in other groups of this subclass
- G06N99/005—Learning machines, i.e. computer in which a programme is changed according to experience gained by the machine itself during a complete run
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N5/00—Computer systems utilising knowledge based models
- G06N5/02—Knowledge representation
- G06N5/022—Knowledge engineering, knowledge acquisition
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N5/00—Computer systems utilising knowledge based models
- G06N5/04—Inference methods or devices
- G06N5/043—Distributed expert systems, blackboards
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/02—Computer systems based on biological models using neural network models
- G06N3/04—Architectures, e.g. interconnection topology
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N5/00—Computer systems utilising knowledge based models
- G06N5/04—Inference methods or devices
- G06N5/046—Forward inferencing, production systems
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/12—Computer systems based on biological models using genetic models
- G06N3/126—Genetic algorithms, i.e. information processing using digital simulations of the genetic system
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/02—Computer systems based on biological models using neural network models
- G06N3/08—Learning methods
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/004—Artificial life, i.e. computers simulating life
- G06N3/006—Artificial life, i.e. computers simulating life based on simulated virtual individual or collective life forms, e.g. single "avatar", social simulations, virtual worlds
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for programme control, e.g. control unit
- G06F9/06—Arrangements for programme control, e.g. control unit using stored programme, i.e. using internal store of processing equipment to receive and retain programme
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N7/00—Computer systems based on specific mathematical models
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06Q—DATA PROCESSING SYSTEMS OR METHODS, SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL, SUPERVISORY OR FORECASTING PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL, SUPERVISORY OR FORECASTING PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q10/00—Administration; Management
- G06Q10/06—Resources, workflows, human or project management, e.g. organising, planning, scheduling or allocating time, human or machine resources; Enterprise planning; Organisational models
-
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B13/00—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion
- G05B13/02—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric
- G05B13/0265—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric the criterion being a learning criterion
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| Ibarz et al. | How to train your robot with deep reinforcement learning: lessons we have learned | |
| Goyal et al. | Retrieval-augmented reinforcement learning | |
| Soares et al. | Agent foundations for aligning machine intelligence with human interests: a technical research agenda | |
| Beetz et al. | CRAM—A Cognitive Robot Abstract Machine for everyday manipulation in human environments | |
| Yoneda et al. | Statler: State-maintaining language models for embodied reasoning | |
| Liu | Autonomous agents and multi-agent systems: explorations in learning, self-organization and adaptive computation | |
| Hammoudeh | A concise introduction to reinforcement learning | |
| Sloss et al. | 2019 evolutionary algorithms review | |
| Kelly et al. | Emergent tangled program graphs in partially observable recursive forecasting and ViZDoom navigation tasks | |
| Wang et al. | Multi-agent imitation learning with copulas | |
| König et al. | Decentralized evolution of robotic behavior using finite state machines | |
| Reitter et al. | Accountable modeling in ACT-UP, a scalable, rapid-prototyping ACT-R implementation | |
| Gabor et al. | Preparing for the unexpected: Diversity improves planning resilience in evolutionary algorithms | |
| Kartasev | Integrating reinforcement learning into behavior trees by hierarchical composition | |
| Sun et al. | Retrieval-augmented hierarchical in-context reinforcement learning and hindsight modular reflections for task planning with llms | |
| Cook et al. | Autonomous intelligent reinforcement inferred symbolism | |
| Kelly et al. | Temporal memory sharing in visual reinforcement learning | |
| Xu et al. | Artificial open world for evaluating AGI: a conceptual design | |
| Sartor et al. | Intrinsically motivated high-level planning for agent exploration | |
| Musumeci et al. | Adaptive team behavior planning using human coach commands | |
| Eberbach | The $-calculus process algebra for problem solving: A paradigmatic shift in handling hard computational problems | |
| Latzke et al. | Imitative reinforcement learning for soccer playing robots | |
| da Mota | A team strategy programming language applied to robotic soccer | |
| Jia et al. | A Brain-Inspired Harmonized Learning With Concurrent Arbitration for Enhancing Motion Planning in Fuzzy Environments | |
| Miikkulainen | Evolutionary supervised machine learning |