Callaway et al., 2021 - Google Patents
Human planning as optimal information seekingCallaway et al., 2021
View PDF- Document ID
- 7501013599901942414
- Author
- Callaway F
- Van Opheusden B
- Gul S
- Das P
- Krueger P
- Lieder F
- Griffiths T
- Publication year
- Publication venue
- Manuscript in preparation
External Links
Snippet
A critical aspect of human intelligence is our ability to plan, that is, to use a model of the world to simulate, evaluate, and select among hypothetical future actions. However, exhaustive planning is intractable because the number of possible action sequences …
- 241000282414 Homo sapiens 0 title abstract description 38
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N5/00—Computer systems utilising knowledge based models
- G06N5/04—Inference methods or devices
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N99/00—Subject matter not provided for in other groups of this subclass
- G06N99/005—Learning machines, i.e. computer in which a programme is changed according to experience gained by the machine itself during a complete run
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/02—Computer systems based on biological models using neural network models
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N5/00—Computer systems utilising knowledge based models
- G06N5/02—Knowledge representation
- G06N5/022—Knowledge engineering, knowledge acquisition
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N7/00—Computer systems based on specific mathematical models
- G06N7/005—Probabilistic networks
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06Q—DATA PROCESSING SYSTEMS OR METHODS, SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL, SUPERVISORY OR FORECASTING PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL, SUPERVISORY OR FORECASTING PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q40/00—Finance; Insurance; Tax strategies; Processing of corporate or income taxes
- G06Q40/06—Investment, e.g. financial instruments, portfolio management or fund management
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computer systems based on biological models
- G06N3/12—Computer systems based on biological models using genetic models
- G06N3/126—Genetic algorithms, i.e. information processing using digital simulations of the genetic system
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06Q—DATA PROCESSING SYSTEMS OR METHODS, SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL, SUPERVISORY OR FORECASTING PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL, SUPERVISORY OR FORECASTING PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q10/00—Administration; Management
- G06Q10/10—Office automation, e.g. computer aided management of electronic mail or groupware; Time management, e.g. calendars, reminders, meetings or time accounting
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06Q—DATA PROCESSING SYSTEMS OR METHODS, SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL, SUPERVISORY OR FORECASTING PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL, SUPERVISORY OR FORECASTING PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q10/00—Administration; Management
- G06Q10/06—Resources, workflows, human or project management, e.g. organising, planning, scheduling or allocating time, human or machine resources; Enterprise planning; Organisational models
- G06Q10/063—Operations research or analysis
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F19/00—Digital computing or data processing equipment or methods, specially adapted for specific applications
- G06F19/30—Medical informatics, i.e. computer-based analysis or dissemination of patient or disease data
- G06F19/34—Computer-assisted medical diagnosis or treatment, e.g. computerised prescription or delivery of medication or diets, computerised local control of medical devices, medical expert systems or telemedicine
- G06F19/345—Medical expert systems, neural networks or other automated diagnosis
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| Callaway et al. | Rational use of cognitive resources in human planning | |
| Callaway et al. | Human planning as optimal information seeking | |
| US7707131B2 (en) | Thompson strategy based online reinforcement learning system for action selection | |
| Cheng et al. | Preference-based policy iteration: Leveraging preference learning for reinforcement learning | |
| Yang et al. | A survey on reinforcement learning for combinatorial optimization | |
| Metaxiotis et al. | Integrating fuzzy logic into decision suppport systems: current research and future prospects | |
| Chang et al. | Data-driven experimental design and model development using Gaussian process with active learning | |
| Jain et al. | Measuring how people learn how to plan. | |
| Semogan et al. | A rule-based fuzzy diagnostics decision support system for tuberculosis | |
| Xu et al. | Deep reinforcement learning for quantitative trading | |
| Ramezankhani et al. | A transductive learning-based early warning system for housing and stock markets with off-policy optimization | |
| Ren et al. | RiskMiner: Discovering Formulaic Alphas via Risk Seeking Monte Carlo Tree Search | |
| Lipovetzky | Planning for novelty: Width-based algorithms for common problems in control, planning and reinforcement learning | |
| Amhraoui et al. | Expected Lenient Q-learning: a fast variant of the Lenient Q-learning algorithm for cooperative stochastic Markov games | |
| Garcia et al. | Inverse engineering preferences in the graph model for conflict resolution | |
| Kitzis et al. | Broadening the tests of learning models | |
| Antonov et al. | Exploring replay | |
| Chang et al. | COMB: Scalable concession-driven opponent models using bayesian learning for preference learning in bilateral multi-issue automated negotiation | |
| Li | A Design Trajectory Map of Human-AI Collaborative Reinforcement Learning Systems: Survey and Taxonomy | |
| Shachter et al. | Using potential influence diagrams for probabilistic inference and decision making | |
| Hajek et al. | Interval-valued intuitionistic fuzzy cognitive maps for stock index forecasting | |
| Lu | AlphaSMT: A reinforcement learning guided SMT solver | |
| Velázquez-Vargas et al. | Learning to Move and Plan like the Knight: Sequential Decision Making with a Novel Motor Mapping | |
| Lu | Automated Machine Learning and Data-Driven Decision Support System for Strategy Management in Organizational Activities | |
| Bauman et al. | A Deep Learning Approach to Goal-Based Portfolio Optimization in Non-Stationary Environments |