Keyphrases
Continuous Action
84%
Acrobot
75%
Action Space
75%
State Action
72%
Continuous State Space
66%
Action Selection
64%
Reinforcement Learning
64%
Continuous Action Space
58%
Minimum Time
56%
Cart-pole
50%
Swing-up Control
50%
State-action-reward-state-action (SARSA)
50%
Balance Control
50%
Genetic Programming
50%
Controller
43%
Swing-up
42%
Newton's Method
41%
Reinforcement Learning Algorithm
33%
Selection Strategy
33%
Benchmark Problems
31%
Eligibility Traces
25%
Value Function
20%
Nelder-Mead
18%
Number of Trials
16%
CACLA
14%
Policy Function
14%
Handstand
11%
Torque Values
10%
Extended Period
10%
Momentum Term
8%
Sarsa Algorithm
8%
Update Equation
8%
Performance Improvement
8%
Gradient Descent
8%
Policy Networks
8%
Nelder-Mead Method
8%
Space Problem
8%
Spacecraft Control
8%
Success Rate
8%
Selection Time
8%
Optimization Techniques
8%
Multilayer Perceptron
8%
Control Problem
8%
Direct Action
8%
High Failure Rate
8%
Optimization Methods
8%
Parameter Values
8%
Balance Problem
6%
Derivative-free Method
6%
Action Value
6%
Engineering
Action Space
100%
Reinforcement Learning
87%
Acrobot
75%
Continuous State
67%
Newton's Method
43%
Benchmark Problem
35%
Success Rate
25%
Selection Method
25%
Value Function
22%
Discretization
20%
Term Approach
12%
Direct Action
12%
Extended Period
10%
Free Method
8%
Partial Derivative
8%
Good Result
8%
Gradient Descent
6%
Space Problem
6%
Optimization Technique
6%
Perceptron
6%
Optimization Method
6%
Applied Torque
5%
Fitness Function
5%