CActorFromQFunction Class Reference
Actor who
creates his Policy on a Q Function. More...
#include <cactorcritic.h>
Inheritance diagram for CActorFromQFunction:
List of all
members.
Detailed Description
Actor who creates his Policy on a Q Function.
The CActorFromQFunction updates it's Q-Function on the
particular state action pair according to the critic he got for
that state action pair. Since we are using a Q-Function the actor
from Q-Function uses QETraces to boost learning. The policy from
the actor is usually a Softmax Policy using the Q-Function, this
Policy must be created by the user exclusivly.
The Q-Function update for this actor is Q(s,a)_new = Q(s,a)_old
+ beta * td, where td is the value coming from the critic.
- See also:
- CActorFromQFunctionAndPolicy.
Constructor & Destructor Documentation
| |
Creates an Actor using the specified Q-Function to adopt his
Policy.
|
| virtual
CActorFromQFunction::~CActorFromQFunction
|
( |
|
) |
[virtual] |
|
Member Function Documentation
| |
Returns the used ETraces.
|
| |
Returns the used Q-Function.
|
| virtual void
CActorFromQFunction::newEpisode
|
( |
|
) |
[virtual] |
|
| |
Updates the Q-Function.
The actor first updates the Etraces (i.e. mulitply all ETraces
with gamma*lambda and then adds the state to the ETraces). Then the
Q-Function is updated by the Etraces Object with the value beta *
critic.
- See also:
- CQETraces
Implements CActor.
Reimplemented in CActorFromQFunctionAndPolicy.
|
Member Data Documentation
| |
The Etraces used for the QFunction.
|
| |
The Q Function of the actor.
|
The documentation for this class was generated from the following
file:
|