CGreedyDistribution Class Reference
Class for a greedy
action distribution. More...
#include <cpolicies.h>
Inheritance diagram for CGreedyDistribution:
List of all
members.
|
Public Member Functions
|
|
virtual void
|
getDistribution
(CStateCollection *state,
CActionSet
*availableActions, double *values)
|
| |
Returns the distribution of the actions that
is sampled by an stochastic policy.
|
Detailed Description
Class for a greedy action distribution.
This class implements a greedy action distribution, so the
probability for the best rated action is always 1, and for the rest
0. If there are more than one greedy action, always the first
action will be taken. Its understood that this distribution is not
differentiable.
Member Function Documentation
| virtual void
CGreedyDistribution::getDistribution
|
( |
CStateCollection *
|
state,
|
|
|
CActionSet *
|
availableActions,
|
|
|
double * |
values |
|
) |
[virtual] |
|
| |
Returns the distribution of the actions that is sampled by an
stochastic policy.
The function gets as input the current state, all available
actions, and the Q-Values (actually it can be any kind of value,
rating an action) of the actions as a double array. Usually only
this Q-Values are used for the distribution (the state is only used
for special exploration policies). The function has to overwrite
the Q-Values in double array with the distribution values.
Implements CActionDistribution.
|
The documentation for this class was generated from the following
file:
|