Reinforcement Learning Toolbox 2.0
last updated:
General
Documentation
Manual
Tutorial
Class Reference
Master Thesis
Examples
Related Papers
Downloads
Links
News
mailto:webmaster
Main Page     Class Hierarchy   Compound List   File List   Compound Members   File Members

CGreedyDistribution Class Reference

Class for a greedy action distribution. More...

#include <cpolicies.h>

Inheritance diagram for CGreedyDistribution:

CActionDistribution CParameterObject CParameters List of all members.


Public Member Functions

virtual void  getDistribution (CStateCollection *state, CActionSet *availableActions, double *values)
  Returns the distribution of the actions that is sampled by an stochastic policy.


Detailed Description

Class for a greedy action distribution.

This class implements a greedy action distribution, so the probability for the best rated action is always 1, and for the rest 0. If there are more than one greedy action, always the first action will be taken. Its understood that this distribution is not differentiable.


Member Function Documentation

virtual void CGreedyDistribution::getDistribution CStateCollection state,
CActionSet availableActions,
double *  values
[virtual]
 

Returns the distribution of the actions that is sampled by an stochastic policy.

The function gets as input the current state, all available actions, and the Q-Values (actually it can be any kind of value, rating an action) of the actions as a double array. Usually only this Q-Values are used for the distribution (the state is only used for special exploration policies). The function has to overwrite the Q-Values in double array with the distribution values.

Implements CActionDistribution.


The documentation for this class was generated from the following file: