Reinforcement Learning Toolbox 2.0
last updated:
General
Documentation
Manual
Tutorial
Class Reference
Master Thesis
Examples
Related Papers
Downloads
Links
News
mailto:webmaster
Main Page     Class Hierarchy   Compound List   File List   Compound Members   File Members

CEpsilonGreedyDistribution Class Reference

Class for the epsilon greedy action distribution. More...

#include <cpolicies.h>

Inheritance diagram for CEpsilonGreedyDistribution:

CActionDistribution CParameterObject CParameters List of all members.


Public Member Functions

  CEpsilonGreedyDistribution (double epsilon)
virtual void  getDistribution (CStateCollection *state, CActionSet *availableActions, double *values)
  Returns the distribution of the actions that is sampled by an stochastic policy.


Detailed Description

Class for the epsilon greedy action distribution.

This class implements the epsilon greedy action distribution. Epsilon greedy policies take the greedy (best rated) action with probability (1 - epsilon) and a random action with probability epsilon. If there are more than one greedy action, always the first action will be taken. To set epsilon please use the parameter "EpsilonGreedy" or the constructor of the class. Its understood that this distribution is not differentiable.

The class CEpsilonGreedyDistribution has following Parameters:

  • "EpsilonGreedy" : epsilon

Constructor & Destructor Documentation

CEpsilonGreedyDistribution::CEpsilonGreedyDistribution double  epsilon  ) 
 

Member Function Documentation

virtual void CEpsilonGreedyDistribution::getDistribution CStateCollection state,
CActionSet availableActions,
double *  values
[virtual]
 

Returns the distribution of the actions that is sampled by an stochastic policy.

The function gets as input the current state, all available actions, and the Q-Values (actually it can be any kind of value, rating an action) of the actions as a double array. Usually only this Q-Values are used for the distribution (the state is only used for special exploration policies). The function has to overwrite the Q-Values in double array with the distribution values.

Implements CActionDistribution.


The documentation for this class was generated from the following file: