Reinforcement Learning Toolbox 2.0
last updated:
General
Documentation
Manual
Tutorial
Class Reference
Master Thesis
Examples
Related Papers
Downloads
Links
News
mailto:webmaster
Main Page     Class Hierarchy   Compound List   File List   Compound Members   File Members

CAbsoluteSoftMaxDistribution Class Reference

#include <cpolicies.h>

Inheritance diagram for CAbsoluteSoftMaxDistribution:

CActionDistribution CParameterObject CParameters List of all members.


Public Member Functions

  CAbsoluteSoftMaxDistribution (double maxAbsValue)
virtual void  getDistribution (CStateCollection *state, CActionSet *availableActions, double *values)
  Returns the distribution of the actions that is sampled by an stochastic policy.

virtual bool  isDifferentiable ()

Constructor & Destructor Documentation

CAbsoluteSoftMaxDistribution::CAbsoluteSoftMaxDistribution double  maxAbsValue  ) 
 

Member Function Documentation

virtual void CAbsoluteSoftMaxDistribution::getDistribution CStateCollection state,
CActionSet availableActions,
double *  values
[virtual]
 

Returns the distribution of the actions that is sampled by an stochastic policy.

The function gets as input the current state, all available actions, and the Q-Values (actually it can be any kind of value, rating an action) of the actions as a double array. Usually only this Q-Values are used for the distribution (the state is only used for special exploration policies). The function has to overwrite the Q-Values in double array with the distribution values.

Implements CActionDistribution.

virtual bool CAbsoluteSoftMaxDistribution::isDifferentiable  )  [inline, virtual]
 

Reimplemented from CActionDistribution.


The documentation for this class was generated from the following file: