Reinforcement Learning Toolbox 2.0
last updated:
General
Documentation
Manual
Tutorial
Class Reference
Master Thesis
Examples
Related Papers
Downloads
Links
News
mailto:webmaster
Main Page     Class Hierarchy   Compound List   File List   Compound Members   File Members

CVFunctionResidualLearner Class Reference

#include <cvfunctionlearner.h>

Inheritance diagram for CVFunctionResidualLearner:

CVFunctionGradientLearner CVFunctionLearner CSemiMDPRewardListener CErrorSender CSemiMDPListener CParameterObject CParameters List of all members.


Public Member Functions

  CVFunctionResidualLearner (CRewardFunction *rewardFunction, CGradientVFunction *vfunction, CResidualFunction *residual, CResidualGradientFunction *residualGradient, CAbstractBetaCalculator *betaCalc)
  ~CVFunctionResidualLearner ()
virtual void  updateVFunction (CStateCollection *oldState, CStateCollection *newState, int duration, double td)
  Updates the V-Function, calls the update V-Function method of the etrace object.

virtual void  newEpisode ()
  Resets the etraces.

CGradientVETraces getResidualETraces ()


Protected Member Functions

virtual void  addETraces (CStateCollection *oldState, CStateCollection *newState, int duration, double td)


Protected Attributes

CGradientVETraces residualGradientTraces
CGradientVETraces directGradientTraces
CGradientVETraces residualETraces
CAbstractBetaCalculator betaCalculator

Constructor & Destructor Documentation

CVFunctionResidualLearner::CVFunctionResidualLearner CRewardFunction rewardFunction,
CGradientVFunction vfunction,
CResidualFunction residual,
CResidualGradientFunction residualGradient,
CAbstractBetaCalculator betaCalc
 
CVFunctionResidualLearner::~CVFunctionResidualLearner  ) 
 

Member Function Documentation

virtual void CVFunctionResidualLearner::addETraces CStateCollection oldState,
CStateCollection newState,
int  duration,
double  td
[protected, virtual]
 
CGradientVETraces* CVFunctionResidualLearner::getResidualETraces  )  [inline]
 
virtual void CVFunctionResidualLearner::newEpisode  )  [virtual]
 

Resets the etraces.

Reimplemented from CVFunctionLearner.

virtual void CVFunctionResidualLearner::updateVFunction CStateCollection oldState,
CStateCollection newState,
int  duration,
double  td
[virtual]
 

Updates the V-Function, calls the update V-Function method of the etrace object.

First the etraces gets multiplied by the attentuation factor (lambda * gamma)^duration, then the etrace of the current step gets added, and than the V-Function is updated by the update function of the etrace object. The update factor is td * learningrate.

Reimplemented from CVFunctionLearner.


Member Data Documentation

CAbstractBetaCalculator* CVFunctionResidualLearner::betaCalculator [protected]
 
CGradientVETraces* CVFunctionResidualLearner::directGradientTraces [protected]
 
CGradientVETraces* CVFunctionResidualLearner::residualETraces [protected]
 
CGradientVETraces* CVFunctionResidualLearner::residualGradientTraces [protected]
 

The documentation for this class was generated from the following file: