Reinforcement Learning Toolbox 2.0
last updated:
General
Documentation
Manual
Tutorial
Class Reference
Master Thesis
Examples
Related Papers
Downloads
Links
News
mailto:webmaster
Main Page     Class Hierarchy   Compound List   File List   Compound Members   File Members

CContinuousActionGradientPolicy Class Reference

#include <ccontinuousactiongradientpolicy.h>

Inheritance diagram for CContinuousActionGradientPolicy:

CContinuousActionController CGradientFunction CStateObject CAgentController CGradientUpdateFunction CActionObject CParameterObject CParameterObject CLearnDataObject CParameters CParameters CParameterObject CParameters CContinuousActionFeaturePolicy CContinuousActionPolicyFromGradientFunction CContinuousActionSigmoidPolicy CContinuousTimeAndActionSigmoidVMGradientPolicy List of all members.


Public Member Functions

  CContinuousActionGradientPolicy (CContinuousAction *contAction, CStateProperties *modelState)
  ~CContinuousActionGradientPolicy ()
virtual void  getNextContinuousAction (CStateCollection *state, CContinuousActionData *action)=0
virtual int  getNumWeights ()=0
  Returns the number of weights.

virtual void  getWeights (double *parameters)=0
  Function for getting all weights.

virtual void  setWeights (double *parameters)=0
  Function for setting all weights.

virtual void  getGradient (CStateCollection *inputState, int outputDimension, CFeatureList *gradientFeatures)=0
virtual void  getGradientPre (ColumnVector *input, ColumnVector *outputErrors, CFeatureList *gradientFeatures)
  Interface for calculating the gradient given the input and the outputerror.

virtual void  getFunctionValuePre (ColumnVector *input, ColumnVector *output)
  Interface for calculating the output value.

virtual void  resetData ()=0
  Interface for resetting the weights.



Protected Member Functions

virtual void  updateWeights (CFeatureList *dParams)=0
  Interface for updating the weights.



Protected Attributes

CStateProperties modelState

Constructor & Destructor Documentation

CContinuousActionGradientPolicy::CContinuousActionGradientPolicy CContinuousAction contAction,
CStateProperties modelState
 
CContinuousActionGradientPolicy::~CContinuousActionGradientPolicy  ) 
 

Member Function Documentation

virtual void CContinuousActionGradientPolicy::getFunctionValuePre ColumnVector *  input,
ColumnVector *  output
[virtual]
 

Interface for calculating the output value.

Implements CGradientFunction.

virtual void CContinuousActionGradientPolicy::getGradient CStateCollection inputState,
int  outputDimension,
CFeatureList gradientFeatures
[pure virtual]
 

Implemented in CContinuousActionPolicyFromGradientFunction, CContinuousActionFeaturePolicy, CContinuousActionSigmoidPolicy, and CContinuousTimeAndActionSigmoidVMGradientPolicy.

virtual void CContinuousActionGradientPolicy::getGradientPre ColumnVector *  input,
ColumnVector *  outputErrors,
CFeatureList gradientFeatures
[virtual]
 

Interface for calculating the gradient given the input and the outputerror.

Implements CGradientFunction.

virtual void CContinuousActionGradientPolicy::getNextContinuousAction CStateCollection state,
CContinuousActionData action
[pure virtual]
 

Implements CContinuousActionController.

Implemented in CContinuousActionPolicyFromGradientFunction, CContinuousActionFeaturePolicy, CContinuousActionSigmoidPolicy, and CContinuousTimeAndActionSigmoidVMGradientPolicy.

virtual int CContinuousActionGradientPolicy::getNumWeights  )  [pure virtual]
 

Returns the number of weights.

Implements CGradientUpdateFunction.

Implemented in CContinuousActionPolicyFromGradientFunction, CContinuousActionFeaturePolicy, CContinuousActionSigmoidPolicy, and CContinuousTimeAndActionSigmoidVMGradientPolicy.

virtual void CContinuousActionGradientPolicy::getWeights double *  parameters  )  [pure virtual]
 

Function for getting all weights.

The double array is assumed to be large enough. This isn't checked!

Implements CGradientUpdateFunction.

Implemented in CContinuousActionPolicyFromGradientFunction, CContinuousActionFeaturePolicy, CContinuousActionSigmoidPolicy, and CContinuousTimeAndActionSigmoidVMGradientPolicy.

virtual void CContinuousActionGradientPolicy::resetData  )  [pure virtual]
 

Interface for resetting the weights.

Implements CGradientUpdateFunction.

Implemented in CContinuousActionPolicyFromGradientFunction, CContinuousActionFeaturePolicy, CContinuousActionSigmoidPolicy, and CContinuousTimeAndActionSigmoidVMGradientPolicy.

virtual void CContinuousActionGradientPolicy::setWeights double *  parameters  )  [pure virtual]
 

Function for setting all weights.

The double array is assumed to be large enough. This isn't checked!

Implements CGradientUpdateFunction.

Implemented in CContinuousActionPolicyFromGradientFunction, CContinuousActionFeaturePolicy, CContinuousActionSigmoidPolicy, and CContinuousTimeAndActionSigmoidVMGradientPolicy.

virtual void CContinuousActionGradientPolicy::updateWeights CFeatureList dParams  )  [protected, pure virtual]
 

Interface for updating the weights.

Implements CGradientUpdateFunction.

Implemented in CContinuousActionPolicyFromGradientFunction, CContinuousActionFeaturePolicy, CContinuousActionSigmoidPolicy, and CContinuousTimeAndActionSigmoidVMGradientPolicy.


Member Data Documentation

CStateProperties* CContinuousActionGradientPolicy::modelState [protected]
 

The documentation for this class was generated from the following file: