Reinforcement Learning Toolbox 2.0
last updated:
General
Documentation
Manual
Tutorial
Class Reference
Master Thesis
Examples
Related Papers
Downloads
Links
News
mailto:webmaster
Main Page     Class Hierarchy   Compound List   File List   Compound Members   File Members

RLToolbox Class List

Here are the classes, structs, unions and interfaces with brief descriptions:
C1DSplittingCondition
CAbsoluteSoftMaxDistribution
CAbstractBetaCalculator
CAbstractFeatureStochasticEstimatedModel Base class for all estimated models
CAbstractFeatureStochasticModel Interface for all model classes
CAbstractQETraces Interface for Q-ETraces
CAbstractQFunction Interface for all Q-Functions
CAbstractStateDiscretizer Interface for all state discretizer
CAbstractVETraces Class representing etraces for a V-Function
CAbstractVFunction Interface reprenting a Value Function
CAcroBotExpRewardFunction
CAcroBotHeightRewardFunction
CAcroBotModel
CAcroBotRewardFunction
CAcroBotVelocityRewardFunction
CAction Class Representing an action the agent or any other SMDP can choose from
CActionData Interface for saving changable data of an action
CActionDataSet Class for mantaining the action data objects of all actions of an action set
CActionDistribution Action Distribution classes define the distributions of stochastic Policies
CActionList Class for logging a sequence of actions
CActionObject Base Class for all Classes which have to maintain an action set
CActionOutput
CActionSet Class maintaining all the actions available for a certain object (normally a CActionObject like a controller)
CActionStatistics Actionstatistics - for comparison of policies and agent controllers
CActionStatisticsComparator Baseclass for all ActionStatisticComparators, used by
CActor Interface for all Actors
CActorForMultipleAgents
CActorFromActionValue Actor class which can only decide beween 2 different action, depending on the action value of the current state
CActorFromContinuousActionGradientPolicy
CActorFromQFunction Actor who creates his Policy on a Q Function
CActorFromQFunctionAndPolicy Actor which uses a QFunction and his Policy for the update
CAdaptiveEtaCalculator Adaptive Learning Rate Calculator Interface class
CAdaptiveParameterBoundedValuesCalculator Super class for all classes which use bounded target values
CAdaptiveParameterCalculator Interface for all adaptive Parameter Calculators
CAdaptiveParameterFromAverageRewardCalculator Adaptive Parameter Calculator which calculates the parameter's value from the current average reward
CAdaptiveParameterFromNEpisodesCalculator Adaptive Parameter Calculator which calculates the parameter's value from the number of learning episodes
CAdaptiveParameterFromNStepsCalculator Adaptive Parameter Calculator which calculates the parameter's value from the number of learning steps
CAdaptiveParameterFromValueCalculator Adaptive Parameter Calculator which calculates the parameter's value from the current value of a V-Function
CAdaptiveParameterUnBoundedValuesCalculator Super class for all classes which use unbounded target values
CAdaptiveTargetGraph
CAdaptiveTargetGraphController
CAdvantageLearner
CAdvantageUpdating
CAgent The class represents the main acting Object of the Learning System, the agent
CAgentController Class for navigating the agent
CAgentLogger Class for logging a whole training trial
CAgentStatisticController Agent controller which returns additionally an statistic object for the action
CAverageReinforcementBaseLineCalculator
CAverageRewardCalculator
CAverageRewardSameStateCalculator
CAverageRewardTestSuiteEvaluator
CBatchCAQDataGenerator
CBatchDataGenerator
CBatchEpisodeUpdate Class for doing batch updates during the learning trial
CBatchGradientLearner
CBatchLearningPolicy
CBatchQDataGenerator
CBatchStepUpdate Class for doing Batch Updates
CBatchVDataGenerator
CCAGradientPolicyInputDerivationCalculator
CCAGradientPolicyNumericInputDerivationCalculator
CCALinearFAQETraces
CCALinearFAQFunction
CCartPoleHeightRewardFunction
CCartPoleModel
CCartPoleRewardFunction
CComposedQETraces This is the E-Trace class for the CComposedQFunction class
CComposedQFunction
CComposedTransitionFunction
CConjugateGradientLearner
CConstantBetaCalculator
CConstantGradientFunctionUpdater
CConstantReinforcementBaseLineCalculator
CContinuousAction
CContinuousActionAddController
CContinuousActionController
CContinuousActionData Class for saving the continuous Values from a ContinuosAction
CContinuousActionFeaturePolicy
CContinuousActionGradientPolicy
CContinuousActionLinearFA
CContinuousActionPolicy
CContinuousActionPolicyFromGradientFunction
CContinuousActionProperties
CContinuousActionQFunction
CContinuousActionRandomPolicy
CContinuousActionSigmoidPolicy
CContinuousActionSmoother
CContinuousCoulomResidual
CContinuousDynamicProgramming
CContinuousDynamicQProgramming
CContinuousDynamicVProgramming
CContinuousEulerResidual
CContinuousMCQEvaluation
CContinuousRBFAction
CContinuousStateList
CContinuousStateRegion
CContinuousTimeAndActionBangBangVMPolicy
CContinuousTimeAndActionSigmoidVMGradientPolicy
CContinuousTimeAndActionSigmoidVMPolicy
CContinuousTimeAndActionTransitionFunction
CContinuousTimeAndActionVMPolicy
CContinuousTimeParameters
CContinuousTimeQFunctionFromTransitionFunction
CContinuousTimeTransitionFunction
CContinuousTimeVMPolicy
CControllerAnalyzer Controller analyzer
CDataCollector
CDataCollectorFromAgentLogger
CDataPreprocessor
CDataSet
CDataSet1D
CDeterministicController Controller class makes a given controller deterministic
CDirectGradient
CDiscreteResidual
CDiscreteStateOperatorAnd The and operator combines several discrete states to one discrete state
CDiscreteStochasticEstimatedModel Estimated Model for Discrete States
CDistributions Collection of Distributions
CDivergentQFunctionException This exception is thrown if a value function has become divergent
CDivergentVFunctionException This exception is thrown if a value function has become divergent
CDynamicLinearContinuousTimeModel
CDynamicProgramming Collection of static functions for dynamic Programming
CEnvironmentModel The environment in which the agent can act
CEpisode Class for logging a single Episode
CEpisodeHistory Interface for all classes which are able to store episodes
CEpisodeHistorySubset
CEpisodeMatlabOutput
CEpisodeOutput This Class writes each step and start of a new episode in readable form to a file
CEpisodeOutputStateChanged Class writes only the steps in which the specified state changes in readable form to a file
CEpsilonGreedyDistribution Class for the epsilon greedy action distribution
CErrorListener
CErrorSender
CEvaluator
CExplorationQFunction
CExtendedAction This abstract class represents extended actions like behaviors or hierarchical SMDPs
CExtendedActionTransitionFunction
CExtendedPrimitiveAction
CExtraLinearRegressionModelForestLearner
CExtraLinearRegressionModelTree
CExtraModelTree
CExtraRegressionForestFeatureLearner
CExtraRegressionForestLearner
CExtraRegressionForestTrainer
CExtraRegressionTree
CExtraTree< TreeData >
CExtraTreeRBFLinearWeightForest
CExtraTreeRegressionForest
CExtraTreesSplittingConditionFactory
CFeature Class for storing a single feature (feature Index and feature Faktor
CFeatureCalculator Base class for all Feature Calculators
CFeatureFunction The feature function for storing features in an double array
CFeatureList Class for storing features as sparse array
CFeatureMap
CFeatureOperatorAnd Combines 2 or more feature states with an "And" operatation
CFeatureOperatorOr Combines 2 or more feature states with an "Or" operatation
CFeatureQFunction Composed feature Q-Function
CFeatureRewardFunction Class for calculating the reward given a feature not a state
CFeatureRewardFunctionFromValueFunction
CFeatureRewardModel
CFeatureStateNNInput
CFeatureStateRewardFunction
CFeatureStateRewardModel
CFeatureStochasticEstimatedModel Estimated Model for feature States
CFeatureStochasticModel Class for loading and storing a fixed Model
CFeatureVETraces This class is used as ETraces for feature V-Functions
CFeatureVFunction
CFeatureVRegressionTreeFunction
CFittedCAQIteration
CFittedIteration
CFittedQIteration
CFittedQIterationAnalyzer
CFittedQNewFeatureCalculator
CFittedVIteration
CForest< TreeData > ***************** CForest ***********************
CForestFeatureCalculator< TreeData >
CFunctionComperator Super class of the V-Function Comperators and Q-Function comperators
CGlobalGridWorldDiscreteState
CGPOMDPGradientCalculator
CGradientCalculator
CGradientFunction Interface for all functions which support gradient update and gradient and output calculation
CGradientFunctionUpdater
CGradientLearner
CGradientQETraces E-Trace class for all gradient Q-Functions
CGradientQFunction
CGradientUpdateFunction Interface class for all function which gradient update
CGradientVETraces ETraces for gradient functions
CGradientVFunction Interface for all classes that can use gradients for updating
CGraphAdaptiveTargetDynamicProgramming
CGraphController
CGraphDebugger
CGraphDynamicProgramming
CGraphLogger
CGraphTarget
CGraphTransition
CGraphTransitionAdaptiveTarget
CGraphValueFromValueFunctionCalculator
CGreedyASComparator Comparator for CMulticontrollerGreedyPolicy prefers best action
CGreedyDistribution Class for a greedy action distribution
CGridFeatureCalculator Abstract Superclass for all feature calculator that partition the state space with a grid
CGridWorld
CGridWorldAction
CGridWorldController
CGridWorldController::GridControllerRecord
CGridWorldModel
CGSearchPolicyGradientUpdater
CHiearchicalAgent
CHierarchicalController Class for calculating a hierarchical execution of a hierarchical structure
CHierarchicalSemiMarkovDecisionProcess Subclass of CSemiMarkovDecisionProcess, used for hierarchical Learning
CHierarchicalStack A list of Actions representing the actual hierarchical Stack
CHierarchicalStackEpisode Class for logging the Hierarchical Stack of a training trial
CHierarchicalStackListener Listener who gets the Hierarchical Stack instead of an action
CHierarchicalStackSender Base class for sending the hierarchical stack to the listeners
CIndividualEtaCalculator Eta Calculator, which allows you to set individual learning rates for each weight
CKDRectangle
CKDTree
CKDTreeMedianSplittingFactory
CKNearestLeaves< TreeData >
CKNearestNeighbors
CKNearestNeighborsTreeData< DataElement, TreeData >
CKNearestRBFCenters
CLeaf< TreeData >
CLeafIndexFactory
CLearnDataObject Interface for all objects that learn, like V-Functions or Q-Functions
CLeastSquaresLearner
CLinearActionContinuousTimeTransitionFunction
CLinearFAContinuousAction
CLinearInterpolationFeatureCalculator
CLinearMultiFeatureCalculator Super class for all feature calculators that use a grid and where more than one feature can be active
CLinearRegression
CLinearRegressionDataFactory
CLineSearchGradientFunctionUpdater
CListenerTestSuite
CLocal4GridWorldState
CLocal4XGridWorldState
CLocal8GridWorldState
CLocalGridWorldDiscreteState
CLocalLinearLearner
CLocalLinearRegression
CLocalRBFLearner
CLocalRBFRegression
CLocalRegression
CLSTDLambda
CLSTDOfflineEpisodePolicyEvaluation
CLSTDOnlinePolicyEvaluation
CMapping< OutputValue >
CMatlabEpisodeOutputLogger
CMatlabQAnalyzerLogger
CMatlabVAnalyzerLogger
CMeanStdPreprocessor
CModelStateDiscretizer Class calculating a single discrete state from several state variables
CModelTree
CMonteCarloCAQLearner
CMonteCarloError
CMonteCarloQError
CMonteCarloQLearner
CMonteCarloSupervisedLearner
CMonteCarloVError
CMonteCarloVLearner
CMultiPoleAction
CMultiPoleContinuousReward
CMultiPoleController
CMultiPoleDiscreteController
CMultiPoleDiscreteState
CMultiPoleFailedState
CMultiPoleModel
CMultiStepAction
CMultiStepActionData Class for saving the duration and the finished flag from a MultiStepAction
CMyArray< T1 > A multidimensional Array
CMyArray2D< T1 > 2 dimensional Array
CMyArray3D< T1 > 3 dimensional array
CMyException Interface for all Exception classes used by the toolbox
CNeuralNetworkStateModifier State Modifier used for Neural Networks input states
CNewFeatureCalculator
CNewFeatureCalculatorDataGenerator
CNode< TreeData >
CNumericPolicyGradientCalculator
COfflineEpisodePolicyEvaluation
COnlinePolicyEvaluation
COptimalVFunctionFromQFunction
CParameterObject Super class of all Objects mantaining parameters
CParameters This class represents a parameterset
CPEGASUSAnalyticalPolicyGradientCalculator
CPEGASUSNumericPolicyGradientCalculator
CPEGASUSPolicyGradientCalculator
CPEGreedyASComparator Comparator for CMulticontrollerGreedyPolicy prefers epsilon-greedy action over best action
CPendulumModel
CPendulumRewardFunction
CPendulumUpTimeCalculator
CPickupAction
CPOASComparator Comparator for CMulticontrollerGreedyPolicy prefers actions of a specific owner
CPolicyEvaluation
CPolicyEvaluationGradientFunction
CPolicyEvaluationTestSuite
CPolicyEvaluator
CPolicyGradientCalculator
CPolicyGradientTestSuite
CPolicyGradientWeightDecayListener
CPolicyGreedynessEvaluator
CPolicyIteration
CPolicyIterationNewFeatures
CPolicyIterationTestSuite
CPolicySameStateEvaluator
CPrimitiveAction
CPrimitiveActionStateChange This class represents an primitive action which gets executed until a specific (mostly discrete) state changes
CPrioritizedSweeping Class for model based Prioritized Sweeping
CPutdownAction
CQAverageTDErrorLearner
CQAverageTDVarianceLearner
CQETraces Q-ETraces consisting of V-Etraces (one for each action)
CQFunction Compounded Q-Function consisting of V-Funcions
CQFunctionAnalyzer Analyzer for Q-Functions
CQFunctionComperator Comperator for 2 Q-Functions
CQFunctionFromGradientFunction
CQFunctionFromStochasticModel Converts a VFunction and a Model to a Q-Function
CQFunctionFromTransitionFunction
CQFunctionSum
CQGreedyPolicy Greedy Policy based on a Q-Function
CQLearner Class for Q-Learning
CQLSTDLambda
CQStochasticExplorationPolicy
CQStochasticPolicy Stochastic Policy which computes its propabilities from the Q-Values of
CRaceTrack
CRaceTrackDiscreteState
CRandomMaxPolicyGradientCalculator
CRandomPolicyGradientCalculator
CRangeSearch
CRBFBasisFunction
CRBFBasisFunctionLinearWeight
CRBFCenterFeatureCalculator
CRBFCenterNetwork
CRBFCenterNetworkKDTree
CRBFCenterNetworkSimpleSearch
CRBFDataFactory
CRBFExtraRegressionForest
CRBFExtraRegressionTree
CRBFFeatureCalculator Represents a normalized RBF network
CRBFForestLearner
CRBFLinearWeightDataFactory
CRBFLinearWeightForest
CRBFRegressionTreeOutputMapping
CRegion
CRegressionFactory
CRegressionForest ***************** CRegressionForest ***********************
CRegressionMultiMapping
CRegressionTreeFunction
CRegressionTreeQFunction
CRegressionTreeVFunction
CREINFORCELearner
CReinforcementBaseLineCalculator
CResidualBetaFunction
CResidualFunction
CResidualGradientFunction
CRewardAsVFunction
CRewardEpisode Logs the reward during an Episode
CRewardFunction Class for clalculating the reward for the learning objects
CRewardFunctionFromValueFunction
CRewardHistory
CRewardHistorySubset
CRewardLogger
CRewardPerEpisodeCalculator
CSampleTransition
CSamplingBasedGraph
CSamplingBasedTransitionModel
CSamplingBasedTransitionModelFromTransitionFunction
CSarsaLearner Class for Sarsa Learning
CSelectiveExplorationCalculator
CSemiMarkovDecisionProcess Class for providing the general Functions for the learning Environment
CSemiMDPLastNRewardFunction Reward Function for Behaviours
CSemiMDPListener Interface for all SemiMDP Listeners
CSemiMDPRewardListener Represents SMDP Listener which also need a reward
CSemiMDPSender Class for sending the State-Action-State Tuple to the Listeners
CSemiMDPTransition Transition for the semiMDP case
CSingleStateDiscretizer Discretizes the original States i'th continuous state into the specified partitions
CSingleStateFeatureCalculator Superclass for calculating features from a single continuous state variable
CSingleStateLinearInterpolationFeatureCalculator Class for linear interpolation features of a single continuous state variable
CSingleStateRBFFeatureCalculator Represents a 1-D RBF-network layed over a specified continous state variable
CSmallLocalGridWorldDiscreteState
CSoftMaxDistribution Soft Max Distribution for Stochastic Policies
CSplittingCondition
CSplittingConditionFactory
CState Represents a single state
CStateActionErrorLearner
CStateActionTransitions Class for storing the Backward and Forward Transitions for a given state action pair
CStateCollection Interface for a state collection
CStateCollectionImpl Implementation of the CStateCollection interface
CStateCollectionList Class for storing a sequence of state collections
CStateErrorLearner
CStateList Class for logging a list of States
CStateModifier Abstract class for calculating modified states from the original Model state
CStateModifiersObject Base Class for all Classes that have to maintain a list of modifiers
CStateMultiModifier Interface for all state modifier how have acces to several other state modifier for state calculation
CStateObject Base class for all classes which have to manage with states
CStateOutput
CStateProperties Class defining the Properties of a State
CStateReward Reward Function that only depends on the current state
CStateVariablesChooser
CStateVETraces State ETraces stores the state object itself
CStaticContinuousAction
CStep Class storing a single step (State-Action-State Tuple)
CStepHistory Class mantaining an unordered set of steps, which can be used for Learning
CStochasticModelAction
CStochasticPolicy Class for modeling a stochastic policy
CStoredEpisodeModel Serves as environment model for an agent, simulating the episodes from an agent logger
CSubGoalBehaviour
CSubGoalController
CSubGoalOutput
CSubsetFactory
CSupervisedFeatureGradientCalculator
CSupervisedGradientCalculator
CSupervisedGradientLearner
CSupervisedLearner
CSupervisedNeuralNetworkMatlabLearner
CSupervisedNeuralNetworkTorchLearner
CSupervisedQFunctionLearner
CSupervisedQFunctionLearnerFromLearners
CSupervisedQFunctionWeightedLearner
CSupervisedQFunctionWeightedLearnerFromLearners
CSupervisedWeightedLearner
CTaxiDomain
CTaxiHierarchicalBehaviour
CTaxiIsTargetDiscreteState
CTDGradientLearner
CTDLearner Class for temporal Difference Learning
CTDResidualLearner
CTestSuite
CTestSuiteCollection
CTestSuiteEvaluator
CTestSuiteEvaluatorLogger
CTestSuiteLoggerFromEvaluator
CTilingFeatureCalculator Tilings represent a grid layed over the state space
CTorchFunction Interface to integrate Torch Machines in the learning systems
CTorchGradientEtaCalculator
CTorchGradientFunction Class for learning with Torch-Gradient machines
CTorchVFunction
CTransition Transition for the Markov Case
CTransitionFunction
CTransitionFunctionEnvironment
CTransitionFunctionFromStochasticModel
CTransitionFunctionInputDerivationCalculator
CTransitionFunctionNumericalInputDerivationCalculator
CTransitionList Class for storing Transitions
CTree< TreeData >
CTreeDataFactory< TreeData >
CTreeElement< TreeData >
CUnknownDataQFunction
CUnknownDataQFunctionFromLocalRBFRegression
CValueCalculator
CValueIteration The Value Iteration Algorithm
CValueSameStateCalculator
CVariableBetaCalculator
CVarioEta Vario Eta Learning Rate Calculator
CVAverageTDErrorLearner
CVAverageTDVarianceLearner
CVectorQuantizationFactory
CVFunctionAnalyzer Analyzer for V-Functions
CVFunctionComperator Comperator for 2 V-Functions
CVFunctionFromGradientFunction Class for learning with Torch-Gradient machines
CVFunctionFromQFunction
CVFunctionGradientLearner
CVFunctionInputDerivationCalculator Interface class for calculating the gradient dV(x)/dx
CVFunctionLearner TD Learner for Value Function learning
CVFunctionNumericInputDerivationCalculator Calculating the input derivation of a V-Function numerically
CVFunctionResidualLearner
CVFunctionSum
CVisitStateActionCounter
CVisitStateActionEstimator
CVisitStateCounter
CVLSTDLambda
CVMStochasticPolicy Stochastic Policy which calculates its action from a Dynamic Model and a V-Function
CVPolicyLearner
CVTable Value Function as a table
CZeroReward
CZeroVFunction Value Function always returning zero
DataSubset
EvaluationValue