Functions
PO_ParallelETUCT.cc File Reference
#include "PO_ParallelETUCT.hh"
#include <algorithm>
#include <sys/time.h>
Include dependency graph for PO_ParallelETUCT.cc:

Go to the source code of this file.

Functions

void * poParallelModelLearningStart (void *arg)
void * poParallelSearchStart (void *arg)

Detailed Description

Implements my real-time model-based RL architecture which uses UCT with eligiblity traces for planning. This version of UCT plans over states augmented with k-action histories. The modified version of UCT used is presented in: L. Kocsis and C. Szepesv´ari, "Bandit based monte-carlo planning," in ECML-06. Number 4212 in LNCS. Springer, 2006, pp. 282-293. The real-time architecture is presented in: Hester, Quinlan, and Stone, "A Real-Time Model-Based Reinforcement Learning Architecture for Robot Control", arXiv 1105.1749, 2011.

Author:
Todd Hester

Definition in file PO_ParallelETUCT.cc.


Function Documentation

void* poParallelModelLearningStart ( void *  arg)

Thread that loops, continually updating model with new experiences.

Definition at line 534 of file PO_ParallelETUCT.cc.

void* poParallelSearchStart ( void *  arg)

Parallel thread that continually does uct search from agent's current state.

Definition at line 1114 of file PO_ParallelETUCT.cc.



rl_agent
Author(s): Todd Hester
autogenerated on Thu Jun 6 2019 22:00:14