1994

Finding structure in reinforcement learning

1995

UTree algorithm (McCallum 1995)

TD models: modeling the world at a mixture of time scales

1998

Hierachical solution of Markov Decision Processes using macro-actions

1999

Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning

2004

MDP homomorphism (Ravindran 2004)

2005

Theoretical results on reinforcement learning with temporally abstract behaviors

2006

Controlled Markov Process (CMP) homomorphisms

2009

Binary action search for learning continuous-action control policies

2011

Automatic construction of temporally extended actions for MDPs using bisimulation metrics

俺寻思

Explorer

uncategorized

1994

Finding structure in reinforcement learning

1995

UTree algorithm (McCallum 1995)

TD models: modeling the world at a mixture of time scales

1998

Hierachical solution of Markov Decision Processes using macro-actions

1999

Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning

2004

MDP homomorphism (Ravindran 2004)

2005

Theoretical results on reinforcement learning with temporally abstract behaviors

2006

Controlled Markov Process (CMP) homomorphisms

2009

Binary action search for learning continuous-action control policies

2011

Automatic construction of temporally extended actions for MDPs using bisimulation metrics

2024

俺寻思

Explorer

uncategorized

1994 §

Finding structure in reinforcement learning §

1995 §

UTree algorithm (McCallum 1995) §

TD models: modeling the world at a mixture of time scales §

1998 §

Hierachical solution of Markov Decision Processes using macro-actions §

1999 §

Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning §

2004 §

MDP homomorphism (Ravindran 2004) §

2005 §

Theoretical results on reinforcement learning with temporally abstract behaviors §

2006 §

Controlled Markov Process (CMP) homomorphisms §

2009 §

Binary action search for learning continuous-action control policies §

2011 §

Automatic construction of temporally extended actions for MDPs using bisimulation metrics §

2024 §

1994

Finding structure in reinforcement learning

1995

UTree algorithm (McCallum 1995)

TD models: modeling the world at a mixture of time scales

1998

Hierachical solution of Markov Decision Processes using macro-actions

1999

Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning

2004

MDP homomorphism (Ravindran 2004)

2005

Theoretical results on reinforcement learning with temporally abstract behaviors

2006

Controlled Markov Process (CMP) homomorphisms

2009

Binary action search for learning continuous-action control policies

2011

Automatic construction of temporally extended actions for MDPs using bisimulation metrics

2024