HOME cs.uu.nl home education contact library calendar search UU.NL
about us research people archive services jobs

publications by dr. H.P. van Hasselt

Hado van Hasselt

dr. H.P. van Hasselt

some publications

Hasselt, H.P. van (2011, January 17). Insights in reinforcement rearning : formal analysis and empirical evaluation of temporal-difference learning algorithms. UU Universiteit Utrecht (278 pag.) ( Utrecht University). Prom./coprom.: prof. dr. J-J.Ch. Meyer, L. Schomaker & dr. M.A. Wiering.

Hasselt, H.P. van (2010). Double Q-learning. In J. Lafferty, C..K..I. Williams, J. Shawe-Taylor, R.S. Zemel, A. Culotta & A. Culotta (Eds.), Advances in Neural Information Processing Systems 23 (pp. 2613-2621).

Seijen, H., Hasselt, H.P. van, Whiteson, S. & Wiering, M.A. (2009). A Theoretical and Empirical Analysis of Expected Sarsa. In Proceedings of the IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning (ADRPL 2009) (pp. 177-184). IEEE Press.

Westra, J., Hasselt, H.P. van, Dignum, F.P.M. & Dignum, M.V. (2009). Adaptive Serious Games Using Agent Organizations. In F.P.M. Dignum, J.M. Bradshaw, BG Silverman & W..A. Doesburg (Eds.), Agents for Games and Simulations (pp. 206-220). Berlin / Heidelberg: Springer.

Westra, J., Hasselt, H.P. van, Dignum, M.V. & Dignum, F.P.M. (2008). On-line Adapting Games using Agent Organizations. In IEEE Symposium on Computational Intelligence and Games (CIG). Perth, Australia.

Wiering, M.A. & Hasselt, H.P. van (2009). The QV Family Compared to Other Reinforcement Learning Algorithms. In Proceedings of the IEEE International Symposium on Approximate Dynamic Programming and Reinforcement Learning (ADPRL 2009) (pp. 101-108). IEEE Press.

Hasselt, H.P. van & Wiering, M.A. (2009). Using Continuous Action Spaces to Solve Discrete Problems. In Proceedings of the 2009 International Joint Conference on Neural Networks (IJCNN 2009) (pp. 1149-1156). Atlanta, GA: IEEE Press.

Wiering, M.A. & Hasselt, H.P. van (2008). Ensemble Algorithms in Reinforcement Learning. IEEE transactions on systems, man, and cybernetics. Part B, Cybernetics, 38(4), 930-936.

Hasselt, H.P. van & Wiering, M.A. (2007). Convergence of Model-Based Temporal Difference Learning for Control. In Proceedings of the IEEE International Symposium on Approximate Dynamic Programming and Reinforcement Learning (ADPRL) (pp. 60-67).

Hasselt, H.P. van & Wiering, M.A. (2007). Reinforcement Learning in Continuous Action Spaces. In Proceedings of the IEEE International Symposium on Approximate Dynamic Programming and Reinforcement Learning (ADPRL) (pp. 272-279).

Wiering, M.A. & Hasselt, H.P. van (2007). Two Novel On-policy Reinforcement Learning Algorithms based on TD(lambda)-methods. In Proceedings of the IEEE International Symposium on Approximate Dynamic Programming and Reinforcement Learning (ADPRL) (pp. 280-287).


valid-html401 webmaster@cs.uu.nl, Thu, 23 May 2013 23:37:47 +0200 ← Departement Informatica, Universiteit Utrecht