
Hasselt, H.P. van (2011, January 17). Insights in reinforcement rearning : formal analysis and empirical evaluation of temporal-difference learning algorithms. UU Universiteit Utrecht (278 pag.) ( Utrecht University). Prom./coprom.: prof. dr. J-J.Ch. Meyer, L. Schomaker & dr. M.A. Wiering. Hasselt, H.P. van (2010). Double Q-learning. In J. Lafferty, C..K..I. Williams, J. Shawe-Taylor, R.S. Zemel, A. Culotta & A. Culotta (Eds.), Advances in Neural Information Processing Systems 23 (pp. 2613-2621). Seijen, H., Hasselt, H.P. van, Whiteson, S. & Wiering, M.A. (2009). A Theoretical and Empirical Analysis of Expected Sarsa. In Proceedings of the IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning (ADRPL 2009) (pp. 177-184). IEEE Press. Westra, J., Hasselt, H.P. van, Dignum, F.P.M. & Dignum, M.V. (2009). Adaptive Serious Games Using Agent Organizations. In F.P.M. Dignum, J.M. Bradshaw, BG Silverman & W..A. Doesburg (Eds.), Agents for Games and Simulations (pp. 206-220). Berlin / Heidelberg: Springer. Westra, J., Hasselt, H.P. van, Dignum, M.V. & Dignum, F.P.M. (2008). On-line Adapting Games using Agent Organizations. In IEEE Symposium on Computational Intelligence and Games (CIG). Perth, Australia. Wiering, M.A. & Hasselt, H.P. van (2009). The QV Family Compared to Other Reinforcement Learning Algorithms. In Proceedings of the IEEE International Symposium on Approximate Dynamic Programming and Reinforcement Learning (ADPRL 2009) (pp. 101-108). IEEE Press. Hasselt, H.P. van & Wiering, M.A. (2009). Using Continuous Action Spaces to Solve Discrete Problems. In Proceedings of the 2009 International Joint Conference on Neural Networks (IJCNN 2009) (pp. 1149-1156). Atlanta, GA: IEEE Press. Wiering, M.A. & Hasselt, H.P. van (2008). Ensemble Algorithms in Reinforcement Learning. IEEE transactions on systems, man, and cybernetics. Part B, Cybernetics, 38(4), 930-936. Hasselt, H.P. van & Wiering, M.A. (2007). Convergence of Model-Based Temporal Difference Learning for Control. In Proceedings of the IEEE International Symposium on Approximate Dynamic Programming and Reinforcement Learning (ADPRL) (pp. 60-67). Hasselt, H.P. van & Wiering, M.A. (2007). Reinforcement Learning in Continuous Action Spaces. In Proceedings of the IEEE International Symposium on Approximate Dynamic Programming and Reinforcement Learning (ADPRL) (pp. 272-279). Wiering, M.A. & Hasselt, H.P. van (2007). Two Novel On-policy Reinforcement Learning Algorithms based on TD(lambda)-methods. In Proceedings of the IEEE International Symposium on Approximate Dynamic Programming and Reinforcement Learning (ADPRL) (pp. 280-287).