Accelerating Critic Learning in Approximate Dynamic Programming via Value Templates and Perceptual Learning
Sponsor
This work was partially supported by the National Science Foundation under grant ECS-9904378
Published In
Proceedings of the International Joint Conference on Neural Networks
Document Type
Citation
Publication Date
9-25-2003
Abstract
The concept of value templates and perceptual learning are introduced as refinements to the reinforcement learning (RL) paradigm. We demonstrate a method for accelerating Dual Heuristic Programming (DHP) critic training using value templates and perceptual learning. Both faster and more stable learning are achieved by using the value template and utilizing its inherent constraints to regularize the perceptual learning task. The method is demonstrated by tuning a neurofuzzy control system for a highly nonlinear 2nd order plant proposed by Sanner and Slotine. We take advantage of the TSK model framework throughout to keep the controller, critic, and model components used in DHP highly interpretable.
Locate the Document
https://doi.org/10.1109/IJCNN.2003.1224035
DOI
10.1109/IJCNN.2003.1224035
Persistent Identifier
https://archives.pdx.edu/ds/psu/37267
Citation Details
Shannon, T. T., Santiago, R. A., & George, G. L. (2003, July). Accelerating critic learning in approximate dynamic programming via value templates and perceptual learning. In Proceedings of the International Joint Conference on Neural Networks, 2003. (Vol. 4, pp. 2922-2927). IEEE.