Incorporation of Perception-based Information in Robot Learning Using Fuzzy Reinforcement Learning Agents |
| |
作者姓名: | ZHOUChangjiu MENGQingchun GUOZhongwen QUWeifen YINBo |
| |
作者单位: | [1]SchoolofElectricalandElectronicEngineering,SingaporePolytechnic,Singapore139651 [2]ComputerScienceDepartment,OceanUniversityof(Qingdao,(Qingdao266003,P.R.China [3]StateKeyLaboratoryofIntelligentSystemsandTechnologiesinTsinghuaUniversity,Beijing100084,P.R.China |
| |
摘 要: | Robot learning in unstructured environments has been proved to be an extremely challenging problem, mainly because of many uncertainties always present in the real world. Human beings, on the other hand, seem to cope very well with uncertain and unpredictable environments, often relying on perception-based information. Furthermore, humans beings can also utilize perceptions to guide their learning on those parts of the perception-action space that are actually relevant to the task. Therefore, we conduct a research aimed at improving robot learning through the incorporation of both perceptionbased and measurement-based information. For this reason, a fuzzy reinforcement learning (FRL) agent is proposed in this paper. Based on a neural-fuzzy architecture, different kinds of information can be incorporated into the FRL agent to initialise its action network, critic network and evaluation feedback module so as to accelerate its learning. By making use of the global optimisation capability of GAs (genetic algorithms), a GA-based FRL (GAFRL) agent is presented to solve the local minima problem in traditional actor-critic reinforcement learning. On the other hand, with the prediction capability of the critic network, GAs can perform a more effective global search. Different GAFRL agents are constructed and verified by using the simulation model of a physical biped robot. The simulation analysis shows that the biped learning rate for dynamic balance can be improved by incorporating perception-based information on biped balancing and walking evaluation. The biped robot can find its application in ocean exploration, detection or sea rescue activity, as well as military maritimeac tivity.
|
关 键 词: | 机器人 感应装置 遗传算法 模糊神经 |
本文献已被 维普 等数据库收录! |
|