dc.contributor.author | Bendersky, Diego | |
dc.contributor.author | Santos, Juan Miguel | |
dc.date.accessioned | 2023-11-23T12:07:09Z | |
dc.date.available | 2023-11-23T12:07:09Z | |
dc.date.issued | 2006 | |
dc.identifier.citation | Bendersky, D. Learning From The Environment With A Universal Reinforcement Function / Diego Bendersky, Juan Miguel Santos // International Conference on Neural Networks and Artificial Intelligence : proceedings, Brest, 31 May – 2 June, 2006 / Edited: V. Golovko [et al.]. – Brest : BSTU, 2006. – P. 182–186 : il. – Bibliogr.: p. 186 (12 titles). | ru_RU |
dc.identifier.uri | https://rep.bstu.by/handle/data/37186 | |
dc.description | Диего Бендерски, Сантос Хуан Мигель. Изучение Окружающей Среды С Универсальной Функцией Подкрепления | ru_RU |
dc.description.abstract | Traditionally, in Reinforcement Learning, the specification of the task Ls contained in the reinforcement function (RF), and ach new task requires the definition of a new RF. But in the nature, explicit reward signals are limited, and the characteristics of the environment afTects not only how animals perform particular tasks, but also what skills an animal will develop during its life. In this work, we propose a novel use of Reinforcement Learning that consist in the learning of different abilities or skills, based on the characteristics of the environment, using a fixed and universal reinforcement function. We also show a method to build a RF for a skill using information from the optimal policy learned in a particular environment and we prove that this method is correct, i.e., the RF constructed in this way produces the same optimal policy. | ru_RU |
dc.language.iso | en | ru_RU |
dc.subject | обучение | ru_RU |
dc.subject | training | ru_RU |
dc.subject | окружающая среда | ru_RU |
dc.subject | environment | ru_RU |
dc.title | Learning From The Environment With A Universal Reinforcement Function | ru_RU |
dc.title.alternative | Изучение Окружающей Среды С Универсальной Функцией Подкрепления | ru_RU |
dc.type | Научный доклад (Working Paper) | ru_RU |