학술논문

State space construction for behavior acquisition in multi agent environments with vision and action
Document Type
Conference
Source
Sixth International Conference on Computer Vision (IEEE Cat. No.98CH36271) Computer vision Computer Vision, 1998. Sixth International Conference on. :870-875 1998
Subject
Computing and Processing
Signal Processing and Analysis
State-space methods
Robot sensing systems
Learning
Robot vision systems
Computer vision
Control theory
Adaptive systems
System identification
Information analysis
State estimation
Language
Abstract
This paper proposes a method which estimates the relationships between learner's behaviors and other agents' ones in the environment through interactions (observation and action) using the method of system identification. In order to identify the model of each agent, Akaike's Information Criterion is applied to the results of Canonical Variate Analysis for the relationship between the observed data in terms of action and future observation. Next, reinforcement learning based on the estimated state vectors is performed to obtain the optimal behavior. The proposed method is applied to a soccer playing situation, where a rolling ball and other moving agents are well modeled and the learner's behaviors are successfully acquired by the method. Computer simulations and real experiments are shown and a discussion is given.