The effects of bounding rationality on the performance and learning of CHREST agents in Tileworld

Martyn Lloyd-Kelly, Peter Lane, Fernand Gobet

Research output: Chapter in Book/Report/Conference proceedingChapter (peer-reviewed)peer-review


Learning in complex and complicated domains is fundamental to performing suitable and timely actions within them. The ability of chess masters to learn and recall huge numbers of board configurations to produce near-optimal actions provides evidence that chunking mechanisms are likely to underpin human learning. Cognitive theories based on chunking argue in favour for the notion of bounded rationality since relatively small chunks of information are learnt in comparison to the total information present in the environment. CHREST, a computational architecture that implements chunking theory, has previously been used to investigate learning in deterministic environments such as chess, where future states are solely dependent upon the actions of agents. In this paper, the CHREST architecture is implemented in agents situated in “Tileworld”, a stochastic environment whose future state depends on both the actions of agents and factors intrinsic to the environment which agents have no control over. The effects of bounding agents’ visual input on learning and performance in various scenarios where the complexity of Tileworld is altered is analysed using computer simulations. Our results show that interactions between independent variables are complex and have important implications for agents situated in stochastic environments where a balance must be struck between learning and performance.
Original languageEnglish
Title of host publicationResearch and Development in Intelligent Systems XXXI
Subtitle of host publicationIncorporating Applications and Innovations in Intelligent Systems XXII
PublisherSpringer Nature
Number of pages14
ISBN (Electronic)978-3-319-12069-0
ISBN (Print)978-3-319-12068-3
Publication statusPublished - 2014


Dive into the research topics of 'The effects of bounding rationality on the performance and learning of CHREST agents in Tileworld'. Together they form a unique fingerprint.

Cite this