Predictive Representations For Sequential Decision Making Under Uncertainty

Boularias, A

Predictive Representations For Sequential Decision Making Under Uncertainty

Boularias, A. (2010). Predictive Representations For Sequential Decision Making Under Uncertainty. PhD Thesis, Université Laval, Québec, Canada.

Item is 公開

表示: 全項目非表示: 全項目

基本情報

表示: 非表示:

アイテムのパーマリンク: https://hdl.handle.net/11858/00-001M-0000-0013-BF5C-D 版のパーマリンク: https://hdl.handle.net/21.11116/0000-0002-AC91-8

資料種別: 学位論文

ファイル

表示: ファイル

非表示: ファイル

:

Boularias-Thesis_[0].pdf (出版社版), 2MB

表示保存

ファイルのパーマリンク:
https://hdl.handle.net/21.11116/0000-0002-AC8B-0

ファイル名:
Boularias-Thesis_[0].pdf

説明:
-

OA-Status:

閲覧制限:
公開

MIMEタイプ / チェックサム:
application/pdf / [MD5]

技術的なメタデータ:

表示

著作権日付:
-

著作権情報:
-

CCライセンス:
-

作成者

表示:

非表示:

作成者:
Boularias, A¹, 著者

所属:
1External Organizations, ou_persistent22

内容説明

表示:

非表示:

キーワード: -

要旨: The problem of making decisions is ubiquitous in life. This problem becomes even more
complex when the decisions should be made sequentially. In fact, the execution of an action
at a given time leads to a change in the environment of the problem, and this change cannot be
predicted with certainty. The aim of a decision-making process is to optimally select actions
in an uncertain environment. To this end, the environment is often modeled as a dynamical
system with multiple states, and the actions are executed so that the system evolves toward
a desirable state.
In this thesis, we proposed a family of stochastic models and algorithms in order to improve
the quality of of the decision-making process. The proposed models are alternative to Markov
Decision Processes, a largely used framework for this type of problems.
In particular, we showed that the state of a dynamical system can be represented more
compactly if it is described in terms of predictions of certain future events. We also showed
that even the cognitive process of selecting actions, known as policy, can be seen as a dynamical
system. Starting from this observation, we proposed a panoply of algorithms, all based on
predictive policy representations, in order to solve different problems of decision-making, such
as decentralized planning, reinforcement learning, or imitation learning.
We also analytically and empirically demonstrated that the proposed approaches lead to
a decrease in the computational complexity and an increase in the quality of the decisions,
compared to standard approaches for planning and learning under uncertainty.

資料詳細

表示:

非表示:

言語:

日付: 出版: 2010-07

出版の状態: 出版

ページ: 192

出版情報: Québec, Canada : Université Laval

目次: -

査読: -

識別子（DOI, ISBNなど）: BibTex参照ID: 6833

学位: 博士号 (PhD)

アイテム詳細

基本情報

ファイル

関連URL

作成者

内容説明

資料詳細

関連イベント

訴訟

Project information

出版物