日本語
 
Help Privacy Policy ポリシー/免責事項
  詳細検索ブラウズ

アイテム詳細

登録内容を編集ファイル形式で保存
 
 
ダウンロード電子メール
  Predictive Representations For Sequential Decision Making Under Uncertainty

Boularias, A. (2010). Predictive Representations For Sequential Decision Making Under Uncertainty. PhD Thesis, Université Laval, Québec, Canada.

Item is

基本情報

表示: 非表示:
資料種別: 学位論文

ファイル

表示: ファイル
非表示: ファイル
:
Boularias-Thesis_[0].pdf (出版社版), 2MB
ファイルのパーマリンク:
https://hdl.handle.net/21.11116/0000-0002-AC8B-0
ファイル名:
Boularias-Thesis_[0].pdf
説明:
-
OA-Status:
閲覧制限:
公開
MIMEタイプ / チェックサム:
application/pdf / [MD5]
技術的なメタデータ:
著作権日付:
-
著作権情報:
-
CCライセンス:
-

関連URL

表示:

作成者

表示:
非表示:
 作成者:
Boularias, A1, 著者           
所属:
1External Organizations, ou_persistent22              

内容説明

表示:
非表示:
キーワード: -
 要旨: The problem of making decisions is ubiquitous in life. This problem becomes even more
complex when the decisions should be made sequentially. In fact, the execution of an action
at a given time leads to a change in the environment of the problem, and this change cannot be
predicted with certainty. The aim of a decision-making process is to optimally select actions
in an uncertain environment. To this end, the environment is often modeled as a dynamical
system with multiple states, and the actions are executed so that the system evolves toward
a desirable state.
In this thesis, we proposed a family of stochastic models and algorithms in order to improve
the quality of of the decision-making process. The proposed models are alternative to Markov
Decision Processes, a largely used framework for this type of problems.
In particular, we showed that the state of a dynamical system can be represented more
compactly if it is described in terms of predictions of certain future events. We also showed
that even the cognitive process of selecting actions, known as policy, can be seen as a dynamical
system. Starting from this observation, we proposed a panoply of algorithms, all based on
predictive policy representations, in order to solve different problems of decision-making, such
as decentralized planning, reinforcement learning, or imitation learning.
We also analytically and empirically demonstrated that the proposed approaches lead to
a decrease in the computational complexity and an increase in the quality of the decisions,
compared to standard approaches for planning and learning under uncertainty.

資料詳細

表示:
非表示:
言語:
 日付: 2010-07
 出版の状態: 出版
 ページ: 192
 出版情報: Québec, Canada : Université Laval
 目次: -
 査読: -
 識別子(DOI, ISBNなど): BibTex参照ID: 6833
 学位: 博士号 (PhD)

関連イベント

表示:

訴訟

表示:

Project information

表示:

出版物

表示: