Learning to Segment in Images and Videos with Different Forms of Supervision

Khoreva, Anna; Schiele, Bernt; Szeliski, Richard; Brox, Thomas

doi:10.22028/D291-26995

詳細要約

Learning to Segment in Images and Videos with Different Forms of Supervision

Khoreva, A., Schiele, B., Szeliski, R., & Brox, T. (2017). Learning to Segment in Images and Videos with Different Forms of Supervision. PhD Thesis, Universität des Saarlandes, Saarbrücken.

Item is 公開

表示: 全項目非表示: 全項目

基本情報

表示: 非表示:

アイテムのパーマリンク: https://hdl.handle.net/21.11116/0000-0000-293F-D 版のパーマリンク: https://hdl.handle.net/21.11116/0000-0000-33DB-0

資料種別: 学位論文

ファイル

表示: ファイル

非表示: ファイル

:

thesis.pdf (出版社版), 60MB

ファイルのパーマリンク:
-

ファイル名:
thesis.pdf

説明:
-

OA-Status:

閲覧制限:
制限付き (Max Planck Institute for Informatics, MSIN; )

MIMEタイプ / チェックサム:
application/pdf

技術的なメタデータ:

著作権日付:
-

著作権情報:
-

CCライセンス:
-

作成者

表示:

非表示:

作成者:
Khoreva, Anna^{1, 2}, 著者
Schiele, Bernt¹, 著者
Szeliski, Richard¹, 著者
Brox, Thomas³, 著者

所属:
1Computer Vision and Multimodal Computing, MPI for Informatics, Max Planck Society, ou_1116547
2International Max Planck Research School, MPI for Informatics, Max Planck Society, Campus E1 4, 66123 Saarbrücken, DE, ou_1116551
3External Organizations, ou_persistent22

内容説明

表示:

非表示:

キーワード: -

要旨: Much progress has been made in image and video segmentation over the last years. To a large extent, the success can be attributed to the strong appearance models completely learned from data, in particular using deep learning methods. However,to perform best these methods require large representative datasets for training with expensive pixel-level annotations, which in case of videos are prohibitive to obtain. Therefore, there is a need to relax this constraint and to consider alternative forms of supervision, which are easier and cheaper to collect. In this thesis, we aim to develop algorithms for learning to segment in images and videos with different levels of supervision. First, we develop approaches for training convolutional networks with weaker forms of supervision, such as bounding boxes or image labels, for object boundary estimation and semantic/instance labelling tasks. We propose to generate pixel-level approximate groundtruth from these weaker forms of annotations to train a network, which allows to achieve high-quality results comparable to the full supervision quality without any modifications of the network architecture or the training procedure. Second, we address the problem of the excessive computational and memory costs inherent to solving video segmentation via graphs. We propose approaches to improve the runtime and memory efficiency as well as the output segmentation quality by learning from the available training data the best representation of the graph. In particular, we contribute with learning must-link constraints, the topology and edge weights of the graph as well as enhancing the graph nodes - superpixels - themselves. Third, we tackle the task of pixel-level object tracking and address the problem of the limited amount of densely annotated video data for training convolutional networks. We introduce an architecture which allows training with static images only and propose an elaborate data synthesis scheme which creates a large number of training examples close to the target domain from the given first frame mask. With the proposed techniques we show that densely annotated consequent video data is not necessary to achieve high-quality temporally coherent video segmentationresults. In summary, this thesis advances the state of the art in weakly supervised image segmentation, graph-based video segmentation and pixel-level object tracking and contributes with the new ways of training convolutional networks with a limited amount of pixel-level annotated training data.

資料詳細

表示:

非表示:

言語: eng - English

日付: 受理: 2017-12-20オンライン出版: 2017出版: 2017

出版の状態: 出版

ページ: 247 p.

出版情報: Saarbrücken : Universität des Saarlandes

目次: -

査読: -

識別子（DOI, ISBNなど）: BibTex参照ID: Khorevaphd2017
DOI: 10.22028/D291-26995
URN: urn:nbn:de:bsz:291-scidok-ds-269954

学位: 博士号 (PhD)

アイテム詳細

基本情報

ファイル

関連URL

作成者

内容説明

資料詳細

関連イベント

訴訟

Project information

出版物