End-to-End Learning of Deep Structured Models for Semantic Segmentation
Licentiate thesis, 2018
This thesis summarizes the content of three papers, all of them presenting solutions to semantic segmentation problems. The applications have varied widely and several different types of data have been considered, ranging from 3D CT images to RGB images of horses. The main focus has been on developing robust and accurate models to solve these problems. The models consist of a CNN capable of learning complex image features coupled with a CRF capable of learning dependencies between output variables. Emphasis has been on creating models that are possible to train end-to-end, as well as developing corresponding optimization methods needed to enable efficient training. End-to-end training gives the CNN and the CRF a chance to learn how to interact and exploit complementary information to achieve better performance.
Semantic segmentation
deep structured models
supervised learning
convolutional neural networks
conditional random fields
Author
Måns Larsson
Chalmers, Electrical Engineering, Signal Processing and Biomedical Engineering
Areas of Advance
Information and Communication Technology
Subject Categories
Computer Vision and Robotics (Autonomous Systems)
Publisher
Chalmers
Room EL43, Hörsalsvägen 11, Campus Johanneberg
Opponent: Professor Hossein Azizpour, KTH, Sweden