End-to-End Learning of Deep Structured Models for Semantic Segmentation
Licentiatavhandling, 2018
This thesis summarizes the content of three papers, all of them presenting solutions to semantic segmentation problems. The applications have varied widely and several different types of data have been considered, ranging from 3D CT images to RGB images of horses. The main focus has been on developing robust and accurate models to solve these problems. The models consist of a CNN capable of learning complex image features coupled with a CRF capable of learning dependencies between output variables. Emphasis has been on creating models that are possible to train end-to-end, as well as developing corresponding optimization methods needed to enable efficient training. End-to-end training gives the CNN and the CRF a chance to learn how to interact and exploit complementary information to achieve better performance.
Semantic segmentation
deep structured models
supervised learning
convolutional neural networks
conditional random fields
Författare
Måns Larsson
Chalmers, Elektroteknik, Signalbehandling och medicinsk teknik
Styrkeområden
Informations- och kommunikationsteknik
Ämneskategorier
Datorseende och robotik (autonoma system)
Utgivare
Chalmers
Room EL43, Hörsalsvägen 11, Campus Johanneberg
Opponent: Professor Hossein Azizpour, KTH, Sweden