Effective and Annotation Efficient Deep Learning for Image Understanding

Effective and Annotation Efficient Deep Learning for Image Understanding
Author :
Publisher :
Total Pages : 0
Release :
ISBN-10 : OCLC:1128269061
ISBN-13 :
Rating : 4/5 ( Downloads)

Book Synopsis Effective and Annotation Efficient Deep Learning for Image Understanding by : Spyridon Gidaris

Download or read book Effective and Annotation Efficient Deep Learning for Image Understanding written by Spyridon Gidaris and published by . This book was released on 2018 with total page 0 pages. Available in PDF, EPUB and Kindle. Book excerpt: Recent development in deep learning have achieved impressive results on image understanding tasks. However, designing deep learning architectures that will effectively solve the image understanding tasks of interest is far from trivial. Even more, the success of deep learning approaches heavily relies on the availability of large-size manually labeled (by humans) data. In this context, the objective of this dissertation is to explore deep learning based approaches for core image understanding tasks that would allow to increase the effectiveness with which they are performed as well as to make their learning process more annotation efficient, i.e., less dependent on the availability of large amounts of manually labeled training data. We first focus on improving the state-of-the-art on object detection. More specifically, we attempt to boost the ability of object detection systems to recognize (even difficult) object instances by proposing a multi-region and semantic segmentation-aware ConvNet-based representation that is able to capture a diverse set of discriminative appearance factors. Also, we aim to improve the localization accuracy of object detection systems by proposing iterative detection schemes and a novel localization model for estimating the bounding box of the objects. We demonstrate that the proposed technical novelties lead to significant improvements in the object detection performance of PASCAL and MS COCO benchmarks. Regarding the pixel-wise image labeling problem, we explored a family of deep neural network architectures that perform structured prediction by learning to (iteratively) improve some initial estimates of the output labels. The goal is to identify which is the optimal architecture for implementing such deep structured prediction models. In this context, we propose to decompose the label improvement task into three steps: 1) detecting the initial label estimates that are incorrect, 2) replacing the incorrect labels with new ones, and finally 3) refining the renewed labels by predicting residual corrections w.r.t. them. We evaluate the explored architectures on the disparity estimation task and we demonstrate that the proposed architecture achieves state-of-the-art results on the KITTI 2015 benchmark.In order to accomplish our goal for annotation efficient learning, we proposed a self-supervised learning approach that learns ConvNet-based image representations by training the ConvNet to recognize the 2d rotation that is applied to the image that it gets as input. We empirically demonstrate that this apparently simple task actually provides a very powerful supervisory signal for semantic feature learning. Specifically, the image features learned from this task exhibit very good results when transferred on the visual tasks of object detection and semantic segmentation, surpassing prior unsupervised learning approaches and thus narrowing the gap with the supervised case.Finally, also in the direction of annotation efficient learning, we proposed a novel few-shot object recognition system that after training is capable to dynamically learn novel categories from only a few data (e.g., only one or five training examples) while it does not forget the categories on which it was trained on. In order to implement the proposed recognition system we introduced two technical novelties, an attention based few-shot classification weight generator, and implementing the classifier of the ConvNet based recognition model as a cosine similarity function between feature representations and classification vectors. We demonstrate that the proposed approach achieved state-of-the-art results on relevant few-shot benchmarks.


Effective and Annotation Efficient Deep Learning for Image Understanding Related Books

Effective and Annotation Efficient Deep Learning for Image Understanding
Language: en
Pages: 0
Authors: Spyridon Gidaris
Categories:
Type: BOOK - Published: 2018 - Publisher:

DOWNLOAD EBOOK

Recent development in deep learning have achieved impressive results on image understanding tasks. However, designing deep learning architectures that will effe
Interpretable and Annotation-Efficient Learning for Medical Image Computing
Language: en
Pages: 292
Authors: Jaime Cardoso
Categories: Computers
Type: BOOK - Published: 2020-10-03 - Publisher: Springer Nature

DOWNLOAD EBOOK

This book constitutes the refereed joint proceedings of the Third International Workshop on Interpretability of Machine Intelligence in Medical Image Computing,
Deep Learning in Medical Image Analysis
Language: en
Pages: 184
Authors: Gobert Lee
Categories: Medical
Type: BOOK - Published: 2020-02-06 - Publisher: Springer Nature

DOWNLOAD EBOOK

This book presents cutting-edge research and applications of deep learning in a broad range of medical imaging scenarios, such as computer-aided diagnosis, imag
Deep Learning for Medical Image Analysis
Language: en
Pages: 544
Authors: S. Kevin Zhou
Categories: Computers
Type: BOOK - Published: 2023-11-23 - Publisher: Academic Press

DOWNLOAD EBOOK

Deep Learning for Medical Image Analysis, Second Edition is a great learning resource for academic and industry researchers and graduate students taking courses
Handbook of Research on Deep Learning-Based Image Analysis Under Constrained and Unconstrained Environments
Language: en
Pages: 381
Authors: Raj, Alex Noel Joseph
Categories: Computers
Type: BOOK - Published: 2020-12-25 - Publisher: IGI Global

DOWNLOAD EBOOK

Recent advancements in imaging techniques and image analysis has broadened the horizons for their applications in various domains. Image analysis has become an