Reinforced feature points: Optimizing feature detection and description for a high-level task

Aritra Bhowmik; Stefan Gumhold; Carsten Rother; Eric Brachmann

doi:10.1109/CVPR42600.2020.00500

Reinforced feature points: Optimizing feature detection and description for a high-level task

Research output: Contribution to book/Conference proceedings/Anthology/Report › Conference contribution › Contributed › peer-review

Contributors

Aritra Bhowmik - , Chair of Computer Graphics and Visualisation (Author)
Stefan Gumhold - , Chair of Computer Graphics and Visualisation, Clusters of Excellence CeTI: Centre for Tactile Internet (Author)
Carsten Rother - , Heidelberg University (Author)
Eric Brachmann - , Heidelberg University (Author)

Abstract

We address a core problem of computer vision: Detection and description of 2D feature points for image matching. For a long time, hand-crafted designs, like the seminal SIFT algorithm, were unsurpassed in accuracy and efficiency. Recently, learned feature detectors emerged that implement detection and description using neural networks. Training these networks usually resorts to optimizing low-level matching scores, often pre-defining sets of image patches which should or should not match, or which should or should not contain key points. Unfortunately, increased accuracy for these low-level matching scores does not necessarily translate to better performance in high-level vision tasks. We propose a new training methodology which embeds the feature detector in a complete vision pipeline, and where the learnable parameters are trained in an end-to-end fashion. We overcome the discrete nature of key point selection and descriptor matching using principles from reinforcement learning. As an example, we address the task of relative pose estimation between a pair of images. We demonstrate that the accuracy of a state-of-the-art learning-based feature detector can be increased when trained for the task it is supposed to solve at test time. Our training methodology poses little restrictions on the task to learn, and works for any architecture which predicts key point heat maps, and descriptors for key point locations.

Details

Original language	English
Title of host publication	2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)
Pages	4947-4956
Number of pages	10
ISBN (electronic)	978-1-7281-7168-5
Publication status	Published - 2020
Peer-reviewed	Yes

Publication series

Series	Conference on Computer Vision and Pattern Recognition (CVPR)
ISSN	1063-6919

Conference

Title	2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2020
Duration	14 - 19 June 2020
City	Virtual, Online
Country	United States of America

External IDs

Scopus	85089152727

Keywords

ASJC Scopus subject areas

Software
Computer Vision and Pattern Recognition

Research Portal of the TU Dresden