End-to-end Learning for Graph Decomposition

Publikation: Beitrag in Buch/Konferenzbericht/Sammelband/GutachtenBeitrag in KonferenzbandBeigetragenBegutachtung

Beitragende

  • Jie Song - , ETH Zürich (Autor:in)
  • Bjoern Andres - , Bosch Center for Artificial Intelligence, Eberhard Karls Universität Tübingen (Autor:in)
  • Michael J Black - , Max-Planck-Institut für Intelligente Systeme (Autor:in)
  • Otmar Hilliges - , ETH Zürich (Autor:in)
  • Siyu Tang - , ETH Zürich, Max-Planck-Institut für Intelligente Systeme, Eberhard Karls Universität Tübingen (Autor:in)

Abstract

Deep neural networks provide powerful tools for pattern recognition, while classical graph algorithms are widely used to solve combinatorial problems. In computer vision, many tasks combine elements of both pattern recognition and graph reasoning. In this paper, we study how to connect deep networks with graph decomposition into an end-to-end trainable framework. More specifically, the minimum cost multicut problem is first converted to an unconstrained binary cubic formulation where cycle consistency constraints are incorporated into the objective function. The new optimization problem can be viewed as a Conditional Random Field (CRF) in which the random variables are associated with the binary edge labels. Cycle constraints are introduced into the CRF as high-order potentials. A standard Convolutional Neural Network (CNN) provides the front-end features for the fully differentiable CRF. The parameters of both parts are optimized in an end-to-end manner. The efficacy of the proposed learning algorithm is demonstrated via experiments on clustering MNIST images and on the challenging task of real-world multi-people pose estimation.

Details

OriginalspracheEnglisch
Titel2019 IEEE/CVF International Conference on Computer Vision (ICCV)
Seiten10092-10101
PublikationsstatusVeröffentlicht - 2019
Peer-Review-StatusJa
Extern publiziertJa

Externe IDs

ORCID /0000-0001-5036-9162/work/143781899
Scopus 85081910211