End-to-end Learning for Graph Decomposition

Research output: Contribution to book/Conference proceedings/Anthology/ReportConference contributionContributedpeer-review

Contributors

  • Jie Song - , ETH Zurich (Author)
  • Bjoern Andres - , Bosch Center for Artificial Intelligence, University of Tübingen (Author)
  • Michael J Black - , Max Planck Institute for Intelligent Systems (Author)
  • Otmar Hilliges - , ETH Zurich (Author)
  • Siyu Tang - , ETH Zurich, Max Planck Institute for Intelligent Systems, University of Tübingen (Author)

Abstract

Deep neural networks provide powerful tools for pattern recognition, while classical graph algorithms are widely used to solve combinatorial problems. In computer vision, many tasks combine elements of both pattern recognition and graph reasoning. In this paper, we study how to connect deep networks with graph decomposition into an end-to-end trainable framework. More specifically, the minimum cost multicut problem is first converted to an unconstrained binary cubic formulation where cycle consistency constraints are incorporated into the objective function. The new optimization problem can be viewed as a Conditional Random Field (CRF) in which the random variables are associated with the binary edge labels. Cycle constraints are introduced into the CRF as high-order potentials. A standard Convolutional Neural Network (CNN) provides the front-end features for the fully differentiable CRF. The parameters of both parts are optimized in an end-to-end manner. The efficacy of the proposed learning algorithm is demonstrated via experiments on clustering MNIST images and on the challenging task of real-world multi-people pose estimation.

Details

Original languageEnglish
Title of host publication2019 IEEE/CVF International Conference on Computer Vision (ICCV)
Pages10092-10101
Publication statusPublished - 2019
Peer-reviewedYes
Externally publishedYes

External IDs

ORCID /0000-0001-5036-9162/work/143781899
Scopus 85081910211