We approach structured output prediction by optimizing a deep value network (DVN) to precisely estimate the task loss on different output configurations for a given input. Once the model is trained, we perform inference by gradient descent on the continuous relaxations of the output variables to find outputs with promising scores from the value network. When applied to image segmentation, the value network takes an image and a segmentation mask as inputs and predicts a scalar estimating the intersection over union between the input and ground truth masks. For multi-label classification, the DVN's objective is to correctly predict the F1 score for any potential label configuration. The DVN framework achieves the state-of-the-art results on multi-label prediction and image segmentation benchmarks.
Deep Value Networks Learn to Evaluate and Iteratively Refine Structured Outputs
Michael Gygli,Mohammad Norouzi,A. Angelova
Published 2017 in International Conference on Machine Learning
ABSTRACT
PUBLICATION RECORD
- Publication year
2017
- Venue
International Conference on Machine Learning
- Publication date
2017-03-13
- Fields of study
Computer Science
- Identifiers
- External record
- Source metadata
Semantic Scholar
CITATION MAP
EXTRACTION MAP
CLAIMS
- No claims are published for this paper.
CONCEPTS
- No concepts are published for this paper.
REFERENCES
Showing 1-58 of 58 references · Page 1 of 1
CITED BY
Showing 1-68 of 68 citing papers · Page 1 of 1