Abstract
Uncertainty quantification in image retrieval is crucial for downstream
decisions, yet it remains a challenging and largely unexplored problem. Current
methods for estimating uncertainties are poorly calibrated, computationally
expensive, or based on heuristics. We present a new method that views image
embeddings as stochastic features rather than deterministic features. Our two
main contributions are (1) a likelihood that matches the triplet constraint and
that evaluates the probability of an anchor being closer to a positive than a
negative; and (2) a prior over the feature space that justifies the
conventional l2 normalization. To ensure computational efficiency, we derive a
variational approximation of the posterior, called the Bayesian triplet loss,
that produces state-of-the-art uncertainty estimates and matches the predictive
performance of current state-of-the-art methods.
| Original language | English |
|---|---|
| Title of host publication | Proceedings of 2021 International Conference on Computer Vision |
| Number of pages | 11 |
| Publication date | 2021 |
| Publication status | Published - 2021 |
| Event | 2021 International Conference on Computer Vision - Virtual event Duration: 11 Oct 2021 → 17 Oct 2021 https://iccv2021.thecvf.com/ |
Conference
| Conference | 2021 International Conference on Computer Vision |
|---|---|
| Location | Virtual event |
| Period | 11/10/2021 → 17/10/2021 |
| Internet address |