In this work we propose a blackbox intervention method for visual dialog models, with the aim of assessing the contribution of individual linguistic or visual components. Concretely, we conduct structured or randomized interventions that aim to impair an individual component of the model, and observe changes in task performance. We reproduce a state-of-the-art visual dialog model and demonstrate that our methodology yields surprising insights, namely that both dialog and image information have minimal contributions to task performance. The intervention method presented here can be applied as a sanity check for the strength and robustness of each component in visual dialog systems.
Examining Cooperation in Visual Dialog Models
A blackbox intervention method for visual dialog models reveals that both dialog and image components contribute minimally to task performance, serving as a robustness check for visual dialog systems.
- Year
- 2017
- Venue
- arXiv 2017
- Authors
- 5
- Hosting
- Abstract onlyARXIV-DEFAULT
Cite
Notes
Only stored in your browser.
Attribution
- Abstract & full text
- arxiv.org/abs/1712.01329ARXIV-DEFAULT
- TL;DR
- Semantic Scholar