Stephen Gould
- Papers
- 13
Cite
Notes
Only stored in your browser.
Authored papers
13DiSA: Diffusion Step Annealing in Autoregressive Image Generation
arXiv 2025
ARINAR: Bi-Level Autoregressive Feature-by-Feature Generative Models
arXiv 2025
Towards Optimal Feature-Shaping Methods for Out-of-Distribution Detection
arXiv 2024
Negative Token Merging: Image-based Adversarial Feature Guidance
arXiv 2024
The First to Know: How Token Distributions Reveal Hidden Knowledge in Large Vision-Language Models?
arXiv 2024
3D-GPT: Procedural 3D Modeling with Large Language Models
arXiv 2023
Scaling Data Generation in Vision-and-Language Navigation
ICCV 2023 1
Exploring Predicate Visual Context in Detecting Human-Object Interactions
ICCV 2023 1
Learning Navigational Visual Representations with Semantic Map Supervision
ICCV 2023 1
Efficient Two-Stage Detection of Human-Object Interactions with a Novel Unary-Pairwise Transformer
CVPR 2022 1
A Recurrent Vision-and-Language BERT for Navigation
arXiv 2020
Spatially Conditioned Graphs for Detecting Human-Object Interactions
ICCV 2021 10
Bottom-Up and Top-Down Attention for Image Captioning and Visual Question Answering
bottom-up-and-top-down-attention-for-image-1
Affiliations
Frequent co-authors
10from 13 papers