Stephen Gould

Papers: 13

Cite

Notes

Only stored in your browser.

Attribution

Affiliations & profile: Semantic Scholar

Attribution policy →

13papers

Authored papers

DiSA: Diffusion Step Annealing in Autoregressive Image Generation

arXiv 2025

2025

ARINAR: Bi-Level Autoregressive Feature-by-Feature Generative Models

arXiv 2025

2025

Negative Token Merging: Image-based Adversarial Feature Guidance

arXiv 2024

2024

The First to Know: How Token Distributions Reveal Hidden Knowledge in Large Vision-Language Models?

arXiv 2024

2024

Towards Optimal Feature-Shaping Methods for Out-of-Distribution Detection

arXiv 2024

2024

3D-GPT: Procedural 3D Modeling with Large Language Models

arXiv 2023

2023

Scaling Data Generation in Vision-and-Language Navigation

ICCV 2023 1

2023

Exploring Predicate Visual Context in Detecting Human-Object Interactions

ICCV 2023 1

2023

Learning Navigational Visual Representations with Semantic Map Supervision

ICCV 2023 1

2023

Efficient Two-Stage Detection of Human-Object Interactions with a Novel Unary-Pairwise Transformer

CVPR 2022 1

2021

A Recurrent Vision-and-Language BERT for Navigation

arXiv 2020

2020

Spatially Conditioned Graphs for Detecting Human-Object Interactions

ICCV 2021 10

2020

Bottom-Up and Top-Down Attention for Image Captioning and Visual Question Answering

bottom-up-and-top-down-attention-for-image-1

2017

Affiliations

No known affiliations.

Frequent co-authors

from 13 papers

Liang Zheng

Qinyu Zhao

Akshay Asthana

Dylan Campbell

Frederic Z. Zhang

Ming Xu

Yicong Hong

Hao Tan

Jaskirat Singh

Kartik Gupta