Shenghua Gao
- Papers
- 23
Cite
Notes
Only stored in your browser.
Authored papers
23LatentChem: From Textual CoT to Latent Thinking in Chemical Reasoning
arXiv 2026
OpenSearch-VL: An Open Recipe for Frontier Multimodal Search Agents
arXiv 2026
SRUM: Fine-Grained Self-Rewarding for Unified Multimodal Models
arXiv 2025
Seeing from Another Perspective: Evaluating Multi-View Understanding in MLLMs
arXiv 2025
Recollection from Pensieve: Novel View Synthesis via Learning from Uncalibrated Videos
arXiv 2025
2D Gaussian Splatting for Geometrically Accurate Radiance Fields
arXiv 2024
CAD-MLLM: Unifying Multimodality-Conditioned CAD Generation With MLLM
arXiv 2024
InstantMesh: Efficient 3D Mesh Generation from a Single Image with Sparse-view Large Reconstruction Models
arXiv 2024
MLLM-Tool: A Multimodal Large Language Model For Tool Agent Learning
arXiv 2024
3D StreetUnveiler with Semantic-Aware 2DGS
arXiv 2024
Scaling Mesh Generation via Compressive Tokenization
CVPR 2025 1
FreeSplatter: Pose-free Gaussian Splatting for Sparse-view 3D Reconstruction
ICCV 2025
MeshSegmenter: Zero-Shot Mesh Semantic Segmentation via Texture Synthesis
arXiv 2024
Michelangelo: Conditional 3D Shape Generation based on Shape-Image-Text Aligned Latent Representation
michelangelo-conditional-3d-shape-generation
DebSDF: Delving into the Details and Bias of Neural Indoor Scene Reconstruction
arXiv 2023
LivelySpeaker: Towards Semantic-Aware Co-Speech Gesture Generation
ICCV 2023 1
Weakly Supervised Video Representation Learning with Unaligned Text for Sequential Videos
CVPR 2023 1
TransRAC: Encoding Multi-scale Temporal Correlation with Transformers for Repetitive Action Counting
CVPR 2022 1
PREF: Phasorial Embedding Fields for Compact Neural Representations
arXiv 2022
Dual-Space NeRF: Learning Animatable Avatars and Scene Lighting in Separate Spaces
arXiv 2022
MOS: A Low Latency and Lightweight Framework for Face Detection, Landmark Localization, and Head Pose Estimation
arXiv 2021
Structured3D: A Large Photo-realistic Dataset for Structured 3D Modeling
ECCV 2020 8
Single-Image Piece-wise Planar 3D Reconstruction via Associative Embedding
single-image-piece-wise-planar-3d-1
Affiliations
Frequent co-authors
10from 23 papers