0

Shenghua Gao

Papers
23

Cite

Notes

Only stored in your browser.

Attribution

Affiliations & profile
Semantic Scholar
Attribution policy →
23papers

Authored papers

23

LatentChem: From Textual CoT to Latent Thinking in Chemical Reasoning

arXiv 2026

2026

OpenSearch-VL: An Open Recipe for Frontier Multimodal Search Agents

arXiv 2026

2026

SRUM: Fine-Grained Self-Rewarding for Unified Multimodal Models

arXiv 2025

2025

Seeing from Another Perspective: Evaluating Multi-View Understanding in MLLMs

arXiv 2025

2025

Recollection from Pensieve: Novel View Synthesis via Learning from Uncalibrated Videos

arXiv 2025

2025

2D Gaussian Splatting for Geometrically Accurate Radiance Fields

arXiv 2024

2024

CAD-MLLM: Unifying Multimodality-Conditioned CAD Generation With MLLM

arXiv 2024

2024

InstantMesh: Efficient 3D Mesh Generation from a Single Image with Sparse-view Large Reconstruction Models

arXiv 2024

2024

MLLM-Tool: A Multimodal Large Language Model For Tool Agent Learning

arXiv 2024

2024

3D StreetUnveiler with Semantic-Aware 2DGS

arXiv 2024

2024

Scaling Mesh Generation via Compressive Tokenization

CVPR 2025 1

2024

FreeSplatter: Pose-free Gaussian Splatting for Sparse-view 3D Reconstruction

ICCV 2025

2024

MeshSegmenter: Zero-Shot Mesh Semantic Segmentation via Texture Synthesis

arXiv 2024

2024

Michelangelo: Conditional 3D Shape Generation based on Shape-Image-Text Aligned Latent Representation

michelangelo-conditional-3d-shape-generation

2023

DebSDF: Delving into the Details and Bias of Neural Indoor Scene Reconstruction

arXiv 2023

2023

LivelySpeaker: Towards Semantic-Aware Co-Speech Gesture Generation

ICCV 2023 1

2023

Weakly Supervised Video Representation Learning with Unaligned Text for Sequential Videos

CVPR 2023 1

2023

TransRAC: Encoding Multi-scale Temporal Correlation with Transformers for Repetitive Action Counting

CVPR 2022 1

2022

PREF: Phasorial Embedding Fields for Compact Neural Representations

arXiv 2022

2022

Dual-Space NeRF: Learning Animatable Avatars and Scene Lighting in Separate Spaces

arXiv 2022

2022

MOS: A Low Latency and Lightweight Framework for Face Detection, Landmark Localization, and Head Pose Estimation

arXiv 2021

2021

Structured3D: A Large Photo-realistic Dataset for Structured 3D Modeling

ECCV 2020 8

2019

Single-Image Piece-wise Planar 3D Reconstruction via Associative Embedding

single-image-piece-wise-planar-3d-1

2019

Affiliations

No known affiliations.

Frequent co-authors

10

from 23 papers