Shijian Lu
- Papers
- 25
Cite
Notes
Only stored in your browser.
Authored papers
25The Last Human-Written Paper: Agent-Native Research Artifacts
arXiv 2026
ParaVT: Taming the Tool Prior Paradox for Parallel Tool Use in Agentic Video Reinforcement Learning
arXiv 2026
RynnBrain: Open Embodied Foundation Models
arXiv 2026
R1-VL: Learning to Reason with Multimodal Large Language Models via Step-wise Group Relative Policy Optimization
arXiv 2025
LongVT: Incentivizing "Thinking with Long Videos" via Native Tool Calling
arXiv 2025
Rolling Forcing: Autoregressive Long Video Diffusion in Real Time
arXiv 2025
MMR1: Enhancing Multimodal Reasoning with Variance-Aware Sampling and Open Resources
arXiv 2025
PacGDC: Label-Efficient Generalizable Depth Completion with Projection Ambiguity and Consistency
ICCV 2025
AGLA: Mitigating Object Hallucinations in Large Vision-Language Models with Assembly of Global and Local Attention
arXiv 2024
The Curse of Multi-Modalities: Evaluating Hallucinations of Large Multimodal Models across Language, Visual, and Audio
arXiv 2024
Novel View Extrapolation with Video Diffusion Priors
arXiv 2024
MMRel: A Relation Understanding Benchmark in the MLLM Era
arXiv 2024
Mitigating Object Hallucination via Concentric Causal Attention
arXiv 2024
Cross-Domain Few-Shot Segmentation via Iterative Support-Query Correspondence Mining
CVPR 2024 1
CAT-SAM: Conditional Tuning for Few-Shot Adaptation of Segment Anything Model
arXiv 2024
Segment Anything with Multiple Modalities
arXiv 2024
Vision-Language Models for Vision Tasks: A Survey
arXiv 2023
Rewrite Caption Semantics: Bridging Semantic Gaps for Language-Supervised Semantic Segmentation
rewrite-caption-semantics-bridging-semantic
AI-Generated Images as Data Source: The Dawn of Synthetic Era
arXiv 2023
StyleRF: Zero-shot 3D Style Transfer of Neural Radiance Fields
CVPR 2023 1
Weakly Supervised 3D Open-vocabulary Segmentation
weakly-supervised-3d-open-vocabulary
3D Semantic Segmentation in the Wild: Learning Generalized Models for Adverse-Condition Point Clouds
CVPR 2023 1
Black-box Unsupervised Domain Adaptation with Bi-directional Atkinson-Shiffrin Memory
ICCV 2023 1
Domain Adaptive Video Segmentation via Temporal Pseudo Supervision
arXiv 2022
Multimodal Image Synthesis and Editing: The Generative AI Era
arXiv 2021
Affiliations
Frequent co-authors
10from 25 papers