Letian Zhang
- Papers
- 8
Cite
Notes
Only stored in your browser.
Authored papers
8AutoResearchClaw: Self-Reinforcing Autonomous Research with Human-AI Collaboration
arXiv 2026
ClinSeekAgent: Automating Multimodal Evidence Seeking for Agentic Clinical Reasoning
arXiv 2026
Your Agent, Their Asset: A Real-World Safety Analysis of OpenClaw
arXiv 2026
GPT-IMAGE-EDIT-1.5M: A Million-Scale, GPT-Generated Image Dataset
arXiv 2025
Oasis: One Image is All You Need for Multimodal Instruction Data Synthesis
ICCV 2025
OpenVision 2: A Family of Generative Pretrained Visual Encoders for Multimodal Learning
arXiv 2025
ImageNet3D: Towards General-Purpose Object-Level 3D Understanding
arXiv 2024
Benchmarking Multi-Image Understanding in Vision and Language Models: Perception, Knowledge, Reasoning, and Multi-Hop Reasoning
arXiv 2024
Affiliations
Frequent co-authors
10from 8 papers