Zhiqiang Hu
- Papers
- 14
Cite
Notes
Only stored in your browser.
Authored papers
14From Perception to Action: An Interactive Benchmark for Vision Reasoning
arXiv 2026
VideoLLaMA 3: Frontier Multimodal Foundation Models for Image and Video Understanding
arXiv 2025
Evolving Prompts In-Context: An Open-ended, Self-replicating Perspective
arXiv 2025
MMR1: Enhancing Multimodal Reasoning with Variance-Aware Sampling and Open Resources
arXiv 2025
SuperWriter: Reflection-Driven Long-Form Generation with Large Language Models
arXiv 2025
Benchmarking Multimodal Mathematical Reasoning with Explicit Visual Dependency
arXiv 2025
Breaking the Memory Barrier: Near Infinite Batch Size Scaling for Contrastive Loss
arXiv 2024
GalleryGPT: Analyzing Paintings with Large Multimodal Models
arXiv 2024
All in an Aggregated Image for In-Image Learning
arXiv 2024
SeaLLMs 3: Open Foundation and Chat Multilingual Large Language Models for Southeast Asian Languages
arXiv 2024
InstructAV: Instruction Fine-tuning Large Language Models for Authorship Verification
arXiv 2024
Plan-and-Solve Prompting: Improving Zero-Shot Chain-of-Thought Reasoning by Large Language Models
arXiv 2023
Adaptive Supervised PatchNCE Loss for Learning H&E-to-IHC Stain Translation with Inconsistent Groundtruth Image Pairs
arXiv 2023
SeaLLMs -- Large Language Models for Southeast Asia
arXiv 2023
Affiliations
Frequent co-authors
10from 14 papers