Guo Chen
- Papers
- 12
Cite
Notes
Only stored in your browser.
Authored papers
12Towards Multimodal Lifelong Understanding: A Dataset and Agentic Baseline
arXiv 2026
Eagle 2.5: Boosting Long-Context Post-Training for Frontier Vision-Language Models
arXiv 2025
Advances in Speech Separation: Techniques, Challenges, and Future Trends
arXiv 2025
Bridging Perspectives: A Survey on Cross-view Collaborative Intelligence with Egocentric-Exocentric Vision
arXiv 2025
AV-Reasoner: Improving and Benchmarking Clue-Grounded Audio-Visual Counting for MLLMs
arXiv 2025
EagleVision: Object-level Attribute Multimodal LLM for Remote Sensing
arXiv 2025
Video Mamba Suite: State Space Model as a Versatile Alternative for Video Understanding
arXiv 2024
Vinci: A Real-time Embodied Smart Assistant based on Egocentric Vision-Language Model
arXiv 2024
SonicSim: A customizable simulation platform for speech processing in moving sound source scenarios
arXiv 2024
MVBench: A Comprehensive Multi-modal Video Understanding Benchmark
CVPR 2024 1
Memory-and-Anticipation Transformer for Online Action Understanding
ICCV 2023 1
FAST: Faster Arbitrarily-Shaped Text Detector with Minimalist Kernel Representation
arXiv 2021
Affiliations
Frequent co-authors
10from 12 papers