Ziyang Luo
- Papers
- 15
Cite
Notes
Only stored in your browser.
Authored papers
15From Storage to Experience: A Survey on the Evolution of LLM Agent Memory Mechanisms
arXiv 2026
GTA1: GUI Test-time Scaling Agent
arXiv 2025
AdamMeme: Adaptively Probe the Reasoning Capacity of Multimodal Large Language Models on Harmfulness
arXiv 2025
MCP-Universe: Benchmarking Large Language Models with Real-World Model Context Protocol Servers
arXiv 2025
MM-CRITIC: A Holistic Evaluation of Large Multimodal Models as Multimodal Critique
arXiv 2025
VideoLLaMA 2: Advancing Spatial-Temporal Modeling and Audio Understanding in Video-LLMs
arXiv 2024
Aria-UI: Visual Grounding for GUI Instructions
arXiv 2024
VideoAutoArena: An Automated Arena for Evaluating Large Multimodal Models in Video Analysis through User Simulation
CVPR 2025 1
MMCode: Benchmarking Multimodal Large Language Models for Code Generation with Visually Rich Programming Problems
arXiv 2024
Towards Explainable Harmful Meme Detection through Multimodal Debate between Large Language Models
arXiv 2024
CodeHalu: Investigating Code Hallucinations in LLMs via Execution-based Verification
arXiv 2024
CofiPara: A Coarse-to-fine Paradigm for Multimodal Sarcasm Target Identification with Large Multimodal Models
arXiv 2024
CodeJudge-Eval: Can Large Language Models be Good Judges in Code Understanding?
arXiv 2024
ScratchEval: Are GPT-4o Smarter than My Child? Evaluating Large Multimodal Models with Visual Programming Challenges
arXiv 2024
WizardCoder: Empowering Code Large Language Models with Evol-Instruct
arXiv 2023
Affiliations
Frequent co-authors
10from 15 papers