Hai Zhao
- Papers
- 34
Cite
Notes
Only stored in your browser.
Authored papers
34Remember Me, Refine Me: A Dynamic Procedural Memory Framework for Experience-Driven Agent Evolution
arXiv 2025
BriLLM: Brain-inspired Large Language Model
arXiv 2025
Towards Enhanced Immersion and Agency for LLM-based Interactive Drama
arXiv 2025
MASKSEARCH: A Universal Pre-Training Framework to Enhance Agentic Search Capability
arXiv 2025
Unfolding the Headline: Iterative Self-Questioning for News Retrieval and Timeline Summarization
arXiv 2025
Plan-over-Graph: Towards Parallelable LLM Agent Schedule
arXiv 2025
Vript: A Video Is Worth Thousands of Words
arXiv 2024
Keep the Cost Down: A Review on Methods to Optimize LLM' s KV-Cache Consumption
arXiv 2024
DOCBENCH: A Benchmark for Evaluating LLM-based Document Reading Systems
arXiv 2024
KVSharer: Efficient Inference via Layer-Wise Dissimilar KV Cache Sharing
arXiv 2024
Dissecting Human and LLM Preferences
arXiv 2024
On the Robustness of Editing Large Language Models
arXiv 2024
SirLLM: Streaming Infinite Retentive LLM
arXiv 2024
CoCo-Agent: A Comprehensive Cognitive MLLM Agent for Smartphone GUI Automation
arXiv 2024
Multi-modal Auto-regressive Modeling via Visual Words
arXiv 2024
Caution for the Environment: Multimodal Agents are Susceptible to Environmental Distractions
arXiv 2024
VHASR: A Multimodal Speech Recognition System With Vision Hotwords
arXiv 2024
Multimodal Chain-of-Thought Reasoning in Language Models
arXiv 2023
CSPRD: A Financial Policy Retrieval Dataset for Chinese Stock Market
arXiv 2023
Chinese Spelling Correction as Rephrasing Language Model
arXiv 2023
CMMLU: Measuring massive multitask language understanding in Chinese
arXiv 2023
Igniting Language Intelligence: The Hitchhiker's Guide From Chain-of-Thought Reasoning to Language Agents
arXiv 2023
Generative Judge for Evaluating Alignment
arXiv 2023
RefGPT: Dialogue Generation of GPT, by GPT, and for GPT
arXiv 2023
Beyond Chain-of-Thought, Effective Graph-of-Thought Reasoning in Language Models
arXiv 2023
Enhancing Visually-Rich Document Understanding via Layout Structure Modeling
arXiv 2023
FSUIE: A Novel Fuzzy Span Mechanism for Universal Information Extraction
arXiv 2023
AutoHall: Automated Hallucination Dataset Generation for Large Language Models
arXiv 2023
Learning Better Masking for Better Language Model Pre-training
arXiv 2022
Machine Reading Comprehension: The Role of Contextualized Language Models and Beyond
arXiv 2020
Topic-Aware Multi-turn Dialogue Modeling
arXiv 2020
Semantics-aware BERT for Language Understanding
arXiv 2019
Attention Is All You Need for Chinese Word Segmentation
EMNLP 2020 11
Modeling Multi-turn Conversation with Deep Utterance Aggregation
modeling-multi-turn-conversation-with-deep-2
Affiliations
Frequent co-authors
10from 34 papers