Lichao Sun
- Papers
- 46
Cite
Notes
Only stored in your browser.
Authored papers
46Horizon-LM: A RAM-Centric Architecture for LLM Training
arXiv 2026
Expert Threshold Routing for Autoregressive Language Modeling with Dynamic Computation Allocation and Load Balancing
arXiv 2026
Digital Twin AI: Opportunities and Challenges from Large Language Models to World Models
arXiv 2026
Can MLLMs Read Students' Minds? Unpacking Multimodal Error Analysis in Handwritten Math
arXiv 2026
AutoResearch AI: Towards AI-Powered Research Automation for Scientific Discovery
arXiv 2026
Graph-of-Skills: Dependency-Aware Structural Retrieval for Massive Agent Skills
arXiv 2026
MegaTrain: Full Precision Training of 100B+ Parameter Large Language Models on a Single GPU
arXiv 2026
IDER: IDempotent Experience Replay for Reliable Continual Learning
arXiv 2026
LiveMedBench: A Contamination-Free Medical Benchmark for LLMs with Automated Rubric Evaluation
arXiv 2026
NodeRAG: Structuring Graph-based RAG with Heterogeneous Nodes
arXiv 2025
Generative AI for Autonomous Driving: Frontiers and Opportunities
arXiv 2025
Large Language Models Post-training: Surveying Techniques from Alignment to Reasoning
arXiv 2025
TruthPrInt: Mitigating LVLM Object Hallucination Via Latent Truthful-Guided Pre-Intervention
arXiv 2025
MMMR: Benchmarking Massive Multi-Modal Reasoning Tasks
arXiv 2025
On the Trustworthiness of Generative Foundation Models: Guideline, Assessment, and Perspective
arXiv 2025
EfficientLLM: Efficiency in Large Language Models
arXiv 2025
SAMed-2: Selective Memory Enhanced Medical Segment Anything Model
arXiv 2025
Missing Premise exacerbates Overthinking: Are Reasoning Models losing Critical Thinking Skill?
arXiv 2025
Vision Matters: Simple Visual Perturbations Can Boost Multimodal Math Reasoning
arXiv 2025
LLaVA-CoT: Let Vision Language Models Reason Step-by-Step
arXiv 2024
Mora: Enabling Generalist Video Generation via A Multi-Agent Framework
arXiv 2024
TrustLLM: Trustworthiness in Large Language Models
arXiv 2024
MLP-KAN: Unifying Deep Representation and Function Learning
arXiv 2024
Sora: A Review on Background, Technology, Limitations, and Opportunities of Large Vision Models
arXiv 2024
GUI-World: A Video Benchmark and Dataset for Multimodal GUI-oriented Understanding
arXiv 2024
MLLM-as-a-Judge: Assessing Multimodal LLM-as-a-Judge with Vision-Language Benchmark
arXiv 2024
GTBench: Uncovering the Strategic Reasoning Limitations of LLMs via Game-Theoretic Evaluations
arXiv 2024
UniGen: A Unified Framework for Textual Dataset Generation Using Large Language Models
arXiv 2024
Both Text and Images Leaked! A Systematic Analysis of Multimodal LLM Data Contamination
arXiv 2024
BenTo: Benchmark Task Reduction with In-Context Transferability
arXiv 2024
HonestLLM: Toward an Honest and Helpful Large Language Model
arXiv 2024
Biomedical SAM 2: Segment Anything in Biomedical Images and Videos
arXiv 2024
Benchmarking Vision Language Model Unlearning via Fictitious Facial Identity Dataset
arXiv 2024
LLM-as-a-Coauthor: Can Mixed Human-Written and Machine-Generated Text Be Detected?
arXiv 2024
Thinking Before Looking: Improving Multimodal LLM Reasoning via Mitigating Visual Hallucination
arXiv 2024
TinyGPT-V: Efficient Multimodal Large Language Model via Small Backbones
arXiv 2023
BiomedGPT: A Generalist Vision-Language Foundation Model for Diverse Biomedical Tasks
arXiv 2023
AlignBench: Benchmarking Chinese Alignment of Large Language Models
arXiv 2023
MetaTool Benchmark for Large Language Models: Deciding Whether to Use Tools and Which to Use
arXiv 2023
DeID-GPT: Zero-shot Medical Text De-Identification by GPT-4
arXiv 2023
Multimodal ChatGPT for Medical Applications: an Experimental Study of GPT-4V
arXiv 2023
InstructionGPT-4: A 200-Instruction Paradigm for Fine-Tuning MiniGPT-4
arXiv 2023
Benchmarking and Analyzing Robust Point Cloud Recognition: Bag of Tricks for Defending Adversarial Examples
ICCV 2023 1
A Comprehensive Survey of AI-Generated Content (AIGC): A History of Generative AI from GAN to ChatGPT
arXiv 2023
A Simple Unified Uncertainty-Guided Framework for Offline-to-Online Reinforcement Learning
arXiv 2023
ArtGPT-4: Towards Artistic-understanding Large Vision-Language Models with Enhanced Adapter
arXiv 2023
Affiliations
Frequent co-authors
10from 46 papers