0

Lichao Sun

Papers
46

Cite

Notes

Only stored in your browser.

Attribution

Affiliations & profile
Semantic Scholar
Attribution policy →
46papers

Authored papers

46

Horizon-LM: A RAM-Centric Architecture for LLM Training

arXiv 2026

2026

Expert Threshold Routing for Autoregressive Language Modeling with Dynamic Computation Allocation and Load Balancing

arXiv 2026

2026

Digital Twin AI: Opportunities and Challenges from Large Language Models to World Models

arXiv 2026

2026

Can MLLMs Read Students' Minds? Unpacking Multimodal Error Analysis in Handwritten Math

arXiv 2026

2026

AutoResearch AI: Towards AI-Powered Research Automation for Scientific Discovery

arXiv 2026

2026

Graph-of-Skills: Dependency-Aware Structural Retrieval for Massive Agent Skills

arXiv 2026

2026

MegaTrain: Full Precision Training of 100B+ Parameter Large Language Models on a Single GPU

arXiv 2026

2026

IDER: IDempotent Experience Replay for Reliable Continual Learning

arXiv 2026

2026

LiveMedBench: A Contamination-Free Medical Benchmark for LLMs with Automated Rubric Evaluation

arXiv 2026

2026

NodeRAG: Structuring Graph-based RAG with Heterogeneous Nodes

arXiv 2025

2025

Generative AI for Autonomous Driving: Frontiers and Opportunities

arXiv 2025

2025

Large Language Models Post-training: Surveying Techniques from Alignment to Reasoning

arXiv 2025

2025

TruthPrInt: Mitigating LVLM Object Hallucination Via Latent Truthful-Guided Pre-Intervention

arXiv 2025

2025

MMMR: Benchmarking Massive Multi-Modal Reasoning Tasks

arXiv 2025

2025

On the Trustworthiness of Generative Foundation Models: Guideline, Assessment, and Perspective

arXiv 2025

2025

EfficientLLM: Efficiency in Large Language Models

arXiv 2025

2025

SAMed-2: Selective Memory Enhanced Medical Segment Anything Model

arXiv 2025

2025

Missing Premise exacerbates Overthinking: Are Reasoning Models losing Critical Thinking Skill?

arXiv 2025

2025

Vision Matters: Simple Visual Perturbations Can Boost Multimodal Math Reasoning

arXiv 2025

2025

LLaVA-CoT: Let Vision Language Models Reason Step-by-Step

arXiv 2024

2024

Mora: Enabling Generalist Video Generation via A Multi-Agent Framework

arXiv 2024

2024

TrustLLM: Trustworthiness in Large Language Models

arXiv 2024

2024

MLP-KAN: Unifying Deep Representation and Function Learning

arXiv 2024

2024

Sora: A Review on Background, Technology, Limitations, and Opportunities of Large Vision Models

arXiv 2024

2024

GUI-World: A Video Benchmark and Dataset for Multimodal GUI-oriented Understanding

arXiv 2024

2024

MLLM-as-a-Judge: Assessing Multimodal LLM-as-a-Judge with Vision-Language Benchmark

arXiv 2024

2024

GTBench: Uncovering the Strategic Reasoning Limitations of LLMs via Game-Theoretic Evaluations

arXiv 2024

2024

UniGen: A Unified Framework for Textual Dataset Generation Using Large Language Models

arXiv 2024

2024

Both Text and Images Leaked! A Systematic Analysis of Multimodal LLM Data Contamination

arXiv 2024

2024

BenTo: Benchmark Task Reduction with In-Context Transferability

arXiv 2024

2024

HonestLLM: Toward an Honest and Helpful Large Language Model

arXiv 2024

2024

Biomedical SAM 2: Segment Anything in Biomedical Images and Videos

arXiv 2024

2024

Benchmarking Vision Language Model Unlearning via Fictitious Facial Identity Dataset

arXiv 2024

2024

LLM-as-a-Coauthor: Can Mixed Human-Written and Machine-Generated Text Be Detected?

arXiv 2024

2024

Thinking Before Looking: Improving Multimodal LLM Reasoning via Mitigating Visual Hallucination

arXiv 2024

2024

TinyGPT-V: Efficient Multimodal Large Language Model via Small Backbones

arXiv 2023

2023

BiomedGPT: A Generalist Vision-Language Foundation Model for Diverse Biomedical Tasks

arXiv 2023

2023

AlignBench: Benchmarking Chinese Alignment of Large Language Models

arXiv 2023

2023

MetaTool Benchmark for Large Language Models: Deciding Whether to Use Tools and Which to Use

arXiv 2023

2023

DeID-GPT: Zero-shot Medical Text De-Identification by GPT-4

arXiv 2023

2023

Multimodal ChatGPT for Medical Applications: an Experimental Study of GPT-4V

arXiv 2023

2023

InstructionGPT-4: A 200-Instruction Paradigm for Fine-Tuning MiniGPT-4

arXiv 2023

2023

Benchmarking and Analyzing Robust Point Cloud Recognition: Bag of Tricks for Defending Adversarial Examples

ICCV 2023 1

2023

A Comprehensive Survey of AI-Generated Content (AIGC): A History of Generative AI from GAN to ChatGPT

arXiv 2023

2023

A Simple Unified Uncertainty-Guided Framework for Offline-to-Online Reinforcement Learning

arXiv 2023

2023

ArtGPT-4: Towards Artistic-understanding Large Vision-Language Models with Enhanced Adapter

arXiv 2023

2023

Affiliations

No known affiliations.

Frequent co-authors

10

from 46 papers