Chao Li
- Papers
- 18
Cite
Notes
Only stored in your browser.
Authored papers
18X-OmniClaw Technical Report: A Unified Mobile Agent for Multimodal Understanding and Interaction
arXiv 2026
DREAM: Where Visual Understanding Meets Text-to-Image Generation
arXiv 2026
Seed-Coder: Let the Code Model Curate Data for Itself
arXiv 2025
Fin-R1: A Large Language Model for Financial Reasoning through Reinforcement Learning
arXiv 2025
A Survey on Inference Optimization Techniques for Mixture of Experts Models
arXiv 2024
ScaleKD: Strong Vision Transformers Could Be Excellent Teachers
arXiv 2024
Idea23D: Collaborative LMM Agents Enable 3D Model Generation from Interleaved Multimodal Inputs
arXiv 2024
Yi: Open Foundation Models by 01.AI
arXiv 2024
Aria: An Open Multimodal Native Mixture-of-Experts Model
arXiv 2024
Adversarial Training on Purification (AToP): Advancing Both Robustness and Generalization
arXiv 2024
Crafting Customisable Characters with LLMs: Introducing SimsChat, a Persona-Driven Role-Playing Agent Framework
arXiv 2024
Soulstyler: Using Large Language Model to Guide Image Style Transfer for Target Object
arXiv 2023
RFLA: A Stealthy Reflected Light Adversarial Attack in the Physical World
ICCV 2023 1
Alternating Local Enumeration (TnALE): Solving Tensor Network Structure Search with Fewer Evaluations
arXiv 2023
Generative Action Description Prompts for Skeleton-based Action Recognition
ICCV 2023 1
Ego4D: Around the World in 3,000 Hours of Egocentric Video
CVPR 2022 1
Read, Listen, and See: Leveraging Multimodal Information Helps Chinese Spell Checking
Findings (ACL) 2021 8
Image Inpainting with Learnable Bidirectional Attention Maps
image-inpainting-with-learnable-bidirectional-1
Affiliations
Frequent co-authors
10from 18 papers