0

Chao Li

Papers
18

Cite

Notes

Only stored in your browser.

Attribution

Affiliations & profile
Semantic Scholar
Attribution policy →
18papers

Authored papers

18

X-OmniClaw Technical Report: A Unified Mobile Agent for Multimodal Understanding and Interaction

arXiv 2026

2026

DREAM: Where Visual Understanding Meets Text-to-Image Generation

arXiv 2026

2026

Seed-Coder: Let the Code Model Curate Data for Itself

arXiv 2025

2025

Fin-R1: A Large Language Model for Financial Reasoning through Reinforcement Learning

arXiv 2025

2025

A Survey on Inference Optimization Techniques for Mixture of Experts Models

arXiv 2024

2024

ScaleKD: Strong Vision Transformers Could Be Excellent Teachers

arXiv 2024

2024

Idea23D: Collaborative LMM Agents Enable 3D Model Generation from Interleaved Multimodal Inputs

arXiv 2024

2024

Yi: Open Foundation Models by 01.AI

arXiv 2024

2024

Aria: An Open Multimodal Native Mixture-of-Experts Model

arXiv 2024

2024

Adversarial Training on Purification (AToP): Advancing Both Robustness and Generalization

arXiv 2024

2024

Crafting Customisable Characters with LLMs: Introducing SimsChat, a Persona-Driven Role-Playing Agent Framework

arXiv 2024

2024

Soulstyler: Using Large Language Model to Guide Image Style Transfer for Target Object

arXiv 2023

2023

RFLA: A Stealthy Reflected Light Adversarial Attack in the Physical World

ICCV 2023 1

2023

Alternating Local Enumeration (TnALE): Solving Tensor Network Structure Search with Fewer Evaluations

arXiv 2023

2023

Generative Action Description Prompts for Skeleton-based Action Recognition

ICCV 2023 1

2022

Ego4D: Around the World in 3,000 Hours of Egocentric Video

CVPR 2022 1

2021

Read, Listen, and See: Leveraging Multimodal Information Helps Chinese Spell Checking

Findings (ACL) 2021 8

2021

Image Inpainting with Learnable Bidirectional Attention Maps

image-inpainting-with-learnable-bidirectional-1

2019

Affiliations

No known affiliations.

Frequent co-authors

10

from 18 papers