Liang Chen
- Papers
- 34
Cite
Notes
Only stored in your browser.
Authored papers
34ClawBench: Can AI Agents Complete Everyday Online Tasks?
arXiv 2026
BabyVision: Visual Reasoning Beyond Language
arXiv 2026
UI-Venus-1.5 Technical Report
arXiv 2026
Kimi K2.5: Visual Agentic Intelligence
arXiv 2026
Kimi-VL Technical Report
arXiv 2025
G1: Bootstrapping Perception and Reasoning Abilities of Vision-Language Model via Reinforcement Learning
arXiv 2025
UI-Venus Technical Report: Building High-performance UI Agents with RFT
arXiv 2025
Multimodal Representation Alignment for Image Generation: Text-Image Interleaved Control Is Easier Than You Think
arXiv 2025
Vulnerability-Aware Alignment: Mitigating Uneven Forgetting in Harmful Fine-Tuning
arXiv 2025
An Image is Worth 1/2 Tokens After Layer 2: Plug-and-Play Inference Acceleration for Large Vision-Language Models
arXiv 2024
Next Token Prediction Towards Multimodal Intelligence: A Comprehensive Survey
arXiv 2024
Parameter-Efficient Fine-Tuning with Discrete Fourier Transform
arXiv 2024
Omni-MATH: A Universal Olympiad Level Mathematic Benchmark For Large Language Models
arXiv 2024
Towards a Unified View of Preference Learning for Large Language Models: A Survey
arXiv 2024
PID: Physics-Informed Diffusion Model for Infrared Image Generation
arXiv 2024
PCA-Bench: Evaluating Multimodal Large Language Models in Perception-Cognition-Action Chain
arXiv 2024
A Spark of Vision-Language Intelligence: 2-Dimensional Autoregressive Transformer for Efficient Finegrained Image Generation
arXiv 2024
Consecutive Batch Model Editing with HooK Layers
arXiv 2024
MMEvalPro: Calibrating Multimodal Benchmarks Towards Trustworthy and Efficient Evaluation
arXiv 2024
UltraEdit: Instruction-based Fine-Grained Image Editing at Scale
arXiv 2024
ML-Bench: Evaluating Large Language Models and Agents for Machine Learning Tasks on Repository-Level Code
arXiv 2023
Large Language Models are not Fair Evaluators
arXiv 2023
MMICL: Empowering Vision-language Model with Multi-Modal In-Context Learning
arXiv 2023
Prompting and Evaluating Large Language Models for Proactive Dialogues: Clarification, Target-guided, and Non-collaboration
arXiv 2023
Improved Test-Time Adaptation for Domain Generalization
CVPR 2023 1
Domain Generalization via Rationale Invariance
ICCV 2023 1
Optimizing Feature Set for Click-Through Rate Prediction
arXiv 2023
On the Off-Target Problem of Zero-Shot Multilingual Neural Machine Translation
arXiv 2023
Towards Hybrid-grained Feature Interaction Selection for Deep Sparse Network
towards-hybrid-grained-feature-interaction
Spiking Graph Convolutional Networks
spiking-graph-convolutional-networks
Self-supervised Learning of Adversarial Example: Towards Good Generalizations for Deepfake Detection
CVPR 2022 1
Large-batch Optimization for Dense Visual Predictions
arXiv 2022
DialogSum: A Real-Life Scenario Dialogue Summarization Dataset
Findings (ACL) 2021 8
Perceiving and Modeling Density is All You Need for Image Dehazing
arXiv 2021
Affiliations
Frequent co-authors
10from 34 papers