0

Liang Lin

Papers
28

Cite

Notes

Only stored in your browser.

Attribution

Affiliations & profile
Semantic Scholar
Attribution policy →
28papers

Authored papers

28

Learning to Foresee: Unveiling the Unlocking Efficiency of On-Policy Distillation

arXiv 2026

2026

A Survey of Large Audio Language Models: Generalization, Trustworthiness, and Outlook

arXiv 2026

2026

Geometry-Editable and Appearance-Preserving Object Compositon

arXiv 2025

2025

3DAffordSplat: Efficient Affordance Reasoning with 3D Gaussians

arXiv 2025

2025

MiniLongBench: The Low-cost Long Context Understanding Benchmark for Large Language Models

arXiv 2025

2025

DART: Differentiable Dynamic Adaptive Region Tokenizer for Vision Transformer and Mamba

arXiv 2025

2025

DSPNet: Dual-vision Scene Perception for Robust 3D Question Answering

CVPR 2025 1

2025

A Comprehensive Survey in LLM(-Agent) Full Stack Safety: Data, Training and Deployment

arXiv 2025

2025

Cross-modal Causal Relation Alignment for Video Question Grounding

cross-modal-causal-relation-alignment-for

2025

SR-FoT: A Syllogistic-Reasoning Framework of Thought for Large Language Models Tackling Knowledge-based Reasoning Tasks

arXiv 2025

2025

Paper Summary Attack: Jailbreaking LLMs through LLM Safety Papers

arXiv 2025

2025

MEIA: Multimodal Embodied Perception and Interaction in Unknown Environments

arXiv 2024

2024

ID-Aligner: Enhancing Identity-Preserving Text-to-Image Generation with Reward Feedback Learning

arXiv 2024

2024

MarvelOVD: Marrying Object Recognition and Vision-Language Models for Robust Open-Vocabulary Object Detection

arXiv 2024

2024

WildVidFit: Video Virtual Try-On in the Wild via Image-Based Controlled Diffusion Models

arXiv 2024

2024

Identity-Preserving Talking Face Generation with Landmark and Appearance Priors

CVPR 2023 1

2023

Control-A-Video: Controllable Text-to-Video Diffusion Models with Motion Prior and Reward Feedback Learning

arXiv 2023

2023

Let's Think Outside the Box: Exploring Leap-of-Thought in Large Language Models with Creative Humor Generation

CVPR 2024 1

2023

SUR-adapter: Enhancing Text-to-Image Pre-trained Diffusion Models with Large Language Models

arXiv 2023

2023

SkeletonMAE: Graph-based Masked Autoencoder for Skeleton Sequence Pre-training

ICCV 2023 1

2023

ScaleLong: Towards More Stable Training of Diffusion Model via Scaling Network Long Skip Connection

scalelong-towards-more-stable-training-of

2023

Masked Images Are Counterfactual Samples for Robust Fine-tuning

CVPR 2023 1

2023

LogicSolver: Towards Interpretable Math Word Problem Solving with Logical Prompt-enhanced Learning

arXiv 2022

2022

UniGeo: Unifying Geometry Logical Reasoning via Reformulating Mathematical Expression

arXiv 2022

2022

GeoQA: A Geometric Question Answering Benchmark Towards Multimodal Numerical Reasoning

Findings (ACL) 2021 8

2021

Towards Quantifiable Dialogue Coherence Evaluation

ACL 2021 5

2021

Efficient Crowd Counting via Structured Knowledge Transfer

arXiv 2020

2020

Cross-Modal Collaborative Representation Learning and a Large-Scale RGBT Benchmark for Crowd Counting

CVPR 2021 1

2020

Affiliations

No known affiliations.

Frequent co-authors

10

from 28 papers