0

Jiang Bian

Papers
33

Cite

Notes

Only stored in your browser.

Attribution

Affiliations & profile
Semantic Scholar
Attribution policy →
33papers

Authored papers

33

LIVE: Long-horizon Interactive Video World Modeling

arXiv 2026

2026

Self-Hinting Language Models Enhance Reinforcement Learning

arXiv 2026

2026

Efficient Document Parsing via Parallel Token Prediction

arXiv 2026

2026

PIKE-RAG: sPecIalized KnowledgE and Rationale Augmented Generation

arXiv 2025

2025

MineWorld: a Real-Time and Open-Source Interactive World Model on Minecraft

arXiv 2025

2025

Text-to-CAD Generation Through Infusing Visual Feedback in Large Language Models

arXiv 2025

2025

CAD-Editor: A Locate-then-Infill Framework with Automated Training Data Synthesis for Text-Based CAD Editing

arXiv 2025

2025

See Less, See Right: Bi-directional Perceptual Shaping For Multimodal Reasoning

arXiv 2025

2025

AdaptiveStep: Automatically Dividing Reasoning Step through Model Confidence

arXiv 2025

2025

villa-X: Enhancing Latent Action Modeling in Vision-Language-Action Models

arXiv 2025

2025

HeurAgenix: Leveraging LLMs for Solving Complex Combinatorial Optimization Challenges

arXiv 2025

2025

MarS: a Financial Market Simulation Engine Powered by Generative Foundation Model

arXiv 2024

2024

VidTwin: Video VAE with Decoupled Structure and Dynamics

CVPR 2025 1

2024

FlexCAD: Unified and Versatile Controllable CAD Generation with Fine-tuned Large Language Models

arXiv 2024

2024

End-to-End Rate-Distortion Optimized 3D Gaussian Representation

arXiv 2024

2024

Protecting Your LLMs with Information Bottleneck

arXiv 2024

2024

DPO Meets PPO: Reinforced Token Optimization for RLHF

arXiv 2024

2024

MAP: Low-compute Model Merging with Amortized Pareto Fronts via Quadratic Approximation

arXiv 2024

2024

C-MORL: Multi-Objective Reinforcement Learning through Efficient Discovery of Pareto Front

arXiv 2024

2024

InstructAvatar: Text-Guided Emotion and Motion Control for Avatar Generation

arXiv 2024

2024

Me LLaMA: Foundation Large Language Models for Medical Applications

arXiv 2024

2024

Mitigating Reversal Curse in Large Language Models via Semantic-aware Permutation Training

arXiv 2024

2024

InvDiff: Invariant Guidance for Bias Mitigation in Diffusion Models

arXiv 2024

2024

MusicAgent: An AI Agent for Music Understanding and Generation with Large Language Models

arXiv 2023

2023

BatteryML:An Open-source platform for Machine Learning on Battery Degradation

arXiv 2023

2023

A Study of Generative Large Language Model for Medical Research and Healthcare

arXiv 2023

2023

NuTime: Numerically Multi-Scaled Embedding for Large-Scale Time-Series Pretraining

arXiv 2023

2023

TiC: Exploring Vision Transformer in Convolution

arXiv 2023

2023

UniAudio: An Audio Foundation Model Toward Universal Audio Generation

arXiv 2023

2023

EvoPrompt: Connecting LLMs with Evolutionary Algorithms Yields Powerful Prompt Optimizers

arXiv 2023

2023

Mildly Constrained Evaluation Policy for Offline Reinforcement Learning

arXiv 2023

2023

MoESys: A Distributed and Efficient Mixture-of-Experts Training and Inference System for Internet Services

arXiv 2022

2022

Empowering Diffusion Models on the Embedding Space for Text Generation

arXiv 2022

2022

Affiliations

No known affiliations.

Frequent co-authors

10

from 33 papers