Jiang Bian
- Papers
- 33
Cite
Notes
Only stored in your browser.
Authored papers
33LIVE: Long-horizon Interactive Video World Modeling
arXiv 2026
Self-Hinting Language Models Enhance Reinforcement Learning
arXiv 2026
Efficient Document Parsing via Parallel Token Prediction
arXiv 2026
PIKE-RAG: sPecIalized KnowledgE and Rationale Augmented Generation
arXiv 2025
MineWorld: a Real-Time and Open-Source Interactive World Model on Minecraft
arXiv 2025
Text-to-CAD Generation Through Infusing Visual Feedback in Large Language Models
arXiv 2025
CAD-Editor: A Locate-then-Infill Framework with Automated Training Data Synthesis for Text-Based CAD Editing
arXiv 2025
See Less, See Right: Bi-directional Perceptual Shaping For Multimodal Reasoning
arXiv 2025
AdaptiveStep: Automatically Dividing Reasoning Step through Model Confidence
arXiv 2025
villa-X: Enhancing Latent Action Modeling in Vision-Language-Action Models
arXiv 2025
HeurAgenix: Leveraging LLMs for Solving Complex Combinatorial Optimization Challenges
arXiv 2025
MarS: a Financial Market Simulation Engine Powered by Generative Foundation Model
arXiv 2024
VidTwin: Video VAE with Decoupled Structure and Dynamics
CVPR 2025 1
FlexCAD: Unified and Versatile Controllable CAD Generation with Fine-tuned Large Language Models
arXiv 2024
End-to-End Rate-Distortion Optimized 3D Gaussian Representation
arXiv 2024
Protecting Your LLMs with Information Bottleneck
arXiv 2024
DPO Meets PPO: Reinforced Token Optimization for RLHF
arXiv 2024
MAP: Low-compute Model Merging with Amortized Pareto Fronts via Quadratic Approximation
arXiv 2024
C-MORL: Multi-Objective Reinforcement Learning through Efficient Discovery of Pareto Front
arXiv 2024
InstructAvatar: Text-Guided Emotion and Motion Control for Avatar Generation
arXiv 2024
Me LLaMA: Foundation Large Language Models for Medical Applications
arXiv 2024
Mitigating Reversal Curse in Large Language Models via Semantic-aware Permutation Training
arXiv 2024
InvDiff: Invariant Guidance for Bias Mitigation in Diffusion Models
arXiv 2024
MusicAgent: An AI Agent for Music Understanding and Generation with Large Language Models
arXiv 2023
BatteryML:An Open-source platform for Machine Learning on Battery Degradation
arXiv 2023
A Study of Generative Large Language Model for Medical Research and Healthcare
arXiv 2023
NuTime: Numerically Multi-Scaled Embedding for Large-Scale Time-Series Pretraining
arXiv 2023
TiC: Exploring Vision Transformer in Convolution
arXiv 2023
UniAudio: An Audio Foundation Model Toward Universal Audio Generation
arXiv 2023
EvoPrompt: Connecting LLMs with Evolutionary Algorithms Yields Powerful Prompt Optimizers
arXiv 2023
Mildly Constrained Evaluation Policy for Offline Reinforcement Learning
arXiv 2023
MoESys: A Distributed and Efficient Mixture-of-Experts Training and Inference System for Internet Services
arXiv 2022
Empowering Diffusion Models on the Embedding Space for Text Generation
arXiv 2022
Affiliations
Frequent co-authors
10from 33 papers