Bo Liu
Researcher; co-author on TextArena multi-agent LLM benchmark.
- Role
- researcher
- Unknown
- GitHub
- Unknown
- Scholar
- scholar.google.com/scholar
- Papers
- 42
Cite
Notes
Only stored in your browser.
Authored papers
42SenseNova-U1: Unifying Multimodal Understanding and Generation with NEO-unify Architecture
arXiv 2026
Simple Recipe Works: Vision-Language-Action Models are Natural Continual Learners with Reinforcement Learning
arXiv 2026
From P(y|x) to P(y): Investigating Reinforcement Learning in Pre-train Space
arXiv 2026
TextArena: Multi-Agent Text-Based Games for LLM Evaluation
preprint
LLaVA-Critic-R1: Your Critic Model is Secretly a Strong Policy Model
arXiv 2025
TextArena
arXiv 2025
Vision-Zero: Scalable VLM Self-Improvement via Strategic Gamified Self-Play
arXiv 2025
Mobius: Text to Seamless Looping Video Generation via Latent Shift
arXiv 2025
SciEvalKit: An Open-source Evaluation Toolkit for Scientific General Intelligence
arXiv 2025
GEM: A Gym for Agentic LLMs
arXiv 2025
Probing Scientific General Intelligence of LLMs with Scientist-Aligned Workflows
arXiv 2025
MAGREF: Masked Guidance for Any-Reference Video Generation
arXiv 2025
SPIRAL: Self-Play on Zero-Sum Games Incentivizes Reasoning via Multi-Agent Multi-Turn Reinforcement Learning
arXiv 2025
SPG: Sandwiched Policy Gradient for Masked Diffusion Language Models
arXiv 2025
Multiple Instance Learning Framework with Masked Hard Instance Mining for Gigapixel Histopathology Image Analysis
arXiv 2025
BigCodeArena: Unveiling More Reliable Human Preferences in Code Generation via Execution
arXiv 2025
ByteMorph: Benchmarking Instruction-Guided Image Editing with Non-Rigid Motions
arXiv 2025
DeepSeek LLM: Scaling Open-Source Language Models with Longtermism
arXiv 2024
DeepSeek-VL: Towards Real-World Vision-Language Understanding
arXiv 2024
DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model
arXiv 2024
DeepSeek-Prover-V1.5: Harnessing Proof Assistant Feedback for Reinforcement Learning and Monte-Carlo Tree Search
arXiv 2024
Positive Text Reframing under Multi-strategy Optimization
arXiv 2024
Feature Re-Embedding: Towards Foundation Model-Level Performance in Computational Pathology
CVPR 2024 1
Cautious Optimizers: Improving Training with One Line of Code
arXiv 2024
AdaFlow: Imitation Learning with Variance-Adaptive Flow-Based Policies
arXiv 2024
Asynchronous Local-SGD Training for Language Modeling
arXiv 2024
EasyAnimate: A High-Performance Long Video Generation Method based on Transformer Architecture
arXiv 2024
Natural Language Reinforcement Learning
arXiv 2024
Longhorn: State Space Models are Amortized Online Learners
arXiv 2024
Memory-Efficient LLM Training with Online Subspace Descent
arXiv 2024
AMO Sampler: Enhancing Text Rendering with Overshooting
CVPR 2025 1
CustomTTT: Motion and Appearance Customized Video Generation via Test-Time Training
arXiv 2024
Learning Memory Mechanisms for Decision Making through Demonstrations
arXiv 2024
DMCVR: Morphology-Guided Diffusion Model for 3D Cardiac Volume Reconstruction
arXiv 2023
LLM+P: Empowering Large Language Models with Optimal Planning Proficiency
arXiv 2023
EasyGen: Easing Multimodal Generation with BiDiffuser and LLMs
arXiv 2023
UGG: Unified Generative Grasping
arXiv 2023
Hierarchical Spatio-Temporal Representation Learning for Gait Recognition
ICCV 2023 1
HiH: A Multi-modal Hierarchy in Hierarchy Network for Unconstrained Gait Recognition
arXiv 2023
EnvPool: A Highly Parallel Reinforcement Learning Environment Execution Engine
arXiv 2022
Continual Learning and Private Unlearning
arXiv 2022
GaitMM: Multi-Granularity Motion Sequence Learning for Gait Recognition
arXiv 2022
Affiliations
Frequent co-authors
10from 42 papers