Shaohan Huang
- Papers
- 32
Cite
Notes
Only stored in your browser.
Authored papers
32SPPO: Sequence-Level PPO for Long-Horizon Reasoning Tasks
arXiv 2026
LLM-in-Sandbox Elicits General Agentic Intelligence
arXiv 2026
Breaking Training Bottlenecks: Effective and Stable Reinforcement Learning for Coding Models
arXiv 2026
Sparse-BitNet: 1.58-bit LLMs are Naturally Friendly to Semi-Structured Sparsity
arXiv 2026
VibeVoice Technical Report
arXiv 2025
Black-Box On-Policy Distillation of Large Language Models
arXiv 2025
BitNet b1.58 2B4T Technical Report
arXiv 2025
On-Policy RL with Optimal Reward Baseline
arXiv 2025
VisCodex: Unified Multimodal Code Generation via Merging Vision and Coding Models
arXiv 2025
Geometric-Mean Policy Optimization
arXiv 2025
BitNet Distillation
arXiv 2025
Multimodal Latent Language Modeling with Next-Token Diffusion
arXiv 2024
You Only Cache Once: Decoder-Decoder Architectures for Language Models
arXiv 2024
MoRA: High-Rank Updating for Parameter-Efficient Fine-Tuning
arXiv 2024
Mixture of LoRA Experts
arXiv 2024
Multi-Head Mixture-of-Experts
arXiv 2024
On Domain-Specific Post-Training for Multimodal Large Language Models
arXiv 2024
Textual Aesthetics in Large Language Models
arXiv 2024
Kosmos-G: Generating Images in Context with Multimodal Large Language Models
arXiv 2023
Kosmos-2: Grounding Multimodal Large Language Models to the World
arXiv 2023
Scaling Sentence Embeddings with Large Language Models
arXiv 2023
Democratizing Reasoning Ability: Tailored Learning from Large Language Model
arXiv 2023
Dual-Alignment Pre-training for Cross-lingual Sentence Embedding
arXiv 2023
A Length-Extrapolatable Transformer
arXiv 2022
PromptBERT: Improving BERT Sentence Embeddings with Prompts
arXiv 2022
CROP: Zero-shot Cross-lingual Named Entity Recognition with Multilingual Labeled Sequence Translation
arXiv 2022
GanLM: Encoder-Decoder Pre-training with an Auxiliary Discriminator
arXiv 2022
DeltaLM: Encoder-Decoder Pre-training for Language Generation and Translation by Augmenting Pretrained Multilingual Encoders
arXiv 2021
Improving Pretrained Cross-Lingual Language Models via Self-Labeled Word Alignment
ACL 2021 5
Allocating Large Vocabulary Capacity for Cross-lingual Language Model Pre-training
EMNLP 2021 11
DocBank: A Benchmark Dataset for Document Layout Analysis
COLING 2020 8
TableBank: A Benchmark Dataset for Table Detection and Recognition
LREC 2020 5
Affiliations
Frequent co-authors
10from 32 papers