0

Shaohan Huang

Papers
32

Cite

Notes

Only stored in your browser.

Attribution

Affiliations & profile
Semantic Scholar
Attribution policy →
32papers

Authored papers

32

SPPO: Sequence-Level PPO for Long-Horizon Reasoning Tasks

arXiv 2026

2026

LLM-in-Sandbox Elicits General Agentic Intelligence

arXiv 2026

2026

Breaking Training Bottlenecks: Effective and Stable Reinforcement Learning for Coding Models

arXiv 2026

2026

Sparse-BitNet: 1.58-bit LLMs are Naturally Friendly to Semi-Structured Sparsity

arXiv 2026

2026

VibeVoice Technical Report

arXiv 2025

2025

Black-Box On-Policy Distillation of Large Language Models

arXiv 2025

2025

BitNet b1.58 2B4T Technical Report

arXiv 2025

2025

On-Policy RL with Optimal Reward Baseline

arXiv 2025

2025

VisCodex: Unified Multimodal Code Generation via Merging Vision and Coding Models

arXiv 2025

2025

Geometric-Mean Policy Optimization

arXiv 2025

2025

BitNet Distillation

arXiv 2025

2025

Multimodal Latent Language Modeling with Next-Token Diffusion

arXiv 2024

2024

You Only Cache Once: Decoder-Decoder Architectures for Language Models

arXiv 2024

2024

MoRA: High-Rank Updating for Parameter-Efficient Fine-Tuning

arXiv 2024

2024

Mixture of LoRA Experts

arXiv 2024

2024

Multi-Head Mixture-of-Experts

arXiv 2024

2024

On Domain-Specific Post-Training for Multimodal Large Language Models

arXiv 2024

2024

Textual Aesthetics in Large Language Models

arXiv 2024

2024

Kosmos-G: Generating Images in Context with Multimodal Large Language Models

arXiv 2023

2023

Kosmos-2: Grounding Multimodal Large Language Models to the World

arXiv 2023

2023

Scaling Sentence Embeddings with Large Language Models

arXiv 2023

2023

Democratizing Reasoning Ability: Tailored Learning from Large Language Model

arXiv 2023

2023

Dual-Alignment Pre-training for Cross-lingual Sentence Embedding

arXiv 2023

2023

A Length-Extrapolatable Transformer

arXiv 2022

2022

PromptBERT: Improving BERT Sentence Embeddings with Prompts

arXiv 2022

2022

CROP: Zero-shot Cross-lingual Named Entity Recognition with Multilingual Labeled Sequence Translation

arXiv 2022

2022

GanLM: Encoder-Decoder Pre-training with an Auxiliary Discriminator

arXiv 2022

2022

DeltaLM: Encoder-Decoder Pre-training for Language Generation and Translation by Augmenting Pretrained Multilingual Encoders

arXiv 2021

2021

Improving Pretrained Cross-Lingual Language Models via Self-Labeled Word Alignment

ACL 2021 5

2021

Allocating Large Vocabulary Capacity for Cross-lingual Language Model Pre-training

EMNLP 2021 11

2021

DocBank: A Benchmark Dataset for Document Layout Analysis

COLING 2020 8

2020

TableBank: A Benchmark Dataset for Table Detection and Recognition

LREC 2020 5

2019

Affiliations

No known affiliations.

Frequent co-authors

10

from 32 papers