Wei Shen
- Papers
- 32
Cite
Notes
Only stored in your browser.
Authored papers
32WorldAct: Activating Monolithic 3D Worlds into Interactive-Ready Object-Centric Scenes
arXiv 2026
CodePercept: Code-Grounded Visual STEM Perception for MLLMs
arXiv 2026
MonitorBench: A Comprehensive Benchmark for Chain-of-Thought Monitorability in Large Language Models
arXiv 2026
Skywork R1V2: Multimodal Hybrid Reinforcement Learning for Reasoning
arXiv 2025
Skywork Open Reasoner 1 Technical Report
arXiv 2025
A Token-level Text Image Foundation Model for Document Understanding
ICCV 2025
LeetCodeDataset: A Temporal Dataset for Robust Evaluation and Efficient Training of Code LLMs
arXiv 2025
Skywork-R1V4: Toward Agentic Multimodal Intelligence through Interleaved Thinking with Images and DeepResearch
arXiv 2025
Pre-DPO: Improving Data Utilization in Direct Preference Optimization Using a Guiding Reference Model
arXiv 2025
Skywork-R1V3 Technical Report
arXiv 2025
MMBench-GUI: Hierarchical Multi-Platform Evaluation Framework for GUI Agents
arXiv 2025
TreePO: Bridging the Gap of Policy Optimization and Efficacy and Inference Efficiency with Heuristic Tree-based Modeling
arXiv 2025
Skywork-Reward-V2: Scaling Preference Data Curation via Human-AI Synergy
arXiv 2025
Few-step Flow for 3D Generation via Marginal-Data Transport Distillation
arXiv 2025
AdaMuon: Adaptive Muon Optimizer
arXiv 2025
Marten: Visual Question Answering with Mask Generation for Multi-modal Document Understanding
CVPR 2025 1
AdaptiveStep: Automatically Dividing Reasoning Step through Model Confidence
arXiv 2025
GaussianObject: High-Quality 3D Object Reconstruction from Four Views with Gaussian Splatting
arXiv 2024
StepCoder: Improve Code Generation with Reinforcement Learning from Compiler Feedback
arXiv 2024
Secrets of RLHF in Large Language Models Part II: Reward Modeling
arXiv 2024
Leveraging Web-Crawled Data for High-Quality Fine-Tuning
arXiv 2024
Training Large Language Models for Reasoning through Reverse Curriculum Reinforcement Learning
arXiv 2024
PosFormer: Recognizing Complex Handwritten Mathematical Expression with Position Forest Transformer
arXiv 2024
RMB: Comprehensively Benchmarking Reward Models in LLM Alignment
arXiv 2024
FLoRA: Low-Rank Core Space for N-dimension
arXiv 2024
Policy Filtration in RLHF to Fine-Tune LLM for Code Generation
arXiv 2024
Linear Alignment: A Closed-form Solution for Aligning Human Preferences without Tuning and Feedback
arXiv 2024
Bidirectional Copy-Paste for Semi-Supervised Medical Image Segmentation
CVPR 2023 1
LoRAMoE: Alleviate World Knowledge Forgetting in Large Language Models via MoE-Style Plugin
arXiv 2023
SoccerNet 2023 Challenges Results
arXiv 2023
iBOT: Image BERT Pre-Training with Online Tokenizer
arXiv 2021
Micro-Batch Training with Batch-Channel Normalization and Weight Standardization
arXiv 2019
Affiliations
Frequent co-authors
10from 32 papers