0

Wei Shen

Papers
32

Cite

Notes

Only stored in your browser.

Attribution

Affiliations & profile
Semantic Scholar
Attribution policy →
32papers

Authored papers

32

WorldAct: Activating Monolithic 3D Worlds into Interactive-Ready Object-Centric Scenes

arXiv 2026

2026

CodePercept: Code-Grounded Visual STEM Perception for MLLMs

arXiv 2026

2026

MonitorBench: A Comprehensive Benchmark for Chain-of-Thought Monitorability in Large Language Models

arXiv 2026

2026

Skywork R1V2: Multimodal Hybrid Reinforcement Learning for Reasoning

arXiv 2025

2025

Skywork Open Reasoner 1 Technical Report

arXiv 2025

2025

A Token-level Text Image Foundation Model for Document Understanding

ICCV 2025

2025

LeetCodeDataset: A Temporal Dataset for Robust Evaluation and Efficient Training of Code LLMs

arXiv 2025

2025

Skywork-R1V4: Toward Agentic Multimodal Intelligence through Interleaved Thinking with Images and DeepResearch

arXiv 2025

2025

Pre-DPO: Improving Data Utilization in Direct Preference Optimization Using a Guiding Reference Model

arXiv 2025

2025

Skywork-R1V3 Technical Report

arXiv 2025

2025

MMBench-GUI: Hierarchical Multi-Platform Evaluation Framework for GUI Agents

arXiv 2025

2025

TreePO: Bridging the Gap of Policy Optimization and Efficacy and Inference Efficiency with Heuristic Tree-based Modeling

arXiv 2025

2025

Skywork-Reward-V2: Scaling Preference Data Curation via Human-AI Synergy

arXiv 2025

2025

Few-step Flow for 3D Generation via Marginal-Data Transport Distillation

arXiv 2025

2025

AdaMuon: Adaptive Muon Optimizer

arXiv 2025

2025

Marten: Visual Question Answering with Mask Generation for Multi-modal Document Understanding

CVPR 2025 1

2025

AdaptiveStep: Automatically Dividing Reasoning Step through Model Confidence

arXiv 2025

2025

GaussianObject: High-Quality 3D Object Reconstruction from Four Views with Gaussian Splatting

arXiv 2024

2024

StepCoder: Improve Code Generation with Reinforcement Learning from Compiler Feedback

arXiv 2024

2024

Secrets of RLHF in Large Language Models Part II: Reward Modeling

arXiv 2024

2024

Leveraging Web-Crawled Data for High-Quality Fine-Tuning

arXiv 2024

2024

Training Large Language Models for Reasoning through Reverse Curriculum Reinforcement Learning

arXiv 2024

2024

PosFormer: Recognizing Complex Handwritten Mathematical Expression with Position Forest Transformer

arXiv 2024

2024

RMB: Comprehensively Benchmarking Reward Models in LLM Alignment

arXiv 2024

2024

FLoRA: Low-Rank Core Space for N-dimension

arXiv 2024

2024

Policy Filtration in RLHF to Fine-Tune LLM for Code Generation

arXiv 2024

2024

Linear Alignment: A Closed-form Solution for Aligning Human Preferences without Tuning and Feedback

arXiv 2024

2024

Bidirectional Copy-Paste for Semi-Supervised Medical Image Segmentation

CVPR 2023 1

2023

LoRAMoE: Alleviate World Knowledge Forgetting in Large Language Models via MoE-Style Plugin

arXiv 2023

2023

SoccerNet 2023 Challenges Results

arXiv 2023

2023

iBOT: Image BERT Pre-Training with Online Tokenizer

arXiv 2021

2021

Micro-Batch Training with Batch-Channel Normalization and Weight Standardization

arXiv 2019

2019

Affiliations

No known affiliations.

Frequent co-authors

10

from 32 papers