0

Wei Wang

Papers
88

Cite

Notes

Only stored in your browser.

Attribution

Affiliations & profile
Semantic Scholar
Attribution policy →
88papers

Authored papers

88

Rethinking Memory Mechanisms of Foundation Agents in the Second Half: A Survey

arXiv 2026

2026

SuperOcc: Toward Cohesive Temporal Modeling for Superquadric-based Occupancy Prediction

arXiv 2026

2026

LongCat-Next: Lexicalizing Modalities as Discrete Tokens

arXiv 2026

2026

LongCat-Flash-Thinking-2601 Technical Report

arXiv 2026

2026

HEARTS: Benchmarking LLM Reasoning on Health Time Series

arXiv 2026

2026

LongCat-Flash-Prover: Advancing Native Formal Reasoning via Agentic Tool-Integrated Reinforcement Learning

arXiv 2026

2026

OSF: On Pre-training and Scaling of Sleep Foundation Models

arXiv 2026

2026

EcoGym: Evaluating LLMs for Long-Horizon Plan-and-Execute in Interactive Economies

arXiv 2026

2026

BubbleRAG: Evidence-Driven Retrieval-Augmented Generation for Black-Box Knowledge Graphs

arXiv 2026

2026

ARLArena: A Unified Framework for Stable Agentic Reinforcement Learning

arXiv 2026

2026

GeoMotionGPT: Geometry-Aligned Motion Understanding with Large Language Models

arXiv 2026

2026

DeepVision-103K: A Visually Diverse, Broad-Coverage, and Verifiable Mathematical Dataset for Multimodal Reasoning

arXiv 2026

2026

Kimi K2.5: Visual Agentic Intelligence

arXiv 2026

2026

CellMaster: Collaborative Cell Type Annotation in Single-Cell Analysis

arXiv 2026

2026

SkyReels-V2: Infinite-length Film Generative Model

arXiv 2025

2025

Reinforcement Learning Optimization for Large-Scale Learning: An Efficient and User-Friendly Scaling Library

arXiv 2025

2025

Visionary: The World Model Carrier Built on WebGPU-Powered Gaussian Splatting Platform

arXiv 2025

2025

EchoVLM: Dynamic Mixture-of-Experts Vision-Language Model for Universal Ultrasound Intelligence

arXiv 2025

2025

Inferix: A Block-Diffusion based Next-Generation Inference Engine for World Simulation

arXiv 2025

2025

Let It Flow: Agentic Crafting on Rock and Roll, Building the ROME Model within an Open Agentic Learning Ecosystem

arXiv 2025

2025

Tiny Model, Big Logic: Diversity-Driven Optimization Elicits Large-Model Reasoning Ability in VibeThinker-1.5B

arXiv 2025

2025

How Far Are We from Genuinely Useful Deep Research Agents?

arXiv 2025

2025

Preference Leakage: A Contamination Problem in LLM-as-a-judge

arXiv 2025

2025

MindWatcher: Toward Smarter Multimodal Tool-Integrated Reasoning

arXiv 2025

2025

Q-Eval-100K: Evaluating Visual Quality and Alignment Level for Text-to-Vision Content

CVPR 2025 1

2025

DuoGuard: A Two-Player RL-Driven Framework for Multilingual LLM Guardrails

arXiv 2025

2025

OpenCSG Chinese Corpus: A Series of High-quality Chinese Datasets for LLM Training

arXiv 2025

2025

Entropy-Based Adaptive Weighting for Self-Training

arXiv 2025

2025

Instruction-Tuning Data Synthesis from Scratch via Web Reconstruction

arXiv 2025

2025

Chasing the Tail: Effective Rubric-based Reward Modeling for Large Language Model Post-Training

arXiv 2025

2025

A Comprehensive Survey in LLM(-Agent) Full Stack Safety: Data, Training and Deployment

arXiv 2025

2025

Reinforcement Mid-Training

arXiv 2025

2025

Learnable SMPLify: A Neural Solution for Optimization-Free Human Pose Inverse Kinematics

arXiv 2025

2025

R-Horizon: How Far Can Your Large Reasoning Model Really Go in Breadth and Depth?

arXiv 2025

2025

MusiXQA: Advancing Visual Music Understanding in Multimodal Large Language Models

arXiv 2025

2025

Lessons Learned from the URGENT 2024 Speech Enhancement Challenge

arXiv 2025

2025

Agentar-Fin-R1: Enhancing Financial Intelligence through Domain Expertise, Training Efficiency, and Advanced Reasoning

arXiv 2025

2025

Hierarchical Frequency Tagging Probe (HFTP): A Unified Approach to Investigate Syntactic Structure Representations in Large Language Models and the Human Brain

arXiv 2025

2025

Wan: Open and Advanced Large-Scale Video Generative Models

arXiv 2025

2025

CoSER: Coordinating LLM-Based Persona Simulation of Established Roles

arXiv 2025

2025

OpenVLThinker: An Early Exploration to Complex Vision-Language Reasoning via Iterative Self-Improvement

arXiv 2025

2025

A Retrospective Systematic Study on Hierarchical Sparse Query Transformer-assisted Ultrasound Screening for Early Hepatocellular Carcinoma

arXiv 2025

2025

In-Context LoRA for Diffusion Transformers

arXiv 2024

2024

CivRealm: A Learning and Reasoning Odyssey in Civilization for Decision-Making Agents

arXiv 2024

2024

Fully Open Source Moxin-7B Technical Report

arXiv 2024

2024

ChatDiT: A Training-Free Baseline for Task-Agnostic Free-Form Chatting with Diffusion Transformers

arXiv 2024

2024

BearLLM: A Prior Knowledge-Enhanced Bearing Health Management Framework with Unified Vibration Signal Representation

arXiv 2024

2024

Enhancing Large Vision Language Models with Self-Training on Image Comprehension

arXiv 2024

2024

AllWeatherNet:Unified Image Enhancement for Autonomous Driving under Adverse Weather and Lowlight-conditions

arXiv 2024

2024

QAQ: Quality Adaptive Quantization for LLM KV Cache

arXiv 2024

2024

IDEA-Bench: How Far are Generative Models from Professional Designing?

CVPR 2025 1

2024

Learning to Edit: Aligning LLMs with Knowledge Editing

arXiv 2024

2024

ProteinGPT: Multimodal LLM for Protein Property Prediction and Structure Understanding

arXiv 2024

2024

UnifiedMLLM: Enabling Unified Representation for Multi-modal Multi-tasks With Large Language Model

arXiv 2024

2024

Security Attacks on LLM-based Code Completion Tools

arXiv 2024

2024

Harnessing Scale and Physics: A Multi-Graph Neural Operator Framework for PDEs on Arbitrary Geometries

arXiv 2024

2024

Object Detectors in the Open Environment: Challenges, Solutions, and Outlook

arXiv 2024

2024

LogicVista: Multimodal LLM Logical Reasoning Benchmark in Visual Contexts

arXiv 2024

2024

InverseCoder: Self-improving Instruction-Tuned Code LLMs with Inverse-Instruct

arXiv 2024

2024

Bridging and Modeling Correlations in Pairwise Data for Direct Preference Optimization

arXiv 2024

2024

Template-Driven LLM-Paraphrased Framework for Tabular Math Word Problem Generation

arXiv 2024

2024

Advancing Fine-Grained Visual Understanding with Multi-Scale Alignment in Multi-Modal Models

arXiv 2024

2024

TradingAgents: Multi-Agents LLM Financial Trading Framework

arXiv 2024

2024

A Comprehensive Survey of Scientific Large Language Models and Their Applications in Scientific Discovery

arXiv 2024

2024

Layer-wise Importance Matters: Less Memory for Better Performance in Parameter-efficient Fine-tuning of Large Language Models

arXiv 2024

2024

Stealth edits to large language models

arXiv 2024

2024

Detecting Conversational Mental Manipulation with Intent-Aware Prompting

arXiv 2024

2024

Beyond Answers: Transferring Reasoning Capabilities to Smaller LLMs Using Multi-Teacher Knowledge Distillation

arXiv 2024

2024

Counterfactual Explanations for Face Forgery Detection via Adversarial Removal of Artifacts

arXiv 2024

2024

CLIMB: A Benchmark of Clinical Bias in Large Language Models

arXiv 2024

2024

Qwen Technical Report

arXiv 2023

2023

InCharacter: Evaluating Personality Fidelity in Role-Playing Agents through Psychological Interviews

arXiv 2023

2023

mPLUG-2: A Modularized Multi-modal Foundation Model Across Text, Image and Video

arXiv 2023

2023

Bongard-OpenWorld: Few-Shot Reasoning for Free-form Visual Concepts in the Real World

arXiv 2023

2023

D-IF: Uncertainty-aware Human Digitization via Implicit Distribution Field

ICCV 2023 1

2023

Learning Over Molecular Conformer Ensembles: Datasets and Benchmarks

arXiv 2023

2023

Householder Projector for Unsupervised Latent Semantics Discovery

ICCV 2023 1

2023

RRHF: Rank Responses to Align Language Models with Human Feedback without tears

arXiv 2023

2023

YUAN 2.0: A Large Language Model with Localized Filtering-based Attention

arXiv 2023

2023

Lion: Adversarial Distillation of Proprietary Large Language Models

arXiv 2023

2023

SciBench: Evaluating College-Level Scientific Problem-Solving Abilities of Large Language Models

arXiv 2023

2023

FollowBench: A Multi-level Fine-grained Constraints Following Benchmark for Large Language Models

arXiv 2023

2023

Towards Vision Enhancing LLMs: Empowering Multimodal Knowledge Storage and Sharing in LLMs

arXiv 2023

2023

mPLUG: Effective and Efficient Vision-Language Learning by Cross-modal Skip-connections

arXiv 2022

2022

Code Recommendation for Open Source Software Developers

arXiv 2022

2022

Global and Local Hierarchy-aware Contrastive Framework for Implicit Discourse Relation Recognition

arXiv 2022

2022

Improved Universal Sentence Embeddings with Prompt-based Contrastive Learning and Energy-based Learning

arXiv 2022

2022

Controllable Person Image Synthesis with Spatially-Adaptive Warped Normalization

arXiv 2021

2021

Affiliations

No known affiliations.

Frequent co-authors

10

from 88 papers