Wei Wang
- Papers
- 88
Cite
Notes
Only stored in your browser.
Authored papers
88Rethinking Memory Mechanisms of Foundation Agents in the Second Half: A Survey
arXiv 2026
SuperOcc: Toward Cohesive Temporal Modeling for Superquadric-based Occupancy Prediction
arXiv 2026
LongCat-Next: Lexicalizing Modalities as Discrete Tokens
arXiv 2026
LongCat-Flash-Thinking-2601 Technical Report
arXiv 2026
HEARTS: Benchmarking LLM Reasoning on Health Time Series
arXiv 2026
LongCat-Flash-Prover: Advancing Native Formal Reasoning via Agentic Tool-Integrated Reinforcement Learning
arXiv 2026
OSF: On Pre-training and Scaling of Sleep Foundation Models
arXiv 2026
EcoGym: Evaluating LLMs for Long-Horizon Plan-and-Execute in Interactive Economies
arXiv 2026
BubbleRAG: Evidence-Driven Retrieval-Augmented Generation for Black-Box Knowledge Graphs
arXiv 2026
ARLArena: A Unified Framework for Stable Agentic Reinforcement Learning
arXiv 2026
GeoMotionGPT: Geometry-Aligned Motion Understanding with Large Language Models
arXiv 2026
DeepVision-103K: A Visually Diverse, Broad-Coverage, and Verifiable Mathematical Dataset for Multimodal Reasoning
arXiv 2026
Kimi K2.5: Visual Agentic Intelligence
arXiv 2026
CellMaster: Collaborative Cell Type Annotation in Single-Cell Analysis
arXiv 2026
SkyReels-V2: Infinite-length Film Generative Model
arXiv 2025
Reinforcement Learning Optimization for Large-Scale Learning: An Efficient and User-Friendly Scaling Library
arXiv 2025
Visionary: The World Model Carrier Built on WebGPU-Powered Gaussian Splatting Platform
arXiv 2025
EchoVLM: Dynamic Mixture-of-Experts Vision-Language Model for Universal Ultrasound Intelligence
arXiv 2025
Inferix: A Block-Diffusion based Next-Generation Inference Engine for World Simulation
arXiv 2025
Let It Flow: Agentic Crafting on Rock and Roll, Building the ROME Model within an Open Agentic Learning Ecosystem
arXiv 2025
Tiny Model, Big Logic: Diversity-Driven Optimization Elicits Large-Model Reasoning Ability in VibeThinker-1.5B
arXiv 2025
How Far Are We from Genuinely Useful Deep Research Agents?
arXiv 2025
Preference Leakage: A Contamination Problem in LLM-as-a-judge
arXiv 2025
MindWatcher: Toward Smarter Multimodal Tool-Integrated Reasoning
arXiv 2025
Q-Eval-100K: Evaluating Visual Quality and Alignment Level for Text-to-Vision Content
CVPR 2025 1
DuoGuard: A Two-Player RL-Driven Framework for Multilingual LLM Guardrails
arXiv 2025
OpenCSG Chinese Corpus: A Series of High-quality Chinese Datasets for LLM Training
arXiv 2025
Entropy-Based Adaptive Weighting for Self-Training
arXiv 2025
Instruction-Tuning Data Synthesis from Scratch via Web Reconstruction
arXiv 2025
Chasing the Tail: Effective Rubric-based Reward Modeling for Large Language Model Post-Training
arXiv 2025
A Comprehensive Survey in LLM(-Agent) Full Stack Safety: Data, Training and Deployment
arXiv 2025
Reinforcement Mid-Training
arXiv 2025
Learnable SMPLify: A Neural Solution for Optimization-Free Human Pose Inverse Kinematics
arXiv 2025
R-Horizon: How Far Can Your Large Reasoning Model Really Go in Breadth and Depth?
arXiv 2025
MusiXQA: Advancing Visual Music Understanding in Multimodal Large Language Models
arXiv 2025
Lessons Learned from the URGENT 2024 Speech Enhancement Challenge
arXiv 2025
Agentar-Fin-R1: Enhancing Financial Intelligence through Domain Expertise, Training Efficiency, and Advanced Reasoning
arXiv 2025
Hierarchical Frequency Tagging Probe (HFTP): A Unified Approach to Investigate Syntactic Structure Representations in Large Language Models and the Human Brain
arXiv 2025
Wan: Open and Advanced Large-Scale Video Generative Models
arXiv 2025
CoSER: Coordinating LLM-Based Persona Simulation of Established Roles
arXiv 2025
OpenVLThinker: An Early Exploration to Complex Vision-Language Reasoning via Iterative Self-Improvement
arXiv 2025
A Retrospective Systematic Study on Hierarchical Sparse Query Transformer-assisted Ultrasound Screening for Early Hepatocellular Carcinoma
arXiv 2025
In-Context LoRA for Diffusion Transformers
arXiv 2024
CivRealm: A Learning and Reasoning Odyssey in Civilization for Decision-Making Agents
arXiv 2024
Fully Open Source Moxin-7B Technical Report
arXiv 2024
ChatDiT: A Training-Free Baseline for Task-Agnostic Free-Form Chatting with Diffusion Transformers
arXiv 2024
BearLLM: A Prior Knowledge-Enhanced Bearing Health Management Framework with Unified Vibration Signal Representation
arXiv 2024
Enhancing Large Vision Language Models with Self-Training on Image Comprehension
arXiv 2024
AllWeatherNet:Unified Image Enhancement for Autonomous Driving under Adverse Weather and Lowlight-conditions
arXiv 2024
QAQ: Quality Adaptive Quantization for LLM KV Cache
arXiv 2024
IDEA-Bench: How Far are Generative Models from Professional Designing?
CVPR 2025 1
Learning to Edit: Aligning LLMs with Knowledge Editing
arXiv 2024
ProteinGPT: Multimodal LLM for Protein Property Prediction and Structure Understanding
arXiv 2024
UnifiedMLLM: Enabling Unified Representation for Multi-modal Multi-tasks With Large Language Model
arXiv 2024
Security Attacks on LLM-based Code Completion Tools
arXiv 2024
Harnessing Scale and Physics: A Multi-Graph Neural Operator Framework for PDEs on Arbitrary Geometries
arXiv 2024
Object Detectors in the Open Environment: Challenges, Solutions, and Outlook
arXiv 2024
LogicVista: Multimodal LLM Logical Reasoning Benchmark in Visual Contexts
arXiv 2024
InverseCoder: Self-improving Instruction-Tuned Code LLMs with Inverse-Instruct
arXiv 2024
Bridging and Modeling Correlations in Pairwise Data for Direct Preference Optimization
arXiv 2024
Template-Driven LLM-Paraphrased Framework for Tabular Math Word Problem Generation
arXiv 2024
Advancing Fine-Grained Visual Understanding with Multi-Scale Alignment in Multi-Modal Models
arXiv 2024
TradingAgents: Multi-Agents LLM Financial Trading Framework
arXiv 2024
A Comprehensive Survey of Scientific Large Language Models and Their Applications in Scientific Discovery
arXiv 2024
Layer-wise Importance Matters: Less Memory for Better Performance in Parameter-efficient Fine-tuning of Large Language Models
arXiv 2024
Stealth edits to large language models
arXiv 2024
Detecting Conversational Mental Manipulation with Intent-Aware Prompting
arXiv 2024
Beyond Answers: Transferring Reasoning Capabilities to Smaller LLMs Using Multi-Teacher Knowledge Distillation
arXiv 2024
Counterfactual Explanations for Face Forgery Detection via Adversarial Removal of Artifacts
arXiv 2024
CLIMB: A Benchmark of Clinical Bias in Large Language Models
arXiv 2024
Qwen Technical Report
arXiv 2023
InCharacter: Evaluating Personality Fidelity in Role-Playing Agents through Psychological Interviews
arXiv 2023
mPLUG-2: A Modularized Multi-modal Foundation Model Across Text, Image and Video
arXiv 2023
Bongard-OpenWorld: Few-Shot Reasoning for Free-form Visual Concepts in the Real World
arXiv 2023
D-IF: Uncertainty-aware Human Digitization via Implicit Distribution Field
ICCV 2023 1
Learning Over Molecular Conformer Ensembles: Datasets and Benchmarks
arXiv 2023
Householder Projector for Unsupervised Latent Semantics Discovery
ICCV 2023 1
RRHF: Rank Responses to Align Language Models with Human Feedback without tears
arXiv 2023
YUAN 2.0: A Large Language Model with Localized Filtering-based Attention
arXiv 2023
Lion: Adversarial Distillation of Proprietary Large Language Models
arXiv 2023
SciBench: Evaluating College-Level Scientific Problem-Solving Abilities of Large Language Models
arXiv 2023
FollowBench: A Multi-level Fine-grained Constraints Following Benchmark for Large Language Models
arXiv 2023
Towards Vision Enhancing LLMs: Empowering Multimodal Knowledge Storage and Sharing in LLMs
arXiv 2023
mPLUG: Effective and Efficient Vision-Language Learning by Cross-modal Skip-connections
arXiv 2022
Code Recommendation for Open Source Software Developers
arXiv 2022
Global and Local Hierarchy-aware Contrastive Framework for Implicit Discourse Relation Recognition
arXiv 2022
Improved Universal Sentence Embeddings with Prompt-based Contrastive Learning and Energy-based Learning
arXiv 2022
Controllable Person Image Synthesis with Spatially-Adaptive Warped Normalization
arXiv 2021
Affiliations
Frequent co-authors
10from 88 papers