Wenyue Hua
- Papers
- 21
Cite
Notes
Only stored in your browser.
Authored papers
21Auditing Agent Harness Safety
arXiv 2026
Interactive Evaluation Requires a Design Science
arXiv 2026
A Survey on Test-Time Scaling in Large Language Models: What, How, Where, and How Well?
arXiv 2025
Layer-Level Self-Exposure and Patch: Affirmative Token Mitigation for Jailbreak Attack Defense
arXiv 2025
A Survey of Self-Evolving Agents: On Path to Artificial Super Intelligence
arXiv 2025
Magentic Marketplace: An Open-Source Environment for Studying Agentic Markets
arXiv 2025
InductionBench: LLMs Fail in the Simplest Complexity Class
arXiv 2025
When AI Meets Finance (StockAgent): Large Language Model-based Stock Trading in Simulated Real-world Environments
arXiv 2024
From Commands to Prompts: LLM-based Semantic File System for AIOS
arXiv 2024
Game-theoretic LLM: Agent Workflow for Negotiation Games
arXiv 2024
Disentangling Memory and Reasoning Ability in Large Language Models
arXiv 2024
Exploring Concept Depth: How Large Language Models Acquire Knowledge at Different Layers?
arXiv 2024
AutoFlow: Automated Workflow Generation for Large Language Model Agents
arXiv 2024
The Impact of Reasoning Step Length on Large Language Models
arXiv 2024
What if LLMs Have Different World Views: Simulating Alien Civilizations with LLM-based Agents
arXiv 2024
Uncertainty is Fragile: Manipulating Uncertainty in Large Language Models
arXiv 2024
RuleArena: A Benchmark for Rule-Guided Reasoning with LLMs in Real-World Scenarios
arXiv 2024
OpenAGI: When LLM Meets Domain Experts
NeurIPS 2023 11
War and Peace (WarAgent): Large Language Model-based Multi-Agent Simulation of World Wars
arXiv 2023
How to Index Item IDs for Recommendation Foundation Models
arXiv 2023
EntQA: Entity Linking as Question Answering
entqa-entity-linking-as-question-answering-1
Affiliations
Frequent co-authors
10from 21 papers