0

Wenyue Hua

Papers
21

Cite

Notes

Only stored in your browser.

Attribution

Affiliations & profile
Semantic Scholar
Attribution policy →
21papers

Authored papers

21

Auditing Agent Harness Safety

arXiv 2026

2026

Interactive Evaluation Requires a Design Science

arXiv 2026

2026

A Survey on Test-Time Scaling in Large Language Models: What, How, Where, and How Well?

arXiv 2025

2025

Layer-Level Self-Exposure and Patch: Affirmative Token Mitigation for Jailbreak Attack Defense

arXiv 2025

2025

A Survey of Self-Evolving Agents: On Path to Artificial Super Intelligence

arXiv 2025

2025

Magentic Marketplace: An Open-Source Environment for Studying Agentic Markets

arXiv 2025

2025

InductionBench: LLMs Fail in the Simplest Complexity Class

arXiv 2025

2025

When AI Meets Finance (StockAgent): Large Language Model-based Stock Trading in Simulated Real-world Environments

arXiv 2024

2024

From Commands to Prompts: LLM-based Semantic File System for AIOS

arXiv 2024

2024

Game-theoretic LLM: Agent Workflow for Negotiation Games

arXiv 2024

2024

Disentangling Memory and Reasoning Ability in Large Language Models

arXiv 2024

2024

Exploring Concept Depth: How Large Language Models Acquire Knowledge at Different Layers?

arXiv 2024

2024

AutoFlow: Automated Workflow Generation for Large Language Model Agents

arXiv 2024

2024

The Impact of Reasoning Step Length on Large Language Models

arXiv 2024

2024

What if LLMs Have Different World Views: Simulating Alien Civilizations with LLM-based Agents

arXiv 2024

2024

Uncertainty is Fragile: Manipulating Uncertainty in Large Language Models

arXiv 2024

2024

RuleArena: A Benchmark for Rule-Guided Reasoning with LLMs in Real-World Scenarios

arXiv 2024

2024

OpenAGI: When LLM Meets Domain Experts

NeurIPS 2023 11

2023

War and Peace (WarAgent): Large Language Model-based Multi-Agent Simulation of World Wars

arXiv 2023

2023

How to Index Item IDs for Recommendation Foundation Models

arXiv 2023

2023

EntQA: Entity Linking as Question Answering

entqa-entity-linking-as-question-answering-1

2021

Affiliations

No known affiliations.

Frequent co-authors

10

from 21 papers