Michael R. Lyu
- Papers
- 16
Cite
Notes
Only stored in your browser.
Authored papers
16From Runnable to Shippable: Multi-Agent Test-Driven Development for Generating Full-Stack Web Applications from Requirements
arXiv 2026
Inference-Time Scaling of Verification: Self-Evolving Deep Research Agents via Test-Time Rubric-Guided Verification
arXiv 2026
ScreenCoder: Advancing Visual-to-Code Generation for Front-End Automation via Modular Multimodal Agents
arXiv 2025
DesignBench: A Comprehensive Benchmark for MLLM-based Front-end Code Generation
arXiv 2025
LogicAsker: Evaluating and Improving the Logical Reasoning Ability of Large Language Models
arXiv 2024
How Well Can LLMs Echo Us? Evaluating AI Chatbots' Role-Play Ability with ECHO
arXiv 2024
How Far Are We on the Decision-Making of LLMs? Evaluating LLMs' Gaming Ability in Multi-Agent Environments
arXiv 2024
New Job, New Gender? Measuring the Social Bias in Image Generation Models
arXiv 2024
Making Long-Context Language Models Better Multi-Hop Reasoners
arXiv 2024
A Unified Debugging Approach via LLM-Based Multi-Agent Synergy
arXiv 2024
Who is ChatGPT? Benchmarking LLMs' Psychological Portrayal Using PsychoBench
arXiv 2023
VL-PET: Vision-and-Language Parameter-Efficient Tuning via Granularity Control
ICCV 2023 1
Revisiting the Reliability of Psychological Scales on Large Language Models
arXiv 2023
All Languages Matter: On the Multilingual Safety of Large Language Models
arXiv 2023
What Makes Good In-context Demonstrations for Code Intelligence Tasks with LLMs?
arXiv 2023
SemParser: A Semantic Parser for Log Analysis
arXiv 2021
Affiliations
Frequent co-authors
10from 16 papers