Neil Zhenqiang Gong
- Papers
- 7
Cite
Notes
Only stored in your browser.
7papers
Authored papers
7Large Language Models Post-training: Surveying Techniques from Alignment to Reasoning
arXiv 2025
On the Trustworthiness of Generative Foundation Models: Guideline, Assessment, and Perspective
arXiv 2025
WAInjectBench: Benchmarking Prompt Injection Detections for Web Agents
arXiv 2025
TrustLLM: Trustworthiness in Large Language Models
arXiv 2024
Certifiably Robust Image Watermark
arXiv 2024
Formalizing and Benchmarking Prompt Injection Attacks and Defenses
arXiv 2023
MetaTool Benchmark for Large Language Models: Deciding Whether to Use Tools and Which to Use
arXiv 2023
Affiliations
No known affiliations.
Frequent co-authors
10from 7 papers