This work is motivated by two key trends. On one hand, large language models (LLMs) have shown remarkable versatility in various generative tasks such as writing, drawing, and question answering, significantly reducing the time required for many routine tasks. On the other hand, researchers, whose work is not only time-consuming but also highly expertise-demanding, face increasing challenges as they have to spend more time reading, writing, and reviewing papers. This raises the question: how can LLMs potentially assist researchers in alleviating their heavy workload? This study focuses on the topic of LLMs assist NLP Researchers, particularly examining the effectiveness of LLM in assisting paper (meta-)reviewing and its recognizability. To address this, we constructed the ReviewCritique dataset, which includes two types of information: (i) NLP papers (initial submissions rather than camera-ready) with both human-written and LLM-generated reviews, and (ii) each review comes with "deficiency" labels and corresponding explanations for individual segments, annotated by experts. Using ReviewCritique, this study explores two threads of research questions: (i) "LLMs as Reviewers", how do reviews generated by LLMs compare with those written by humans in terms of quality and distinguishability? (ii) "LLMs as Metareviewers", how effectively can LLMs identify potential issues, such as Deficient or unprofessional review segments, within individual paper reviews? To our knowledge, this is the first work to provide such a comprehensive analysis.
LLMs Assist NLP Researchers: Critique Paper (Meta-)Reviewing
This study evaluates the effectiveness of large language models in assisting NLP researchers with paper reviewing and metareviewing using a newly constructed dataset with human and LLM-generated reviews.
- Year
- 2024
- Venue
- arXiv 2024
- Authors
- 40
- Hosting
- Abstract onlyARXIV-DEFAULT
Cite
Notes
Only stored in your browser.
Attribution
- Abstract & full text
- arxiv.org/abs/2406.16253v3ARXIV-DEFAULT
- TL;DR
- Semantic Scholar
Abstract
Authors
40Wenting ZhaoRui ZhangFei WangZiHao WangPhilip S. YuYinghui LiFei LiuTao LiLu ChengHaoran LiRuohao GuoYibo WangHenry Peng ZouZhaowei WangNan ZhangJie FuRenze LouQin LiuMeng FangChen XingYixin CaoRuihong HuangPengzhi GaoZhongfen DengCongying XiaWenpeng YinJing GuJiangshu DuVipul GuptaShuaiqi LiuPranav Narayanan VenkitMukund SrinathHaoran Ranran ZhangTianlin LiuJiayang ChengYing SuRaj Sanjay ShahKangda WeiSurangika RanathungaEduardo Blanco