Timothy Baldwin
- Papers
- 19
Cite
Notes
Only stored in your browser.
Authored papers
19Qorgau: Evaluating LLM Safety in Kazakh-Russian Bilingual Contexts
arXiv 2025
SCALAR: Scientific Citation-based Live Assessment of Long-context Academic Reasoning
arXiv 2025
Fact-Checking the Output of Large Language Models via Token-Level Uncertainty Quantification
arXiv 2024
ToolGen: Unified Tool Retrieval and Calling via Generation
arXiv 2024
Against The Achilles' Heel: A Survey on Red Teaming for Generative Models
arXiv 2024
Benchmarking Uncertainty Quantification Methods for Large Language Models with LM-Polygraph
arXiv 2024
Web2Code: A Large-scale Webpage-to-Code Dataset and Evaluation Framework for Multimodal LLMs
arXiv 2024
Learning From Failure: Integrating Negative Examples when Fine-tuning Large Language Models as Agents
arXiv 2024
ArabicMMLU: Assessing Massive Multitask Language Understanding in Arabic
arXiv 2024
BiMediX2: Bio-Medical EXpert LMM for Diverse Medical Modalities
arXiv 2024
BiMediX: Bilingual Medical Mixture of Experts LLM
arXiv 2024
Do-Not-Answer: A Dataset for Evaluating Safeguards in LLMs
arXiv 2023
Large Language Models Only Pass Primary School Exams in Indonesia: A Comprehensive Test on IndoMMLU
arXiv 2023
CMMLU: Measuring massive multitask language understanding in Chinese
arXiv 2023
Bactrian-X: Multilingual Replicable Instruction-Following Models with Low-Rank Adaptation
arXiv 2023
Multi-EuP: The Multilingual European Parliament Dataset for Analysis of Bias in Information Retrieval
arXiv 2023
NusaCrowd: Open Source Initiative for Indonesian NLP Resources
arXiv 2022
IndoBERTweet: A Pretrained Language Model for Indonesian Twitter with Effective Domain-Specific Vocabulary Initialization
EMNLP 2021 11
Liputan6: A Large-scale Indonesian Dataset for Text Summarization
Asian Chapter of the Association for Computational Linguistics 2020
Affiliations
Frequent co-authors
10from 19 papers