Martin Potthast
- Papers
- 10
Cite
Notes
Only stored in your browser.
Authored papers
10The German Commons - 154 Billion Tokens of Openly Licensed Text for German Language Models
arXiv 2025
Rank-DistiLLM: Closing the Effectiveness Gap Between Cross-Encoders and LLMs for Passage Re-Ranking
arXiv 2024
Set-Encoder: Permutation-Invariant Inter-Passage Attention for Listwise Passage Re-Ranking with Cross-Encoders
arXiv 2024
GAIA Search: Hugging Face and Pyserini Interoperability for NLP Training Data Exploration
arXiv 2023
Spacerini: Plug-and-play Search Engines with Pyserini and Hugging Face
arXiv 2023
Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models
TMLR
Sparse Pairwise Re-ranking with Pre-trained Transformers
arXiv 2022
Small-Text: Active Learning for Text Classification in Python
small-text-active-learning-for-text
FastWARC: Optimizing Large-Scale Web Archive Analytics
arXiv 2021
Revisiting Uncertainty-based Query Strategies for Active Learning with Transformers
Findings (ACL) 2022 5
Affiliations
Frequent co-authors
10from 10 papers
Benno Stein
Matthias Hagen
Christopher Akiki
Christopher Schröder
Ferdinand Schlatt
Maik Fröbe
Akintunde Oladipo
Aleksandra Piktus
Andreas Niekler
Bevan Koopman