Martin Potthast

110 publications

11 venues

H Index 30

Affiliation

University of Kassel, Kassel, Germany
hessian.AI, Darmstadt, Germany
ScaDS.AI, Dresden and Leipzig, Germany
Leipzig University, Leipzig, Germany
Bauhaus University, Weimar, Germany

Links

Name Venue Year citations
TITE: Token-Independent Text Encoder for Information Retrieval. SIGIR 2025 2
Large Language Model Relevance Assessors Agree With One Another More Than With Human Assessors. SIGIR 2025 5
Counterfactual Query Rewriting to Use Historical Relevance Feedback. ECIR 2025 2
Corpus Subsampling: Estimating the Effectiveness of Neural Retrieval Models on Large Corpora. ECIR 2025 6
AiReview: An Open Platform for Accelerating Systematic Reviews with LLMs. SIGIR 2025 2
Web-Scale Retrieval Experimentation with chatnoir-pyterrier. ECIR 2025 2
ReNeuIR at SIGIR 2025: The Fourth Workshop on Reaching Efficiency in Neural Information Retrieval. SIGIR 2025 4
Ranking Generated Answers - On the Agreement of Retrieval Models with Humans on Consumer Health Questions. ECIR 2025 0
The Viability of Crowdsourcing for RAG Evaluation. SIGIR 2025 6
A Test Collection for Dataset Retrieval. ECIR 2025 1
Overview of PAN 2025: Generative AI Detection, Multilingual Text Detoxification, Multi-author Writing Style Analysis, and Generative Plagiarism Detection - Extended Abstract. ECIR 2025 7
Overview of Touché 2025: Argumentation Systems - Extended Abstract. ECIR 2025 9
ImageCLEF 2025: Multimedia Retrieval in Medical, Social Media and Content Recommendation Applications. ECIR 2025 1
TIREx Tracker: The Information Retrieval Experiment Tracker. SIGIR 2025 6
Call for Research on the Impact of Information Retrieval on Social Norms. ECIR 2025 0
The Second International Workshop on Open Web Search (WOWS). ECIR 2025 1
Set-Encoder: Permutation-Invariant Inter-passage Attention for Listwise Passage Re-ranking with Cross-Encoders. ECIR 2025 0
Rank-DistiLLM: Closing the Effectiveness Gap Between Cross-Encoders and LLMs for Passage Re-ranking. ECIR 2025 0
Resources for Combining Teaching and Research in Information Retrieval Coursework. SIGIR 2024 5
Is Google Getting Worse? A Longitudinal Investigation of SEO Spam in Search Engines. ECIR 2024 19
Overview of PAN 2024: Multi-author Writing Style Analysis, Multilingual Text Detoxification, Oppositional Thinking Analysis, and Generative AI Authorship Verification - Extended Abstract. ECIR 2024 71
The Information Retrieval Experiment Platform (Extended Abstract). IJCAI 2024 1
Overview of Touché 2024: Argumentation Systems. ECIR 2024 28
Zero-Shot Generative Large Language Models for Systematic Review Screening Automation. ECIR 2024 28
The Open Web Index - Crawling and Indexing the Web for Public Use. ECIR 2024 7
Systematic Evaluation of Neural Retrieval Models on the Touché 2020 Argument Retrieval Subset of BEIR. SIGIR 2024 16
The First International Workshop on Open Web Search (WOWS). ECIR 2024 5
Analyzing Adversarial Attacks on Sequence-to-Sequence Relevance Models. ECIR 2024 10
ReNeuIR at SIGIR 2024: The Third Workshop on Reaching Efficiency in Neural Information Retrieval. SIGIR 2024 5
Advancing Multimedia Retrieval in Medical, Social Media and Content Recommendation Applications with ImageCLEF 2024. ECIR 2024 2
Evaluating Generative Ad Hoc Information Retrieval. SIGIR 2024 0
Manipulating Embeddings of Stable Diffusion Prompts. IJCAI 2024 0
Overview of Touché 2023: Argument and Causal Retrieval - Extended Abstract. ECIR 2023 9
Overview of PAN 2023: Authorship Verification, Multi-author Writing Style Analysis, Profiling Cryptocurrency Influencers, and Trigger Detection - Extended Abstract. ECIR 2023 36
The Archive Query Log: Mining Millions of Search Result Pages of Hundreds of Search Engines from 25 Years of Web Archives. SIGIR 2023 12
Dynamic Exploratory Search for the Information Retrieval Anthology. ECIR 2023 0
Shared Tasks as Tutorials: A Methodical Approach. AAAI 2023 7
Smooth Operators for Effective Systematic Review Queries. SIGIR 2023 2
The Information Retrieval Experiment Platform. SIGIR 2023 47
Paraphrase Acquisition from Image Captions. EACL 2023 0
Modeling Appropriate Language in Argumentation. ACL 2023 6
Trigger Warning Assignment as a Multi-Label Document Classification Problem. ACL 2023 18
On Stance Detection in Image Retrieval for Argumentation. SIGIR 2023 7
Indicative Summarization of Long Discussions. EMNLP 2023 1
pybool_ir: A Toolkit for Domain-Specific Search Experiments. SIGIR 2023 3
Bootstrapped nDCG Estimation in the Presence of Unjudged Documents. ECIR 2023 10
Continuous Integration for Reproducible Shared Tasks with TIRA.io. ECIR 2023 198
Mining Health-related Cause-Effect Statements with High Precision at Large Scale. COLING 2022 3
Clickbait Spoiling via Question Answering and Passage Retrieval. ACL 2022 39
CausalQA: A Benchmark for Causal Question Answering. COLING 2022 32
Overview of PAN 2022: Authorship Verification, Profiling Irony and Stereotype Spreaders, Style Change Detection, and Trigger Detection - Extended Abstract. ECIR 2022 38
The Power of Anchor Text in the Neural Retrieval Era. ECIR 2022 4
Overview of Touché 2022: Argument Retrieval - Extended Abstract. ECIR 2022 0
Identifying Queries in Instant Search Logs. SIGIR 2021 0
The Information Retrieval Anthology. SIGIR 2021 12
Overview of PAN 2021: Authorship Verification, Profiling Hate Speech Spreaders on Twitter, and Style Change Detection - Extended Abstract. ECIR 2021 89
An Empirical Comparison of Web Page Segmentation Algorithms. ECIR 2021 13
On Classifying whether Two Texts are on the Same Side of an Argument. EMNLP 2021 14
CopyCat: Near-Duplicates Within and Between the ClueWeb and the Common Crawl. SIGIR 2021 21
Overview of Touché 2021: Argument Retrieval - Extended Abstract. ECIR 2021 0
Estimating Topic Difficulty Using Normalized Discounted Cumulated Gain. CIKM 2020 8
Web Page Segmentation Revisited: Evaluation Framework and Dataset. CIKM 2020 16
Crawling and Preprocessing Mailing Lists At Scale for Dialog Analysis. ACL 2020 8
CauseNet: Towards a Causality Graph Extracted from the Web. CIKM 2020 91
News Editorials: Towards Summarizing Long Argumentative Texts. COLING 2020 16
The Impact of Negative Relevance Judgments on NDCG. CIKM 2020 15
The Effect of Content-Equivalent Near-Duplicates on the Evaluation of Search Engines. ECIR 2020 22
Target Inference in Argument Conclusion Generation. ACL 2020 24
Shared Tasks on Authorship Analysis at PAN 2020. ECIR 2020 19
Abstractive Snippet Generation. WWW 2020 33
Touché: First Shared Task on Argument Retrieval. ECIR 2020 17
Sampling Bias Due to Near-Duplicates in Learning to Rank. SIGIR 2020 18
Efficient Pairwise Annotation of Argument Quality. ACL 2020 31
A Search Engine for Police Press Releases to Double-Check the News. ECIR 2020 0
Bias Analysis and Mitigation in the Evaluation of Authorship Verification. ACL 2019 30
Heuristic Authorship Obfuscation. ACL 2019 26
Debiasing Vandalism Detection Models at Wikidata. WWW 2019 15
A Decade of Shared Tasks in Digital Text Forensics at PAN. ECIR 2019 31
Argument Search: Assessing Argument Relevance. SIGIR 2019 51
Celebrity Profiling. ACL 2019 34
Wikipedia Text Reuse: Within and Without. ECIR 2019 0
A User Study on Snippet Generation: Text Reuse vs. Paraphrases. SIGIR 2018 12
Elastic ChatNoir: Search Engine for the ClueWeb and the Common Crawl. ECIR 2018 73
Crowdsourcing a Large Corpus of Clickbait on Twitter. COLING 2018 83
A Stylometric Inquiry into Hyperpartisan and Fake News. ACL 2018 0
A Large-Scale Query Spelling Correction Corpus. SIGIR 2017 36
WSDM Cup 2017: Vandalism Detection and Triple Scoring. WSDM 2017 30
Source Retrieval for Web-Scale Text Reuse Detection. CIKM 2017 17
Who Wrote the Web? Revisiting Influential Author Identification Research Applicable to Information Retrieval. ECIR 2016 55
Vandalism Detection in Wikidata. CIKM 2016 97
Clickbait Detection. ECIR 2016 230
Twitter Sentiment Detection via Ensemble Classification Using Averaged Confidence Scores. ECIR 2015 45
Towards Vandalism Detection in Knowledge Bases: Corpus Construction and Analysis. SIGIR 2015 34
Improving Cloze Test Performance of Language Learners Using Web N-Grams. COLING 2014 3
Crowdsourcing Interaction Logs to Understand Text Reuse from the Web. ACL 2013 67
ChatNoir: a search engine for the ClueWeb09 corpus. SIGIR 2012 84
Towards optimum query segmentation: in doubt without. CIKM 2012 43
Query segmentation revisited. WWW 2011 82
The power of naive query segmentation. SIGIR 2010 28
Retrieving Customary Web Language to Assist Writers. ECIR 2010 25
Netspeak - Assisting Writers in Choosing Words. ECIR 2010 10
Opinion Summarization of Web Comments. ECIR 2010 40
Cross-Language High Similarity Search: Why No Sub-linear Time Bound Can Be Expected. ECIR 2010 5
Towards comment-based cross-media retrieval. WWW 2010 6
Crowdsourcing a wikipedia vandalism corpus. SIGIR 2010 0
Measuring the descriptiveness of web comments. SIGIR 2009 21
Automatic Vandalism Detection in Wikipedia. ECIR 2008 203
A Wikipedia-Based Multilingual Retrieval Model. ECIR 2008 242
Wikipedia in the pocket: indexing technology for near-duplicate detection and high similarity search. SIGIR 2007 11
Strategies for retrieving plagiarized documents. SIGIR 2007 131
Copyright ©2019 Universität Würzburg

Impressum | Privacy | FAQ