Martin Potthast

110 publications

11 venues

H Index 30

Affiliation

University of Kassel, Kassel, Germany
hessian.AI, Darmstadt, Germany
ScaDS.AI, Dresden and Leipzig, Germany
Leipzig University, Leipzig, Germany
Bauhaus University, Weimar, Germany

Links

Name	Venue	Year	citations
TITE: Token-Independent Text Encoder for Information Retrieval.	SIGIR	2025	2
Large Language Model Relevance Assessors Agree With One Another More Than With Human Assessors.	SIGIR	2025	5
Counterfactual Query Rewriting to Use Historical Relevance Feedback.	ECIR	2025	2
Corpus Subsampling: Estimating the Effectiveness of Neural Retrieval Models on Large Corpora.	ECIR	2025	6
AiReview: An Open Platform for Accelerating Systematic Reviews with LLMs.	SIGIR	2025	2
Web-Scale Retrieval Experimentation with chatnoir-pyterrier.	ECIR	2025	2
ReNeuIR at SIGIR 2025: The Fourth Workshop on Reaching Efficiency in Neural Information Retrieval.	SIGIR	2025	4
Ranking Generated Answers - On the Agreement of Retrieval Models with Humans on Consumer Health Questions.	ECIR	2025	0
The Viability of Crowdsourcing for RAG Evaluation.	SIGIR	2025	6
A Test Collection for Dataset Retrieval.	ECIR	2025	1
Overview of PAN 2025: Generative AI Detection, Multilingual Text Detoxification, Multi-author Writing Style Analysis, and Generative Plagiarism Detection - Extended Abstract.	ECIR	2025	7
Overview of Touché 2025: Argumentation Systems - Extended Abstract.	ECIR	2025	9
ImageCLEF 2025: Multimedia Retrieval in Medical, Social Media and Content Recommendation Applications.	ECIR	2025	1
TIREx Tracker: The Information Retrieval Experiment Tracker.	SIGIR	2025	6
Call for Research on the Impact of Information Retrieval on Social Norms.	ECIR	2025	0
The Second International Workshop on Open Web Search (WOWS).	ECIR	2025	1
Set-Encoder: Permutation-Invariant Inter-passage Attention for Listwise Passage Re-ranking with Cross-Encoders.	ECIR	2025	0
Rank-DistiLLM: Closing the Effectiveness Gap Between Cross-Encoders and LLMs for Passage Re-ranking.	ECIR	2025	0
Resources for Combining Teaching and Research in Information Retrieval Coursework.	SIGIR	2024	5
Is Google Getting Worse? A Longitudinal Investigation of SEO Spam in Search Engines.	ECIR	2024	19
Overview of PAN 2024: Multi-author Writing Style Analysis, Multilingual Text Detoxification, Oppositional Thinking Analysis, and Generative AI Authorship Verification - Extended Abstract.	ECIR	2024	71
The Information Retrieval Experiment Platform (Extended Abstract).	IJCAI	2024	1
Overview of Touché 2024: Argumentation Systems.	ECIR	2024	28
Zero-Shot Generative Large Language Models for Systematic Review Screening Automation.	ECIR	2024	28
The Open Web Index - Crawling and Indexing the Web for Public Use.	ECIR	2024	7
Systematic Evaluation of Neural Retrieval Models on the Touché 2020 Argument Retrieval Subset of BEIR.	SIGIR	2024	16
The First International Workshop on Open Web Search (WOWS).	ECIR	2024	5
Analyzing Adversarial Attacks on Sequence-to-Sequence Relevance Models.	ECIR	2024	10
ReNeuIR at SIGIR 2024: The Third Workshop on Reaching Efficiency in Neural Information Retrieval.	SIGIR	2024	5
Advancing Multimedia Retrieval in Medical, Social Media and Content Recommendation Applications with ImageCLEF 2024.	ECIR	2024	2
Evaluating Generative Ad Hoc Information Retrieval.	SIGIR	2024	0
Manipulating Embeddings of Stable Diffusion Prompts.	IJCAI	2024	0
Overview of Touché 2023: Argument and Causal Retrieval - Extended Abstract.	ECIR	2023	9
Overview of PAN 2023: Authorship Verification, Multi-author Writing Style Analysis, Profiling Cryptocurrency Influencers, and Trigger Detection - Extended Abstract.	ECIR	2023	36
The Archive Query Log: Mining Millions of Search Result Pages of Hundreds of Search Engines from 25 Years of Web Archives.	SIGIR	2023	12
Dynamic Exploratory Search for the Information Retrieval Anthology.	ECIR	2023	0
Shared Tasks as Tutorials: A Methodical Approach.	AAAI	2023	7
Smooth Operators for Effective Systematic Review Queries.	SIGIR	2023	2
The Information Retrieval Experiment Platform.	SIGIR	2023	47
Paraphrase Acquisition from Image Captions.	EACL	2023	0
Modeling Appropriate Language in Argumentation.	ACL	2023	6
Trigger Warning Assignment as a Multi-Label Document Classification Problem.	ACL	2023	18
On Stance Detection in Image Retrieval for Argumentation.	SIGIR	2023	7
Indicative Summarization of Long Discussions.	EMNLP	2023	1
pybool_ir: A Toolkit for Domain-Specific Search Experiments.	SIGIR	2023	3
Bootstrapped nDCG Estimation in the Presence of Unjudged Documents.	ECIR	2023	10
Continuous Integration for Reproducible Shared Tasks with TIRA.io.	ECIR	2023	198
Mining Health-related Cause-Effect Statements with High Precision at Large Scale.	COLING	2022	3
Clickbait Spoiling via Question Answering and Passage Retrieval.	ACL	2022	39
CausalQA: A Benchmark for Causal Question Answering.	COLING	2022	32
Overview of PAN 2022: Authorship Verification, Profiling Irony and Stereotype Spreaders, Style Change Detection, and Trigger Detection - Extended Abstract.	ECIR	2022	38
The Power of Anchor Text in the Neural Retrieval Era.	ECIR	2022	4
Overview of Touché 2022: Argument Retrieval - Extended Abstract.	ECIR	2022	0
Identifying Queries in Instant Search Logs.	SIGIR	2021	0
The Information Retrieval Anthology.	SIGIR	2021	12
Overview of PAN 2021: Authorship Verification, Profiling Hate Speech Spreaders on Twitter, and Style Change Detection - Extended Abstract.	ECIR	2021	89
An Empirical Comparison of Web Page Segmentation Algorithms.	ECIR	2021	13
On Classifying whether Two Texts are on the Same Side of an Argument.	EMNLP	2021	14
CopyCat: Near-Duplicates Within and Between the ClueWeb and the Common Crawl.	SIGIR	2021	21
Overview of Touché 2021: Argument Retrieval - Extended Abstract.	ECIR	2021	0
Estimating Topic Difficulty Using Normalized Discounted Cumulated Gain.	CIKM	2020	8
Web Page Segmentation Revisited: Evaluation Framework and Dataset.	CIKM	2020	16
Crawling and Preprocessing Mailing Lists At Scale for Dialog Analysis.	ACL	2020	8
CauseNet: Towards a Causality Graph Extracted from the Web.	CIKM	2020	91
News Editorials: Towards Summarizing Long Argumentative Texts.	COLING	2020	16
The Impact of Negative Relevance Judgments on NDCG.	CIKM	2020	15
The Effect of Content-Equivalent Near-Duplicates on the Evaluation of Search Engines.	ECIR	2020	22
Target Inference in Argument Conclusion Generation.	ACL	2020	24
Shared Tasks on Authorship Analysis at PAN 2020.	ECIR	2020	19
Abstractive Snippet Generation.	WWW	2020	33
Touché: First Shared Task on Argument Retrieval.	ECIR	2020	17
Sampling Bias Due to Near-Duplicates in Learning to Rank.	SIGIR	2020	18
Efficient Pairwise Annotation of Argument Quality.	ACL	2020	31
A Search Engine for Police Press Releases to Double-Check the News.	ECIR	2020	0
Bias Analysis and Mitigation in the Evaluation of Authorship Verification.	ACL	2019	30
Heuristic Authorship Obfuscation.	ACL	2019	26
Debiasing Vandalism Detection Models at Wikidata.	WWW	2019	15
A Decade of Shared Tasks in Digital Text Forensics at PAN.	ECIR	2019	31
Argument Search: Assessing Argument Relevance.	SIGIR	2019	51
Celebrity Profiling.	ACL	2019	34
Wikipedia Text Reuse: Within and Without.	ECIR	2019	0
A User Study on Snippet Generation: Text Reuse vs. Paraphrases.	SIGIR	2018	12
Elastic ChatNoir: Search Engine for the ClueWeb and the Common Crawl.	ECIR	2018	73
Crowdsourcing a Large Corpus of Clickbait on Twitter.	COLING	2018	83
A Stylometric Inquiry into Hyperpartisan and Fake News.	ACL	2018	0
A Large-Scale Query Spelling Correction Corpus.	SIGIR	2017	36
WSDM Cup 2017: Vandalism Detection and Triple Scoring.	WSDM	2017	30
Source Retrieval for Web-Scale Text Reuse Detection.	CIKM	2017	17
Who Wrote the Web? Revisiting Influential Author Identification Research Applicable to Information Retrieval.	ECIR	2016	55
Vandalism Detection in Wikidata.	CIKM	2016	97
Clickbait Detection.	ECIR	2016	230
Twitter Sentiment Detection via Ensemble Classification Using Averaged Confidence Scores.	ECIR	2015	45
Towards Vandalism Detection in Knowledge Bases: Corpus Construction and Analysis.	SIGIR	2015	34
Improving Cloze Test Performance of Language Learners Using Web N-Grams.	COLING	2014	3
Crowdsourcing Interaction Logs to Understand Text Reuse from the Web.	ACL	2013	67
ChatNoir: a search engine for the ClueWeb09 corpus.	SIGIR	2012	84
Towards optimum query segmentation: in doubt without.	CIKM	2012	43
Query segmentation revisited.	WWW	2011	82
The power of naive query segmentation.	SIGIR	2010	28
Retrieving Customary Web Language to Assist Writers.	ECIR	2010	25
Netspeak - Assisting Writers in Choosing Words.	ECIR	2010	10
Opinion Summarization of Web Comments.	ECIR	2010	40
Cross-Language High Similarity Search: Why No Sub-linear Time Bound Can Be Expected.	ECIR	2010	5
Towards comment-based cross-media retrieval.	WWW	2010	6
Crowdsourcing a wikipedia vandalism corpus.	SIGIR	2010	0
Measuring the descriptiveness of web comments.	SIGIR	2009	21
Automatic Vandalism Detection in Wikipedia.	ECIR	2008	203
A Wikipedia-Based Multilingual Retrieval Model.	ECIR	2008	242
Wikipedia in the pocket: indexing technology for near-duplicate detection and high similarity search.	SIGIR	2007	11
Strategies for retrieving plagiarized documents.	SIGIR	2007	131