Matthias Hagen

63 publications

10 venues

H Index 20


Friedrich-Schiller-Universit t Jena, Institut f r Informatik, Germany
Martin Luther University of Halle-Wittenberg, Institute of Computer Science, Germany


Name Venue Year citations
The Information Retrieval Experiment Platform. SIGIR 2023 0
The Archive Query Log: Mining Millions of Search Result Pages of Hundreds of Search Engines from 25 Years of Web Archives. SIGIR 2023 0
Shared Tasks as Tutorials: A Methodical Approach. AAAI 2023 0
Continuous Integration for Reproducible Shared Tasks with ECIR 2023 0
Bootstrapped nDCG Estimation in the Presence of Unjudged Documents. ECIR 2023 0
Overview of Touché 2023: Argument and Causal Retrieval - Extended Abstract. ECIR 2023 0
Paraphrase Acquisition from Image Captions. EACL 2023 0
City of Disguise: A Query Obfuscation Game on the ClueWeb. ECIR 2022 0
Axiomatic Retrieval Experimentation with ir_axioms. SIGIR 2022 2
Towards Understanding and Answering Comparative Questions. WSDM 2022 8
Mining Health-related Cause-Effect Statements with High Precision at Large Scale. COLING 2022 0
Clickbait Spoiling via Question Answering and Passage Retrieval. ACL 2022 1
CausalQA: A Benchmark for Causal Question Answering. COLING 2022 1
The Power of Anchor Text in the Neural Retrieval Era. ECIR 2022 1
The SimIIR 2.0 Framework: User Types, Markov Model-Based Interaction Simulation, and Advanced Query Generation. CIKM 2022 0
Overview of Touché 2022: Argument Retrieval - Extended Abstract. ECIR 2022 0
Query Interpretations from Entity-Linked Segmentations. WSDM 2022 0
Misbeliefs and Biases in Health-Related Searches. CIKM 2021 2
Identifying Queries in Instant Search Logs. SIGIR 2021 0
The Information Retrieval Anthology. SIGIR 2021 2
CopyCat: Near-Duplicates Within and Between the ClueWeb and the Common Crawl. SIGIR 2021 9
Overview of Touché 2021: Argument Retrieval - Extended Abstract. ECIR 2021 0
The Effect of Content-Equivalent Near-Duplicates on the Evaluation of Search Engines. ECIR 2020 15
Touché: First Shared Task on Argument Retrieval. ECIR 2020 10
Sampling Bias Due to Near-Duplicates in Learning to Rank. SIGIR 2020 12
Efficient Pairwise Annotation of Argument Quality. ACL 2020 15
Comparative Web Search Questions. WSDM 2020 21
Abstractive Snippet Generation. WWW 2020 17
Estimating Topic Difficulty Using Normalized Discounted Cumulated Gain. CIKM 2020 2
The Impact of Negative Relevance Judgments on NDCG. CIKM 2020 6
A Search Engine for Police Press Releases to Double-Check the News. ECIR 2020 0
Query-Task Mapping. SIGIR 2019 6
TARGER: Neural Argument Mining at Your Fingertips. ACL 2019 64
Argument Search: Assessing Argument Relevance. SIGIR 2019 40
Heuristic Authorship Obfuscation. ACL 2019 14
Bias Analysis and Mitigation in the Evaluation of Authorship Verification. ACL 2019 21
Wikipedia Text Reuse: Within and Without. ECIR 2019 0
Elastic ChatNoir: Search Engine for the ClueWeb and the Common Crawl. ECIR 2018 37
Toward Voice Query Clarification. SIGIR 2018 46
Crowdsourcing a Large Corpus of Clickbait on Twitter. COLING 2018 66
Modeling Deliberative Argumentation Strategies on Wikipedia. ACL 2018 12
A User Study on Snippet Generation: Text Reuse vs. Paraphrases. SIGIR 2018 8
A Large-Scale Query Spelling Correction Corpus. SIGIR 2017 15
Source Retrieval for Web-Scale Text Reuse Detection. CIKM 2017 16
Patterns of Argumentation Strategies across Topics. EMNLP 2017 24
Clickbait Detection. ECIR 2016 169
A News Editorial Corpus for Mining Argumentation Strategies. COLING 2016 63
Axiomatic Result Re-Ranking. CIKM 2016 19
Supporting Scholarly Search with Keyqueries. ECIR 2016 20
Who Wrote the Web? Revisiting Influential Author Identification Research Applicable to Information Retrieval. ECIR 2016 53
Twitter Sentiment Detection via Ensemble Classification Using Averaged Confidence Scores. ECIR 2015 40
A Corpus of Realistic Known-Item Topics with Associated Web Pages in the ClueWeb09. ECIR 2015 8
What Users Ask a Search Engine: Analyzing One Billion Russian Question Queries. CIKM 2015 18
Improving Cloze Test Performance of Language Learners Using Web N-Grams. COLING 2014 3
Generating Acrostics via Paraphrasing and Heuristic Search. COLING 2014 9
Crowdsourcing Interaction Logs to Understand Text Reuse from the Web. ACL 2013 64
From keywords to keyqueries: content descriptors for the web. SIGIR 2013 24
Towards optimum query segmentation: in doubt without. CIKM 2012 42
ChatNoir: a search engine for the ClueWeb09 corpus. SIGIR 2012 79
Query segmentation revisited. WWW 2011 79
Query session detection as a cascade. CIKM 2011 23
Introducing the User-over-Ranking Hypothesis. ECIR 2011 12
The power of naive query segmentation. SIGIR 2010 29
