Hinrich Schütze

158 publications

19 venues

H Index 44

Affiliation

Ludwig Maximilian University of Munich, Center for Information and Language Processing, Germany
University of Stuttgart, Institute for Natural Language Processing, Germany
Stanford University, CA, USA

Links

Name Venue Year citations
Understanding In-Context Machine Translation for Low-Resource Languages: A Case Study on Manchu. ACL 2025 5
HYPEROFA: Expanding LLM Vocabulary to New Languages via Hypernetwork-Based Embedding Initialization. ACL 2025 1
Lost in Multilinguality: Dissecting Cross-lingual Factual Inconsistency in Transformer Language Models. ACL 2025 30
Language Mixing in Reasoning Language Models: Patterns, Impact, and Internal Causes. EMNLP 2025 10
Collapse of Dense Retrievers: Short, Early, and Literal Biases Outranking Factual Evidence. ACL 2025 12
On Relation-Specific Neurons in Large Language Models. EMNLP 2025 0
ImpliRet: Benchmarking the Implicit Fact Retrieval Challenge. EMNLP 2025 2
Your Pretrained Model Tells the Difficulty Itself: A Self-Adaptive Curriculum Learning Paradigm for Natural Language Understanding. ACL 2025 2
NoLiMa: Long-Context Evaluation Beyond Literal Matching. ICML 2025 59
LangSAMP: Language-Script Aware Multilingual Pretraining. ACL 2025 0
BMIKE-53: Investigating Cross-Lingual Knowledge Editing with In-Context Learning. ACL 2025 0
TransMI: A Framework to Create Strong Baselines from Multilingual Pretrained Language Models for Transliterated Data. COLING 2025 0
How Transliterations Improve Crosslingual Alignment. COLING 2025 0
ToPro: Token-Level Prompt Decomposition for Cross-Lingual Sequence Labeling Tasks. EACL 2024 9
A Unified Data Augmentation Framework for Low-Resource Multi-domain Dialogue Generation. ECML/PKDD 2024 2
TransliCo: A Contrastive Learning Framework to Address the Script Barrier in Multilingual Pretrained Language Models. ACL 2024 3
HiFT: A Hierarchical Full Parameter Fine-Tuning Strategy. EMNLP 2024 17
GlotCC: An Open Broad-Coverage CommonCrawl Corpus and Pipeline for Minority Languages. NIPS/NeurIPS 2024 14
Political Compass or Spinning Arrow? Towards More Meaningful Evaluations for Values and Opinions in Large Language Models. ACL 2024 142
Kardeş-NLU: Transfer to Low-Resource Languages with Big Brother's Help - A Benchmark and Evaluation for Turkic Languages. EACL 2024 0
ChatZero: Zero-Shot Cross-Lingual Dialogue Generation via Pseudo-Target Language. ECAI 2024 0
Language Models with Rationality. EMNLP 2023 22
Counting the Bugs in ChatGPT's Wugs: A Multilingual Investigation into the Morphological Capabilities of a Large Language Model. EMNLP 2023 15
How to Distill your BERT: An Empirical Study on the Impact of Weight Initialisation and Distillation Objectives. ACL 2023 11
GradSim: Gradient-Based Language Grouping for Effective Multilingual Training. EMNLP 2023 4
A Crosslingual Investigation of Conceptualization in 1335 Languages. ACL 2023 15
Glot500: Scaling Multilingual Corpora and Language Models to 500 Languages. ACL 2023 142
PVGRU: Generating Diverse and Relevant Dialogue Responses via Pseudo-Variational Mechanism. ACL 2023 0
Graph-Based Multilingual Label Propagation for Low-Resource Part-of-Speech Tagging. EMNLP 2022 9
Differentiable Multi-Agent Actor-Critic for Multi-Step Radiology Report Summarization. ACL 2022 19
Unsupervised Detection of Contextualized Embedding Bias with Application to Ideology. ICML 2022 1
Flow-Adapter Architecture for Unsupervised Machine Translation. ACL 2022 9
An Embarrassingly Simple Method to Mitigate Undesirable Properties of Pretrained Language Model Tokenizers. ACL 2022 72
CoDA21: Evaluating Language Understanding Capabilities of NLP Models With Context-Definition Alignment. ACL 2022 6
The better your Syntax, the better your Semantics? Probing Pretrained Language Models for the English Comparative Correlative. EMNLP 2022 47
CaMEL: Case Marker Extraction without Labels. ACL 2022 3
Improving Scene Graph Classification by Exploiting Knowledge from Texts. AAAI 2022 0
Continuous Entailment Patterns for Lexical Inference in Context. EMNLP 2021 3
Language Models for Lexical Inference in Context. EACL 2021 15
Multilingual LAMA: Investigating Knowledge in Multilingual Pretrained Language Models. EACL 2021 163
Discrete and Soft Prompting for Multilingual Models. EMNLP 2021 78
Does She Wink or Does She Nod? A Challenging Benchmark for Evaluating Word Understanding of Language Models. EACL 2021 5
BeliefBank: Adding Memory to a Pre-Trained Language Model for a Systematic Notion of Belief. EMNLP 2021 70
Generating Datasets with Pretrained Language Models. EMNLP 2021 263
Graph Algorithms for Multiparallel Word Alignment. EMNLP 2021 7
Exploiting Cloze-Questions for Few-Shot Text Classification and Natural Language Inference. EACL 2021 0
Few-Shot Text Generation with Natural Language Instructions. EMNLP 2021 0
Neural Topic Modeling with Continual Lifelong Learning. ICML 2020 54
Automatically Identifying Words That Can Serve as Labels for Few-Shot Text Classification. COLING 2020 223
Increasing Learning Efficiency of Self-Attention Networks through Direct Position Interactions, Learnable Temperature, and Convoluted Attention. COLING 2020 9
Predicting the Growth of Morphological Families from Social and Linguistic Factors. ACL 2020 12
Explainable and Discourse Topic-aware Neural Language Understanding. ICML 2020 7
Combining Word Embeddings with Bilingual Orthography Embeddings for Bilingual Dictionary Induction. COLING 2020 4
Monolingual and Multilingual Reduction of Gender Bias in Contextualized Representations. COLING 2020 44
Masking as an Efficient Alternative to Finetuning for Pretrained Language Models. EMNLP 2020 130
A Graph Auto-encoder Model of Derivational Morphology. ACL 2020 12
DagoBERT: Generating Derivational Morphology with a Pretrained Language Model. EMNLP 2020 33
TRENDNERT: A Benchmark for Trend and Downtrend Detection in a Scientific Domain. AAAI 2020 1
Are Pretrained Language Models Symbolic Reasoners over Knowledge? CoNLL 2020 76
Fine-Grained Argument Unit Recognition and Classification. AAAI 2020 0
Rare Words: A Major Problem for Contextualized Embeddings and How to Fix it by Attentive Mimicking. AAAI 2020 0
Sentence Meta-Embeddings for Unsupervised Semantic Textual Similarity. ACL 2020 0
Negated and Misprimed Probes for Pretrained Language Models: Birds Can Talk, But Cannot Fly. ACL 2020 0
BERTRAM: Improved Word Embeddings Have Big Impact on Contextualized Model Performance. ACL 2020 0
An Unsupervised Joint System for Text Generation from Knowledge Graphs and Semantic Parsing. EMNLP 2020 0
Identifying Elements Essential for BERT's Multilinguality. EMNLP 2020 0
Automatic Domain Adaptation Outperforms Manual Domain Adaptation for Predicting Financial Outcomes. ACL 2019 15
SherLIiC: A Typed Event-Focused Lexical Inference Benchmark for Evaluating Natural Language Inference. ACL 2019 14
Type-aware Convolutional Neural Networks for Slot Filling. JAIR 2019 6
Probing for Semantic Classes: Diagnosing the Meaning Content of Word Embeddings. ACL 2019 38
A Multilingual BPE Embedding Space for Universal Sentiment Lexicon Induction. ACL 2019 9
Document Informed Neural Autoregressive Topic Models with Distributional Prior. AAAI 2019 0
Neural Relation Extraction within and across Sentence Boundaries. AAAI 2019 0
Learning Semantic Representations for Novel Words: Leveraging Both Form and Context. AAAI 2019 0
Neural Transductive Learning and Beyond: Morphological Generation in the Minimal-Resource Setting. EMNLP 2018 11
Two Methods for Domain Adaptation of Bilingual Tasks: Delightfully Simple and Broadly Applicable. ACL 2018 16
Multi-View Learning: Multilingual and Multi-Representation Entity Typing. EMNLP 2018 0
Evaluating neural network explanation methods using hybrid documents and morphosyntactic agreement. ACL 2018 88
Recurrent One-Hop Predictions for Reasoning over Knowledge Graphs. COLING 2018 23
Embedding Learning Through Multilingual Concept Induction. ACL 2018 20
End-Task Oriented Textual Entailment via Deep Explorations of Inter-Sentence Interactions. ACL 2018 26
Corpus-Level Fine-Grained Entity Typing. JAIR 2018 0
Task-Specific Attentive Pooling of Phrase Alignments Contributes to Sentence Matching. EACL 2017 19
Global Normalization of Convolutional Neural Networks for Joint Entity and Relation Classification. EMNLP 2017 77
Past, Present, Future: A Computational Investigation of the Typology of Tense in 1000 Languages. EMNLP 2017 49
Multi-level Representations for Fine-Grained Typing of Knowledge Base Entities. EACL 2017 39
One-Shot Neural Cross-Lingual Transfer for Paradigm Completion. ACL 2017 40
End-to-End Trainable Attentive Decoder for Hierarchical Entity Classification. EACL 2017 15
Noise Mitigation for Neural Entity Typing and Relation Extraction. EACL 2017 0
Nonsymbolic Text Representation. EACL 2017 0
Neural Multi-Source Morphological Reinflection. EACL 2017 0
Exploring Different Dimensions of Attention for Uncertainty Detection. EACL 2017 0
Table Filling Multi-Task Recurrent Neural Network for Joint Entity and Relation Extraction. COLING 2016 234
Single-Model Encoder-Decoder with Explicit Morphological Representation for Reinflection. ACL 2016 90
Word Embedding Calculus in Meaningful Ultradense Subspaces. ACL 2016 49
A Piggyback System for Joint Entity Mention Detection and Linking in Web Queries. WWW 2016 34
Learning Word Meta-Embeddings. ACL 2016 108
Morphological Segmentation Inside-Out. EMNLP 2016 16
Neural Morphological Analysis: Encoding-Decoding Canonical Segments. EMNLP 2016 47
Morphological Smoothing and Extrapolation of Word Embeddings. ACL 2016 65
Intrinsic Subspace Evaluation of Word Embedding Representations. ACL 2016 37
LAMB: A Good Shepherd of Morphologically Rich Languages. EMNLP 2016 8
Simple Question Answering by Attentive Convolutional Neural Network. COLING 2016 175
MultiGranCNN: An Architecture for General Matching of Text Chunks on Multiple Levels of Granularity. ACL 2015 80
AutoExtend: Extending Word Embeddings to Embeddings for Synsets and Lexemes. ACL 2015 290
Learning Better Embeddings for Rare Words Using Distributional Representations. EMNLP 2015 9
Joint Lemmatization and Morphological Tagging with Lemming. EMNLP 2015 128
Corpus-level Fine-grained Entity Typing Using Contextual Information. EMNLP 2015 77
Online Updating of Word Representations for Part-of-Speech Tagging. EMNLP 2015 17
Multichannel Variable-Size Convolution for Sentence Classification. CoNLL 2015 162
Labeled Morphological Segmentation with Semi-Markov Models. CoNLL 2015 51
Using Mined Coreference Chains as a Resource for a Semantic Task. EMNLP 2014 26
Dependency parsing with latent refinements of part-of-speech tags. EMNLP 2014 4
Fine-Grained Contextual Predictions for Hard Sentiment Words. EMNLP 2014 0
Unsupervised Training Set Generation for Automatic Acquisition of Technical Terminology in Patents. COLING 2014 45
CoSimRank: A Flexible & Efficient Graph-Theoretic Similarity Measure. ACL 2014 45
Multi-Domain Sentiment Relevance Classification with Automatic Representation Learning. EACL 2014 2
Improving Citation Polarity Classification with Product Reviews. ACL 2014 13
Picking the Amateur's Mind - Predicting Chess Player Strength from Game Annotations. COLING 2014 0
Sentiment Relevance. ACL 2013 24
Bootstrapping Semantic Lexicons for Technical Domains. IJCNLP 2013 10
The Topology of Semantic Knowledge. EMNLP 2013 3
Towards Robust Cross-Domain Domain Adaptation for Part-of-Speech Tagging. IJCNLP 2013 14
Multilingual Lexicon Bootstrapping - Improving a Lexicon Induction System Using a Parallel Corpus. IJCNLP 2013 7
Efficient Higher-Order CRFs for Morphological Tagging. EMNLP 2013 222
Preliminary study of technical terminology for the retrieval of scientific book metadata records. SIGIR 2012 8
Automatic Detection of Point of View Differences in Wikipedia. COLING 2012 16
Crosslingual distant supervision for extracting relations of different complexity. CIKM 2012 13
Automatic generation of short informative sentiment summaries. EACL 2012 13
Towards a Generic and Flexible Citation Classifier Based on a Faceted Classification Scheme. COLING 2012 63
Piggyback: Using Search Engines for Robust Cross-Domain Named Entity Recognition. ACL 2011 41
Integrating history-length interpolation and classes in language modeling. ACL 2011 8
Sense discrimination for physics retrieval. SIGIR 2011 11
Bootstrapping coreference resolution using word associations. ACL 2011 33
Improved Modeling of Out-Of-Vocabulary Words Using Morphological Classes. ACL 2011 10
Active Learning with Amazon Mechanical Turk. EMNLP 2011 91
A Cascaded Classification Approach to Semantic Head Recognition. EMNLP 2011 10
Self-Annotation for fine-grained geospatial relation extraction. COLING 2010 13
Relational feature engineering of natural language processing. CIKM 2010 13
IR, NLP, and Visualization. ECIR 2010 1
Frequency Matters: Pitch Accents and Information Status. EACL 2009 10
Rich Bitext Projection Features for Parse Reranking. EACL 2009 9
Stopping Criteria for Active Learning of Named Entity Recognition. COLING 2008 86
A Graph-theoretic Model of Lexical Syntactic Acquisition. EMNLP 2008 10
Disorder inequality: a combinatorial approach to nearest neighbor search. WSDM 2008 45
Improving active learning recall via disjunctive boolean constraints. SIGIR 2007 1
Performance thresholding in practical text classification. CIKM 2006 59
The Effect of Corpus Size in Combining Supervised and Unsupervised Training for Disambiguation. ACL 2006 9
A Lattice-Based Framework for Enhancing Statistical Parsers with Information from Unlabeled Corpora. CoNLL 2006 2
Inclusion of Textual Documentation in the Analysis of Multidimensional Data Sets: Application to Gene Expression Data. MLJ 2003 14
Projections for Efficient Document Clustering. SIGIR 1997 292
Automatic Detection of Text Genre. ACL 1997 508
Method Combination For Document Filtering. SIGIR 1996 130
A Comparison of Classifiers and Document Representations for the Routing Problem. SIGIR 1995 573
Part-of-Speech Tagging using a Variable Memory Markov Model. ACL 1994 74
Part-of-Speech Induction from Scratch. ACL 1993 159
Word Space. NIPS/NeurIPS 1992 325
Communication and Inference through Situations. IJCAI 1991 19
Copyright ©2019 Universität Würzburg

Impressum | Privacy | FAQ