Hinrich Schütze

147 publications

19 venues

H Index 37

Affiliation

Ludwig Maximilian University of Munich, Center for Information and Language Processing, Germany
University of Stuttgart, Institute for Natural Language Processing, Germany
Stanford University, CA, USA

Links

Name	Venue	Year	citations
TransMI: A Framework to Create Strong Baselines from Multilingual Pretrained Language Models for Transliterated Data.	COLING	2025	0
How Transliterations Improve Crosslingual Alignment.	COLING	2025	0
A Unified Data Augmentation Framework for Low-Resource Multi-domain Dialogue Generation.	ECML/PKDD	2024	0
Political Compass or Spinning Arrow? Towards More Meaningful Evaluations for Values and Opinions in Large Language Models.	ACL	2024	0
TransliCo: A Contrastive Learning Framework to Address the Script Barrier in Multilingual Pretrained Language Models.	ACL	2024	0
Kardeş-NLU: Transfer to Low-Resource Languages with Big Brother's Help - A Benchmark and Evaluation for Turkic Languages.	EACL	2024	0
ToPro: Token-Level Prompt Decomposition for Cross-Lingual Sequence Labeling Tasks.	EACL	2024	0
ChatZero: Zero-Shot Cross-Lingual Dialogue Generation via Pseudo-Target Language.	ECAI	2024	0
GlotCC: An Open Broad-Coverage CommonCrawl Corpus and Pipeline for Minority Languages.	NIPS/NeurIPS	2024	0
HiFT: A Hierarchical Full Parameter Fine-Tuning Strategy.	EMNLP	2024	0
Glot500: Scaling Multilingual Corpora and Language Models to 500 Languages.	ACL	2023	0
How to Distill your BERT: An Empirical Study on the Impact of Weight Initialisation and Distillation Objectives.	ACL	2023	0
PVGRU: Generating Diverse and Relevant Dialogue Responses via Pseudo-Variational Mechanism.	ACL	2023	0
A Crosslingual Investigation of Conceptualization in 1335 Languages.	ACL	2023	0
Language Models with Rationality.	EMNLP	2023	0
GradSim: Gradient-Based Language Grouping for Effective Multilingual Training.	EMNLP	2023	0
Counting the Bugs in ChatGPT's Wugs: A Multilingual Investigation into the Morphological Capabilities of a Large Language Model.	EMNLP	2023	0
An Embarrassingly Simple Method to Mitigate Undesirable Properties of Pretrained Language Model Tokenizers.	ACL	2022	4
CoDA21: Evaluating Language Understanding Capabilities of NLP Models With Context-Definition Alignment.	ACL	2022	2
The better your Syntax, the better your Semantics? Probing Pretrained Language Models for the English Comparative Correlative.	EMNLP	2022	2
Graph-Based Multilingual Label Propagation for Low-Resource Part-of-Speech Tagging.	EMNLP	2022	0
Differentiable Multi-Agent Actor-Critic for Multi-Step Radiology Report Summarization.	ACL	2022	3
Unsupervised Detection of Contextualized Embedding Bias with Application to Ideology.	ICML	2022	0
CaMEL: Case Marker Extraction without Labels.	ACL	2022	0
Flow-Adapter Architecture for Unsupervised Machine Translation.	ACL	2022	0
Improving Scene Graph Classification by Exploiting Knowledge from Texts.	AAAI	2022	0
BeliefBank: Adding Memory to a Pre-Trained Language Model for a Systematic Notion of Belief.	EMNLP	2021	24
Continuous Entailment Patterns for Lexical Inference in Context.	EMNLP	2021	3
Multilingual LAMA: Investigating Knowledge in Multilingual Pretrained Language Models.	EACL	2021	39
Graph Algorithms for Multiparallel Word Alignment.	EMNLP	2021	4
Language Models for Lexical Inference in Context.	EACL	2021	8
Few-Shot Text Generation with Natural Language Instructions.	EMNLP	2021	45
Generating Datasets with Pretrained Language Models.	EMNLP	2021	65
Discrete and Soft Prompting for Multilingual Models.	EMNLP	2021	26
Does She Wink or Does She Nod? A Challenging Benchmark for Evaluating Word Understanding of Language Models.	EACL	2021	3
Exploiting Cloze-Questions for Few-Shot Text Classification and Natural Language Inference.	EACL	2021	0
Masking as an Efficient Alternative to Finetuning for Pretrained Language Models.	EMNLP	2020	48
A Graph Auto-encoder Model of Derivational Morphology.	ACL	2020	9
Explainable and Discourse Topic-aware Neural Language Understanding.	ICML	2020	3
Increasing Learning Efficiency of Self-Attention Networks through Direct Position Interactions, Learnable Temperature, and Convoluted Attention.	COLING	2020	3
Combining Word Embeddings with Bilingual Orthography Embeddings for Bilingual Dictionary Induction.	COLING	2020	2
Are Pretrained Language Models Symbolic Reasoners over Knowledge?	CoNLL	2020	36
Automatically Identifying Words That Can Serve as Labels for Few-Shot Text Classification.	COLING	2020	81
Monolingual and Multilingual Reduction of Gender Bias in Contextualized Representations.	COLING	2020	19
DagoBERT: Generating Derivational Morphology with a Pretrained Language Model.	EMNLP	2020	17
Predicting the Growth of Morphological Families from Social and Linguistic Factors.	ACL	2020	10
TRENDNERT: A Benchmark for Trend and Downtrend Detection in a Scientific Domain.	AAAI	2020	1
Neural Topic Modeling with Continual Lifelong Learning.	ICML	2020	19
Fine-Grained Argument Unit Recognition and Classification.	AAAI	2020	0
Rare Words: A Major Problem for Contextualized Embeddings and How to Fix it by Attentive Mimicking.	AAAI	2020	0
Sentence Meta-Embeddings for Unsupervised Semantic Textual Similarity.	ACL	2020	0
Negated and Misprimed Probes for Pretrained Language Models: Birds Can Talk, But Cannot Fly.	ACL	2020	0
BERTRAM: Improved Word Embeddings Have Big Impact on Contextualized Model Performance.	ACL	2020	0
An Unsupervised Joint System for Text Generation from Knowledge Graphs and Semantic Parsing.	EMNLP	2020	0
Identifying Elements Essential for BERT's Multilinguality.	EMNLP	2020	0
SherLIiC: A Typed Event-Focused Lexical Inference Benchmark for Evaluating Natural Language Inference.	ACL	2019	11
Automatic Domain Adaptation Outperforms Manual Domain Adaptation for Predicting Financial Outcomes.	ACL	2019	12
Probing for Semantic Classes: Diagnosing the Meaning Content of Word Embeddings.	ACL	2019	28
A Multilingual BPE Embedding Space for Universal Sentiment Lexicon Induction.	ACL	2019	6
Type-aware Convolutional Neural Networks for Slot Filling.	JAIR	2019	4
Document Informed Neural Autoregressive Topic Models with Distributional Prior.	AAAI	2019	0
Neural Relation Extraction within and across Sentence Boundaries.	AAAI	2019	0
Learning Semantic Representations for Novel Words: Leveraging Both Form and Context.	AAAI	2019	0
Neural Transductive Learning and Beyond: Morphological Generation in the Minimal-Resource Setting.	EMNLP	2018	10
Two Methods for Domain Adaptation of Bilingual Tasks: Delightfully Simple and Broadly Applicable.	ACL	2018	17
Embedding Learning Through Multilingual Concept Induction.	ACL	2018	16
End-Task Oriented Textual Entailment via Deep Explorations of Inter-Sentence Interactions.	ACL	2018	21
Recurrent One-Hop Predictions for Reasoning over Knowledge Graphs.	COLING	2018	19
Evaluating neural network explanation methods using hybrid documents and morphosyntactic agreement.	ACL	2018	67
Multi-View Learning: Multilingual and Multi-Representation Entity Typing.	EMNLP	2018	0
Corpus-Level Fine-Grained Entity Typing.	JAIR	2018	0
Global Normalization of Convolutional Neural Networks for Joint Entity and Relation Classification.	EMNLP	2017	71
Task-Specific Attentive Pooling of Phrase Alignments Contributes to Sentence Matching.	EACL	2017	18
Past, Present, Future: A Computational Investigation of the Typology of Tense in 1000 Languages.	EMNLP	2017	38
Multi-level Representations for Fine-Grained Typing of Knowledge Base Entities.	EACL	2017	37
End-to-End Trainable Attentive Decoder for Hierarchical Entity Classification.	EACL	2017	14
One-Shot Neural Cross-Lingual Transfer for Paradigm Completion.	ACL	2017	39
Noise Mitigation for Neural Entity Typing and Relation Extraction.	EACL	2017	0
Nonsymbolic Text Representation.	EACL	2017	0
Neural Multi-Source Morphological Reinflection.	EACL	2017	0
Exploring Different Dimensions of Attention for Uncertainty Detection.	EACL	2017	0
A Piggyback System for Joint Entity Mention Detection and Linking in Web Queries.	WWW	2016	35
Learning Word Meta-Embeddings.	ACL	2016	90
Intrinsic Subspace Evaluation of Word Embedding Representations.	ACL	2016	33
Morphological Segmentation Inside-Out.	EMNLP	2016	10
Morphological Smoothing and Extrapolation of Word Embeddings.	ACL	2016	63
Simple Question Answering by Attentive Convolutional Neural Network.	COLING	2016	147
Neural Morphological Analysis: Encoding-Decoding Canonical Segments.	EMNLP	2016	40
Single-Model Encoder-Decoder with Explicit Morphological Representation for Reinflection.	ACL	2016	83
LAMB: A Good Shepherd of Morphologically Rich Languages.	EMNLP	2016	8
Table Filling Multi-Task Recurrent Neural Network for Joint Entity and Relation Extraction.	COLING	2016	168
Word Embedding Calculus in Meaningful Ultradense Subspaces.	ACL	2016	45
Joint Lemmatization and Morphological Tagging with Lemming.	EMNLP	2015	100
Labeled Morphological Segmentation with Semi-Markov Models.	CoNLL	2015	43
Corpus-level Fine-grained Entity Typing Using Contextual Information.	EMNLP	2015	67
Online Updating of Word Representations for Part-of-Speech Tagging.	EMNLP	2015	18
AutoExtend: Extending Word Embeddings to Embeddings for Synsets and Lexemes.	ACL	2015	290
MultiGranCNN: An Architecture for General Matching of Text Chunks on Multiple Levels of Granularity.	ACL	2015	69
Multichannel Variable-Size Convolution for Sentence Classification.	CoNLL	2015	142
Learning Better Embeddings for Rare Words Using Distributional Representations.	EMNLP	2015	9
Using Mined Coreference Chains as a Resource for a Semantic Task.	EMNLP	2014	25
Improving Citation Polarity Classification with Product Reviews.	ACL	2014	12
CoSimRank: A Flexible & Efficient Graph-Theoretic Similarity Measure.	ACL	2014	34
Unsupervised Training Set Generation for Automatic Acquisition of Technical Terminology in Patents.	COLING	2014	40
Fine-Grained Contextual Predictions for Hard Sentiment Words.	EMNLP	2014	0
Multi-Domain Sentiment Relevance Classification with Automatic Representation Learning.	EACL	2014	2
Dependency parsing with latent refinements of part-of-speech tags.	EMNLP	2014	4
Picking the Amateur's Mind - Predicting Chess Player Strength from Game Annotations.	COLING	2014	0
Efficient Higher-Order CRFs for Morphological Tagging.	EMNLP	2013	209
Sentiment Relevance.	ACL	2013	23
Towards Robust Cross-Domain Domain Adaptation for Part-of-Speech Tagging.	IJCNLP	2013	12
The Topology of Semantic Knowledge.	EMNLP	2013	3
Multilingual Lexicon Bootstrapping - Improving a Lexicon Induction System Using a Parallel Corpus.	IJCNLP	2013	5
Bootstrapping Semantic Lexicons for Technical Domains.	IJCNLP	2013	9
Automatic generation of short informative sentiment summaries.	EACL	2012	13
Towards a Generic and Flexible Citation Classifier Based on a Faceted Classification Scheme.	COLING	2012	51
Preliminary study of technical terminology for the retrieval of scientific book metadata records.	SIGIR	2012	8
Automatic Detection of Point of View Differences in Wikipedia.	COLING	2012	14
Crosslingual distant supervision for extracting relations of different complexity.	CIKM	2012	12
Sense discrimination for physics retrieval.	SIGIR	2011	10
A Cascaded Classification Approach to Semantic Head Recognition.	EMNLP	2011	10
Integrating history-length interpolation and classes in language modeling.	ACL	2011	9
Piggyback: Using Search Engines for Robust Cross-Domain Named Entity Recognition.	ACL	2011	38
Bootstrapping coreference resolution using word associations.	ACL	2011	30
Active Learning with Amazon Mechanical Turk.	EMNLP	2011	86
Improved Modeling of Out-Of-Vocabulary Words Using Morphological Classes.	ACL	2011	10
Relational feature engineering of natural language processing.	CIKM	2010	15
IR, NLP, and Visualization.	ECIR	2010	1
Self-Annotation for fine-grained geospatial relation extraction.	COLING	2010	13
Frequency Matters: Pitch Accents and Information Status.	EACL	2009	11
Rich Bitext Projection Features for Parse Reranking.	EACL	2009	10
Stopping Criteria for Active Learning of Named Entity Recognition.	COLING	2008	84
A Graph-theoretic Model of Lexical Syntactic Acquisition.	EMNLP	2008	10
Disorder inequality: a combinatorial approach to nearest neighbor search.	WSDM	2008	44
Improving active learning recall via disjunctive boolean constraints.	SIGIR	2007	1
The Effect of Corpus Size in Combining Supervised and Unsupervised Training for Disambiguation.	ACL	2006	9
Performance thresholding in practical text classification.	CIKM	2006	54
A Lattice-Based Framework for Enhancing Statistical Parsers with Information from Unlabeled Corpora.	CoNLL	2006	2
Inclusion of Textual Documentation in the Analysis of Multidimensional Data Sets: Application to Gene Expression Data.	MLJ	2003	14
Automatic Detection of Text Genre.	ACL	1997	491
Projections for Efficient Document Clustering.	SIGIR	1997	7
Method Combination For Document Filtering.	SIGIR	1996	133
A Comparison of Classifiers and Document Representations for the Routing Problem.	SIGIR	1995	580
Part-of-Speech Tagging using a Variable Memory Markov Model.	ACL	1994	76
Part-of-Speech Induction from Scratch.	ACL	1993	151
Word Space.	NIPS/NeurIPS	1992	315
Communication and Inference through Situations.	IJCAI	1991	17