Alexander G. Schwing

129 publications

14 venues

H Index 50

Name Venue Year citations
Variational Rectified Flow Matching. ICML 2025 18
LIFe-GoM: Generalizable Human Rendering with Learned Iterative Feedback Over Multi-Resolution Gaussians-on-Mesh. ICLR 2025 8
Towards Hierarchical Rectified Flow. ICLR 2025 10
MMAudio: Taming Multimodal Joint Training for High-Quality Video-to-Audio Synthesis. CVPR 2025 0
MV-DUSt3R+: Single-Stage Scene Reconstruction from Sparse Views In 2 Seconds. CVPR 2025 0
RELOCATE: A Simple Training-Free Baseline for Visual Query Localization Using Region-Based Representations. CVPR 2025 0
OW-VISCapTor: Abstractors for Open-World Video Instance Segmentation and Captioning. NIPS/NeurIPS 2024 5
GoMAvatar: Efficient Animatable Human Modeling from Monocular Video Using Gaussians-on-Mesh. CVPR 2024 56
Reinforcement Learning Gradients as Vitamin for Online Finetuning Decision Transformers. NIPS/NeurIPS 2024 5
NeRFDeformer: NeRF Transformation from a Single View via 3D Scene Flows. CVPR 2024 2
Layer Collaboration in the Forward-Forward Algorithm. AAAI 2024 0
Offline Imitation from Observation via Primal Wasserstein State Occupancy Matching. ICML 2024 0
Putting the Object Back into Video Object Segmentation. CVPR 2024 0
Pseudo-Generalized Dynamic View Synthesis from a Video. ICLR 2024 0
Robust Model-Based Optimization for Challenging Fitness Landscapes. ICLR 2024 0
AutoFocusFormer: Image Segmentation off the Grid. CVPR 2023 17
Tracking Anything with Decoupled Video Segmentation. ICCV 2023 222
Diffusion Probabilistic Fields. ICLR 2023 32
A Simple Solution for Offline Imitation from Observations and Examples with Possibly Incomplete Trajectories. NIPS/NeurIPS 2023 7
Context-Aware Relative Object Queries to Unify Video Instance and Panoptic Segmentation. CVPR 2023 10
Surface Snapping Optimization Layer for Single Image Object Shape Reconstruction. ICML 2023 1
Occupancy Planes for Single-View RGB-D Human Reconstruction. AAAI 2023 0
RGB-Only Reconstruction of Tabletop Scenes for Collision-Free Manipulator Control. ICRA 2023 0
SDFusion: Multimodal 3D Shape Completion, Reconstruction, and Generation. CVPR 2023 0
Learning to Decompose Visual Features with Latent Textual Prompts. ICLR 2023 0
Total Variation Optimization Layers for Computer Vision. CVPR 2022 17
DigGAN: Discriminator gradIent Gap Regularization for GAN Training with Limited Data. NIPS/NeurIPS 2022 25
CEIP: Combining Explicit and Implicit Priors for Reinforcement Learning with Demonstrations. NIPS/NeurIPS 2022 3
Equivariance Discovery by Learned Parameter-Sharing. AISTATS 2022 20
Neural Volumetric Object Selection. CVPR 2022 71
Joint Forecasting of Panoptic Segmentations with Difference Attention. CVPR 2022 0
Initialization and Alignment for Adversarial Texture Optimization. ECCV 2022 2
Learnable Polyphase Sampling for Shift Invariant and Equivariant Convolutional Networks. NIPS/NeurIPS 2022 19
Generative Multiplane Images: Making a 2D GAN 3D-Aware. ECCV 2022 74
XMem: Long-Term Video Object Segmentation with an Atkinson-Shiffrin Memory Model. ECCV 2022 590
On the Importance of Gradient Norm in PAC-Bayesian Bounds. NIPS/NeurIPS 2022 6
MuMuQA: Multimedia Multi-Hop News Question Answering via Cross-Media Knowledge Extraction and Grounding. AAAI 2022 0
Asking for Knowledge (AFK): Training RL Agents to Query External Knowledge Using Language. ICML 2022 0
Masked-attention Mask Transformer for Universal Image Segmentation. CVPR 2022 0
Enjoy Your Editing: Controllable GANs for Image Editing via Latent Space Navigation. ICLR 2021 86
GridToPix: Training Embodied Agents with Minimal Supervision. ICCV 2021 24
Panoptic Segmentation Forecasting. CVPR 2021 15
SAIL-VOS 3D: A Synthetic Dataset and Baselines for Object Detection and 3D Mesh Reconstruction From Video Data. CVPR 2021 44
The Surprising Effectiveness of Visual Odometry Techniques for Embodied PointGoal Navigation. ICCV 2021 50
Class-agnostic Reconstruction of Dynamic Objects from Videos. NIPS/NeurIPS 2021 9
FuseRec: fusing user and item homophily modeling with temporal recommender systems. DMKD 2021 7
Per-Pixel Classification is Not All You Need for Semantic Segmentation. NIPS/NeurIPS 2021 1917
DeepQAMVS: Query-Aware Hierarchical Pointer Networks for Multi-Video Summarization. SIGIR 2021 14
Interpretation of Emergent Communication in Heterogeneous Collaborative Embodied Agents. ICCV 2021 30
Perceptual Score: What Data Modalities Does Your Model Perceive? NIPS/NeurIPS 2021 45
Assignment-Space-based Multi-Object Tracking and Segmentation. ICCV 2021 9
Cooperative Exploration for Multi-Agent Deep Reinforcement Learning. ICML 2021 131
3D Spatial Recognition Without Spatially Labeled 3D. CVPR 2021 67
Semantic Tracklets: An Object-Centric Representation for Visual Multi-Agent Reinforcement Learning. IROS 2021 21
Bridging the Imitation Gap by Adaptive Insubordination. NIPS/NeurIPS 2021 0
A Contrastive Learning Approach for Training Variational Autoencoder Priors. NIPS/NeurIPS 2021 0
Towards a Better Global Loss Landscape of GANs. NIPS/NeurIPS 2020 36
Not All Unlabeled Data are Equal: Learning to Weight Data in Semi-supervised Learning. NIPS/NeurIPS 2020 101
Agriculture-Vision: A Large Aerial Image Database for Agricultural Pattern Analysis. CVPR 2020 198
Dynamic Neural Relational Inference. CVPR 2020 71
Removing Bias in Multi-modal Classifiers: Regularization by Maximizing Functional Entropies. NIPS/NeurIPS 2020 103
Instance-Aware, Context-Focused, and Memory-Efficient Weakly Supervised Object Detection. CVPR 2020 223
Proposal-Based Video Completion. ECCV 2020 43
Spatially Aware Multimodal Transformers for TextVQA. ECCV 2020 95
High-Throughput Synchronous Deep RL. NIPS/NeurIPS 2020 13
Can We Learn Heuristics for Graphical Model Inference Using Reinforcement Learning? CVPR 2020 5
A Cordial Sync: Going Beyond Marginal Policies for Multi-agent Embodied Tasks. ECCV 2020 65
UFO ECCV 2020 0
Chirality Nets for Human Pose Regression. NIPS/NeurIPS 2019 59
Diverse Generation for Multi-Agent Sports Games. CVPR 2019 109
Sequential Latent Spaces for Modeling the Intention During Diverse Image Captioning. ICCV 2019 71
Factor Graph Attention. CVPR 2019 112
ViCo: Word Embeddings From Visual Co-Occurrences. ICCV 2019 27
TAB-VCR: Tags and Attributes based VCR Baselines. NIPS/NeurIPS 2019 25
PIC: Permutation Invariant Critic for Multi-Agent Deep Reinforcement Learning. CoRL 2019 90
Graph Structured Prediction Energy Networks. NIPS/NeurIPS 2019 19
Two Body Problem: Collaborative Visual Task Completion. CVPR 2019 75
Max-Sliced Wasserstein Distance and Its Use for GANs. CVPR 2019 228
Co-Generation with GANs using AIS based HMC. NIPS/NeurIPS 2019 2
A Simple Baseline for Audio-Visual Scene-Aware Dialog. CVPR 2019 85
Fast, Diverse and Accurate Image Captioning Guided by Part-Of-Speech. CVPR 2019 0
SAIL-VOS: Semantic Amodal Instance Level Video Object Segmentation - A Synthetic Dataset and Baselines. CVPR 2019 0
No-Frills Human-Object Interaction Detection: Factorization, Layout Encodings, and Training Techniques. ICCV 2019 0
VideoMatch: Matching Based Video Object Segmentation. ECCV 2018 295
Diverse and Coherent Paragraph Generation from Images. ECCV 2018 67
Straight to the Facts: Learning Knowledge Base Retrieval for Factual Visual Question Answering. ECCV 2018 116
Generative Modeling Using the Sliced Wasserstein Distance. CVPR 2018 249
Pipe-SGD: A Decentralized Pipelined SGD Framework for Distributed Deep Net Training. NIPS/NeurIPS 2018 109
Two Can Play This Game: Visual Dialog With Discriminative Question Generation and Answering. CVPR 2018 82
Deep Structured Prediction with Nonlinear Output Transformations. NIPS/NeurIPS 2018 26
Unsupervised Video Object Segmentation Using Motion Saliency-Guided Spatio-Temporal Propagation. ECCV 2018 103
Structural Consistency and Controllability for Diverse Colorization. ECCV 2018 52
Unsupervised Textual Grounding: Linking Words to Image Concepts. CVPR 2018 44
Out of the Box: Reasoning with Graph Convolution Nets for Factual Visual Question Answering. NIPS/NeurIPS 2018 11
GradiVeQ: Vector Quantization for Bandwidth-Efficient Gradient Aggregation in Distributed CNN Training. NIPS/NeurIPS 2018 69
Convolutional Image Captioning. CVPR 2018 0
High-Order Attention Models for Visual Question Answering. NIPS/NeurIPS 2017 105
Asynchronous Parallel Coordinate Minimization for MAP Inference. NIPS/NeurIPS 2017 6
Dualing GANs. NIPS/NeurIPS 2017 21
Diverse and Accurate Image Description Using a Variational Auto-Encoder with an Additive Gaussian Encoding Space. NIPS/NeurIPS 2017 180
Creativity: Generating Diverse Questions Using Variational Autoencoders. CVPR 2017 158
Semantic Image Inpainting with Deep Generative Models. CVPR 2017 0
MaskRNN: Instance Level Video Object Segmentation. NIPS/NeurIPS 2017 0
Interpretable and Globally Optimal Prediction for Textual Grounding using Image Concepts. NIPS/NeurIPS 2017 0
Efficient Deep Learning for Stereo Matching. CVPR 2016 769
Blending Learning and Inference in Conditional Random Fields. JMLR 2016 9
Constraints Based Convex Belief Propagation. NIPS/NeurIPS 2016 0
Learning Deep Parsimonious Representations. NIPS/NeurIPS 2016 100
Training Deep Neural Networks via Direct Loss Minimization. ICML 2016 0
Rent3D: Floor-plan priors for monocular layout estimation. CVPR 2015 89
Monocular Object Instance Segmentation and Depth Ordering with CNNs. ICCV 2015 164
Smooth and Strong: MAP Inference with Linear Convergence. NIPS/NeurIPS 2015 33
Learning to segment under various forms of weak supervision. CVPR 2015 188
Learning Deep Structured Models. ICML 2015 0
Computational Education using Latent Structured Prediction. AISTATS 2014 5
Efficient Structured Parsing of Facades Using Dynamic Programming. CVPR 2014 64
Globally Convergent Parallel MAP LP Relaxation Solver using the Frank-Wolfe Algorithm. ICML 2014 20
Message Passing Inference for Large Scale Graphical Models with High Order Potentials. NIPS/NeurIPS 2014 4
Efficient Inference of Continuous Markov Random Fields with Polynomial Potentials. NIPS/NeurIPS 2014 20
Tell Me What You See and I Will Show You Where It Is. CVPR 2014 98
Latent Structured Active Learning. NIPS/NeurIPS 2013 118
Estimating the 3D Layout of Indoor Scenes and Its Clutter from Depth Sensors. ICCV 2013 78
Box in the Box: Joint 3D Layout and Object Reasoning from Single Images. ICCV 2013 135
Efficient Structured Prediction with Latent Variables for General Graphical Models. ICML 2012 71
Efficient Exact Inference for 3D Indoor Scene Understanding. ECCV 2012 122
Globally Convergent Dual MAP LP Relaxation Solvers using Fenchel-Young Margins. NIPS/NeurIPS 2012 34
Efficient structured prediction for 3D indoor scene understanding. CVPR 2012 136
Distributed message passing for large scale graphical models. CVPR 2011 83
Adaptive random forest - How many "experts" to ask before making a decision? CVPR 2011 0
Copyright ©2019 Universität Würzburg

Impressum | Privacy | FAQ