Layer Collaboration in the Forward-Forward Algorithm.
|
AAAI |
2024 |
0 |
Offline Imitation from Observation via Primal Wasserstein State Occupancy Matching.
|
ICML |
2024 |
0 |
Putting the Object Back into Video Object Segmentation.
|
CVPR |
2024 |
0 |
GoMAvatar: Efficient Animatable Human Modeling from Monocular Video Using Gaussians-on-Mesh.
|
CVPR |
2024 |
0 |
NeRFDeformer: NeRF Transformation from a Single View via 3D Scene Flows.
|
CVPR |
2024 |
0 |
Reinforcement Learning Gradients as Vitamin for Online Finetuning Decision Transformers.
|
NIPS/NeurIPS |
2024 |
0 |
OW-VISCapTor: Abstractors for Open-World Video Instance Segmentation and Captioning.
|
NIPS/NeurIPS |
2024 |
0 |
Pseudo-Generalized Dynamic View Synthesis from a Video.
|
ICLR |
2024 |
0 |
Robust Model-Based Optimization for Challenging Fitness Landscapes.
|
ICLR |
2024 |
0 |
Occupancy Planes for Single-View RGB-D Human Reconstruction.
|
AAAI |
2023 |
0 |
Surface Snapping Optimization Layer for Single Image Object Shape Reconstruction.
|
ICML |
2023 |
0 |
RGB-Only Reconstruction of Tabletop Scenes for Collision-Free Manipulator Control.
|
ICRA |
2023 |
0 |
Context-Aware Relative Object Queries to Unify Video Instance and Panoptic Segmentation.
|
CVPR |
2023 |
0 |
SDFusion: Multimodal 3D Shape Completion, Reconstruction, and Generation.
|
CVPR |
2023 |
0 |
AutoFocusFormer: Image Segmentation off the Grid.
|
CVPR |
2023 |
0 |
A Simple Solution for Offline Imitation from Observations and Examples with Possibly Incomplete Trajectories.
|
NIPS/NeurIPS |
2023 |
0 |
Learning to Decompose Visual Features with Latent Textual Prompts.
|
ICLR |
2023 |
0 |
Diffusion Probabilistic Fields.
|
ICLR |
2023 |
0 |
Tracking Anything with Decoupled Video Segmentation.
|
ICCV |
2023 |
0 |
Initialization and Alignment for Adversarial Texture Optimization.
|
ECCV |
2022 |
0 |
Joint Forecasting of Panoptic Segmentations with Difference Attention.
|
CVPR |
2022 |
0 |
Equivariance Discovery by Learned Parameter-Sharing.
|
AISTATS |
2022 |
3 |
Generative Multiplane Images: Making a 2D GAN 3D-Aware.
|
ECCV |
2022 |
18 |
Total Variation Optimization Layers for Computer Vision.
|
CVPR |
2022 |
0 |
Neural Volumetric Object Selection.
|
CVPR |
2022 |
5 |
XMem: Long-Term Video Object Segmentation with an Atkinson-Shiffrin Memory Model.
|
ECCV |
2022 |
9 |
MuMuQA: Multimedia Multi-Hop News Question Answering via Cross-Media Knowledge Extraction and Grounding.
|
AAAI |
2022 |
0 |
Asking for Knowledge (AFK): Training RL Agents to Query External Knowledge Using Language.
|
ICML |
2022 |
0 |
Masked-attention Mask Transformer for Universal Image Segmentation.
|
CVPR |
2022 |
0 |
CEIP: Combining Explicit and Implicit Priors for Reinforcement Learning with Demonstrations.
|
NIPS/NeurIPS |
2022 |
0 |
DigGAN: Discriminator gradIent Gap Regularization for GAN Training with Limited Data.
|
NIPS/NeurIPS |
2022 |
0 |
On the Importance of Gradient Norm in PAC-Bayesian Bounds.
|
NIPS/NeurIPS |
2022 |
0 |
Learnable Polyphase Sampling for Shift Invariant and Equivariant Convolutional Networks.
|
NIPS/NeurIPS |
2022 |
0 |
Panoptic Segmentation Forecasting.
|
CVPR |
2021 |
0 |
GridToPix: Training Embodied Agents with Minimal Supervision.
|
ICCV |
2021 |
12 |
FuseRec: fusing user and item homophily modeling with temporal recommender systems.
|
DMKD |
2021 |
4 |
Interpretation of Emergent Communication in Heterogeneous Collaborative Embodied Agents.
|
ICCV |
2021 |
8 |
Per-Pixel Classification is Not All You Need for Semantic Segmentation.
|
NIPS/NeurIPS |
2021 |
307 |
3D Spatial Recognition Without Spatially Labeled 3D.
|
CVPR |
2021 |
21 |
Enjoy Your Editing: Controllable GANs for Image Editing via Latent Space Navigation.
|
ICLR |
2021 |
37 |
The Surprising Effectiveness of Visual Odometry Techniques for Embodied PointGoal Navigation.
|
ICCV |
2021 |
16 |
SAIL-VOS 3D: A Synthetic Dataset and Baselines for Object Detection and 3D Mesh Reconstruction From Video Data.
|
CVPR |
2021 |
10 |
Class-agnostic Reconstruction of Dynamic Objects from Videos.
|
NIPS/NeurIPS |
2021 |
3 |
Cooperative Exploration for Multi-Agent Deep Reinforcement Learning.
|
ICML |
2021 |
31 |
Semantic Tracklets: An Object-Centric Representation for Visual Multi-Agent Reinforcement Learning.
|
IROS |
2021 |
6 |
DeepQAMVS: Query-Aware Hierarchical Pointer Networks for Multi-Video Summarization.
|
SIGIR |
2021 |
3 |
Perceptual Score: What Data Modalities Does Your Model Perceive?
|
NIPS/NeurIPS |
2021 |
9 |
Assignment-Space-based Multi-Object Tracking and Segmentation.
|
ICCV |
2021 |
0 |
Bridging the Imitation Gap by Adaptive Insubordination.
|
NIPS/NeurIPS |
2021 |
0 |
A Contrastive Learning Approach for Training Variational Autoencoder Priors.
|
NIPS/NeurIPS |
2021 |
0 |
Instance-Aware, Context-Focused, and Memory-Efficient Weakly Supervised Object Detection.
|
CVPR |
2020 |
104 |
A Cordial Sync: Going Beyond Marginal Policies for Multi-agent Embodied Tasks.
|
ECCV |
2020 |
32 |
Not All Unlabeled Data are Equal: Learning to Weight Data in Semi-supervised Learning.
|
NIPS/NeurIPS |
2020 |
55 |
Agriculture-Vision: A Large Aerial Image Database for Agricultural Pattern Analysis.
|
CVPR |
2020 |
81 |
Dynamic Neural Relational Inference.
|
CVPR |
2020 |
30 |
Can We Learn Heuristics for Graphical Model Inference Using Reinforcement Learning?
|
CVPR |
2020 |
3 |
Spatially Aware Multimodal Transformers for TextVQA.
|
ECCV |
2020 |
50 |
Towards a Better Global Loss Landscape of GANs.
|
NIPS/NeurIPS |
2020 |
15 |
Removing Bias in Multi-modal Classifiers: Regularization by Maximizing Functional Entropies.
|
NIPS/NeurIPS |
2020 |
37 |
Proposal-Based Video Completion.
|
ECCV |
2020 |
9 |
High-Throughput Synchronous Deep RL.
|
NIPS/NeurIPS |
2020 |
10 |
UFO
|
ECCV |
2020 |
0 |
A Simple Baseline for Audio-Visual Scene-Aware Dialog.
|
CVPR |
2019 |
32 |
Diverse Generation for Multi-Agent Sports Games.
|
CVPR |
2019 |
55 |
Sequential Latent Spaces for Modeling the Intention During Diverse Image Captioning.
|
ICCV |
2019 |
40 |
PIC: Permutation Invariant Critic for Multi-Agent Deep Reinforcement Learning.
|
CoRL |
2019 |
55 |
ViCo: Word Embeddings From Visual Co-Occurrences.
|
ICCV |
2019 |
19 |
Two Body Problem: Collaborative Visual Task Completion.
|
CVPR |
2019 |
47 |
Factor Graph Attention.
|
CVPR |
2019 |
86 |
TAB-VCR: Tags and Attributes based VCR Baselines.
|
NIPS/NeurIPS |
2019 |
16 |
Graph Structured Prediction Energy Networks.
|
NIPS/NeurIPS |
2019 |
11 |
Max-Sliced Wasserstein Distance and Its Use for GANs.
|
CVPR |
2019 |
113 |
Co-Generation with GANs using AIS based HMC.
|
NIPS/NeurIPS |
2019 |
2 |
Chirality Nets for Human Pose Regression.
|
NIPS/NeurIPS |
2019 |
27 |
Fast, Diverse and Accurate Image Captioning Guided by Part-Of-Speech.
|
CVPR |
2019 |
0 |
SAIL-VOS: Semantic Amodal Instance Level Video Object Segmentation - A Synthetic Dataset and Baselines.
|
CVPR |
2019 |
0 |
No-Frills Human-Object Interaction Detection: Factorization, Layout Encodings, and Training Techniques.
|
ICCV |
2019 |
0 |
VideoMatch: Matching Based Video Object Segmentation.
|
ECCV |
2018 |
184 |
Deep Structured Prediction with Nonlinear Output Transformations.
|
NIPS/NeurIPS |
2018 |
22 |
Out of the Box: Reasoning with Graph Convolution Nets for Factual Visual Question Answering.
|
NIPS/NeurIPS |
2018 |
153 |
Unsupervised Video Object Segmentation Using Motion Saliency-Guided Spatio-Temporal Propagation.
|
ECCV |
2018 |
81 |
Structural Consistency and Controllability for Diverse Colorization.
|
ECCV |
2018 |
34 |
Pipe-SGD: A Decentralized Pipelined SGD Framework for Distributed Deep Net Training.
|
NIPS/NeurIPS |
2018 |
74 |
Generative Modeling Using the Sliced Wasserstein Distance.
|
CVPR |
2018 |
147 |
Two Can Play This Game: Visual Dialog With Discriminative Question Generation and Answering.
|
CVPR |
2018 |
70 |
Straight to the Facts: Learning Knowledge Base Retrieval for Factual Visual Question Answering.
|
ECCV |
2018 |
70 |
Diverse and Coherent Paragraph Generation from Images.
|
ECCV |
2018 |
57 |
Unsupervised Textual Grounding: Linking Words to Image Concepts.
|
CVPR |
2018 |
30 |
GradiVeQ: Vector Quantization for Bandwidth-Efficient Gradient Aggregation in Distributed CNN Training.
|
NIPS/NeurIPS |
2018 |
53 |
Convolutional Image Captioning.
|
CVPR |
2018 |
0 |
Diverse and Accurate Image Description Using a Variational Auto-Encoder with an Additive Gaussian Encoding Space.
|
NIPS/NeurIPS |
2017 |
141 |
Asynchronous Parallel Coordinate Minimization for MAP Inference.
|
NIPS/NeurIPS |
2017 |
5 |
Creativity: Generating Diverse Questions Using Variational Autoencoders.
|
CVPR |
2017 |
126 |
Dualing GANs.
|
NIPS/NeurIPS |
2017 |
20 |
High-Order Attention Models for Visual Question Answering.
|
NIPS/NeurIPS |
2017 |
72 |
Semantic Image Inpainting with Deep Generative Models.
|
CVPR |
2017 |
0 |
MaskRNN: Instance Level Video Object Segmentation.
|
NIPS/NeurIPS |
2017 |
0 |
Interpretable and Globally Optimal Prediction for Textual Grounding using Image Concepts.
|
NIPS/NeurIPS |
2017 |
0 |
Efficient Deep Learning for Stereo Matching.
|
CVPR |
2016 |
661 |
Learning Deep Parsimonious Representations.
|
NIPS/NeurIPS |
2016 |
87 |
Blending Learning and Inference in Conditional Random Fields.
|
JMLR |
2016 |
9 |
Constraints Based Convex Belief Propagation.
|
NIPS/NeurIPS |
2016 |
0 |
Training Deep Neural Networks via Direct Loss Minimization.
|
ICML |
2016 |
0 |
Smooth and Strong: MAP Inference with Linear Convergence.
|
NIPS/NeurIPS |
2015 |
31 |
Monocular Object Instance Segmentation and Depth Ordering with CNNs.
|
ICCV |
2015 |
149 |
Rent3D: Floor-plan priors for monocular layout estimation.
|
CVPR |
2015 |
86 |
Learning to segment under various forms of weak supervision.
|
CVPR |
2015 |
174 |
Learning Deep Structured Models.
|
ICML |
2015 |
0 |
Globally Convergent Parallel MAP LP Relaxation Solver using the Frank-Wolfe Algorithm.
|
ICML |
2014 |
19 |
Efficient Structured Parsing of Facades Using Dynamic Programming.
|
CVPR |
2014 |
56 |
Efficient Inference of Continuous Markov Random Fields with Polynomial Potentials.
|
NIPS/NeurIPS |
2014 |
19 |
Computational Education using Latent Structured Prediction.
|
AISTATS |
2014 |
5 |
Message Passing Inference for Large Scale Graphical Models with High Order Potentials.
|
NIPS/NeurIPS |
2014 |
5 |
Tell Me What You See and I Will Show You Where It Is.
|
CVPR |
2014 |
93 |
Estimating the 3D Layout of Indoor Scenes and Its Clutter from Depth Sensors.
|
ICCV |
2013 |
79 |
Latent Structured Active Learning.
|
NIPS/NeurIPS |
2013 |
62 |
Box in the Box: Joint 3D Layout and Object Reasoning from Single Images.
|
ICCV |
2013 |
131 |
Efficient Structured Prediction with Latent Variables for General Graphical Models.
|
ICML |
2012 |
69 |
Globally Convergent Dual MAP LP Relaxation Solvers using Fenchel-Young Margins.
|
NIPS/NeurIPS |
2012 |
31 |
Efficient Exact Inference for 3D Indoor Scene Understanding.
|
ECCV |
2012 |
122 |
Efficient structured prediction for 3D indoor scene understanding.
|
CVPR |
2012 |
127 |
Distributed message passing for large scale graphical models.
|
CVPR |
2011 |
83 |
Adaptive random forest - How many "experts" to ask before making a decision?
|
CVPR |
2011 |
0 |