Scene-Centric Unsupervised Panoptic Segmentation.
|
CVPR |
2025 |
5 |
AnyCam: Learning to Recover Camera Poses and Intrinsics from Casual Videos.
|
CVPR |
2025 |
7 |
Unbiasing through Textual Descriptions: Mitigating Representation Bias in Video Benchmarks.
|
CVPR |
2025 |
1 |
FLARE: Feed-forward Geometry, Appearance and Camera Estimation from Uncalibrated Sparse Views.
|
CVPR |
2025 |
109 |
VGGT: Visual Geometry Grounded Transformer.
|
CVPR |
2025 |
729 |
Scaling Backwards: Minimal Synthetic Pre-Training?
|
ECCV |
2024 |
9 |
DragAPart: Learning a Part-Level Motion Prior for Articulated Objects.
|
ECCV |
2024 |
30 |
Rethinking Image Super-Resolution from Training Data Perspectives.
|
ECCV |
2024 |
6 |
IM-3D: Iterative Multiview Diffusion and Reconstruction for High-Quality 3D Generation.
|
ICML |
2024 |
88 |
SHIC: Shape-Image Correspondences with No Keypoint Supervision.
|
ECCV |
2024 |
9 |
Dataset Enhancement with Instance-Level Augmentations.
|
ECCV |
2024 |
19 |
Learning the 3D Fauna of the Web.
|
CVPR |
2024 |
46 |
HowToCaption: Prompting LLMs to Transform Video Annotations at Scale.
|
ECCV |
2024 |
0 |
CoTracker: It Is Better to Track Together.
|
ECCV |
2024 |
0 |
Diffusion Models for Open-Vocabulary Segmentation.
|
ECCV |
2024 |
0 |
Scene-Conditional 3D Object Stylization and Composition.
|
ECCV |
2024 |
0 |
VGGSfM: Visual Geometry Grounded Deep Structure from Motion.
|
CVPR |
2024 |
0 |
Cache Me if You Can: Accelerating Diffusion Models through Block Caching.
|
CVPR |
2024 |
0 |
Splatter Image: Ultra-Fast Single-View 3D Reconstruction.
|
CVPR |
2024 |
0 |
Learning Segmentation from Point Trajectories.
|
NIPS/NeurIPS |
2024 |
0 |
What does CLIP know about a red circle? Visual prompt engineering for VLMs.
|
ICCV |
2023 |
245 |
Temperature Schedules for self-supervised contrastive methods on long-tail data.
|
ICLR |
2023 |
63 |
Viewset Diffusion: (0-)Image-Conditioned 3D Generative Models from 2D Data.
|
ICCV |
2023 |
128 |
DynamicStereo: Consistent Dynamic Depth from Stereo Videos.
|
CVPR |
2023 |
104 |
Continual Detection Transformer for Incremental Object Detection.
|
CVPR |
2023 |
100 |
Behind the Scenes: Density Fields for Single View Reconstruction.
|
CVPR |
2023 |
68 |
RealFusion 360° Reconstruction of Any Object from a Single Image.
|
CVPR |
2023 |
357 |
PoseDiffusion: Solving Pose Estimation via Diffusion-aided Bundle Adjustment.
|
ICCV |
2023 |
143 |
PC
|
CVPR |
2023 |
0 |
MagicPony: Learning Articulated 3D Animals in the Wild.
|
CVPR |
2023 |
0 |
Unsupervised Learning of Probably Symmetric Deformable 3D Objects From Images in the Wild (Invited Paper).
|
TPAMI |
2023 |
0 |
De-rendering 3D Objects in the Wild.
|
CVPR |
2022 |
37 |
Deep Spectral Methods: A Surprisingly Strong Baseline for Unsupervised Semantic Segmentation and Localization.
|
CVPR |
2022 |
193 |
VTC: Improving Video-Text Retrieval with User Comments.
|
ECCV |
2022 |
8 |
Unsupervised Multi-Object Segmentation by Predicting Probable Motion Patterns.
|
NIPS/NeurIPS |
2022 |
23 |
Finding an Unsupervised Image Segmenter in each of your Deep Generative Models.
|
ICLR |
2022 |
0 |
Unsupervised Part Discovery from Contrastive Reconstruction.
|
NIPS/NeurIPS |
2021 |
72 |
Neural Response Interpretation Through the Lens of Critical Pathways.
|
CVPR |
2021 |
41 |
Unsupervised Learning of Probably Symmetric Deformable 3D Objects from Images in the Wild (Extended Abstract).
|
IJCAI |
2021 |
0 |
Semantic Image Manipulation Using Scene Graphs.
|
CVPR |
2020 |
137 |
Labelling unlabelled videos from scratch with multi-modal self-supervision.
|
NIPS/NeurIPS |
2020 |
163 |
Unsupervised Learning of Probably Symmetric Deformable 3D Objects From Images in the Wild.
|
CVPR |
2020 |
0 |
Self-labelling via simultaneous clustering and representation learning.
|
ICLR |
2020 |
0 |
A critical analysis of self-supervision, or what we can learn from a single image.
|
ICLR |
2020 |
0 |
Finding and Visualizing Weaknesses of Deep Reinforcement Learning Agents.
|
ICLR |
2020 |
0 |
Explaining the Ambiguity of Object Detection and 6D Pose From Visual Data.
|
ICCV |
2019 |
0 |
Towards Unsupervised Image Captioning With Shared Multimodal Embeddings.
|
ICCV |
2019 |
113 |
Dealing with Ambiguity in Robotic Grasping via Multiple Predictions.
|
ACCV |
2018 |
19 |
Guide Me: Interacting With Deep Networks.
|
CVPR |
2018 |
39 |
Learning in an Uncertain World: Representing Ambiguity Through Multiple Hypotheses.
|
ICCV |
2017 |
0 |
Sensor substitution for video-based action recognition.
|
IROS |
2016 |
26 |
Image segmentation in Twenty Questions.
|
CVPR |
2015 |
29 |
Robust Optimization for Deep Regression.
|
ICCV |
2015 |
188 |