ADA-Track++: End-to-End Multi-Camera 3D Multi-Object Tracking With Alternating Detection and Association.
|
TPAMI |
2026 |
0 |
MANTA: Diffusion Mamba for Efficient and Effective Stochastic Long-Term Dense Action Anticipation.
|
CVPR |
2025 |
3 |
Canonical Rank Adaptation: An Efficient Fine-Tuning Strategy for Vision Transformers.
|
ICML |
2025 |
2 |
SyncVP: Joint Diffusion for Synchronous Multi-Modal Video Prediction.
|
CVPR |
2025 |
6 |
Hierarchical Vector Quantization for Unsupervised Action Segmentation.
|
AAAI |
2025 |
0 |
Video-Panda: Parameter-efficient Alignment for Encoder-free Video-Language Models.
|
CVPR |
2025 |
0 |
STING-BEE: Towards Vision-Language Model for Real-World X-ray Baggage Security Inspection.
|
CVPR |
2025 |
0 |
GroupMamba: Efficient Group-Based Visual State Space Model.
|
CVPR |
2025 |
0 |
Fréchet Wavelet Distance: A Domain-Agnostic Metric for Image Generation.
|
ICLR |
2025 |
0 |
A Multimodal Handover Failure Detection Dataset and Baselines.
|
ICRA |
2024 |
5 |
Gated Temporal Diffusion for Stochastic Long-Term Dense Anticipation.
|
ECCV |
2024 |
10 |
Identifying Spatio-Temporal Drivers of Extreme Events.
|
NIPS/NeurIPS |
2024 |
0 |
ADA-Track: End-to-End Multi-Camera 3D Multi-Object Tracking with Alternating Detection and Association.
|
CVPR |
2024 |
0 |
PowerBEV: A Powerful Yet Lightweight Framework for Instance Prediction in Bird's-Eye View.
|
IJCAI |
2023 |
34 |
Smoothness Similarity Regularization for Few-Shot GAN Adaptation.
|
ICCV |
2023 |
3 |
Social Diffusion: Long-term Multiple Human Motion Anticipation.
|
ICCV |
2023 |
36 |
3DMOTFormer: Graph Transformer for Online 3D Multi-Object Tracking.
|
ICCV |
2023 |
35 |
How Much Temporal Long-Term Context is Needed for Action Segmentation?
|
ICCV |
2023 |
48 |
Humans in Kitchens: A Dataset for Multi-Person Human Motion Forecasting with Scene Context.
|
NIPS/NeurIPS |
2023 |
9 |
MS-TCN++: Multi-Stage Temporal Convolutional Network for Action Segmentation.
|
TPAMI |
2023 |
0 |
PoseTrack21: A Dataset for Person Search, Multi-Object Tracking and Multi-Person Pose Tracking.
|
CVPR |
2022 |
48 |
TAVA: Template-free Animatable Volumetric Actors.
|
ECCV |
2022 |
188 |
Ranking Info Noise Contrastive Estimation: Boosting Contrastive Learning via Ranked Positives.
|
AAAI |
2022 |
59 |
Unified Fully and Timestamp Supervised Temporal Action Segmentation via Sequence to Sequence Translation.
|
ECCV |
2022 |
110 |
Keypoint Message Passing for Video-Based Person Re-identification.
|
AAAI |
2022 |
0 |
Adaptive Token Sampling for Efficient Vision Transformers.
|
ECCV |
2022 |
0 |
Multi-Scale Interaction for Real-Time LiDAR Data Segmentation on an Embedded Platform.
|
IEEE Robotics and Automation Letters |
2022 |
0 |
Fast Weakly Supervised Action Segmentation Using Mutual Consistency.
|
TPAMI |
2022 |
0 |
Moving Object Segmentation in 3D LiDAR Data: A Learning-Based Approach Exploiting Sequential Data.
|
IEEE Robotics and Automation Letters |
2021 |
223 |
Long Short View Feature Decomposition via Contrastive Video Representation Learning.
|
ICCV |
2021 |
38 |
Temporal Action Segmentation From Timestamp Supervision.
|
CVPR |
2021 |
100 |
Using Visual Anomaly Detection for Task Execution Monitoring.
|
IROS |
2021 |
13 |
Towards 3D LiDAR-based semantic scene understanding of 3D point cloud sequences: The SemanticKITTI Dataset.
|
IJRR |
2021 |
159 |
Spatial-Temporal Consistency Network for Low-Latency Trajectory Forecasting.
|
ICCV |
2021 |
23 |
3D CNNs With Adaptive Temporal Feature Resolutions.
|
CVPR |
2021 |
0 |
You Only Need Adversarial Supervision for Semantic Image Synthesis.
|
ICLR |
2021 |
0 |
Pose Refinement Graph Convolutional Network for Skeleton-Based Action Recognition.
|
IEEE Robotics and Automation Letters |
2021 |
0 |
Self-supervised Keypoint Correspondences for Multi-person Pose Estimation and Tracking in Videos.
|
ECCV |
2020 |
51 |
Recursive Bayesian Filtering for Multiple Human Pose Tracking from Multiple Cameras.
|
ACCV |
2020 |
13 |
SCT: Set Constrained Temporal Transformer for Set Supervised Action Segmentation.
|
CVPR |
2020 |
78 |
Large Scale Holistic Video Understanding.
|
ECCV |
2020 |
0 |
Discovering Multi-label Actor-Action Association in a Weakly Supervised Setting.
|
ACCV |
2020 |
0 |
Sequence Prediction Using Spectral RNNs.
|
ICANN |
2020 |
0 |
Open Set Domain Adaptation for Image and Action Recognition.
|
TPAMI |
2020 |
0 |
A Hybrid RNN-HMM Approach for Weakly Supervised Temporal Action Segmentation.
|
TPAMI |
2020 |
0 |
MS-TCN: Multi-Stage Temporal Convolutional Network for Action Segmentation.
|
CVPR |
2019 |
806 |
SemanticKITTI: A Dataset for Semantic Scene Understanding of LiDAR Sequences.
|
ICCV |
2019 |
2262 |
Unsupervised Learning of Action Classes With Continuous Temporal Embedding.
|
CVPR |
2019 |
127 |
What Object Should I Use? - Task Driven Object Detection.
|
CVPR |
2019 |
32 |
Human Motion Prediction via Spatio-Temporal Inpainting.
|
ICCV |
2019 |
0 |
When Will You Do What? - Anticipating Temporal Occurrences of Activities.
|
CVPR |
2018 |
210 |
Hand Pose Estimation via Latent 2.5D Heatmap Regression.
|
ECCV |
2018 |
346 |
Spatio-temporal Channel Correlation Networks for Action Classification.
|
ECCV |
2018 |
194 |
NeuralNetwork-Viterbi: A Framework for Weakly Supervised Video Learning.
|
CVPR |
2018 |
151 |
AVID: Adversarial Visual Irregularity Detection.
|
ACCV |
2018 |
95 |
PoseTrack: A Benchmark for Human Pose Estimation and Tracking.
|
CVPR |
2018 |
0 |
Action Sets: Weakly Supervised Action Segmentation Without Ordering Constraints.
|
CVPR |
2018 |
0 |
Weakly Supervised Affordance Detection.
|
CVPR |
2017 |
98 |
Weakly Supervised Action Learning with RNN Based Fine-to-Coarse Modeling.
|
CVPR |
2017 |
213 |
Open Set Domain Adaptation.
|
ICCV |
2017 |
617 |
SurfaceNet: An End-to-End 3D Neural Network for Multiview Stereopsis.
|
ICCV |
2017 |
448 |
PoseTrack: Joint Multi-person Pose Estimation and Tracking.
|
CVPR |
2017 |
0 |
Incremental Learning of Random Forests for Large-Scale Image Classification.
|
TPAMI |
2016 |
147 |
Temporal Action Detection Using a Statistical Language Model.
|
CVPR |
2016 |
219 |
A Dual-Source Approach for 3D Pose Estimation from a Single Image.
|
CVPR |
2016 |
0 |
From categories to subcategories: Large-scale image classification with partial class label refinement.
|
CVPR |
2015 |
59 |
3D Object Reconstruction from Hand-Object Interactions.
|
ICCV |
2015 |
86 |
Material Classification Based on Training Data Synthesized Using a BTF Database.
|
ECCV |
2014 |
94 |
Incremental Learning of NCM Forests for Large-Scale Image Classification.
|
CVPR |
2014 |
93 |
Discovering Object Classes from Activities.
|
ECCV |
2014 |
15 |
Efficient Pose-Based Action Recognition.
|
ACCV |
2014 |
66 |
Body Parts Dependent Joint Regressors for Human Pose Estimation in Still Images.
|
TPAMI |
2014 |
77 |
Markerless Motion Capture of Multiple Characters Using Multiview Image Segmentation.
|
TPAMI |
2013 |
141 |
Human Pose Estimation Using Body Parts Dependent Joint Regressors.
|
CVPR |
2013 |
303 |
Towards Understanding Action Recognition.
|
ICCV |
2013 |
948 |
Motion Capture of Hands in Action Using Discriminative Salient Points.
|
ECCV |
2012 |
332 |
Interactive object detection.
|
CVPR |
2012 |
101 |
Latent Hough Transform for Object Detection.
|
ECCV |
2012 |
52 |
Local Context Priors for Object Proposal Generation.
|
ACCV |
2012 |
13 |
Real-time facial feature detection using conditional regression forests.
|
CVPR |
2012 |
435 |
Hough Forests for Object Detection, Tracking, and Action Recognition.
|
TPAMI |
2011 |
650 |
Scalable multi-class object detection.
|
CVPR |
2011 |
69 |
Real time head pose estimation with random regression forests.
|
CVPR |
2011 |
505 |
Functional categorization of objects using real-time markerless motion capture.
|
CVPR |
2011 |
69 |
What makes a chair a chair?
|
CVPR |
2011 |
285 |
Outdoor human motion capture using inverse kinematics and von mises-fisher sampling.
|
ICCV |
2011 |
104 |
Learning Probabilistic Non-Linear Latent Variable Models for Tracking Complex Activities.
|
NIPS/NeurIPS |
2011 |
67 |
Markerless motion capture of interacting characters using multi-view image segmentation.
|
CVPR |
2011 |
164 |
Fast articulated motion tracking using a sums of Gaussians body model.
|
ICCV |
2011 |
242 |
A Hough transform-based voting framework for action recognition.
|
CVPR |
2010 |
303 |
Backprojection Revisited: Scalable Multi-view Object Detection and Similarity Metrics for Detections.
|
ECCV |
2010 |
39 |
2D Action Recognition Serves 3D Human Pose Estimation.
|
ECCV |
2010 |
66 |
An object-dependent hand pose prior from sparse training data.
|
CVPR |
2010 |
76 |
Combined Region and Motion-Based 3D Tracking of Rigid and Articulated Objects.
|
TPAMI |
2010 |
0 |
Markerless Motion Capture with unsynchronized moving cameras.
|
CVPR |
2009 |
222 |
Class-specific Hough forests for object detection.
|
CVPR |
2009 |
666 |
Motion capture using joint skeleton tracking and surface estimation.
|
CVPR |
2009 |
465 |
Drift-free tracking of rigid and articulated objects.
|
CVPR |
2008 |
52 |