|Evaluation of ImageNet-CoG Benchmark
[ICCV 2021 paper][Project page][Code]
|Software for Multimodal Transformer
[ECCV 2020 paper][Project page][Code]
|Software for Motion-Augmented RGB for Action Recognition
[CVPR 2019 paper][Project page][Code]
|Software for end-to-end incremental learning|
[ECCV 2018 paper][Code]
[CVPR 2018 paper][Project page][Dataset]
|Software for joint modeling of first and third-person videos|
[CVPR 2018 paper][Code]
|Software for learning video object segmentation
[ICCV 2017 paper][Journal version][Project page][Code]
|Software for learning motion patterns
[CVPR 2017 paper][Project page][Code]
|Software for online object tracking
[ICCV 2015 paper][Project page][Code]
|Software for weakly-supervised semantic segmentation
[ECCV 2016 paper][Project page][Code]
|Software for pose estimation and segmentation of multiple people
[TPAMI 2015 paper][Project page][Code]
|Software for estimating human pose in videos
[CVPR 2014 paper][Project page][Code]
|Poses in the Wild Dataset
This dataset has 30 video sequences generated from three Hollywood movies. Each sequence has approximately 30 frames and is annotated for human upper-body keypoints.
[CVPR 2014 paper][Project page][Dataset]
|Inria 3DMovie Dataset
This dataset contains all the stereo pairs and their annotations used in our ICCV 2013 paper. Most of this data was extracted from the "StreetDance 3D" [Giwa and Pasquini, 2010] and "Pina" [Wenders, 2011] stereo movies.
[ICCV 2013 paper][Project page][Dataset]
|Software for learning graphs
[ICCV 2013 paper][Code]
| WILLOW-ObjectClass dataset for evaluating graph matching
A dataset containing object class images and their part annotations.
[ICCV 2013 paper][Dataset]
|IIIT-STR, Sports-10K, TV Series-1M scene text retrieval datasets
The IIIT STR dataset is harvested from Google and Flickr image search. Sports-10K dataset is from sports video clips, containing many advertisement signboards, and the TV Series-1M is from four popular TV series: Friends, Buffy, Mr. Bean, and Open All Hours.
[ICCV 2013 paper][Project page][Image & Video datasets]
|Software for solving detection and segmentation problems jointly
[ECCV 2010 paper][Code]
|IIIT 5K-Word dataset
This dataset is harvested from Google image search. It contains 5000 cropped word images from Scene Texts and born-digital images.
[BMVC 2012 paper][Project page][Dataset]
|SVT-CHAR: Annotated character dataset
This dataset contains character level bounding boxes and ground truth annotation of SVT-WORD dataset.
[CVPR 2012 paper][Project page][Dataset]
|Alpha-expansion Beta-shrink Moves Software for MRFs
[UAI 2011 paper][Code]
A dataset containing short video clips and annotation of a subset of frames for evaluating video segmentation.
[CVPR 2011 paper][Project page][Dataset]
|Dataset for Vision labelling problems
Data used in our TPAMI 2010 paper for evaluating (colour/object) segmentation, stereo estimation problems defined on MRF/CRF.
[TPAMI 2010 paper][Data]
|Efficient Solvers for Multi-label MRFs
[TPAMI 2010 paper][Code]