A Spatio-Temporal Descriptor Based on 3D-Gradients
British Machine Vision Conference - sep 2008
Download the publication :
In this work, we present a novel local descriptor for video sequences. The
proposed descriptor is based on histograms of oriented 3D spatio-temporal
gradients. Our contribution is four-fold. (i) To compute 3D gradients for
arbitrary scales, we develop a memory-efficient algorithm based on integral
videos. (ii) We propose a generic 3D orientation quantization which is based
on regular polyhedrons. (iii) We perform an in-depth evaluation of all
descriptor parameters and optimize them for action recognition. (iv) We apply
our descriptor to various action datasets (KTH, Weizmann, Hollywood) and
show that we outperform the state-of-the-art.
Images and movies
See also
BibTex references
@InProceedings{KMS08,
author = "Alexander Kl{\"a}ser and Marcin Marsza{\l}ek and Cordelia Schmid",
title = "A Spatio-Temporal Descriptor Based on 3D-Gradients",
booktitle = "British Machine Vision Conference",
pages = "995--1004",
month = "sep",
year = "2008",
keywords = "LEAR, CLASS, action recognition, BOW, 3D, SIFT, descriptor, gradients, videos, KTH, Weizmann, Hollywood",
url = "http://lear.inrialpes.fr/pubs/2008/KMS08"
}
Other publications by...