Learning Realistic Human Actions from Movies
Conference on Computer Vision & Pattern Recognition - jun 2008
Download the publication :
The aim of this paper is to address recognition of natural human actions in
diverse and realistic video settings. This challenging but important subject
has mostly been ignored in the past due to several problems one of which is
the lack of realistic and annotated video datasets. Our first contribution is
to address this limitation and to investigate the use of movie scripts for
automatic annotation of human actions in videos. We evaluate alternative
methods for action retrieval from scripts and show benefits of a text-based
classifier. Using the retrieved action samples for visual learning, we next
turn to the problem of action classification in video. We present a new method
for video classification that builds upon and extends several recent ideas
including local space-time features, space-time pyramids and multi-channel
non-linear SVMs. The method is shown to improve state-of-the-art results on
the standard KTH action dataset by achieving 91.8% accuracy. Given the inherent
problem of noisy labels in automatic annotation, we particularly investigate
and show high tolerance of our method to annotation errors in the training set.
We finally apply the method to the learning and classification of challenging
action classes in movies and show promising results.
Images and movies
See also
BibTex references
@InProceedings{LMSR08,
author = "Ivan Laptev and Marcin Marsza{\l}ek and Cordelia Schmid and Benjamin Rozenfeld",
title = "Learning Realistic Human Actions from Movies",
booktitle = "Conference on Computer Vision \& Pattern Recognition",
month = "jun",
year = "2008",
keywords = "LEAR",
url = "http://lear.inrialpes.fr/pubs/2008/LMSR08"
}
Other publications by...