Internship 2009-2010
Robust face descriptors in uncontrolled settings
Keywords:
Computer vision, machine learning, face description
and recognition, missing data, metric learning.
Supervisors : Matthieu Guillaumin, Jakob Verbeek and Cordelia Schmid.
General information:
Minimum internship duration: 4 months. Salary depending on intern's status.
The internship will take place in the LEAR team at INRIA, Grenoble,
France.
2009/12/14 Update: This internship offer is
closed. The position is filled.
Context:
LEAR's main focus is to use machine learning approaches to tackle
computer vision related tasks. Among them, the detection and analysis of
human faces is still a major challenge despite the large research
community and literature. Significant advances are still necessary
to make face recognition robust in uncontrolled
environments. Compared to settings that deal only with varying lighting
conditions, and small expression or pose changes, uncontrolled environments
are subject to major pose and expression changes, as well as occlusions
(hats, hands, hair, other persons, ...), and changes in age, glasses,
hair and facial hair styles. To achieve recognition in this setting, we
need to design and develop robust face descriptors and associate
them with powerful machine learning and statistical modeling techniques.
Current processing pipeline for face
recognition in the LEAR team
|
Goal and approach:
Following recent successes [2,3,4,5,6], we will adopt an approach based on
face description, which implies representing faces as vectors in a
high-dimensional space. The first goal of the internship will be to
perform an experimental comparative study of existing descriptors, most
of which will be provided [1,2,5,7] but some will have to be re-implemented
[3,4,6]. This study will be conducted using standard face data sets and
benchmarks [10,11]. Following the lessons drawn from this study, the
intern will build an efficient processing pipeline for extracting a
robust face descriptors from real-world images, as shown in the
illustration.
Another approach to consider, following [9], consists in building a
recognition system by combining several binary classifiers that give
specific information about
visual attributes of a face image: is
this a man or woman? is it a child or adult? does this person wear
glasses? does (s)he wear a hat? is (s)he blonde? ... The collected
(real-valued) answers to those questions can be used as inputs for other
machine learning techniques.
The second goal of the internship is to deal with occlusions.
Occlusions that affect face images translate into noise in the face
descriptors discussed above, and this noise is usually not random.
Therefore, these occlusions can be modeled, rejected and considered as
missing data in the descriptor at recognition step. We propose to adapt
metric learning frameworks, which have shown great performance in a
number of computer vision related tasks [2,13], with an
Expectation-Maximization derivation that can handle missing data in a
principled way [12].
Requirement:
Excellent academic records, applied mathematics and
scientific programming skills are essential. Previous experience in
image processing is an asset.
Application:
Applicants should send a CV and contact information of
a referee by email to
matthieu<dot>guillaumin<at>inria<dot>fr
References:
[1] A. Kläser,
Human detection and character recognition in
TV-style movies, In Informatiktage, 2007.
[2] M. Guillaumin, J.Verbeek, and
C.Schmid,
Is that you? Metric Learning Approaches for
Face Identification, Proceedings of the IEEE International Conference on Computer Vision 2009.
[3] J. Sivic, M. Everingham, and
A.Zisserman,
'Who are you?' - Learning person specific classifiers
from video, Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition 2009.
[4] N. Pinto, J. DiCarlo, and D. Cox,
How far can
you get with a modern face recognition test set using only simple
features?, Proceedings of the IEEE Conference on Computer Vision and
Pattern Recognition 2009.
[5] Y. Taigman, L. Wolf, and T. Hassner,
Multiple One-Shots for Utilizing Class Label Information,
Proceedings of the British Machine Vision Conference
2009.
[6] N. Kumar, A. C. Berg, P. N. Belhumeur, and S. K. Nayar,
Attribute
and Simile Classifiers for Face Verification, Proceedings of the IEEE International Conference on Computer Vision 2009.
[7] M. Guillaumin, T. Mensink, J. Verbeek, and C. Schmid,
Automatic
Face Naming with Caption-based Supervision, Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition 2008.
[8]
A comprehensive face recognition
reference site [9]
Columbia University face verification
project
[10]
Labeled Faces in the Wild dataset, protocol and
results
[11]
Face Recognition Grand
Challenge
[12] S. Roweis,
EM Algorithms for PCA and SPCA, Neural Information Processing Systems 1997.
[13] P. Jain, B. Kulis and K. Grauman,
Fast image search for learned
metrics, Proceedings of the IEEE Conference on Computer Vision and
Pattern Recognition, 2008.