openface/docs/index.md

161 lines
6.5 KiB
Markdown
Raw Normal View History

2015-11-03 06:30:14 +08:00
# OpenFace <iframe src="https://ghbtns.com/github-btn.html?user=cmusatyalab&repo=openface&type=star&count=true&size=large" frameborder="0" scrolling="0" width="160px" height="30px"></iframe>
2015-11-01 20:52:46 +08:00
2015-11-03 06:30:14 +08:00
<center>
*Free and open source face recognition with
2015-11-12 00:51:56 +08:00
deep neural networks.*
2015-11-03 06:30:14 +08:00
</center>
2015-11-01 20:52:46 +08:00
---
2015-11-12 00:51:56 +08:00
OpenFace is a Python and [Torch](http://torch.ch) implementation of
face recognition with deep neural networks and is based on
the CVPR 2015 paper
2015-11-01 20:52:46 +08:00
[FaceNet: A Unified Embedding for Face Recognition and Clustering](http://www.cv-foundation.org/openaccess/content_cvpr_2015/app/1A_089.pdf)
2015-11-07 03:49:42 +08:00
by Florian Schroff, Dmitry Kalenichenko, and James Philbin at Google.
2015-11-01 20:52:46 +08:00
Torch allows the network to be executed on a CPU or with CUDA.
2016-01-21 02:18:27 +08:00
**Crafted by [Brandon Amos](http://bamos.github.io) in
[Satya's](https://www.cs.cmu.edu/~satya/) research group at
2015-11-01 20:52:46 +08:00
Carnegie Mellon University.**
---
2015-11-03 06:41:09 +08:00
+ The code is available on GitHub at
[cmusatyalab/openface](https://github.com/cmusatyalab/openface).
2015-12-25 07:48:06 +08:00
+ [API Documentation](http://openface-api.readthedocs.org/en/latest/index.html)
2015-11-03 06:41:09 +08:00
+ Join the
[cmu-openface group](https://groups.google.com/forum/#!forum/cmu-openface)
or the
[gitter chat](https://gitter.im/cmusatyalab/openface)
for discussions and installation issues.
+ Development discussions and bugs reports are on the
[issue tracker](https://github.com/cmusatyalab/openface/issues).
---
This research was supported by the National Science Foundation (NSF)
under grant number CNS-1518865. Additional support
was provided by the Intel Corporation, Google, Vodafone, NVIDIA, and the
Conklin Kistler family fund. Any opinions, findings, conclusions or
recommendations expressed in this material are those of the authors
and should not be attributed to their employers or funding sources.
---
2015-11-03 06:41:09 +08:00
2015-11-01 20:52:46 +08:00
### Isn't face recognition a solved problem?
No! Accuracies from research papers have just begun to surpass
human accuracies on some benchmarks.
The accuracies of open source face recognition systems lag
behind the state-of-the-art.
2016-01-13 21:36:16 +08:00
See [our accuracy comparisons](http://cmusatyalab.github.io/openface/models-and-accuracies/)
2015-12-04 16:14:37 +08:00
on the famous LFW benchmark.
2015-11-01 20:52:46 +08:00
---
### Please use responsibly!
We do not support the use of this project in applications
that violate privacy and security.
2016-01-21 03:39:38 +08:00
We are using this to help cognitively impaired users
2015-11-01 20:52:46 +08:00
sense and understand the world around them.
---
# Overview
The following overview shows the workflow for a single input
image of Sylvestor Stallone from the publicly available
[LFW dataset](http://vis-www.cs.umass.edu/lfw/person/Sylvester_Stallone.html).
1. Detect faces with a pre-trained models from
[dlib](http://blog.dlib.net/2014/02/dlib-186-released-make-your-own-object.html)
or
[OpenCV](http://docs.opencv.org/master/d7/d8b/tutorial_py_face_detection.html).
2. Transform the face for the neural network.
This repository uses dlib's
[real-time pose estimation](http://blog.dlib.net/2014/08/real-time-face-pose-estimation.html)
with OpenCV's
[affine transformation](http://docs.opencv.org/doc/tutorials/imgproc/imgtrans/warp_affine/warp_affine.html)
2015-11-05 21:55:03 +08:00
to try to make the eyes and bottom lip appear in
2015-11-01 20:52:46 +08:00
the same location on each image.
3. Use a deep neural network to represent (or embed) the face on
a 128-dimensional unit hypersphere.
The embedding is a generic representation for anybody's face.
Unlike other face representations, this embedding has the nice property
that a larger distance between two face embeddings means
that the faces are likely not of the same person.
This property makes clustering, similarity detection,
and classification tasks easier than other face recognition
techniques where the Euclidean distance between
features is not meaningful.
4. Apply your favorite clustering or classification techniques
to the features to complete your recognition task.
See below for our examples for classification and
similarity detection, including an online web demo.
2015-11-03 06:30:14 +08:00
![](https://raw.githubusercontent.com/cmusatyalab/openface/master/images/summary.jpg)
2015-11-01 20:52:46 +08:00
# Citations
2015-11-12 00:51:56 +08:00
The following is a [BibTeX](http://www.bibtex.org/)
and plaintext reference
for the OpenFace GitHub repository.
The reference may change in the future.
The BibTeX entry requires the `url` LaTeX package.
2015-11-01 20:52:46 +08:00
```
2016-01-12 00:03:11 +08:00
@misc{amos2016openface,
2015-11-12 00:51:56 +08:00
title = {{OpenFace: Face Recognition with Deep Neural Networks}},
2016-01-12 00:03:11 +08:00
author = {Amos, Brandon and Ludwiczuk, Bartosz and Harkes, Jan and
Pillai, Padmanabhan and Elgazzar, Khalid and Satyanarayanan, Mahadev},
2015-11-12 00:51:56 +08:00
howpublished = {\url{http://github.com/cmusatyalab/openface}},
2016-01-12 00:03:11 +08:00
note = {Accessed: 2016-01-11}
2015-11-01 20:52:46 +08:00
}
2015-11-12 00:51:56 +08:00
2016-01-12 00:03:11 +08:00
Brandon Amos, Bartosz Ludwiczuk, Jan Harkes, Padmanabhan Pillai,
Khalid Elgazzar, and Mahadev Satyanarayanan.
2015-11-12 00:51:56 +08:00
OpenFace: Face Recognition with Deep Neural Networks.
http://github.com/cmusatyalab/openface.
2016-01-12 00:03:11 +08:00
Accessed: 2016-01-11
2015-11-01 20:52:46 +08:00
```
# Acknowledgements
+ The fantastic Torch ecosystem and community.
+ [Alfredo Canziani's](https://github.com/Atcold)
implementation of FaceNet's loss function in
[torch-TripletEmbedding](https://github.com/Atcold/torch-TripletEmbedding).
+ [Nicholas Léonard](https://github.com/nicholas-leonard)
for quickly merging my pull requests to
[nicholas-leonard/dpnn](https://github.com/nicholas-leonard/dpnn)
modifying the inception layer.
+ [Francisco Massa](https://github.com/fmassa)
and
[Andrej Karpathy](http://cs.stanford.edu/people/karpathy/)
for
quickly releasing [nn.Normalize](https://github.com/torch/nn/pull/341)
after I expressed interest in using it.
+ [Soumith Chintala](https://github.com/soumith) for
help with the [fbcunn](https://github.com/facebook/fbcunn)
example code.
+ [Davis King's](https://github.com/davisking) [dlib](https://github.com/davisking/dlib)
library for face detection and alignment.
+ Zhuo Chen, Kiryong Ha, Wenlu Hu,
[Rahul Sukthankar](http://www.cs.cmu.edu/~rahuls/), and
Junjue Wang for insightful discussions.
# Licensing
2016-01-13 21:36:16 +08:00
Unless otherwise stated, the source code and trained models
`nn4.v*.t7` and `celeb-classifier.nn4.v*.t7` are copyright
2015-11-01 20:52:46 +08:00
Carnegie Mellon University and licensed under the
2016-01-20 03:07:37 +08:00
[Apache 2.0 License](https://github.com/cmusatyalab/openface/blob/master/LICENSE).
2015-11-01 20:52:46 +08:00
Portions from the following third party sources have
been modified and are included in this repository.
These portions are noted in the source files and are
copyright their respective authors with
the licenses listed.
Project | Modified | License
---|---|---|
[Atcold/torch-TripletEmbedding](https://github.com/Atcold/torch-TripletEmbedding) | No | MIT
[facebook/fbnn](https://github.com/facebook/fbnn) | Yes | BSD