PASCAL VOC数据集是PASCAL VOC挑战赛的数据集,可应用于图像识别中目标分类、目标检测、目标分割、人体布局、动作识别等方面的应用。PASCAL VOC不断更新,其中 PASCAL VOC 2007 与 PASCAL VOC 2012两个年份的数据集使用较多。PASCAL VOC 201...
CIFAR-10包含10个类别,60,000个训练图像,彩色图像大小:32x32,10,000个测试图像。CIFAR-100则是包含100个类,每类有600张图片,其中500张用于训练,100张用于测试;这100个类分组成20个超类。图像类别均有明确标注。CIFAR对于图像分类算法测试来说是一个非常...
Imagenet数据集是目前深度学习图像领域应用得非常多的一个领域,可用于图像分类、定位、检测等研究工作。数据集有1400多万幅图片,涵盖2万多个类别,其中有超过百万的图片有明确的类别标注和图像中物体位置的标注。
Open Source Computer Vision Library
deepgazeComputer Vision library for human-computer interaction. It implements Head Pose and Gaze Direction Estimation Using Convolutional Neural Netwo...
Real-time face detection and emotion/gender classification using fer2013/imdb datasets with a keras CNN model and openCV.
Automatic License Plate Recognition library
Keras implementation of 'LipNet: End-to-End Sentence-level Lipreading'
An MXNet implementation of Mask R-CNN
Keras version of Realtime Multi-Person Pose Estimation project
Translate darknet to tensorflow. Load trained weights, retrain/fine-tune using tensorflow, export constant graph def to mobile devices
The world's simplest facial recognition api for Python and the command line
Deep Pose Estimation implemented using Tensorflow with Custom Architecture for fast inference.
Super Resolution for images using deep learning.
Text to image synthesis using thought vectors
OpenFace – a state-of-the art tool intended for facial landmark detection, head pose estimation, facial action unit recognition, and eye-gaze estimati...
Image Super-Resolution for Anime-Style Art
A tensorflow implementation of "Deep Convolutional Generative Adversarial Networks"
205幅图像,共468个人脸.由从Flickr采集的205幅图像组成,共468个人脸,其包含复杂的背景变化和人脸姿态变化等。
3837幅图像,每个人脸标定68个关键点