- RIDE: Reversal Invariant Descriptor Enhancement
- VQA: Visual Question Answering
- Convex Optimization with Abstract Linear Operators
- Multiple Hypothesis Tracking Revisited
- P-CNN: Pose-Based CNN Features for Action Recognition
- HARF: Hierarchy-Associated Rich Features for Salient Object Detection
- Sequence to Sequence -- Video to Text
- Deep Learning Strong Parts for Pedestrian Detection
- Understanding and Diagnosing Visual Tracking Systems
- Temporal Perception and Prediction in Ego-Centric Video
- Matrix Backpropagation for Deep Networks with Structured Layers
- Aligning Books and Movies: Towards Story-Like Visual Explanations by Watching Movies and Reading Books
- Multi-view Domain Generalization for Visual Recognition
- Holistically-Nested Edge Detection
- Where to Buy It: Matching Street Clothing Photos in Online Shops
- Infinite Feature Selection
- Personalized Age Progression with Aging Dictionary
- Secrets of GrabCut and Kernel K-Means
- Bilinear CNN Models for Fine-Grained Visual Recognition
- Hierarchical Convolutional Features for Visual Tracking
- Context Aware Active Learning of Activity Recognition Models
- General Dynamic Scene Reconstruction from Multiple View Video
- Differential Recurrent Neural Networks for Action Recognition
- Multi-view Convolutional Neural Networks for 3D Shape Recognition
- Delving Deep into Rectifiers: Surpassing Human-Level Performance on ImageNet Classification
- HICO: A Benchmark for Recognizing Human-Object Interactions in Images
- Bi-Shifting Auto-Encoder for Unsupervised Domain Adaptation
- Deep Networks for Image Super-Resolution with Sparse Prior
- Domain Generalization for Object Recognition with Multi-task Autoencoders
- Learning Complexity-Aware Cascades for Deep Pedestrian Detection
- Simpler Non-Parametric Methods Provide as Good or Better Results to Multiple-Instance Learning
- Deep Learning Face Attributes in the Wild
- Robust Non-rigid Motion Tracking and Surface Reconstruction Using L0 Regularization
- Learning Image and User Features for Recommendation in Social Networks
- Actions and Attributes from Wholes and Parts
- Simultaneous Deep Transfer Across Domains and Tasks
- Naive Bayes Super-Resolution Forest
- Minimal Solvers for 3D Geometry from Satellite Imagery
- Learning Deconvolution Network for Semantic Segmentation
- An Efficient Statistical Method for Image Noise Level Estimation
- Deep Colorization
- [ieee.org] Learning Discriminative Reconstructions for Unsupervised Outlier Removal
- Learning Deep Object Detectors from 3D Models
- Aggregating Local Deep Features for Image Retrieval
- An Accurate Iris Segmentation Framework Under Relaxed Imaging Constraints Using Total Variation Model
- Joint Fine-Tuning in Deep Neural Networks for Facial Expression Recognition
- Pairwise Conditional Random Forests for Facial Expression Recognition
- Integrating Dashcam Views through Inter-Video Mapping
- Learning Large-Scale Automatic Image Colorization
- Dense Continuous-Time Tracking and Mapping with Rolling Shutter RGB-D Cameras