2019.12.01 Vision papers

 

11-28-2019

ASR is all you need: cross-modal distillation for lip reading
by Triantafyllos Afouras et al

11-27-2019

Fully Unsupervised Probabilistic Noise2Void
by Mangal Prakash et al

11-27-2019

Fair DARTS: Eliminating Unfair Advantages in Differentiable Architecture Search
by Xiangxiang Chu et al

11-28-2019

Enhancing Passive Non-Line-of-Sight Imaging Using Polarization Cues
by Kenichiro Tanaka et al

11-28-2019

Land Cover Change Detection via Semantic Segmentation
by Renee Su et al

11-27-2019

Deep Image Harmonization via Domain Verification
by Wenyan Cong et al

11-26-2019

Domain-Aware Dynamic Networks
by Tianyuan Zhang et al

11-27-2019

Towards Reliable Evaluation of Road Network Reconstructions
by Leonardo Citraro et al

11-27-2019

Multi-view shape estimation of transparent containers
by Alessio Xompero et al

11-27-2019

Your Local GAN: Designing Two Dimensional Local Attention Mechanisms for Generative Models
by Giannis Daras et al

11-26-2019

Revisiting Deep Architectures for Head Motion Prediction in 360{\deg} Videos
by Miguel Fabian Romero Rondon et al

11-26-2019

Edge-Guided Occlusion Fading Reduction for a Light-Weighted Self-Supervised Monocular Depth Estimation
by Kuo-Shiuan Peng et al

11-28-2019

Learning Generalizable Representations via Diverse Supervision
by Ziqi Pang et al

11-26-2019

Multi-Task Driven Feature Models for Thermal Infrared Tracking
by Qiao Liu et al

11-26-2019

LaFIn: Generative Landmark Guided Face Inpainting
by Yang Yang et al

11-26-2019

A Two-stream End-to-End Deep Learning Network for Recognizing Atypical Visual Attention in Autism Spectrum Disorder
by Jin Xie et al

11-26-2019

Efficient Attention Mechanism for Handling All the Interactions between Many Inputs with Application to Visual Dialog
by Van-Quang Nguyen et al

11-26-2019

Transfer Learning in Visual and Relational Reasoning
by T. S. Jayram et al

11-26-2019

A Neural Rendering Framework for Free-Viewpoint Relighting
by Zhang Chen et al

11-26-2019

Decoupling Features and Coordinates for Few-shot RGB Relocalization
by Siyan Dong et al

11-28-2019

Mixture-Model-based Bounding Box Density Estimation for Object Detection
by Jaeyoung Yoo et al

11-27-2019

Detecting total hip replacement prosthesis design on preoperative radiographs using deep convolutional neural network
by Alireza Borjali et al

11-29-2019

Bi-Directional Domain Translation for Zero-Shot Sketch-Based Image Retrieval
by Jiangtong Li et al

11-28-2019

Geometric Feedback Network for Point Cloud Classification
by Qiu Shi et al

11-26-2019

Towards Fairness in Visual Recognition: Effective Strategies for Bias Mitigation
by Zeyu Wang et al

11-28-2019

Every Frame Counts: Joint Learning of Video Segmentation and Optical Flow
by Mingyu Ding et al

11-27-2019

AdaShare: Learning What To Share For Efficient Deep Multi-Task Learning
by Ximeng Sun et al

11-27-2019

Learning with less data via Weakly Labeled Patch Classification in Digital Pathology
by Eu Wern Teh et al

11-29-2019

Detecting anthropogenic cloud perturbations with deep learning
by Duncan Watson-Parris et al

11-28-2019

Self-Supervised Unconstrained Illumination Invariant Representation
by Damian Kaliroff et al

11-29-2019

Domain-invariant Stereo Matching Networks
by Feihu Zhang et al

11-29-2019

Learning Modular Representations for Long-Term Multi-Agent Motion Predictions
by Todor Davchev et al

11-29-2019

Color inference from semantic labeling for person search in videos
by Jules Simon et al

11-26-2019

Distraction-Aware Feature Learning for Human Attribute Recognition via Coarse-to-Fine Attention Mechanism
by Mingda Wu et al

11-26-2019

Semantic Bottleneck Scene Generation
by Samaneh Azadi et al

11-26-2019

WSOD with PSNet and Box Regression
by Sheng Yi et al

11-26-2019

Noise Robust Generative Adversarial Networks
by Takuhiro Kaneko et al

11-28-2019

Cameras Viewing Cameras Geometry
by Danail Brezov et al

11-28-2019

Continuous Adaptation for Interactive Object Segmentation by Learning from Corrections
by Theodora Kontogianni et al

11-28-2019

Siam R-CNN: Visual Tracking by Re-Detection
by Paul Voigtlaender et al

11-27-2019

High- and Low-level image component decomposition using VAEs for improved reconstruction and anomaly detection
by David Zimmerer et al

11-29-2019

Confidence Calibration and Predictive Uncertainty Estimation for Deep Medical Image Segmentation
by Alireza Mehrtash et al

11-29-2019

Unpaired Image Translation via Adaptive Convolution-based Normalization
by Wonwoong Cho et al

11-29-2019

Transflow Learning: Repurposing Flow Models Without Retraining
by Andrew Gambardella et al

11-28-2019

Continuous Dropout
by Xu Shen et al

11-28-2019

xMUDA: Cross-Modal Unsupervised Domain Adaptation for 3D Semantic Segmentation
by Maximilian Jaritz et al

11-29-2019

On the Benefits of Attributional Robustness
by Mayank Singh et al

11-29-2019

Weakly Supervised Cell Instance Segmentation by Propagating from Detection Response
by Kazuya Nishimura et al

11-27-2019

Recovering Facial Reflectance and Geometry from Multi-view Images
by Guoxian Song et al

11-26-2019

Super-Resolution for Practical Automated Plant Disease Diagnosis System
by Quan Huu Cap et al

11-26-2019

Skeleton based Zero Shot Action Recognition in Joint Pose-Language Semantic Space
by Bhavan Jasani et al

11-26-2019

Hearing Lips: Improving Lip Reading by Distilling Speech Recognizers
by Ya Zhao et al

11-27-2019

Literature Review of Action Recognition in the Wild
by Asket Kaur et al

11-28-2019

Fruit Detection, Segmentation and 3D Visualisation of Environments in Apple Orchards
by Hanwen Kang et al

11-28-2019

Applying Artificial Intelligence to Glioma Imaging: Advances and Challenges
by Weina Jin et al

11-27-2019

PREDICT & CLUSTER: Unsupervised Skeleton Based Action Recognition
by Kun Su et al

11-27-2019

PointPWC-Net: A Coarse-to-Fine Network for Supervised and Self-Supervised Scene Flow Estimation on 3D Point Clouds
by Wenxuan Wu et al

11-27-2019

Leveraging Self-supervised Denoising for Image Segmentation
by Mangal Prakash et al

11-28-2019

Self-Supervised Learning by Cross-Modal Audio-Video Clustering
by Humam Alwassel et al

11-27-2019

SpoC: Spoofing Camera Fingerprints
by Davide Cozzolino et al

11-26-2019

GhostNet: More Features from Cheap Operations
by Kai Han et al

11-29-2019

Using Fully Convolutional Neural Networks to detect manipulated images in videos
by Michail Tarasiou et al

11-29-2019

X-Ray Sobolev Variational Auto-Encoders
by Gabriel Turinici

11-26-2019

Text2FaceGAN: Face Generation from Fine Grained Textual Descriptions
by Osaid Rehman Nasir et al

11-26-2019

Content-based image retrieval speedup
by Sadegh Fadaei et al

11-26-2019

MixNMatch: Multifactor Disentanglement and Encoding for Conditional Image Generation
by Yuheng Li et al

11-26-2019

Multi-person Spatial Interaction in a Large Immersive Display Using Smartphones as Touchpads
by Gyanendra Sharma et al

11-29-2019

Semi-Relaxed Quantization with DropBits: Training Low-Bit Neural Networks via Bit-wise Regularization
by Jihun Yun et al

11-28-2019

Region segmentation via deep learning and convex optimization
by Matthias Sonntag et al

11-26-2019

Password-conditioned Anonymization and Deanonymization with Face Identity Transformers
by Xiuye Gu et al

11-27-2019

All you need is a good representation: A multi-level and classifier-centric representation for few-shot learning
by Shaoli Huang et al

11-29-2019

Correlation-aware Adversarial Domain Adaptation and Generalization
by Mohammad Mahfujur Rahman et al

11-29-2019

Online Structured Sparsity-based Moving Object Detection from Satellite Videos
by Zhang Junpeng et al

11-29-2019

Blockwisely Supervised Neural Architecture Search with Knowledge Distillation
by Changlin Li et al

11-27-2019

An End-to-end Framework for Unconstrained Monocular 3D Hand Pose Estimation
by Sanjeev Sharma et al

11-27-2019

Error Resilient Deep Compressive Sensing
by Thuong et al

11-27-2019

Action Recognition via Pose-Based Graph Convolutional Networks with Intermediate Dense Supervision
by Lei Shi et al

11-27-2019

Residual Bi-Fusion Feature Pyramid Network for Accurate Single-shot Object Detection
by Ping-Yang Chen et al

11-27-2019

Grapy-ML: Graph Pyramid Mutual Learning for Cross-dataset Human Parsing
by Haoyu He et al

11-27-2019

Class-Conditional Domain Adaptation on Semantic Segmentation
by Yue Wang et al

11-27-2019

GRIm-RePR: Prioritising Generating Important Features for Pseudo-Rehearsal
by Craig Atkinson et al

11-29-2019

An adaptive and fully automatic method for estimating the 3D position of bendable instruments using endoscopic images
by Paolo Cabras et al

11-26-2019

Imitation Learning of Robot Policies by Combining Language, Vision and Demonstration
by Simon Stepputtis et al

11-26-2019

Revisiting Image Aesthetic Assessment via Self-Supervised Feature Learning
by Kekai Sheng et al

11-28-2019

SEAN: Image Synthesis with Semantic Region-Adaptive Normalization
by Peihao Zhu et al

11-27-2019

Discriminative Adversarial Domain Adaptation
by Hui Tang et al

11-27-2019

3D Shape Completion with Multi-view Consistent Inference
by Tao Hu et al

11-29-2019

Investigations on the inference optimization techniques and their impact on multiple hardware platforms for Semantic Segmentation
by Sethu Hareesh Kolluru

11-28-2019

Learning Semantic Correspondence Exploiting an Object-level Prior
by Junghyup Lee et al

11-28-2019

Patch Reordering: a Novel Way to Achieve Rotation and Translation Invariance in Convolutional Neural Networks
by Xu Shen et al

11-28-2019

A novel classification-selection approach for the self updating of template-based face recognition systems
by Giulia Orrù et al

11-27-2019

Rethinking Temporal Fusion for Video-based Person Re-identification on Semantic and Time Aspect
by Xinyang Jiang et al

11-27-2019

Palmprint Recognition in Uncontrolled and Uncooperative Environment
by Wojciech Michal Matkowski et al

11-27-2019

A Discriminative Learned CNN Embedding For Remote Senseing Image Scene Classification
by Wen Wang et al

11-27-2019

Exploring Frequency Domain Interpretation of Convolutional Neural Networks
by Zhongfan Jia et al

11-26-2019

CSPNet: A New Backbone that can Enhance Learning Capability of CNN
by Chien-Yao Wang et al

11-27-2019

Perceive, Transform, and Act: Multi-Modal Attention Networks for Vision-and-Language Navigation
by Federico Landi et al

11-28-2019

Quality analysis of DCGAN-generated mammography lesions
by Basel Alyafi et al

11-27-2019

Empirical Upper-bound in Object Detection and More
by Ali Borji et al

11-29-2019

Indirect Local Attacks for Context-aware Semantic Segmentation Networks
by Krishna Kanth Nakka et al

11-27-2019

Locality Aware Appearance Metric for Multi-Target Multi-Camera Tracking
by Yunzhong Hou et al

11-27-2019

Sparse-GAN: Sparsity-constrained Generative Adversarial Network for Anomaly Detection in Retinal OCT Image
by Kang Zhou et al

11-27-2019

Unbiased Evaluation of Deep Metric Learning Algorithms
by Istvan Fehervari et al

11-28-2019

One-Shot Object Detection with Co-Attention and Co-Excitation
by Ting-I Hsieh et al

11-27-2019

Towards Precise End-to-end Weakly Supervised Object Detection Network
by Ke Yang et al

11-26-2019

AttentionGAN: Unpaired Image-to-Image Translation using Attention-Guided Generative Adversarial Networks
by Hao Tang et al

11-26-2019

Visual Physics: Discovering Physical Laws from Videos
by Pradyumna Chari et al

11-26-2019

Novelty Detection Via Blurring
by Sungik Choi et al

11-26-2019

SuperGlue: Learning Feature Matching with Graph Neural Networks
by Paul-Edouard Sarlin et al

11-26-2019

Can Attention Masks Improve Adversarial Robustness?
by Pratik Vaishnavi et al

11-26-2019

Image2StyleGAN++: How to Edit the Embedded Images?
by Rameen Abdal et al

11-26-2019

You might also like this model: Data Driven Approach for Recommending Deep Learning Models for Unknown Image Datasets
by Ameya Prabhu et al

11-26-2019

Procrustes registration of two-dimensional statistical shape models without correspondences
by Alma Eguizabal et al

11-27-2019

Orthogonal Convolutional Neural Networks
by Jiayun Wang et al

11-27-2019

Multi-View Matching Network for 6D Pose Estimation
by Daniel Mas Montserrat et al

11-27-2019

Soft Anchor-Point Object Detection
by Chenchen Zhu et al

11-29-2019

Collaborative Attention Network for Person Re-identification
by Wenpeng Li et al

11-27-2019

Methods of Weighted Combination for Text Field Recognition in a Video Stream
by Olga Petrova et al

11-27-2019

AdapNet: Adaptability Decomposing Encoder-Decoder Network for Weakly Supervised Action Recognition and Localization
by Xiao-Yu Zhang et al

11-27-2019

Shearlets as Feature Extractor for Semantic Edge Detection: The Model-Based and Data-Driven Realm
by Héctor Andrade-Loarca et al

11-29-2019

Mean Shift Rejection: Training Deep Neural Networks Without Minibatch Statistics or Normalization
by Brendan Ruff et al

11-29-2019

DIFAR: Deep Image Formation and Retouching
by Sean Moran et al

11-26-2019

Learning to Match Templates for Unseen Instance Detection
by Jean-Philippe Mercier et al

11-26-2019

F3Net: Fusion, Feedback and Focus for Salient Object Detection
by Jun Wei et al

11-26-2019

Occluded Pedestrian Detection with Visible IoU and Box Sign Predictor
by Ruiqi Lu et al

11-27-2019

PointRGCN: Graph Convolution Networks for 3D Vehicles Detection Refinement
by Jesus Zarzar et al

11-26-2019

ViewAL: Active Learning with Viewpoint Entropy for Semantic Segmentation
by Yawar Siddiqui et al

11-27-2019

QKD: Quantization-aware Knowledge Distillation
by Jangho Kim et al

11-27-2019

Non-Autoregressive Video Captioning with Iterative Refinement
by Bang Yang et al

11-27-2019

Deep Stereo using Adaptive Thin Volume Representation with Uncertainty Awareness
by Shuo Cheng et al

11-28-2019

Detection and Mitigation of Rare Subclasses in Neural Network Classifiers
by Colin Paterson et al

11-27-2019

Semantic Head Enhanced Pedestrian Detection in a Crowd
by Ruiqi Lu et al

11-29-2019

Learning from Irregularly Sampled Data for Endomicroscopy Super-resolution: A Comparative Study of Sparse and Dense Approaches
by Agnieszka Barbara Szczotka et al

11-29-2019

CAGNet: Content-Aware Guidance for Salient Object Detection
by Sina Mohammadi et al

11-29-2019

Deep autofocus with cone-beam CT consistency constraint
by Alexander Preuhs et al

11-27-2019

Graph Representation for Face Analysis in Image Collections
by Domingo Mery et al

11-26-2019

In Perfect Shape: Certifiably Optimal 3D Shape Reconstruction from 2D Landmarks
by Heng Yang et al

11-26-2019

Compressed MRI Reconstruction Exploiting a Rotation-Invariant Total Variation Discretization
by Erfan Ebrahim Esfahani et al

11-27-2019

PanDA: Panoptic Data Augmentation
by Yang Liu et al

11-28-2019

Deep Object Co-segmentation via Spatial-Semantic Network Modulation
by Kaihua Zhang et al

11-28-2019

An Efficient Multi-Domain Framework for Image-to-Image Translation
by Ye Lin et al

11-28-2019

Light-weight Calibrator: a Separable Component for Unsupervised Domain Adaptation
by Shaokai Ye et al

11-27-2019

Document Structure Extraction for Forms using Very High Resolution Semantic Segmentation
by Mausoom Sarkar et al

11-26-2019

Data Augmentation Using Adversarial Training for Construction-Equipment Classification
by Francis Baek et al

11-29-2019

Whats Hidden in a Randomly Weighted Neural Network?
by Vivek Ramanujan et al

11-26-2019

FAN: Feature Adaptation Network for Surveillance Face Recognition and Normalization
by Xi Yin et al

11-26-2019

Multi-Level Network for High-Speed Multi-Person Pose Estimation
by Ying Huang et al

11-26-2019

G-TAD: Sub-Graph Localization for Temporal Action Detection
by Mengmeng Xu et al

11-27-2019

LucidDream: Controlled Temporally-Consistent DeepDream on Videos
by Joel Ruben Antony Moniz et al

11-27-2019

Example-Guided Scene Image Synthesis using Masked Spatial-Channel Attention and Patch-Based Self-Supervision
by Haitian Zheng et al

11-27-2019

GLA in MediaEval 2018 Emotional Impact of Movies Task
by Jennifer J. Sun et al

11-26-2019

Multi-Object Portion Tracking in 4D Fluorescence Microscopy Imagery with Deep Feature Maps
by Yang Jiao et al

11-28-2019

Dividing and Conquering Cross-Modal Recipe Retrieval: from Nearest Neighbours Baselines to SoTA
by Mikhail Fain et al

11-28-2019

AutoRemover: Automatic Object Removal for Autonomous Driving Videos
by Rong Zhang et al

11-27-2019

Adaptive Initialization Method for K-means Algorithm
by Jie Yang et al

11-27-2019

Decision Propagation Networks for Image Classification
by Keke Tang et al

11-26-2019

Potential of deep features for opinion-unaware, distortion-unaware, no-reference image quality assessment
by Subhayan Mukherjee et al

11-26-2019

Artificial Intelligence for Diagnosis of Skin Cancer: Challenges and Opportunities
by Manu Goyal et al

11-26-2019

DDNet: Dual-path Decoder Network for Occlusion Relationship Reasoning
by Panhe Feng et al

11-28-2019

Motion Equivariance OF Event-based Camera Data with the Temporal Normalization Transform
by Ziyun Wang

11-28-2019

Lidar-Camera Co-Training for Semi-Supervised Road Detection
by Luca Caltagirone et al

11-27-2019

Analysis of Explainers of Black Box Deep Neural Networks for Computer Vision: A Survey
by Vanessa Buhrmester et al

11-27-2019

AdaSample: Adaptive Sampling of Hard Positives for Descriptor Learning
by Xin-Yu Zhang et al

11-29-2019

DIST: Rendering Deep Implicit Signed Distance Function with Differentiable Sphere Tracing
by Shaohui Liu et al

11-26-2019

Using Depth for Pixel-Wise Detection of Adversarial Attacks in Crowd Counting
by Weizhe Liu et al

 
Craig Smith