2019.12.01 Vision papers

11-28-2019	ASR is all you need: cross-modal distillation for lip reading by Triantafyllos Afouras et al
11-27-2019	Fully Unsupervised Probabilistic Noise2Void by Mangal Prakash et al
11-27-2019	Fair DARTS: Eliminating Unfair Advantages in Differentiable Architecture Search by Xiangxiang Chu et al
11-28-2019	Enhancing Passive Non-Line-of-Sight Imaging Using Polarization Cues by Kenichiro Tanaka et al
11-28-2019	Land Cover Change Detection via Semantic Segmentation by Renee Su et al
11-27-2019	Deep Image Harmonization via Domain Verification by Wenyan Cong et al
11-26-2019	Domain-Aware Dynamic Networks by Tianyuan Zhang et al
11-27-2019	Towards Reliable Evaluation of Road Network Reconstructions by Leonardo Citraro et al
11-27-2019	Multi-view shape estimation of transparent containers by Alessio Xompero et al
11-27-2019	Your Local GAN: Designing Two Dimensional Local Attention Mechanisms for Generative Models by Giannis Daras et al
11-26-2019	Revisiting Deep Architectures for Head Motion Prediction in 360{\deg} Videos by Miguel Fabian Romero Rondon et al
11-26-2019	Edge-Guided Occlusion Fading Reduction for a Light-Weighted Self-Supervised Monocular Depth Estimation by Kuo-Shiuan Peng et al
11-28-2019	Learning Generalizable Representations via Diverse Supervision by Ziqi Pang et al
11-26-2019	Multi-Task Driven Feature Models for Thermal Infrared Tracking by Qiao Liu et al
11-26-2019	LaFIn: Generative Landmark Guided Face Inpainting by Yang Yang et al
11-26-2019	A Two-stream End-to-End Deep Learning Network for Recognizing Atypical Visual Attention in Autism Spectrum Disorder by Jin Xie et al
11-26-2019	Efficient Attention Mechanism for Handling All the Interactions between Many Inputs with Application to Visual Dialog by Van-Quang Nguyen et al
11-26-2019	Transfer Learning in Visual and Relational Reasoning by T. S. Jayram et al
11-26-2019	A Neural Rendering Framework for Free-Viewpoint Relighting by Zhang Chen et al
11-26-2019	Decoupling Features and Coordinates for Few-shot RGB Relocalization by Siyan Dong et al
11-28-2019	Mixture-Model-based Bounding Box Density Estimation for Object Detection by Jaeyoung Yoo et al
11-27-2019	Detecting total hip replacement prosthesis design on preoperative radiographs using deep convolutional neural network by Alireza Borjali et al
11-29-2019	Bi-Directional Domain Translation for Zero-Shot Sketch-Based Image Retrieval by Jiangtong Li et al
11-28-2019	Geometric Feedback Network for Point Cloud Classification by Qiu Shi et al
11-26-2019	Towards Fairness in Visual Recognition: Effective Strategies for Bias Mitigation by Zeyu Wang et al
11-28-2019	Every Frame Counts: Joint Learning of Video Segmentation and Optical Flow by Mingyu Ding et al
11-27-2019	AdaShare: Learning What To Share For Efficient Deep Multi-Task Learning by Ximeng Sun et al
11-27-2019	Learning with less data via Weakly Labeled Patch Classification in Digital Pathology by Eu Wern Teh et al
11-29-2019	Detecting anthropogenic cloud perturbations with deep learning by Duncan Watson-Parris et al
11-28-2019	Self-Supervised Unconstrained Illumination Invariant Representation by Damian Kaliroff et al
11-29-2019	Domain-invariant Stereo Matching Networks by Feihu Zhang et al
11-29-2019	Learning Modular Representations for Long-Term Multi-Agent Motion Predictions by Todor Davchev et al
11-29-2019	Color inference from semantic labeling for person search in videos by Jules Simon et al
11-26-2019	Distraction-Aware Feature Learning for Human Attribute Recognition via Coarse-to-Fine Attention Mechanism by Mingda Wu et al
11-26-2019	Semantic Bottleneck Scene Generation by Samaneh Azadi et al
11-26-2019	WSOD with PSNet and Box Regression by Sheng Yi et al
11-26-2019	Noise Robust Generative Adversarial Networks by Takuhiro Kaneko et al
11-28-2019	Cameras Viewing Cameras Geometry by Danail Brezov et al
11-28-2019	Continuous Adaptation for Interactive Object Segmentation by Learning from Corrections by Theodora Kontogianni et al
11-28-2019	Siam R-CNN: Visual Tracking by Re-Detection by Paul Voigtlaender et al
11-27-2019	High- and Low-level image component decomposition using VAEs for improved reconstruction and anomaly detection by David Zimmerer et al
11-29-2019	Confidence Calibration and Predictive Uncertainty Estimation for Deep Medical Image Segmentation by Alireza Mehrtash et al
11-29-2019	Unpaired Image Translation via Adaptive Convolution-based Normalization by Wonwoong Cho et al
11-29-2019	Transflow Learning: Repurposing Flow Models Without Retraining by Andrew Gambardella et al
11-28-2019	Continuous Dropout by Xu Shen et al
11-28-2019	xMUDA: Cross-Modal Unsupervised Domain Adaptation for 3D Semantic Segmentation by Maximilian Jaritz et al
11-29-2019	On the Benefits of Attributional Robustness by Mayank Singh et al
11-29-2019	Weakly Supervised Cell Instance Segmentation by Propagating from Detection Response by Kazuya Nishimura et al
11-27-2019	Recovering Facial Reflectance and Geometry from Multi-view Images by Guoxian Song et al
11-26-2019	Super-Resolution for Practical Automated Plant Disease Diagnosis System by Quan Huu Cap et al
11-26-2019	Skeleton based Zero Shot Action Recognition in Joint Pose-Language Semantic Space by Bhavan Jasani et al
11-26-2019	Hearing Lips: Improving Lip Reading by Distilling Speech Recognizers by Ya Zhao et al
11-27-2019	Literature Review of Action Recognition in the Wild by Asket Kaur et al
11-28-2019	Fruit Detection, Segmentation and 3D Visualisation of Environments in Apple Orchards by Hanwen Kang et al
11-28-2019	Applying Artificial Intelligence to Glioma Imaging: Advances and Challenges by Weina Jin et al
11-27-2019	PREDICT & CLUSTER: Unsupervised Skeleton Based Action Recognition by Kun Su et al
11-27-2019	PointPWC-Net: A Coarse-to-Fine Network for Supervised and Self-Supervised Scene Flow Estimation on 3D Point Clouds by Wenxuan Wu et al
11-27-2019	Leveraging Self-supervised Denoising for Image Segmentation by Mangal Prakash et al
11-28-2019	Self-Supervised Learning by Cross-Modal Audio-Video Clustering by Humam Alwassel et al
11-27-2019	SpoC: Spoofing Camera Fingerprints by Davide Cozzolino et al
11-26-2019	GhostNet: More Features from Cheap Operations by Kai Han et al
11-29-2019	Using Fully Convolutional Neural Networks to detect manipulated images in videos by Michail Tarasiou et al
11-29-2019	X-Ray Sobolev Variational Auto-Encoders by Gabriel Turinici
11-26-2019	Text2FaceGAN: Face Generation from Fine Grained Textual Descriptions by Osaid Rehman Nasir et al
11-26-2019	Content-based image retrieval speedup by Sadegh Fadaei et al
11-26-2019	MixNMatch: Multifactor Disentanglement and Encoding for Conditional Image Generation by Yuheng Li et al
11-26-2019	Multi-person Spatial Interaction in a Large Immersive Display Using Smartphones as Touchpads by Gyanendra Sharma et al
11-29-2019	Semi-Relaxed Quantization with DropBits: Training Low-Bit Neural Networks via Bit-wise Regularization by Jihun Yun et al
11-28-2019	Region segmentation via deep learning and convex optimization by Matthias Sonntag et al
11-26-2019	Password-conditioned Anonymization and Deanonymization with Face Identity Transformers by Xiuye Gu et al
11-27-2019	All you need is a good representation: A multi-level and classifier-centric representation for few-shot learning by Shaoli Huang et al
11-29-2019	Correlation-aware Adversarial Domain Adaptation and Generalization by Mohammad Mahfujur Rahman et al
11-29-2019	Online Structured Sparsity-based Moving Object Detection from Satellite Videos by Zhang Junpeng et al
11-29-2019	Blockwisely Supervised Neural Architecture Search with Knowledge Distillation by Changlin Li et al
11-27-2019	An End-to-end Framework for Unconstrained Monocular 3D Hand Pose Estimation by Sanjeev Sharma et al
11-27-2019	Error Resilient Deep Compressive Sensing by Thuong et al
11-27-2019	Action Recognition via Pose-Based Graph Convolutional Networks with Intermediate Dense Supervision by Lei Shi et al
11-27-2019	Residual Bi-Fusion Feature Pyramid Network for Accurate Single-shot Object Detection by Ping-Yang Chen et al
11-27-2019	Grapy-ML: Graph Pyramid Mutual Learning for Cross-dataset Human Parsing by Haoyu He et al
11-27-2019	Class-Conditional Domain Adaptation on Semantic Segmentation by Yue Wang et al
11-27-2019	GRIm-RePR: Prioritising Generating Important Features for Pseudo-Rehearsal by Craig Atkinson et al
11-29-2019	An adaptive and fully automatic method for estimating the 3D position of bendable instruments using endoscopic images by Paolo Cabras et al
11-26-2019	Imitation Learning of Robot Policies by Combining Language, Vision and Demonstration by Simon Stepputtis et al
11-26-2019	Revisiting Image Aesthetic Assessment via Self-Supervised Feature Learning by Kekai Sheng et al
11-28-2019	SEAN: Image Synthesis with Semantic Region-Adaptive Normalization by Peihao Zhu et al
11-27-2019	Discriminative Adversarial Domain Adaptation by Hui Tang et al
11-27-2019	3D Shape Completion with Multi-view Consistent Inference by Tao Hu et al
11-29-2019	Investigations on the inference optimization techniques and their impact on multiple hardware platforms for Semantic Segmentation by Sethu Hareesh Kolluru
11-28-2019	Learning Semantic Correspondence Exploiting an Object-level Prior by Junghyup Lee et al
11-28-2019	Patch Reordering: a Novel Way to Achieve Rotation and Translation Invariance in Convolutional Neural Networks by Xu Shen et al
11-28-2019	A novel classification-selection approach for the self updating of template-based face recognition systems by Giulia Orrù et al
11-27-2019	Rethinking Temporal Fusion for Video-based Person Re-identification on Semantic and Time Aspect by Xinyang Jiang et al
11-27-2019	Palmprint Recognition in Uncontrolled and Uncooperative Environment by Wojciech Michal Matkowski et al
11-27-2019	A Discriminative Learned CNN Embedding For Remote Senseing Image Scene Classification by Wen Wang et al
11-27-2019	Exploring Frequency Domain Interpretation of Convolutional Neural Networks by Zhongfan Jia et al
11-26-2019	CSPNet: A New Backbone that can Enhance Learning Capability of CNN by Chien-Yao Wang et al
11-27-2019	Perceive, Transform, and Act: Multi-Modal Attention Networks for Vision-and-Language Navigation by Federico Landi et al
11-28-2019	Quality analysis of DCGAN-generated mammography lesions by Basel Alyafi et al
11-27-2019	Empirical Upper-bound in Object Detection and More by Ali Borji et al
11-29-2019	Indirect Local Attacks for Context-aware Semantic Segmentation Networks by Krishna Kanth Nakka et al
11-27-2019	Locality Aware Appearance Metric for Multi-Target Multi-Camera Tracking by Yunzhong Hou et al
11-27-2019	Sparse-GAN: Sparsity-constrained Generative Adversarial Network for Anomaly Detection in Retinal OCT Image by Kang Zhou et al
11-27-2019	Unbiased Evaluation of Deep Metric Learning Algorithms by Istvan Fehervari et al
11-28-2019	One-Shot Object Detection with Co-Attention and Co-Excitation by Ting-I Hsieh et al
11-27-2019	Towards Precise End-to-end Weakly Supervised Object Detection Network by Ke Yang et al
11-26-2019	AttentionGAN: Unpaired Image-to-Image Translation using Attention-Guided Generative Adversarial Networks by Hao Tang et al
11-26-2019	Visual Physics: Discovering Physical Laws from Videos by Pradyumna Chari et al
11-26-2019	Novelty Detection Via Blurring by Sungik Choi et al
11-26-2019	SuperGlue: Learning Feature Matching with Graph Neural Networks by Paul-Edouard Sarlin et al
11-26-2019	Can Attention Masks Improve Adversarial Robustness? by Pratik Vaishnavi et al
11-26-2019	Image2StyleGAN++: How to Edit the Embedded Images? by Rameen Abdal et al
11-26-2019	You might also like this model: Data Driven Approach for Recommending Deep Learning Models for Unknown Image Datasets by Ameya Prabhu et al
11-26-2019	Procrustes registration of two-dimensional statistical shape models without correspondences by Alma Eguizabal et al
11-27-2019	Orthogonal Convolutional Neural Networks by Jiayun Wang et al
11-27-2019	Multi-View Matching Network for 6D Pose Estimation by Daniel Mas Montserrat et al
11-27-2019	Soft Anchor-Point Object Detection by Chenchen Zhu et al
11-29-2019	Collaborative Attention Network for Person Re-identification by Wenpeng Li et al
11-27-2019	Methods of Weighted Combination for Text Field Recognition in a Video Stream by Olga Petrova et al
11-27-2019	AdapNet: Adaptability Decomposing Encoder-Decoder Network for Weakly Supervised Action Recognition and Localization by Xiao-Yu Zhang et al
11-27-2019	Shearlets as Feature Extractor for Semantic Edge Detection: The Model-Based and Data-Driven Realm by Héctor Andrade-Loarca et al
11-29-2019	Mean Shift Rejection: Training Deep Neural Networks Without Minibatch Statistics or Normalization by Brendan Ruff et al
11-29-2019	DIFAR: Deep Image Formation and Retouching by Sean Moran et al
11-26-2019	Learning to Match Templates for Unseen Instance Detection by Jean-Philippe Mercier et al
11-26-2019	F3Net: Fusion, Feedback and Focus for Salient Object Detection by Jun Wei et al
11-26-2019	Occluded Pedestrian Detection with Visible IoU and Box Sign Predictor by Ruiqi Lu et al
11-27-2019	PointRGCN: Graph Convolution Networks for 3D Vehicles Detection Refinement by Jesus Zarzar et al
11-26-2019	ViewAL: Active Learning with Viewpoint Entropy for Semantic Segmentation by Yawar Siddiqui et al
11-27-2019	QKD: Quantization-aware Knowledge Distillation by Jangho Kim et al
11-27-2019	Non-Autoregressive Video Captioning with Iterative Refinement by Bang Yang et al
11-27-2019	Deep Stereo using Adaptive Thin Volume Representation with Uncertainty Awareness by Shuo Cheng et al
11-28-2019	Detection and Mitigation of Rare Subclasses in Neural Network Classifiers by Colin Paterson et al
11-27-2019	Semantic Head Enhanced Pedestrian Detection in a Crowd by Ruiqi Lu et al
11-29-2019	Learning from Irregularly Sampled Data for Endomicroscopy Super-resolution: A Comparative Study of Sparse and Dense Approaches by Agnieszka Barbara Szczotka et al
11-29-2019	CAGNet: Content-Aware Guidance for Salient Object Detection by Sina Mohammadi et al
11-29-2019	Deep autofocus with cone-beam CT consistency constraint by Alexander Preuhs et al
11-27-2019	Graph Representation for Face Analysis in Image Collections by Domingo Mery et al
11-26-2019	In Perfect Shape: Certifiably Optimal 3D Shape Reconstruction from 2D Landmarks by Heng Yang et al
11-26-2019	Compressed MRI Reconstruction Exploiting a Rotation-Invariant Total Variation Discretization by Erfan Ebrahim Esfahani et al
11-27-2019	PanDA: Panoptic Data Augmentation by Yang Liu et al
11-28-2019	Deep Object Co-segmentation via Spatial-Semantic Network Modulation by Kaihua Zhang et al
11-28-2019	An Efficient Multi-Domain Framework for Image-to-Image Translation by Ye Lin et al
11-28-2019	Light-weight Calibrator: a Separable Component for Unsupervised Domain Adaptation by Shaokai Ye et al
11-27-2019	Document Structure Extraction for Forms using Very High Resolution Semantic Segmentation by Mausoom Sarkar et al
11-26-2019	Data Augmentation Using Adversarial Training for Construction-Equipment Classification by Francis Baek et al
11-29-2019	Whats Hidden in a Randomly Weighted Neural Network? by Vivek Ramanujan et al
11-26-2019	FAN: Feature Adaptation Network for Surveillance Face Recognition and Normalization by Xi Yin et al
11-26-2019	Multi-Level Network for High-Speed Multi-Person Pose Estimation by Ying Huang et al
11-26-2019	G-TAD: Sub-Graph Localization for Temporal Action Detection by Mengmeng Xu et al
11-27-2019	LucidDream: Controlled Temporally-Consistent DeepDream on Videos by Joel Ruben Antony Moniz et al
11-27-2019	Example-Guided Scene Image Synthesis using Masked Spatial-Channel Attention and Patch-Based Self-Supervision by Haitian Zheng et al
11-27-2019	GLA in MediaEval 2018 Emotional Impact of Movies Task by Jennifer J. Sun et al
11-26-2019	Multi-Object Portion Tracking in 4D Fluorescence Microscopy Imagery with Deep Feature Maps by Yang Jiao et al
11-28-2019	Dividing and Conquering Cross-Modal Recipe Retrieval: from Nearest Neighbours Baselines to SoTA by Mikhail Fain et al
11-28-2019	AutoRemover: Automatic Object Removal for Autonomous Driving Videos by Rong Zhang et al
11-27-2019	Adaptive Initialization Method for K-means Algorithm by Jie Yang et al
11-27-2019	Decision Propagation Networks for Image Classification by Keke Tang et al
11-26-2019	Potential of deep features for opinion-unaware, distortion-unaware, no-reference image quality assessment by Subhayan Mukherjee et al
11-26-2019	Artificial Intelligence for Diagnosis of Skin Cancer: Challenges and Opportunities by Manu Goyal et al
11-26-2019	DDNet: Dual-path Decoder Network for Occlusion Relationship Reasoning by Panhe Feng et al
11-28-2019	Motion Equivariance OF Event-based Camera Data with the Temporal Normalization Transform by Ziyun Wang
11-28-2019	Lidar-Camera Co-Training for Semi-Supervised Road Detection by Luca Caltagirone et al
11-27-2019	Analysis of Explainers of Black Box Deep Neural Networks for Computer Vision: A Survey by Vanessa Buhrmester et al
11-27-2019	AdaSample: Adaptive Sampling of Hard Positives for Descriptor Learning by Xin-Yu Zhang et al
11-29-2019	DIST: Rendering Deep Implicit Signed Distance Function with Differentiable Sphere Tracing by Shaohui Liu et al
11-26-2019	Using Depth for Pixel-Wise Detection of Adversarial Attacks in Crowd Counting by Weizhe Liu et al

Craig SmithDecember 3, 2019