2020.7.27 Vision papers

07-22-2020	Contact and Human Dynamics from Monocular Video by Davis Rempe et al
07-21-2020	Towards Nonlinear Disentanglement in Natural Data with Temporal Sparse Coding by David Klindt et al
07-22-2020	DeepSVG: A Hierarchical Generative Network for Vector Graphics Animation by Alexandre Carlier et al
07-21-2020	Accelerating Deep Learning Applications in Space by Martina Lofqvist et al
07-21-2020	Shape and Viewpoint without Keypoints by Shubham Goel et al
07-22-2020	CrossTransformers: spatially-aware few-shot transfer by Carl Doersch et al
07-21-2020	PointContrast: Unsupervised Pre-training for 3D Point Cloud Understanding by Saining Xie et al
07-23-2020	Whole-Body Human Pose Estimation in the Wild by Sheng Jin et al
07-22-2020	Neural Sparse Voxel Fields by Lingjie Liu et al
07-22-2020	Unsupervised Shape and Pose Disentanglement for 3D Meshes by Keyang Zhou et al
07-21-2020	Learning Monocular Visual Odometry via Self-Supervised Long-Term Modeling by Yuliang Zou et al
07-21-2020	Neural Mesh Flow: 3D Manifold Mesh Generationvia Diffeomorphic Flows by Kunal Gupta et al
07-23-2020	TSIT: A Simple and Versatile Framework for Image-to-Image Translation by Liming Jiang et al
07-23-2020	Smooth-AP: Smoothing the Path Towards Large-Scale Image Retrieval by Andrew Brown et al
07-23-2020	Bridging the Imitation Gap by Adaptive Insubordination by Luca Weihs et al
07-23-2020	Spatially Aware Multimodal Transformers for TextVQA by Yash Kant et al
07-24-2020	The Surprising Effectiveness of Linear Unsupervised Image-to-Image Translation by Eitan Richardson et al
07-23-2020	PP-YOLO: An Effective and Efficient Implementation of Object Detector by Xiang Long et al
07-22-2020	Analogical Reasoning for Visually Grounded Language Acquisition by Bo Wu et al
07-22-2020	Adversarial Training Reduces Information and Improves Transferability by Matteo Terzi et al
07-23-2020	Funnel Activation for Visual Recognition by Ningning Ma et al
07-22-2020	Threat of Adversarial Attacks on Face Recognition: A Comprehensive Survey by Fatemeh Vakhshiteh et al
07-21-2020	Garment Design with Generative Adversarial Networks by Chenxi Yuan et al
07-22-2020	PareCO: Pareto-aware Channel Optimization for Slimmable Neural Networks by Ting-Wu Chin et al
07-23-2020	SBAT: Video Captioning with Sparse Boundary-Aware Transformer by Tao Jin et al
07-22-2020	Integrating Image Captioning with Rule-based Entity Masking by Aditya Mogadala et al
07-22-2020	Cloud Transformers by Kirill Mazur et al
07-21-2020	Foley Music: Learning to Generate Music from Videos by Chuang Gan et al
07-22-2020	Deep Learning Based Segmentation of Various Brain Lesions for Radiosurgery by Siang-Ruei Wu et al
07-22-2020	Darwins Neural Network: AI-based Strategies for Rapid and Scalable Cell and Coronavirus Screening by Sang Won Lee et al
07-21-2020	Deep Preset: Blending and Retouching Photos with Color Style Transfer by Man M. Ho et al
07-23-2020	WeightNet: Revisiting the Design Space of Weight Networks by Ningning Ma et al
07-23-2020	Sound2Sight: Generating Visual Dynamics from Sound and Context by Anoop Cherian et al
07-23-2020	Enhanced Transfer Learning for Autonomous Driving with Systematic Accident Simulation by Shivam Akhauri et al
07-23-2020	Body2Hands: Learning to Infer 3D Hands from Conversational Gesture Body Dynamics by Evonne Ng et al
07-23-2020	HITNet: Hierarchical Iterative Tile Refinement Network for Real-time Stereo Matching by Vladimir Tankovich et al
07-22-2020	History Repeats Itself: Human Motion Prediction via Motion Attention by Wei Mao et al
07-23-2020	Right for the Right Reason: Making Image Classification Robust by Anna Nguyen et al
07-22-2020	Tiny Transfer Learning: Towards Memory-Efficient On-Device Learning by Han Cai et al
07-21-2020	Backdoor Attacks and Countermeasures on Deep Learning: A Comprehensive Review by Yansong Gao et al
07-22-2020	Guided Deep Decoder: Unsupervised Image Pair Fusion by Tatsumi Uezato et al
07-23-2020	Neural Geometric Parser for Single Image Camera Calibration by Jinwoo Lee et al
07-23-2020	Weakly Supervised 3D Object Detection from Lidar Point Cloud by Qinghao Meng et al
07-23-2020	PIPAL: a Large-Scale Image Quality Assessment Dataset for Perceptual Image Restoration by Jinjin Gu et al
07-22-2020	Multi-Task Curriculum Framework for Open-Set Semi-Supervised Learning by Qing Yu et al
07-23-2020	Accurate RGB-D Salient Object Detection via Collaborative Learning by Wei Ji et al
07-23-2020	Few-Shot Object Detection and Viewpoint Estimation for Objects in the Wild by Yang Xiao et al
07-23-2020	Zero-Shot Recognition through Image-Guided Semantic Classification by Mei-Chen Yeh et al
07-22-2020	SIZER: A Dataset and Model for Parsing 3D Clothing and Learning Size Sensitive 3D Clothing by Garvita Tiwari et al
07-23-2020	Implicit Latent Variable Model for Scene-Consistent Motion Forecasting by Sergio Casas et al
07-22-2020	Comprehensive Image Captioning via Scene Graph Decomposition by Yiwu Zhong et al
07-23-2020	Harnessing spatial homogeneity of neuroimaging data: patch individual filter layers for CNNs by Fabian Eitel et al
07-23-2020	The Devil is in Classification: A Simple Framework for Long-tail Instance Segmentation by Tao Wang et al
07-23-2020	Representation Sharing for Fast Object Detector Search and Beyond by Yujie Zhong et al
07-22-2020	End-to-End Optimization of Scene Layout by Andrew Luo et al
07-23-2020	End-to-end Learning of Compressible Features by Saurabh Singh et al
07-23-2020	CAD-Deform: Deformable Fitting of CAD Models to 3D Scans by Vladislav Ishimtsev et al
07-22-2020	Subjective and Objective Quality Assessment of High Frame Rate Videos by Pavan C. Madhusudana et al
07-23-2020	ReLaB: Reliable Label Bootstrapping for Semi-Supervised Learning by Paul Albert et al
07-21-2020	CVR-Net: A deep convolutional neural network for coronavirus recognition from chest radiography images by Md. Kamrul Hasan et al
07-23-2020	A Study on Evaluation Standard for Automatic Crack Detection Regard the Random Fractal by Hongyu Li et al
07-22-2020	Multi-modality imaging with structure-promoting regularisers by Matthias J. Ehrhardt
07-21-2020	MovieNet: A Holistic Dataset for Movie Understanding by Qingqiu Huang et al
07-23-2020	BSL-1K: Scaling up co-articulated sign language recognition using mouthing cues by Samuel Albanie et al
07-23-2020	MuCAN: Multi-Correspondence Aggregation Network for Video Super-Resolution by Wenbo Li et al
07-23-2020	Autonomous Removal of Perspective Distortion based on Detection Results of Robotic Elevator Button Corner by Nachuan Ma
07-23-2020	Differentiable Hierarchical Graph Grouping for Multi-Person Pose Estimation by Sheng Jin et al
07-22-2020	Attention based Multiple Instance Learning for Classification of Blood Cell Disorders by Ario Sadafi et al
07-23-2020	Pixel-Pair Occlusion Relationship Map(P2ORM): Formulation, Inference & Application by Xuchong Qiu et al
07-21-2020	Rethinking CNN Models for Audio Classification by Kamalesh Palanisamy et al
07-24-2020	Artificial Intelligence in the Creative Industries: A Review by Nantheera Anantrasirichai et al
07-21-2020	A Framework based on Deep Neural Networks to Extract Anatomy of Mosquitoes from Images by Mona Minakshi et al
07-23-2020	AttentionNAS: Spatiotemporal Attention Cell Search for Video Classification by Xiaofang Wang et al
07-22-2020	All at Once: Temporally Adaptive Multi-Frame Interpolation with Advanced Motion Modeling by Zhixiang Chi et al
07-22-2020	Illumination invariant hyperspectral image unmixing based on a digital surface model by Tatsumi Uezato et al
07-24-2020	Interpreting Spatially Infinite Generative Models by Chaochao Lu et al
07-23-2020	Leveraging Bottom-Up and Top-Down Attention for Few-Shot Object Detection by Xianyu Chen et al
07-24-2020	Unsupervised Discovery of 3D Physical Objects from Video by Yilun Du et al
07-23-2020	Regularization of Building Boundaries in Satellite Images using Adversarial and Regularized Losses by Stefano Zorzi et al
07-23-2020	A Solution to Product detection in Densely Packed Scenes by Tianze Rong et al
07-21-2020	Sparse Nonnegative Tensor Factorization and Completion with Noisy Observations by Xiongjun Zhang et al
07-21-2020	Who Left the Dogs Out? 3D Animal Reconstruction with Expectation Maximization in the Loop by Benjamin Biggs et al
07-22-2020	Multi-Metric Evaluation of Thermal-to-Visual Face Recognition by Kenneth Lai et al
07-22-2020	Unsupervised Deep Representation Learning for Real-Time Tracking by Ning Wang et al
07-21-2020	Balanced Meta-Softmax for Long-Tailed Visual Recognition by Jiawei Ren et al
07-21-2020	CyCNN: A Rotation Invariant CNN using Polar Mapping and Cylindrical Convolution Layers by Jinpyo Kim et al
07-22-2020	Edge-aware Graph Representation Learning and Reasoning for Face Parsing by Gusi Te et al
07-23-2020	Real-time CNN-based Segmentation Architecture for Ball Detection in a Single View Setup by Gabriel Van Zandycke et al
07-21-2020	Movement Assessment from Skeleton Videos: A Review by Tal Hakim
07-22-2020	Wasserstein Routed Capsule Networks by Alexander Fuchs et al
07-23-2020	CurveLane-NAS: Unifying Lane-Sensitive Architecture Search and Adaptive Point Blending by Hang Xu et al
07-21-2020	Towards Visual Distortion in Black-Box Attacks by Nannan Li et al
07-21-2020	AinnoSeg: Panoramic Segmentation with High Perfomance by Jiahong Wu et al
07-21-2020	SLNSpeech: solving extended speech separation problem by the help of sign language by Jiasong Wu et al
07-23-2020	Polylidar3D -- Fast Polygon Extraction from 3D Data by Jeremy Castagno et al
07-21-2020	IITK at SemEval-2020 Task 8: Unimodal and Bimodal Sentiment Analysis of Internet Memes by Vishal Keswani et al
07-22-2020	DEAL: Deep Evidential Active Learning for Image Classification by Patrick Hemmer et al
07-21-2020	Self-supervised Feature Learning via Exploiting Multi-modal Data for Retinal Disease Diagnosis by Xiaomeng Li et al
07-24-2020	What and Where: Learn to Plug Adapters via NAS for Multi-Domain Learning by Hanbin Zhao et al
07-22-2020	Improving Monocular Depth Estimation by Leveraging Structural Awareness and Complementary Datasets by Tian Chen et al
07-21-2020	Shape-aware Semi-supervised 3D Semantic Segmentation for Medical Images by Shuailin Li et al
07-21-2020	Creating a Large-scale Synthetic Dataset for Human Activity Recognition by Ollie Matthews et al
07-24-2020	Deforming the Loss Surface by Liangming Chen et al
07-24-2020	A Lightweight Neural Network for Monocular View Generation with Occlusion Handling by Simon Evain et al
07-24-2020	CelebA-Spoof: Large-Scale Face Anti-Spoofing Dataset with Rich Annotations by Yuanhan Zhang et al
07-22-2020	Deep Variational Instance Segmentation by Jialin Yuan et al
07-21-2020	Complementing Representation Deficiency in Few-shot Image Classification: A Meta-Learning Approach by Xian Zhong et al
07-21-2020	An Image Analogies Approach for Multi-Scale Contour Detection by Slimane Larabi et al
07-21-2020	Feature-metric Loss for Self-supervised Learning of Depth and Egomotion by Chang Shu et al
07-21-2020	A Computation-Efficient CNN System for High-Quality Brain Tumor Segmentation by Yanming Sun et al
07-22-2020	Deep-VFX: Deep Action Recognition Driven VFX for Short Video by Ao Luo et al
07-22-2020	CNN+RNN Depth and Skeleton based Dynamic Hand Gesture Recognition by Kenneth Lai et al
07-22-2020	Dog Identification using Soft Biometrics and Neural Networks by Kenneth Lai et al
07-21-2020	Learning to Compose Hypercolumns for Visual Correspondence by Juhong Min et al
07-23-2020	Are Visual Explanations Useful? A Case Study in Model-in-the-Loop Prediction by Eric Chu et al
07-21-2020	Exploiting Temporal Coherence for Self-Supervised One-shot Video Re-identification by Dripta S. Raychaudhuri et al
07-21-2020	Deep Semi-supervised Knowledge Distillation for Overlapping Cervical Cell Instance Segmentation by Yanning Zhou et al
07-22-2020	Learnable Descent Algorithm for Nonsmooth Nonconvex Image Reconstruction by Yunmei Chen et al
07-22-2020	Risk Assessment in the Face-based Watchlist Screening in e-Border by Kenneth Lai et al
07-22-2020	Video-ception Network: Towards Multi-Scale Efficient Asymmetric Spatial-Temporal Interactions by Yuan Tian et al
07-22-2020	Attend and Segment: Attention Guided Active Semantic Segmentation by Soroush Seifi et al
07-21-2020	Instance-aware Self-supervised Learning for Nuclei Segmentation by Xinpeng Xie et al
07-24-2020	Self-Supervised Monocular 3D Face Reconstruction by Occlusion-Aware Multi-view Geometry Consistency by Jiaxiang Shang et al
07-22-2020	Real-Time Instrument Segmentation in Robotic Surgery using Auxiliary Supervised Deep Adversarial Learning by Mobarakol Islam et al
07-22-2020	FedOCR: Communication-Efficient Federated Learning for Scene Text Recognition by Wenqing Zhang et al
07-21-2020	Graph-PCNN: Two Stage Human Pose Estimation with Graph Pose Refinement by Jian Wang et al
07-21-2020	Uncertainty-Aware Weakly Supervised Action Detection from Untrimmed Videos by Anurag Arnab et al
07-22-2020	A weakly supervised registration-based framework for prostate segmentation via the combination of statistical shape model and CNN by Chunxia Qin et al
07-22-2020	Adma: A Flexible Loss Function for Neural Networks by Aditya Shrivastava
07-22-2020	Combining Implicit Function Learning and Parametric Models for 3D Human Reconstruction by Bharat Lal Bhatnagar et al

07-24-2020	An LSTM Approach to Temporal 3D Object Detection in LiDAR Point Clouds by Rui Huang et al
07-21-2020	Novel View Synthesis on Unpaired Data by Conditional Deformable Variational Auto-Encoder by Mingyu Yin et al
07-21-2020	Video Super-resolution with Temporal Group Attention by Takashi Isobe et al
07-21-2020	Learning Person Re-identification Models from Videos with Weak Supervision by Xueping Wang et al
07-21-2020	MI^2GAN: Generative Adversarial Network for Medical Image Domain Adaptation using Mutual Information Constraint by Xinpeng Xie et al
07-24-2020	Multi-view adaptive graph convolutions for graph classification by Nikolas Adaloglou et al
07-21-2020	Directional Temporal Modeling for Action Recognition by Xinyu Li et al
07-21-2020	Multi-modal Transformer for Video Retrieval by Valentin Gabeur et al
07-21-2020	BorderDet: Border Feature for Dense Object Detection by Han Qiu et al
07-24-2020	Reparameterizing Convolutions for Incremental Multi-Task Learning without Task Interference by Menelaos Kanakis et al
07-21-2020	Soft Expert Reward Learning for Vision-and-Language Navigation by Hu Wang et al
07-21-2020	Procrustean Regression Networks: Learning 3D Structure of Non-Rigid Objects from 2D Annotations by Sungheon Park et al
07-21-2020	Fine-Grained Image Captioning with Global-Local Discriminative Objective by Jie Wu et al
07-22-2020	Leveraging Undiagnosed Data for Glaucoma Classification with Teacher-Student Learning by Junde Wu et al
07-21-2020	Camera On-boarding for Person Re-identification using Hypothesis Transfer Learning by Sk Miraj Ahmed et al
07-22-2020	Greenhouse Segmentation on High-Resolution Optical Satellite Imagery using Deep Learning Techniques by Orkhan Baghirli et al
07-22-2020	Fragments-Expert: A Graphical User Interface MATLAB Toolbox for Classification of File Fragments by Mehdi Teimouri et al
07-21-2020	Lymphocyte counting -- Error Analysis of Regression versus Bounding Box Detection Approaches by Lin Geng Foo et al
07-22-2020	Watchlist Risk Assessment using Multiparametric Cost and Relative Entropy by K. Lai et al
07-22-2020	Multi-Spectral Facial Biometrics in Access Control by K. Lai et al
07-21-2020	Video Representation Learning by Recognizing Temporal Transformations by Simon Jenni et al
07-21-2020	Recurrent Exposure Generation for Low-Light Face Detection by Jinxiu Liang et al
07-21-2020	Multi-person 3D Pose Estimation in Crowded Scenes Based on Multi-View Geometry by He Chen et al
07-21-2020	Optimization of data-driven filterbank for automatic speaker verification by Susanta Sarangi et al
07-21-2020	Joint Visual and Temporal Consistency for Unsupervised Domain Adaptive Person Re-Identification by Jianing Li et al
07-21-2020	Multi-label Thoracic Disease Image Classification with Cross-Attention Networks by Congbo Ma et al
07-21-2020	Balance Scene Learning Mechanism for Offshore and Inshore Ship Detection in SAR Images by Tianwen Zhang et al
07-24-2020	Hallucinating Saliency Maps for Fine-Grained Image Classification for Limited Data Domains by Carola Figueroa-Flores et al
07-24-2020	Visual Compositional Learning for Human-Object Interaction Detection by Zhi Hou et al
07-22-2020	DeepCLR: Correspondence-Less Architecture for Deep End-to-End Point Cloud Registration by Markus Horn et al
07-21-2020	FLOT: Scene Flow on Point Clouds Guided by Optimal Transport by Gilles Puy et al
07-22-2020	Endo-Sim2Real: Consistency learning-based domain adaptation for instrument segmentation by Manish Sahu et al
07-21-2020	One Click Lesion RECIST Measurement and Segmentation on CT Scans by Youbao Tang et al
07-24-2020	Approximately Optimal Binning for the Piecewise Constant Approximation of the Normalized Unexplained Variance (nUV) Dissimilarity Measure by Attila Fazekas et al
07-24-2020	KPRNet: Improving projection-based LiDAR semantic segmentation by Deyvid Kochanov et al
07-22-2020	Depthwise Spatio-Temporal STFT Convolutional Neural Networks for Human Action Recognition by Sudhakar Kumawat et al
07-22-2020	End-to-End Trainable Deep Active Contour Models for Automated Image Segmentation: Delineating Buildings in Aerial Imagery by Ali Hatamizadeh et al
07-24-2020	Fully Convolutional Networks for Continuous Sign Language Recognition by Ka Leong Cheng et al
07-21-2020	Fully Automated Segmentation of the Left Ventricle in Magnetic Resonance Images by ZiHao Wang et al
07-23-2020	COVID TV-UNet: Segmenting COVID-19 Chest CT Images Using Connectivity Imposed U-Net by Narges Saeedizadeh et al
07-21-2020	Relative Pose Estimation for Multi-Camera Systems from Affine Correspondences by Banglei Guan et al
07-24-2020	Micro-expression spotting: A new benchmark by Thuong-Khanh Tran et al
07-21-2020	A Deep Ordinal Distortion Estimation Approach for Distortion Rectification by Kang Liao et al
07-24-2020	MiCo: Mixup Co-Training for Semi-Supervised Domain Adaptation by Luyu Yang et al
07-21-2020	Learning Object Relation Graph and Tentative Policy for Visual Navigation by Heming Du et al
07-23-2020	ZSCRGAN: A GAN-based Expectation Maximization Model for Zero-Shot Retrieval of Images from Textual Descriptions by Anurag Roy et al
07-21-2020	Dense Hybrid Recurrent Multi-view Stereo Net with Dynamic Consistency Checking by Jianfeng Yan et al
07-22-2020	Deep Models and Shortwave Infrared Information to Detect Face Presentation Attacks by Guillaume Heusch et al
07-22-2020	Feature based Sequential Classifier with Attention Mechanism by Sudhir Sornapudi et al
07-24-2020	On the Effectiveness of Image Rotation for Open Set Domain Adaptation by Silvia Bucci et al
07-24-2020	Self-Supervised Learning Across Domains by Silvia Bucci et al
07-21-2020	Representative-Discriminative Learning for Open-set Land Cover Classification of Satellite Imagery by Razieh Kaviani Baghbaderani et al
07-21-2020	Enhancement of damaged-image prediction through Cahn-Hilliard Image Inpainting by José A. Carrillo et al
07-24-2020	Learning Crisp Edge Detector Using Logical Refinement Network by Luyan Liu et al
07-24-2020	Study of Different Deep Learning Approach with Explainable AI for Screening Patients with COVID-19 Symptoms: Using CT Scan and Chest X-ray Image Dataset by Md Manjurul Ahsan et al
07-24-2020	HEU Emotion: A Large-scale Database for Multi-modal Emotion Recognition in the Wild by Jing Chen et al
07-24-2020	Map-Repair: Deep Cadastre Maps Alignment and Temporal Inconsistencies Fix in Satellite Images by Stefano Zorzi et al
07-24-2020	Real-World Multi-Domain Data Applications for Generalizations to Clinical Settings by Nooshin Mojab et al
07-24-2020	Machine-learned Regularization and Polygonization of Building Segmentation Masks by Stefano Zorzi et al
07-24-2020	Stain Style Transfer of Histopathology Images Via Structure-Preserved Generative Learning by Hanwen Liang et al
07-22-2020	Learning Disentangled Feature Representation for Hybrid-distorted Image Restoration by Xin Li et al
07-24-2020	Commonality-Parsing Network across Shape and Appearance for Partially Supervised Instance Segmentation by Qi Fan et al
07-24-2020	Style Transfer for Co-Speech Gesture Animation: A Multi-Speaker Conditional-Mixture Approach by Chaitanya Ahuja et al
07-22-2020	Human-Centered Unsupervised Segmentation Fusion by Gregor Koporec et al
07-22-2020	Learning Directional Feature Maps for Cardiac MRI Segmentation by Feng Cheng et al
07-23-2020	Towards Recognizing Unseen Categories in Unseen Domains by Massimiliano Mancini et al
07-24-2020	Performance analysis of weighted low rank model with sparse image histograms for face recognition under lowlevel illumination and occlusion by K. V. Sridhar et al
07-23-2020	Parkinsons Disease Detection with Ensemble Architectures based on ILSVRC Models by Tahjid Ashfaque Mostafa et al
07-23-2020	SeismoGlow -- Data augmentation for the class imbalance problem by Ruy Luiz Milidiú et al
07-23-2020	Locality-Aware Rotated Ship Detection in High-Resolution Remote Sensing Imagery Based on Multi-Scale Convolutional Network by Lingyi Liu et al
07-22-2020	Learning One Class Representations for Face Presentation Attack Detection using Multi-channel Convolutional Neural Networks by Anjith George et al
07-23-2020	Frequency Domain-based Perceptual Loss for Super Resolution by Shane D. Sims
07-21-2020	A Hybrid Neuromorphic Object Tracking and Classification Framework for Real-time Systems by Andres Ussa et al
07-23-2020	Learning Noise-Aware Encoder-Decoder from Noisy Labels by Alternating Back-Propagation for Saliency Detection by Jing Zhang et al
07-24-2020	A Comprehensive Study on Sign Language Recognition Methods by Nikolas Adaloglou et al

Craig SmithJuly 27, 2020