2020.5.4 Vision papers

04-30-2020	Consistent Video Depth Estimation by Xuan Luo et al
04-30-2020	CNN Explainer: Learning Convolutional Neural Networks with Interactive Visualization by Zijie J. Wang et al
04-28-2020	Image Augmentation Is All You Need: Regularizing Deep Reinforcement Learning from Pixels by Ilya Kostrikov et al
04-28-2020	Learning Feature Descriptors using Camera Pose Supervision by Qianqian Wang et al
04-30-2020	Improving Vision-and-Language Navigation with Image-Text Pairs from the Web by Arjun Majumdar et al
04-29-2020	Editing in Style: Uncovering the Local Semantics of GANs by Edo Collins et al
04-29-2020	MobileDets: Searching for Object Detection Architectures for Mobile Accelerators by Yunyang Xiong et al
04-28-2020	DiVA: Diverse Visual Feature Aggregation for Deep Metric Learning by Timo Milbich et al
04-28-2020	VD-BERT: A Unified Vision and Dialog Transformer with BERT by Yue Wang et al
04-30-2020	SS3D: Single Shot 3D Object Detector by Aniket Limaye et al
04-29-2020	VGGSound: A Large-scale Audio-Visual Dataset by Honglie Chen et al
04-29-2020	Interactive Video Stylization Using Few-Shot Patch-Based Training by Ondřej Texler et al
04-28-2020	Neural Hair Rendering by Menglei Chai et al
04-29-2020	Pragmatic Issue-Sensitive Image Captioning by Allen Nie et al
05-01-2020	Adversarial Synthesis of Human Pose from Text by Yifei Zhang et al
04-29-2020	UAV and Machine Learning Based Refinement of a Satellite-Driven Vegetation Index for Precision Agriculture by Vittorio Mazzia et al
04-30-2020	MuSe 2020 -- The First International Multimodal Sentiment Analysis in Real-life Media Challenge and Workshop by Lukas Stappen et al
04-30-2020	EXACT: A collaboration toolset for algorithm-aided annotation of almost everything by Christian Marzahl et al
04-30-2020	Progressive Transformers for End-to-End Sign Language Production by Ben Saunders et al
04-30-2020	Out-of-the-box channel pruned networks by Ragav Venkatesan et al
04-29-2020	Physarum Powered Differentiable Linear Programming Layers and Applications by Zihang Meng et al
04-29-2020	Detecting Deep-Fake Videos from Appearance and Behavior by Shruti Agarwal et al
04-28-2020	Multi-task Learning with Crowdsourced Features Improves Skin Lesion Diagnosis by Ralf Raumanns et al
04-28-2020	Do We Need Fully Connected Output Layers in Convolutional Networks? by Zhongchao Qian et al
04-28-2020	Pyramid Attention Networks for Image Restoration by Yiqun Mei et al
04-30-2020	DIABLO: Dictionary-based Attention Block for Deep Metric Learning by Pierre Jacob et al
04-30-2020	Polarization Human Shape and Pose Dataset by Shihao Zou et al
04-30-2020	Improving Semantic Segmentation via Self-Training by Yi Zhu et al
04-30-2020	HERO: Hierarchical Encoder for Video+Language Omni-representation Pre-training by Linjie Li et al
04-29-2020	APB2Face: Audio-guided face reenactment with auxiliary pose and blink signals by Jiangning Zhang et al
04-30-2020	PreCNet: Next Frame Video Prediction Based on Predictive Coding by Zdenek Straka et al
04-28-2020	Exploring Self-attention for Image Recognition by Hengshuang Zhao et al
04-29-2020	Salient Object Detection Combining a Self-attention Module and a Feature Pyramid Network by Guangyu Ren et al
04-30-2020	Polygonal Building Segmentation by Frame Field Learning by Nicolas Girard et al
04-30-2020	Towards Embodied Scene Description by Sinan Tan et al
04-30-2020	The 4th AI City Challenge by Milind Naphade et al
04-29-2020	Bias-corrected estimator for intrinsic dimension and differential entropy--a visual multiscale approach by Jugurta Montalvão et al
04-30-2020	Generative Adversarial Networks in Digital Pathology: A Survey on Trends and Future Potential by Maximilian Ernst Tschuchnig et al
04-28-2020	The Immersion of Directed Multi-graphs in Embedding Fields. Generalisations by Bogdan Bocse et al
04-30-2020	Multi-View Spectral Clustering Tailored Tensor Low-Rank Representation by Yuheng Jia et al
05-01-2020	The AVA-Kinetics Localized Human Actions Video Dataset by Ang Li et al
04-29-2020	Multiresolution and Multimodal Speech Recognition with Transformers by Georgios Paraskevopoulos et al
04-29-2020	Rethinking Class-Discrimination Based CNN Channel Pruning by Yuchen Liu et al
04-29-2020	Assessing Car Damage using Mask R-CNN by Sarath P et al
04-29-2020	TRP: Trained Rank Pruning for Efficient Deep Neural Networks by Yuhui Xu et al
04-30-2020	Dynamic Language Binding in Relational Visual Reasoning by Thao Minh Le et al
04-28-2020	Span-based Localizing Network for Natural Language Video Localization by Hao Zhang et al
04-29-2020	Teaching Cameras to Feel: Estimating Tactile Physical Properties of Surfaces From Images by Matthew Purri et al
04-30-2020	Inability of spatial transformations of CNN feature maps to support invariant recognition by Ylva Jansson et al
04-30-2020	Feedback U-net for Cell Image Segmentation by Eisuke Shibuya et al
04-30-2020	SimPropNet: Improved Similarity Propagation for Few-shot Image Segmentation by Siddhartha Gairola et al
04-29-2020	A Multi-scale Optimization Learning Framework for Diffeomorphic Deformable Registration by Risheng Liu et al
04-29-2020	Deep Transfer Learning For Plant Center Localization by Enyu Cai et al
04-30-2020	Bilateral Attention Network for RGB-D Salient Object Detection by Zhao Zhang et al
05-01-2020	Diverse Visuo-Lingustic Question Answering (DVLQA) Challenge by Shailaja Sampat et al
04-29-2020	The EPIC-KITCHENS Dataset: Collection, Challenges and Baselines by Dima Damen et al
04-29-2020	Image Morphing with Perceptual Constraints and STN Alignment by Noa Fish et al
04-28-2020	Minority Reports Defense: Defending Against Adversarial Patches by Michael McCoyd et al
04-28-2020	Event-based Robotic Grasping Detection with Neuromorphic Vision Sensor and Event-Stream Dataset by Bin Li et al
04-29-2020	Effective Human Activity Recognition Based on Small Datasets by Bruce X. B. Yu et al
04-29-2020	Zero-Shot Learning and its Applications from Autonomous Vehicles to COVID-19 Diagnosis: A Review by Mahdi Rezaei et al
04-29-2020	Video Contents Understanding using Deep Neural Networks by Mohammadhossein Toutiaee et al
04-29-2020	Beyond Instructional Videos: Probing for More Diverse Visual-Textual Grounding on YouTube by Jack Hessel et al
04-28-2020	Cross-modal Speaker Verification and Recognition: A Multilingual Perspective by Muhammad Saad Saeed et al
04-28-2020	An Auto-Encoder Strategy for Adaptive Image Segmentation by Evan M. Yu et al
05-01-2020	PCA-SRGAN: Incremental Orthogonal Projection Discrimination for Face Super-resolution by Hao Dou et al
04-29-2020	Informative Scene Decomposition for Crowd Analysis, Comparison and Simulation Guidance by Feixiang He et al
04-28-2020	Unifying Neural Learning and Symbolic Reasoning for Spinal Medical Report Generation by Zhongyi Han et al
04-29-2020	Minimal Rolling Shutter Absolute Pose with Unknown Focal Length and Radial Distortion by Zuzana Kukelova et al
05-01-2020	Distilling Spikes: Knowledge Distillation in Spiking Neural Networks by Ravi Kumar Kushawaha et al
04-28-2020	Revisiting Multi-Task Learning in the Deep Learning Era by Simon Vandenhende et al
04-28-2020	Less is More: Sample Selection and Label Conditioning Improve Skin Lesion Segmentation by Vinicius Ribeiro et al
04-29-2020	Retinal vessel segmentation by probing adaptive to lighting variations by Guillaume Noyel et al
04-28-2020	Deflating Dataset Bias Using Synthetic Data Augmentation by Nikita Jaipuria et al
04-28-2020	Identification of Cervical Pathology using Adversarial Neural Networks by Abhilash Nandy et al
04-28-2020	A novel Region of Interest Extraction Layer for Instance Segmentation by Leonardo Rossi et al
04-28-2020	Leveraging Photometric Consistency over Time for Sparsely Supervised Hand-Object Reconstruction by Yana Hasson et al
04-29-2020	Motion Guided 3D Pose Estimation from Videos by Jingbo Wang et al
04-28-2020	Transferable Active Grasping and Real Embodied Dataset by Xiangyu Chen et al
04-28-2020	Residual Channel Attention Generative Adversarial Network for Image Super-Resolution and Noise Reduction by Jie Cai et al
04-29-2020	Skeleton Focused Human Activity Recognition in RGB Video by Bruce X. B. Yu et al
04-28-2020	Gradient-Induced Co-Saliency Detection by Zhao Zhang et al
04-28-2020	Multi-Scale Boosted Dehazing Network with Dense Feature Fusion by Hang Dong et al
05-01-2020	A Naturalness Evaluation Database for Video Prediction Models by Nagabhushan Somraj et al
04-29-2020	DR-SPAAM: A Spatial-Attention and Auto-regressive Model for Person Detection in 2D Range Data by Dan Jia et al
04-28-2020	Small-Task Incremental Learning by Arthur Douillard et al
04-28-2020	Addressing Artificial Intelligence Bias in Retinal Disease Diagnostics by Philippe Burlina et al
04-30-2020	Bridging Mode Connectivity in Loss Landscapes and Adversarial Robustness by Pu Zhao et al
04-28-2020	FU-net: Multi-class Image Segmentation Using Feedback Weighted U-net by Mina Jafari et al
04-28-2020	Histogram-based Auto Segmentation: A Novel Approach to Segmenting Integrated Circuit Structures from SEM Images by Ronald Wilson et al
05-01-2020	Cross-modal Language Generation using Pivot Stabilization for Web-scale Language Coverage by Ashish V. Thapliyal et al
04-28-2020	Visual Grounding of Learned Physical Models by Yunzhu Li et al
04-29-2020	Single-Side Domain Generalization for Face Anti-Spoofing by Yunpei Jia et al
04-29-2020	Seeing voices and hearing voices: learning discriminative embeddings using cross-modal self-supervision by Soo-Whan Chung et al
05-01-2020	HLVU : A New Challenge to Test Deep Understanding of Movies the Way Humans do by Keith Curtis et al
04-29-2020	Image Captioning through Image Transformer by Sen He et al
04-29-2020	Deepfake Video Forensics based on Transfer Learning by Rahul U et al
04-28-2020	3D Solid Spherical Bispectrum CNNs for Biomedical Texture Analysis by Valentin Oreiller et al
04-29-2020	Counting of Grapevine Berries in Images via Semantic Segmentation using Convolutional Neural Networks by Laura Zabawa et al
05-01-2020	A Comprehensive Study on Visual Explanations for Spatio-temporal Networks by Zhenqiang Li et al
04-28-2020	Multivariate Confidence Calibration for Object Detection by Fabian Küppers et al
04-28-2020	Unmanned Aerial Systems for Wildland and Forest Fires: Sensing, Perception, Cooperation and Assistance by Moulay A. Akhloufi et al
04-28-2020	DRU-net: An Efficient Deep Convolutional Neural Network for Medical Image Segmentation by Mina Jafari et al
04-29-2020	Action Sequence Predictions of Vehicles in Urban Environments using Map and Social Context by Jan-Nico Zaech et al
04-28-2020	SSIM-Based CTU-Level Joint Optimal Bit Allocation and Rate Distortion Optimization by Yang Li et al
04-29-2020	Tensor train rank minimization with nonlocal self-similarity for tensor completion by Meng Ding et al
04-30-2020	Importance Driven Continual Learning for Segmentation Across Domains by Sinan Özgür Özgün et al

04-29-2020	A Fast 3D CNN for Hyperspectral Image Classification by Muhammad Ahmad
05-01-2020	Computing the Testing Error without a Testing Set by Ciprian Corneanu et al
05-01-2020	ACCL: Adversarial constrained-CNN loss for weakly supervised medical image segmentation by Pengyi Zhang et al
04-28-2020	Style-transfer GANs for bridging the domain gap in synthetic pose estimator training by Pavel Rojtberg et al
04-30-2020	A Novel Perspective to Zero-shot Learning: Towards an Alignment of Manifold Structures via Semantic Feature Expansion by Jingcai Guo et al
05-01-2020	Deeply Cascaded U-Net for Multi-Task Image Processing by Ilja Gubins et al
05-01-2020	Deepfake Forensics Using Recurrent Neural Networks by Rahul U et al
04-30-2020	M^3VSNet: Unsupervised Multi-metric Multi-view Stereo Network by Baichuan Huang et al
04-28-2020	Hybrid Attention for Automatic Segmentation of Whole Fetal Head in Prenatal Ultrasound Volumes by Xin Yang et al
05-01-2020	Multi-Camera Trajectory Forecasting: Pedestrian Trajectory Prediction in a Network of Cameras by Olly Styles et al
04-30-2020	Conceptual Design of Human-Drone Communication in Collaborative Environments by Hans Dermot Doran et al
04-30-2020	Survey on Reliable Deep Learning-Based Person Re-Identification Models: Are We There Yet? by Bahram Lavi et al
05-01-2020	MOPS-Net: A Matrix Optimization-driven Network forTask-Oriented 3D Point Cloud Downsampling by Yue Qian et al
04-30-2020	Attentive Weakly Supervised land cover mapping for object-based satellite image time series data with spatial interpretation by Dino Ienco et al
05-01-2020	Recognizing American Sign Language Nonmanual Signal Grammar Errors in Continuous Videos by Elahe Vahdani et al
05-01-2020	Aggregation and Finetuning for Clothes Landmark Detection by Tzu-Heng Lin
04-30-2020	Occlusion resistant learning of intuitive physics from videos by Ronan Riochet et al
04-28-2020	Real-Time Apple Detection System Using Embedded Systems With Hardware Accelerators: An Edge AI Application by Vittorio Mazzia et al
04-30-2020	CP-NAS: Child-Parent Neural Architecture Search for 1-bit CNNs by Li'an Zhuo et al
04-28-2020	Classifying Image Sequences of Astronomical Transients with Deep Neural Networks by Catalina Gómez et al
05-01-2020	Defocus Deblurring Using Dual-Pixel Data by Abdullah Abuolaim et al
04-30-2020	Generative Adversarial Data Programming by Arghya Pal et al
04-30-2020	Pedestrian Path, Pose and Intention Prediction through Gaussian Process Dynamical Models and Pedestrian Activity Recognition by Raul Quintero et al
04-30-2020	Sequence Information Channel Concatenation for Improving Camera Trap Image Burst Classification by Bhuvan Malladihalli Shashidhara et al
05-01-2020	Investigating Class-level Difficulty Factors in Multi-label Classification Problems by Mark Marsden et al
04-30-2020	Domain Siamese CNNs for Sparse Multispectral Disparity Estimation by David-Alexandre Beaupre et al
04-28-2020	SCRDet++: Detecting Small, Cluttered and Rotated Objects via Instance-Level Feature Denoising and Rotation Loss Smoothing by Xue Yang et al
04-30-2020	Unsupervised Lesion Detection via Image Restoration with a Normative Prior by Xiaoran Chen et al
04-29-2020	The International Workshop on Osteoarthritis Imaging Knee MRI Segmentation Challenge: A Multi-Institute Evaluation and Analysis Framework on a Standardized Dataset by Arjun D. Desai et al
05-01-2020	An Efficient Integration of Disentangled Attended Expression and Identity FeaturesFor Facial Expression Transfer andSynthesis by Kamran Ali et al

Craig SmithMay 4, 2020