2020.5.4 Vision papers

 

04-30-2020

Consistent Video Depth Estimation
by Xuan Luo et al

04-30-2020

CNN Explainer: Learning Convolutional Neural Networks with Interactive Visualization
by Zijie J. Wang et al

04-28-2020

Image Augmentation Is All You Need: Regularizing Deep Reinforcement Learning from Pixels
by Ilya Kostrikov et al

04-28-2020

Learning Feature Descriptors using Camera Pose Supervision
by Qianqian Wang et al

04-30-2020

Improving Vision-and-Language Navigation with Image-Text Pairs from the Web
by Arjun Majumdar et al

04-29-2020

Editing in Style: Uncovering the Local Semantics of GANs
by Edo Collins et al

04-29-2020

MobileDets: Searching for Object Detection Architectures for Mobile Accelerators
by Yunyang Xiong et al

04-28-2020

DiVA: Diverse Visual Feature Aggregation for Deep Metric Learning
by Timo Milbich et al

04-28-2020

VD-BERT: A Unified Vision and Dialog Transformer with BERT
by Yue Wang et al

04-30-2020

SS3D: Single Shot 3D Object Detector
by Aniket Limaye et al

04-29-2020

VGGSound: A Large-scale Audio-Visual Dataset
by Honglie Chen et al

04-29-2020

Interactive Video Stylization Using Few-Shot Patch-Based Training
by Ondřej Texler et al

04-28-2020

Neural Hair Rendering
by Menglei Chai et al

04-29-2020

Pragmatic Issue-Sensitive Image Captioning
by Allen Nie et al

05-01-2020

Adversarial Synthesis of Human Pose from Text
by Yifei Zhang et al

04-29-2020

UAV and Machine Learning Based Refinement of a Satellite-Driven Vegetation Index for Precision Agriculture
by Vittorio Mazzia et al

04-30-2020

MuSe 2020 -- The First International Multimodal Sentiment Analysis in Real-life Media Challenge and Workshop
by Lukas Stappen et al

04-30-2020

EXACT: A collaboration toolset for algorithm-aided annotation of almost everything
by Christian Marzahl et al

04-30-2020

Progressive Transformers for End-to-End Sign Language Production
by Ben Saunders et al

04-30-2020

Out-of-the-box channel pruned networks
by Ragav Venkatesan et al

04-29-2020

Physarum Powered Differentiable Linear Programming Layers and Applications
by Zihang Meng et al

04-29-2020

Detecting Deep-Fake Videos from Appearance and Behavior
by Shruti Agarwal et al

04-28-2020

Multi-task Learning with Crowdsourced Features Improves Skin Lesion Diagnosis
by Ralf Raumanns et al

04-28-2020

Do We Need Fully Connected Output Layers in Convolutional Networks?
by Zhongchao Qian et al

04-28-2020

Pyramid Attention Networks for Image Restoration
by Yiqun Mei et al

04-30-2020

DIABLO: Dictionary-based Attention Block for Deep Metric Learning
by Pierre Jacob et al

04-30-2020

Polarization Human Shape and Pose Dataset
by Shihao Zou et al

04-30-2020

Improving Semantic Segmentation via Self-Training
by Yi Zhu et al

04-30-2020

HERO: Hierarchical Encoder for Video+Language Omni-representation Pre-training
by Linjie Li et al

04-29-2020

APB2Face: Audio-guided face reenactment with auxiliary pose and blink signals
by Jiangning Zhang et al

04-30-2020

PreCNet: Next Frame Video Prediction Based on Predictive Coding
by Zdenek Straka et al

04-28-2020

Exploring Self-attention for Image Recognition
by Hengshuang Zhao et al

04-29-2020

Salient Object Detection Combining a Self-attention Module and a Feature Pyramid Network
by Guangyu Ren et al

04-30-2020

Polygonal Building Segmentation by Frame Field Learning
by Nicolas Girard et al

04-30-2020

Towards Embodied Scene Description
by Sinan Tan et al

04-30-2020

The 4th AI City Challenge
by Milind Naphade et al

04-29-2020

Bias-corrected estimator for intrinsic dimension and differential entropy--a visual multiscale approach
by Jugurta Montalvão et al

04-30-2020

Generative Adversarial Networks in Digital Pathology: A Survey on Trends and Future Potential
by Maximilian Ernst Tschuchnig et al

04-28-2020

The Immersion of Directed Multi-graphs in Embedding Fields. Generalisations
by Bogdan Bocse et al

04-30-2020

Multi-View Spectral Clustering Tailored Tensor Low-Rank Representation
by Yuheng Jia et al

05-01-2020

The AVA-Kinetics Localized Human Actions Video Dataset
by Ang Li et al

04-29-2020

Multiresolution and Multimodal Speech Recognition with Transformers
by Georgios Paraskevopoulos et al

04-29-2020

Rethinking Class-Discrimination Based CNN Channel Pruning
by Yuchen Liu et al

04-29-2020

Assessing Car Damage using Mask R-CNN
by Sarath P et al

04-29-2020

TRP: Trained Rank Pruning for Efficient Deep Neural Networks
by Yuhui Xu et al

04-30-2020

Dynamic Language Binding in Relational Visual Reasoning
by Thao Minh Le et al

04-28-2020

Span-based Localizing Network for Natural Language Video Localization
by Hao Zhang et al

04-29-2020

Teaching Cameras to Feel: Estimating Tactile Physical Properties of Surfaces From Images
by Matthew Purri et al

04-30-2020

Inability of spatial transformations of CNN feature maps to support invariant recognition
by Ylva Jansson et al

04-30-2020

Feedback U-net for Cell Image Segmentation
by Eisuke Shibuya et al

04-30-2020

SimPropNet: Improved Similarity Propagation for Few-shot Image Segmentation
by Siddhartha Gairola et al

04-29-2020

A Multi-scale Optimization Learning Framework for Diffeomorphic Deformable Registration
by Risheng Liu et al

04-29-2020

Deep Transfer Learning For Plant Center Localization
by Enyu Cai et al

04-30-2020

Bilateral Attention Network for RGB-D Salient Object Detection
by Zhao Zhang et al

05-01-2020

Diverse Visuo-Lingustic Question Answering (DVLQA) Challenge
by Shailaja Sampat et al

04-29-2020

The EPIC-KITCHENS Dataset: Collection, Challenges and Baselines
by Dima Damen et al

04-29-2020

Image Morphing with Perceptual Constraints and STN Alignment
by Noa Fish et al

04-28-2020

Minority Reports Defense: Defending Against Adversarial Patches
by Michael McCoyd et al

04-28-2020

Event-based Robotic Grasping Detection with Neuromorphic Vision Sensor and Event-Stream Dataset
by Bin Li et al

04-29-2020

Effective Human Activity Recognition Based on Small Datasets
by Bruce X. B. Yu et al

04-29-2020

Zero-Shot Learning and its Applications from Autonomous Vehicles to COVID-19 Diagnosis: A Review
by Mahdi Rezaei et al

04-29-2020

Video Contents Understanding using Deep Neural Networks
by Mohammadhossein Toutiaee et al

04-29-2020

Beyond Instructional Videos: Probing for More Diverse Visual-Textual Grounding on YouTube
by Jack Hessel et al

04-28-2020

Cross-modal Speaker Verification and Recognition: A Multilingual Perspective
by Muhammad Saad Saeed et al

04-28-2020

An Auto-Encoder Strategy for Adaptive Image Segmentation
by Evan M. Yu et al

05-01-2020

PCA-SRGAN: Incremental Orthogonal Projection Discrimination for Face Super-resolution
by Hao Dou et al

04-29-2020

Informative Scene Decomposition for Crowd Analysis, Comparison and Simulation Guidance
by Feixiang He et al

04-28-2020

Unifying Neural Learning and Symbolic Reasoning for Spinal Medical Report Generation
by Zhongyi Han et al

04-29-2020

Minimal Rolling Shutter Absolute Pose with Unknown Focal Length and Radial Distortion
by Zuzana Kukelova et al

05-01-2020

Distilling Spikes: Knowledge Distillation in Spiking Neural Networks
by Ravi Kumar Kushawaha et al

04-28-2020

Revisiting Multi-Task Learning in the Deep Learning Era
by Simon Vandenhende et al

04-28-2020

Less is More: Sample Selection and Label Conditioning Improve Skin Lesion Segmentation
by Vinicius Ribeiro et al

04-29-2020

Retinal vessel segmentation by probing adaptive to lighting variations
by Guillaume Noyel et al

04-28-2020

Deflating Dataset Bias Using Synthetic Data Augmentation
by Nikita Jaipuria et al

04-28-2020

Identification of Cervical Pathology using Adversarial Neural Networks
by Abhilash Nandy et al

04-28-2020

A novel Region of Interest Extraction Layer for Instance Segmentation
by Leonardo Rossi et al

04-28-2020

Leveraging Photometric Consistency over Time for Sparsely Supervised Hand-Object Reconstruction
by Yana Hasson et al

04-29-2020

Motion Guided 3D Pose Estimation from Videos
by Jingbo Wang et al

04-28-2020

Transferable Active Grasping and Real Embodied Dataset
by Xiangyu Chen et al

04-28-2020

Residual Channel Attention Generative Adversarial Network for Image Super-Resolution and Noise Reduction
by Jie Cai et al

04-29-2020

Skeleton Focused Human Activity Recognition in RGB Video
by Bruce X. B. Yu et al

04-28-2020

Gradient-Induced Co-Saliency Detection
by Zhao Zhang et al

04-28-2020

Multi-Scale Boosted Dehazing Network with Dense Feature Fusion
by Hang Dong et al

05-01-2020

A Naturalness Evaluation Database for Video Prediction Models
by Nagabhushan Somraj et al

04-29-2020

DR-SPAAM: A Spatial-Attention and Auto-regressive Model for Person Detection in 2D Range Data
by Dan Jia et al

04-28-2020

Small-Task Incremental Learning
by Arthur Douillard et al

04-28-2020

Addressing Artificial Intelligence Bias in Retinal Disease Diagnostics
by Philippe Burlina et al

04-30-2020

Bridging Mode Connectivity in Loss Landscapes and Adversarial Robustness
by Pu Zhao et al

04-28-2020

FU-net: Multi-class Image Segmentation Using Feedback Weighted U-net
by Mina Jafari et al

04-28-2020

Histogram-based Auto Segmentation: A Novel Approach to Segmenting Integrated Circuit Structures from SEM Images
by Ronald Wilson et al

05-01-2020

Cross-modal Language Generation using Pivot Stabilization for Web-scale Language Coverage
by Ashish V. Thapliyal et al

04-28-2020

Visual Grounding of Learned Physical Models
by Yunzhu Li et al

04-29-2020

Single-Side Domain Generalization for Face Anti-Spoofing
by Yunpei Jia et al

04-29-2020

Seeing voices and hearing voices: learning discriminative embeddings using cross-modal self-supervision
by Soo-Whan Chung et al

05-01-2020

HLVU : A New Challenge to Test Deep Understanding of Movies the Way Humans do
by Keith Curtis et al

04-29-2020

Image Captioning through Image Transformer
by Sen He et al

04-29-2020

Deepfake Video Forensics based on Transfer Learning
by Rahul U et al

04-28-2020

3D Solid Spherical Bispectrum CNNs for Biomedical Texture Analysis
by Valentin Oreiller et al

04-29-2020

Counting of Grapevine Berries in Images via Semantic Segmentation using Convolutional Neural Networks
by Laura Zabawa et al

05-01-2020

A Comprehensive Study on Visual Explanations for Spatio-temporal Networks
by Zhenqiang Li et al

04-28-2020

Multivariate Confidence Calibration for Object Detection
by Fabian Küppers et al

04-28-2020

Unmanned Aerial Systems for Wildland and Forest Fires: Sensing, Perception, Cooperation and Assistance
by Moulay A. Akhloufi et al

04-28-2020

DRU-net: An Efficient Deep Convolutional Neural Network for Medical Image Segmentation
by Mina Jafari et al

04-29-2020

Action Sequence Predictions of Vehicles in Urban Environments using Map and Social Context
by Jan-Nico Zaech et al

04-28-2020

SSIM-Based CTU-Level Joint Optimal Bit Allocation and Rate Distortion Optimization
by Yang Li et al

04-29-2020

Tensor train rank minimization with nonlocal self-similarity for tensor completion
by Meng Ding et al

04-30-2020

Importance Driven Continual Learning for Segmentation Across Domains
by Sinan Özgür Özgün et al

04-29-2020

A Fast 3D CNN for Hyperspectral Image Classification
by Muhammad Ahmad

05-01-2020

Computing the Testing Error without a Testing Set
by Ciprian Corneanu et al

05-01-2020

ACCL: Adversarial constrained-CNN loss for weakly supervised medical image segmentation
by Pengyi Zhang et al

04-28-2020

Style-transfer GANs for bridging the domain gap in synthetic pose estimator training
by Pavel Rojtberg et al

04-30-2020

A Novel Perspective to Zero-shot Learning: Towards an Alignment of Manifold Structures via Semantic Feature Expansion
by Jingcai Guo et al

05-01-2020

Deeply Cascaded U-Net for Multi-Task Image Processing
by Ilja Gubins et al

05-01-2020

Deepfake Forensics Using Recurrent Neural Networks
by Rahul U et al

04-30-2020

M^3VSNet: Unsupervised Multi-metric Multi-view Stereo Network
by Baichuan Huang et al

04-28-2020

Hybrid Attention for Automatic Segmentation of Whole Fetal Head in Prenatal Ultrasound Volumes
by Xin Yang et al

05-01-2020

Multi-Camera Trajectory Forecasting: Pedestrian Trajectory Prediction in a Network of Cameras
by Olly Styles et al

04-30-2020

Conceptual Design of Human-Drone Communication in Collaborative Environments
by Hans Dermot Doran et al

04-30-2020

Survey on Reliable Deep Learning-Based Person Re-Identification Models: Are We There Yet?
by Bahram Lavi et al

05-01-2020

MOPS-Net: A Matrix Optimization-driven Network forTask-Oriented 3D Point Cloud Downsampling
by Yue Qian et al

04-30-2020

Attentive Weakly Supervised land cover mapping for object-based satellite image time series data with spatial interpretation
by Dino Ienco et al

05-01-2020

Recognizing American Sign Language Nonmanual Signal Grammar Errors in Continuous Videos
by Elahe Vahdani et al

05-01-2020

Aggregation and Finetuning for Clothes Landmark Detection
by Tzu-Heng Lin

04-30-2020

Occlusion resistant learning of intuitive physics from videos
by Ronan Riochet et al

04-28-2020

Real-Time Apple Detection System Using Embedded Systems With Hardware Accelerators: An Edge AI Application
by Vittorio Mazzia et al

04-30-2020

CP-NAS: Child-Parent Neural Architecture Search for 1-bit CNNs
by Li'an Zhuo et al

04-28-2020

Classifying Image Sequences of Astronomical Transients with Deep Neural Networks
by Catalina Gómez et al

05-01-2020

Defocus Deblurring Using Dual-Pixel Data
by Abdullah Abuolaim et al

04-30-2020

Generative Adversarial Data Programming
by Arghya Pal et al

04-30-2020

Pedestrian Path, Pose and Intention Prediction through Gaussian Process Dynamical Models and Pedestrian Activity Recognition
by Raul Quintero et al

04-30-2020

Sequence Information Channel Concatenation for Improving Camera Trap Image Burst Classification
by Bhuvan Malladihalli Shashidhara et al

05-01-2020

Investigating Class-level Difficulty Factors in Multi-label Classification Problems
by Mark Marsden et al

04-30-2020

Domain Siamese CNNs for Sparse Multispectral Disparity Estimation
by David-Alexandre Beaupre et al

04-28-2020

SCRDet++: Detecting Small, Cluttered and Rotated Objects via Instance-Level Feature Denoising and Rotation Loss Smoothing
by Xue Yang et al

04-30-2020

Unsupervised Lesion Detection via Image Restoration with a Normative Prior
by Xiaoran Chen et al

04-29-2020

The International Workshop on Osteoarthritis Imaging Knee MRI Segmentation Challenge: A Multi-Institute Evaluation and Analysis Framework on a Standardized Dataset
by Arjun D. Desai et al

05-01-2020

An Efficient Integration of Disentangled Attended Expression and Identity FeaturesFor Facial Expression Transfer andSynthesis
by Kamran Ali et al

 
Craig Smith