2020.7.27 Vision papers

 

07-22-2020

Contact and Human Dynamics from Monocular Video
by Davis Rempe et al

07-21-2020

Towards Nonlinear Disentanglement in Natural Data with Temporal Sparse Coding
by David Klindt et al

07-22-2020

DeepSVG: A Hierarchical Generative Network for Vector Graphics Animation
by Alexandre Carlier et al

07-21-2020

Accelerating Deep Learning Applications in Space
by Martina Lofqvist et al

07-21-2020

Shape and Viewpoint without Keypoints
by Shubham Goel et al

07-22-2020

CrossTransformers: spatially-aware few-shot transfer
by Carl Doersch et al

07-21-2020

PointContrast: Unsupervised Pre-training for 3D Point Cloud Understanding
by Saining Xie et al

07-23-2020

Whole-Body Human Pose Estimation in the Wild
by Sheng Jin et al

07-22-2020

Neural Sparse Voxel Fields
by Lingjie Liu et al

07-22-2020

Unsupervised Shape and Pose Disentanglement for 3D Meshes
by Keyang Zhou et al

07-21-2020

Learning Monocular Visual Odometry via Self-Supervised Long-Term Modeling
by Yuliang Zou et al

07-21-2020

Neural Mesh Flow: 3D Manifold Mesh Generationvia Diffeomorphic Flows
by Kunal Gupta et al

07-23-2020

TSIT: A Simple and Versatile Framework for Image-to-Image Translation
by Liming Jiang et al

07-23-2020

Smooth-AP: Smoothing the Path Towards Large-Scale Image Retrieval
by Andrew Brown et al

07-23-2020

Bridging the Imitation Gap by Adaptive Insubordination
by Luca Weihs et al

07-23-2020

Spatially Aware Multimodal Transformers for TextVQA
by Yash Kant et al

07-24-2020

The Surprising Effectiveness of Linear Unsupervised Image-to-Image Translation
by Eitan Richardson et al

07-23-2020

PP-YOLO: An Effective and Efficient Implementation of Object Detector
by Xiang Long et al

07-22-2020

Analogical Reasoning for Visually Grounded Language Acquisition
by Bo Wu et al

07-22-2020

Adversarial Training Reduces Information and Improves Transferability
by Matteo Terzi et al

07-23-2020

Funnel Activation for Visual Recognition
by Ningning Ma et al

07-22-2020

Threat of Adversarial Attacks on Face Recognition: A Comprehensive Survey
by Fatemeh Vakhshiteh et al

07-21-2020

Garment Design with Generative Adversarial Networks
by Chenxi Yuan et al

07-22-2020

PareCO: Pareto-aware Channel Optimization for Slimmable Neural Networks
by Ting-Wu Chin et al

07-23-2020

SBAT: Video Captioning with Sparse Boundary-Aware Transformer
by Tao Jin et al

07-22-2020

Integrating Image Captioning with Rule-based Entity Masking
by Aditya Mogadala et al

07-22-2020

Cloud Transformers
by Kirill Mazur et al

07-21-2020

Foley Music: Learning to Generate Music from Videos
by Chuang Gan et al

07-22-2020

Deep Learning Based Segmentation of Various Brain Lesions for Radiosurgery
by Siang-Ruei Wu et al

07-22-2020

Darwins Neural Network: AI-based Strategies for Rapid and Scalable Cell and Coronavirus Screening
by Sang Won Lee et al

07-21-2020

Deep Preset: Blending and Retouching Photos with Color Style Transfer
by Man M. Ho et al

07-23-2020

WeightNet: Revisiting the Design Space of Weight Networks
by Ningning Ma et al

07-23-2020

Sound2Sight: Generating Visual Dynamics from Sound and Context
by Anoop Cherian et al

07-23-2020

Enhanced Transfer Learning for Autonomous Driving with Systematic Accident Simulation
by Shivam Akhauri et al

07-23-2020

Body2Hands: Learning to Infer 3D Hands from Conversational Gesture Body Dynamics
by Evonne Ng et al

07-23-2020

HITNet: Hierarchical Iterative Tile Refinement Network for Real-time Stereo Matching
by Vladimir Tankovich et al

07-22-2020

History Repeats Itself: Human Motion Prediction via Motion Attention
by Wei Mao et al

07-23-2020

Right for the Right Reason: Making Image Classification Robust
by Anna Nguyen et al

07-22-2020

Tiny Transfer Learning: Towards Memory-Efficient On-Device Learning
by Han Cai et al

07-21-2020

Backdoor Attacks and Countermeasures on Deep Learning: A Comprehensive Review
by Yansong Gao et al

07-22-2020

Guided Deep Decoder: Unsupervised Image Pair Fusion
by Tatsumi Uezato et al

07-23-2020

Neural Geometric Parser for Single Image Camera Calibration
by Jinwoo Lee et al

07-23-2020

Weakly Supervised 3D Object Detection from Lidar Point Cloud
by Qinghao Meng et al

07-23-2020

PIPAL: a Large-Scale Image Quality Assessment Dataset for Perceptual Image Restoration
by Jinjin Gu et al

07-22-2020

Multi-Task Curriculum Framework for Open-Set Semi-Supervised Learning
by Qing Yu et al

07-23-2020

Accurate RGB-D Salient Object Detection via Collaborative Learning
by Wei Ji et al

07-23-2020

Few-Shot Object Detection and Viewpoint Estimation for Objects in the Wild
by Yang Xiao et al

07-23-2020

Zero-Shot Recognition through Image-Guided Semantic Classification
by Mei-Chen Yeh et al

07-22-2020

SIZER: A Dataset and Model for Parsing 3D Clothing and Learning Size Sensitive 3D Clothing
by Garvita Tiwari et al

07-23-2020

Implicit Latent Variable Model for Scene-Consistent Motion Forecasting
by Sergio Casas et al

07-22-2020

Comprehensive Image Captioning via Scene Graph Decomposition
by Yiwu Zhong et al

07-23-2020

Harnessing spatial homogeneity of neuroimaging data: patch individual filter layers for CNNs
by Fabian Eitel et al

07-23-2020

The Devil is in Classification: A Simple Framework for Long-tail Instance Segmentation
by Tao Wang et al

07-23-2020

Representation Sharing for Fast Object Detector Search and Beyond
by Yujie Zhong et al

07-22-2020

End-to-End Optimization of Scene Layout
by Andrew Luo et al

07-23-2020

End-to-end Learning of Compressible Features
by Saurabh Singh et al

07-23-2020

CAD-Deform: Deformable Fitting of CAD Models to 3D Scans
by Vladislav Ishimtsev et al

07-22-2020

Subjective and Objective Quality Assessment of High Frame Rate Videos
by Pavan C. Madhusudana et al

07-23-2020

ReLaB: Reliable Label Bootstrapping for Semi-Supervised Learning
by Paul Albert et al

07-21-2020

CVR-Net: A deep convolutional neural network for coronavirus recognition from chest radiography images
by Md. Kamrul Hasan et al

07-23-2020

A Study on Evaluation Standard for Automatic Crack Detection Regard the Random Fractal
by Hongyu Li et al

07-22-2020

Multi-modality imaging with structure-promoting regularisers
by Matthias J. Ehrhardt

07-21-2020

MovieNet: A Holistic Dataset for Movie Understanding
by Qingqiu Huang et al

07-23-2020

BSL-1K: Scaling up co-articulated sign language recognition using mouthing cues
by Samuel Albanie et al

07-23-2020

MuCAN: Multi-Correspondence Aggregation Network for Video Super-Resolution
by Wenbo Li et al

07-23-2020

Autonomous Removal of Perspective Distortion based on Detection Results of Robotic Elevator Button Corner
by Nachuan Ma

07-23-2020

Differentiable Hierarchical Graph Grouping for Multi-Person Pose Estimation
by Sheng Jin et al

07-22-2020

Attention based Multiple Instance Learning for Classification of Blood Cell Disorders
by Ario Sadafi et al

07-23-2020

Pixel-Pair Occlusion Relationship Map(P2ORM): Formulation, Inference & Application
by Xuchong Qiu et al

07-21-2020

Rethinking CNN Models for Audio Classification
by Kamalesh Palanisamy et al

07-24-2020

Artificial Intelligence in the Creative Industries: A Review
by Nantheera Anantrasirichai et al

07-21-2020

A Framework based on Deep Neural Networks to Extract Anatomy of Mosquitoes from Images
by Mona Minakshi et al

07-23-2020

AttentionNAS: Spatiotemporal Attention Cell Search for Video Classification
by Xiaofang Wang et al

07-22-2020

All at Once: Temporally Adaptive Multi-Frame Interpolation with Advanced Motion Modeling
by Zhixiang Chi et al

07-22-2020

Illumination invariant hyperspectral image unmixing based on a digital surface model
by Tatsumi Uezato et al

07-24-2020

Interpreting Spatially Infinite Generative Models
by Chaochao Lu et al

07-23-2020

Leveraging Bottom-Up and Top-Down Attention for Few-Shot Object Detection
by Xianyu Chen et al

07-24-2020

Unsupervised Discovery of 3D Physical Objects from Video
by Yilun Du et al

07-23-2020

Regularization of Building Boundaries in Satellite Images using Adversarial and Regularized Losses
by Stefano Zorzi et al

07-23-2020

A Solution to Product detection in Densely Packed Scenes
by Tianze Rong et al

07-21-2020

Sparse Nonnegative Tensor Factorization and Completion with Noisy Observations
by Xiongjun Zhang et al

07-21-2020

Who Left the Dogs Out? 3D Animal Reconstruction with Expectation Maximization in the Loop
by Benjamin Biggs et al

07-22-2020

Multi-Metric Evaluation of Thermal-to-Visual Face Recognition
by Kenneth Lai et al

07-22-2020

Unsupervised Deep Representation Learning for Real-Time Tracking
by Ning Wang et al

07-21-2020

Balanced Meta-Softmax for Long-Tailed Visual Recognition
by Jiawei Ren et al

07-21-2020

CyCNN: A Rotation Invariant CNN using Polar Mapping and Cylindrical Convolution Layers
by Jinpyo Kim et al

07-22-2020

Edge-aware Graph Representation Learning and Reasoning for Face Parsing
by Gusi Te et al

07-23-2020

Real-time CNN-based Segmentation Architecture for Ball Detection in a Single View Setup
by Gabriel Van Zandycke et al

07-21-2020

Movement Assessment from Skeleton Videos: A Review
by Tal Hakim

07-22-2020

Wasserstein Routed Capsule Networks
by Alexander Fuchs et al

07-23-2020

CurveLane-NAS: Unifying Lane-Sensitive Architecture Search and Adaptive Point Blending
by Hang Xu et al

07-21-2020

Towards Visual Distortion in Black-Box Attacks
by Nannan Li et al

07-21-2020

AinnoSeg: Panoramic Segmentation with High Perfomance
by Jiahong Wu et al

07-21-2020

SLNSpeech: solving extended speech separation problem by the help of sign language
by Jiasong Wu et al

07-23-2020

Polylidar3D -- Fast Polygon Extraction from 3D Data
by Jeremy Castagno et al

07-21-2020

IITK at SemEval-2020 Task 8: Unimodal and Bimodal Sentiment Analysis of Internet Memes
by Vishal Keswani et al

07-22-2020

DEAL: Deep Evidential Active Learning for Image Classification
by Patrick Hemmer et al

07-21-2020

Self-supervised Feature Learning via Exploiting Multi-modal Data for Retinal Disease Diagnosis
by Xiaomeng Li et al

07-24-2020

What and Where: Learn to Plug Adapters via NAS for Multi-Domain Learning
by Hanbin Zhao et al

07-22-2020

Improving Monocular Depth Estimation by Leveraging Structural Awareness and Complementary Datasets
by Tian Chen et al

07-21-2020

Shape-aware Semi-supervised 3D Semantic Segmentation for Medical Images
by Shuailin Li et al

07-21-2020

Creating a Large-scale Synthetic Dataset for Human Activity Recognition
by Ollie Matthews et al

07-24-2020

Deforming the Loss Surface
by Liangming Chen et al

07-24-2020

A Lightweight Neural Network for Monocular View Generation with Occlusion Handling
by Simon Evain et al

07-24-2020

CelebA-Spoof: Large-Scale Face Anti-Spoofing Dataset with Rich Annotations
by Yuanhan Zhang et al

07-22-2020

Deep Variational Instance Segmentation
by Jialin Yuan et al

07-21-2020

Complementing Representation Deficiency in Few-shot Image Classification: A Meta-Learning Approach
by Xian Zhong et al

07-21-2020

An Image Analogies Approach for Multi-Scale Contour Detection
by Slimane Larabi et al

07-21-2020

Feature-metric Loss for Self-supervised Learning of Depth and Egomotion
by Chang Shu et al

07-21-2020

A Computation-Efficient CNN System for High-Quality Brain Tumor Segmentation
by Yanming Sun et al

07-22-2020

Deep-VFX: Deep Action Recognition Driven VFX for Short Video
by Ao Luo et al

07-22-2020

CNN+RNN Depth and Skeleton based Dynamic Hand Gesture Recognition
by Kenneth Lai et al

07-22-2020

Dog Identification using Soft Biometrics and Neural Networks
by Kenneth Lai et al

07-21-2020

Learning to Compose Hypercolumns for Visual Correspondence
by Juhong Min et al

07-23-2020

Are Visual Explanations Useful? A Case Study in Model-in-the-Loop Prediction
by Eric Chu et al

07-21-2020

Exploiting Temporal Coherence for Self-Supervised One-shot Video Re-identification
by Dripta S. Raychaudhuri et al

07-21-2020

Deep Semi-supervised Knowledge Distillation for Overlapping Cervical Cell Instance Segmentation
by Yanning Zhou et al

07-22-2020

Learnable Descent Algorithm for Nonsmooth Nonconvex Image Reconstruction
by Yunmei Chen et al

07-22-2020

Risk Assessment in the Face-based Watchlist Screening in e-Border
by Kenneth Lai et al

07-22-2020

Video-ception Network: Towards Multi-Scale Efficient Asymmetric Spatial-Temporal Interactions
by Yuan Tian et al

07-22-2020

Attend and Segment: Attention Guided Active Semantic Segmentation
by Soroush Seifi et al

07-21-2020

Instance-aware Self-supervised Learning for Nuclei Segmentation
by Xinpeng Xie et al

07-24-2020

Self-Supervised Monocular 3D Face Reconstruction by Occlusion-Aware Multi-view Geometry Consistency
by Jiaxiang Shang et al

07-22-2020

Real-Time Instrument Segmentation in Robotic Surgery using Auxiliary Supervised Deep Adversarial Learning
by Mobarakol Islam et al

07-22-2020

FedOCR: Communication-Efficient Federated Learning for Scene Text Recognition
by Wenqing Zhang et al

07-21-2020

Graph-PCNN: Two Stage Human Pose Estimation with Graph Pose Refinement
by Jian Wang et al

07-21-2020

Uncertainty-Aware Weakly Supervised Action Detection from Untrimmed Videos
by Anurag Arnab et al

07-22-2020

A weakly supervised registration-based framework for prostate segmentation via the combination of statistical shape model and CNN
by Chunxia Qin et al

07-22-2020

Adma: A Flexible Loss Function for Neural Networks
by Aditya Shrivastava

07-22-2020

Combining Implicit Function Learning and Parametric Models for 3D Human Reconstruction
by Bharat Lal Bhatnagar et al

07-24-2020

An LSTM Approach to Temporal 3D Object Detection in LiDAR Point Clouds
by Rui Huang et al

07-21-2020

Novel View Synthesis on Unpaired Data by Conditional Deformable Variational Auto-Encoder
by Mingyu Yin et al

07-21-2020

Video Super-resolution with Temporal Group Attention
by Takashi Isobe et al

07-21-2020

Learning Person Re-identification Models from Videos with Weak Supervision
by Xueping Wang et al

07-21-2020

MI^2GAN: Generative Adversarial Network for Medical Image Domain Adaptation using Mutual Information Constraint
by Xinpeng Xie et al

07-24-2020

Multi-view adaptive graph convolutions for graph classification
by Nikolas Adaloglou et al

07-21-2020

Directional Temporal Modeling for Action Recognition
by Xinyu Li et al

07-21-2020

Multi-modal Transformer for Video Retrieval
by Valentin Gabeur et al

07-21-2020

BorderDet: Border Feature for Dense Object Detection
by Han Qiu et al

07-24-2020

Reparameterizing Convolutions for Incremental Multi-Task Learning without Task Interference
by Menelaos Kanakis et al

07-21-2020

Soft Expert Reward Learning for Vision-and-Language Navigation
by Hu Wang et al

07-21-2020

Procrustean Regression Networks: Learning 3D Structure of Non-Rigid Objects from 2D Annotations
by Sungheon Park et al

07-21-2020

Fine-Grained Image Captioning with Global-Local Discriminative Objective
by Jie Wu et al

07-22-2020

Leveraging Undiagnosed Data for Glaucoma Classification with Teacher-Student Learning
by Junde Wu et al

07-21-2020

Camera On-boarding for Person Re-identification using Hypothesis Transfer Learning
by Sk Miraj Ahmed et al

07-22-2020

Greenhouse Segmentation on High-Resolution Optical Satellite Imagery using Deep Learning Techniques
by Orkhan Baghirli et al

07-22-2020

Fragments-Expert: A Graphical User Interface MATLAB Toolbox for Classification of File Fragments
by Mehdi Teimouri et al

07-21-2020

Lymphocyte counting -- Error Analysis of Regression versus Bounding Box Detection Approaches
by Lin Geng Foo et al

07-22-2020

Watchlist Risk Assessment using Multiparametric Cost and Relative Entropy
by K. Lai et al

07-22-2020

Multi-Spectral Facial Biometrics in Access Control
by K. Lai et al

07-21-2020

Video Representation Learning by Recognizing Temporal Transformations
by Simon Jenni et al

07-21-2020

Recurrent Exposure Generation for Low-Light Face Detection
by Jinxiu Liang et al

07-21-2020

Multi-person 3D Pose Estimation in Crowded Scenes Based on Multi-View Geometry
by He Chen et al

07-21-2020

Optimization of data-driven filterbank for automatic speaker verification
by Susanta Sarangi et al

07-21-2020

Joint Visual and Temporal Consistency for Unsupervised Domain Adaptive Person Re-Identification
by Jianing Li et al

07-21-2020

Multi-label Thoracic Disease Image Classification with Cross-Attention Networks
by Congbo Ma et al

07-21-2020

Balance Scene Learning Mechanism for Offshore and Inshore Ship Detection in SAR Images
by Tianwen Zhang et al

07-24-2020

Hallucinating Saliency Maps for Fine-Grained Image Classification for Limited Data Domains
by Carola Figueroa-Flores et al

07-24-2020

Visual Compositional Learning for Human-Object Interaction Detection
by Zhi Hou et al

07-22-2020

DeepCLR: Correspondence-Less Architecture for Deep End-to-End Point Cloud Registration
by Markus Horn et al

07-21-2020

FLOT: Scene Flow on Point Clouds Guided by Optimal Transport
by Gilles Puy et al

07-22-2020

Endo-Sim2Real: Consistency learning-based domain adaptation for instrument segmentation
by Manish Sahu et al

07-21-2020

One Click Lesion RECIST Measurement and Segmentation on CT Scans
by Youbao Tang et al

07-24-2020

Approximately Optimal Binning for the Piecewise Constant Approximation of the Normalized Unexplained Variance (nUV) Dissimilarity Measure
by Attila Fazekas et al

07-24-2020

KPRNet: Improving projection-based LiDAR semantic segmentation
by Deyvid Kochanov et al

07-22-2020

Depthwise Spatio-Temporal STFT Convolutional Neural Networks for Human Action Recognition
by Sudhakar Kumawat et al

07-22-2020

End-to-End Trainable Deep Active Contour Models for Automated Image Segmentation: Delineating Buildings in Aerial Imagery
by Ali Hatamizadeh et al

07-24-2020

Fully Convolutional Networks for Continuous Sign Language Recognition
by Ka Leong Cheng et al

07-21-2020

Fully Automated Segmentation of the Left Ventricle in Magnetic Resonance Images
by ZiHao Wang et al

07-23-2020

COVID TV-UNet: Segmenting COVID-19 Chest CT Images Using Connectivity Imposed U-Net
by Narges Saeedizadeh et al

07-21-2020

Relative Pose Estimation for Multi-Camera Systems from Affine Correspondences
by Banglei Guan et al

07-24-2020

Micro-expression spotting: A new benchmark
by Thuong-Khanh Tran et al

07-21-2020

A Deep Ordinal Distortion Estimation Approach for Distortion Rectification
by Kang Liao et al

07-24-2020

MiCo: Mixup Co-Training for Semi-Supervised Domain Adaptation
by Luyu Yang et al

07-21-2020

Learning Object Relation Graph and Tentative Policy for Visual Navigation
by Heming Du et al

07-23-2020

ZSCRGAN: A GAN-based Expectation Maximization Model for Zero-Shot Retrieval of Images from Textual Descriptions
by Anurag Roy et al

07-21-2020

Dense Hybrid Recurrent Multi-view Stereo Net with Dynamic Consistency Checking
by Jianfeng Yan et al

07-22-2020

Deep Models and Shortwave Infrared Information to Detect Face Presentation Attacks
by Guillaume Heusch et al

07-22-2020

Feature based Sequential Classifier with Attention Mechanism
by Sudhir Sornapudi et al

07-24-2020

On the Effectiveness of Image Rotation for Open Set Domain Adaptation
by Silvia Bucci et al

07-24-2020

Self-Supervised Learning Across Domains
by Silvia Bucci et al

07-21-2020

Representative-Discriminative Learning for Open-set Land Cover Classification of Satellite Imagery
by Razieh Kaviani Baghbaderani et al

07-21-2020

Enhancement of damaged-image prediction through Cahn-Hilliard Image Inpainting
by José A. Carrillo et al

07-24-2020

Learning Crisp Edge Detector Using Logical Refinement Network
by Luyan Liu et al

07-24-2020

Study of Different Deep Learning Approach with Explainable AI for Screening Patients with COVID-19 Symptoms: Using CT Scan and Chest X-ray Image Dataset
by Md Manjurul Ahsan et al

07-24-2020

HEU Emotion: A Large-scale Database for Multi-modal Emotion Recognition in the Wild
by Jing Chen et al

07-24-2020

Map-Repair: Deep Cadastre Maps Alignment and Temporal Inconsistencies Fix in Satellite Images
by Stefano Zorzi et al

07-24-2020

Real-World Multi-Domain Data Applications for Generalizations to Clinical Settings
by Nooshin Mojab et al

07-24-2020

Machine-learned Regularization and Polygonization of Building Segmentation Masks
by Stefano Zorzi et al

07-24-2020

Stain Style Transfer of Histopathology Images Via Structure-Preserved Generative Learning
by Hanwen Liang et al

07-22-2020

Learning Disentangled Feature Representation for Hybrid-distorted Image Restoration
by Xin Li et al

07-24-2020

Commonality-Parsing Network across Shape and Appearance for Partially Supervised Instance Segmentation
by Qi Fan et al

07-24-2020

Style Transfer for Co-Speech Gesture Animation: A Multi-Speaker Conditional-Mixture Approach
by Chaitanya Ahuja et al

07-22-2020

Human-Centered Unsupervised Segmentation Fusion
by Gregor Koporec et al

07-22-2020

Learning Directional Feature Maps for Cardiac MRI Segmentation
by Feng Cheng et al

07-23-2020

Towards Recognizing Unseen Categories in Unseen Domains
by Massimiliano Mancini et al

07-24-2020

Performance analysis of weighted low rank model with sparse image histograms for face recognition under lowlevel illumination and occlusion
by K. V. Sridhar et al

07-23-2020

Parkinsons Disease Detection with Ensemble Architectures based on ILSVRC Models
by Tahjid Ashfaque Mostafa et al

07-23-2020

SeismoGlow -- Data augmentation for the class imbalance problem
by Ruy Luiz Milidiú et al

07-23-2020

Locality-Aware Rotated Ship Detection in High-Resolution Remote Sensing Imagery Based on Multi-Scale Convolutional Network
by Lingyi Liu et al

07-22-2020

Learning One Class Representations for Face Presentation Attack Detection using Multi-channel Convolutional Neural Networks
by Anjith George et al

07-23-2020

Frequency Domain-based Perceptual Loss for Super Resolution
by Shane D. Sims

07-21-2020

A Hybrid Neuromorphic Object Tracking and Classification Framework for Real-time Systems
by Andres Ussa et al

07-23-2020

Learning Noise-Aware Encoder-Decoder from Noisy Labels by Alternating Back-Propagation for Saliency Detection
by Jing Zhang et al

07-24-2020

A Comprehensive Study on Sign Language Recognition Methods
by Nikolas Adaloglou et al

 
Craig Smith