2020.8.17 Vision papers

 

08-12-2020

Compression of Deep Learning Models for Text: A Survey
by Manish Gupta et al

08-13-2020

Full-Body Awareness from Partial Observations
by Chris Rockwell et al

08-13-2020

Black Magic in Deep Learning: How Human Skill Impacts Network Training
by Kanav Anand et al

08-11-2020

BREEDS: Benchmarks for Subpopulation Shift
by Shibani Santurkar et al

08-13-2020

Lift, Splat, Shoot: Encoding Images From Arbitrary Camera Rigs by Implicitly Unprojecting to 3D
by Jonah Philion et al

08-12-2020

What Should Not Be Contrastive in Contrastive Learning
by Tete Xiao et al

08-11-2020

Audio- and Gaze-driven Facial Animation of Codec Avatars
by Alexander Richard et al

08-11-2020

Visual Imitation Made Easy
by Sarah Young et al

08-11-2020

Learning to Caricature via Semantic Shape Transform
by Wenqing Chu et al

08-13-2020

Motion Similarity Modeling -- A State of the Art Report
by Anna Sebernegg et al

08-13-2020

3D Bird Reconstruction: a Dataset, Model, and Shape Recovery from a Single View
by Marc Badger et al

08-13-2020

Powers of layers for image-to-image translation
by Hugo Touvron et al

08-12-2020

Generating Person-Scene Interactions in 3D Scenes
by Siwei Zhang et al

08-13-2020

Perceive, Predict, and Plan: Safe Motion Planning Through Interpretable Semantic Representations
by Abbas Sadat et al

08-12-2020

DSM-Net: Disentangled Structured Mesh Net for Controllable Generation of Fine Geometry
by Jie Yang et al

08-12-2020

Procedural Urban Forestry
by Till Niese et al

08-13-2020

DF-GAN: Deep Fusion Generative Adversarial Networks for Text-to-Image Synthesis
by Ming Tao et al

08-13-2020

AdaIN-Switchable CycleGAN for Efficient Unsupervised Low-Dose CT Denoising
by Jawook Gu et al

08-11-2020

DTVNet: Dynamic Time-lapse Video Generation via Single Still Image
by Jiangning Zhang et al

08-13-2020

Towards Visually Explaining Similarity Models
by Meng Zheng et al

08-13-2020

Deep Learning to Quantify Pulmonary Edema in Chest Radiographs
by Steven Horng et al

08-13-2020

Robust Image Matching By Dynamic Feature Selection
by Hao Huang et al

08-13-2020

Network Architecture Search for Domain Adaptation
by Yichen Li et al

08-13-2020

Multi-Mask Self-Supervised Learning for Physics-Guided Neural Networks in Highly Accelerated MRI
by Burhaneddin Yaman et al

08-12-2020

Mitigating Dataset Imbalance via Joint Generation and Classification
by Aadarsh Sahoo et al

08-13-2020

ExplAIn: Explanatory Artificial Intelligence for Diabetic Retinopathy Diagnosis
by Gwenolé Quellec et al

08-11-2020

SAFRON: Stitching Across the Frontier for Generating Colorectal Cancer Histology Images
by Srijay Deshpande et al

08-11-2020

Text as Neural Operator: Image Manipulation by Text Instruction
by Tianhao Zhang et al

08-13-2020

Unsupervised Image Restoration Using Partially Linear Denoisers
by Rihuan Ke et al

08-11-2020

SIDOD: A Synthetic Image Dataset for 3D Object Pose Recognition with Distractors
by Mona Jalal et al

08-12-2020

Visual Localization for Autonomous Driving: Mapping the Accurate Location in the City Maze
by Dongfang Liu et al

08-13-2020

Look, Listen, and Attend: Co-Attention Network for Self-Supervised Audio-Visual Representation Learning
by Ying Cheng et al

08-14-2020

Self-Sampling for Neural Point Cloud Consolidation
by Gal Metzer et al

08-12-2020

Feature Binding with Category-Dependant MixUp for Semantic Segmentation and Adversarial Robustness
by Md Amirul Islam et al

08-11-2020

Image segmentation via Cellular Automata
by Mark Sandler et al

08-13-2020

Predicting Visual Overlap of Images Through Interpretable Non-Metric Box Embeddings
by Anita Rau et al

08-12-2020

Object Detection for Graphical User Interface: Old Fashioned or Deep Learning or a Combination?
by Jieshan Chen et al

08-12-2020

Forest R-CNN: Large-Vocabulary Long-Tailed Object Detection and Instance Segmentation
by Jialian Wu et al

08-12-2020

Free View Synthesis
by Gernot Riegler et al

08-12-2020

Towards Modality Transferable Visual Information Representation with Optimal Model Compression
by Rongqun Lin et al

08-13-2020

End-to-end Contextual Perception and Prediction with Interaction Transformer
by Lingyun Luke Li et al

08-12-2020

ISIA Food-500: A Dataset for Large-Scale Food Recognition via Stacked Global-Local Attention Network
by Weiqing Min et al

08-13-2020

SkeletonNet: A Topology-Preserving Solution for Learning Mesh Reconstruction of Object Surfaces from RGB Images
by Jiapeng Tang et al

08-11-2020

Rethinking Pseudo-LiDAR Representation
by Xinzhu Ma et al

08-13-2020

What leads to generalization of object proposals?
by Rui Wang et al

08-13-2020

Modeling Caricature Expressions by 3D Blendshape and Dynamic Texture
by Keyu Chen et al

08-11-2020

GeLaTO: Generative Latent Textured Objects
by Ricardo Martin-Brualla et al

08-14-2020

Unsupervised vs. transfer learning for multimodal one-shot matching of speech and images
by Leanne Nortje et al

08-12-2020

Few shot clustering for indoor occupancy detection with extremely low-quality images from battery free cameras
by Homagni Saha et al

08-13-2020

Shift Equivariance in Object Detection
by Marco Manfredi et al

08-13-2020

CycleMorph: Cycle Consistent Unsupervised Deformable Image Registration
by Boah Kim et al

08-11-2020

Learning to See Through Obstructions with Layered Decomposition
by Yu-Lun Liu et al

08-13-2020

An Ensemble of Knowledge Sharing Models for Dynamic Hand Gesture Recognition
by Kenneth Lai et al

08-12-2020

Attention-based Fully Gated Conventional Recurrent Neural Network for Russian Handwritten Text
by Abdelrahman Abdallah et al

08-12-2020

Co-training for On-board Deep Object Detection
by Gabriel Villalonga et al

08-12-2020

Open Set Recognition with Conditional Probabilistic Generative Models
by Xin Sun et al

08-13-2020

Localizing the Common Action Among a Few Videos
by Pengwan Yang et al

08-12-2020

An Overview of Deep Learning Architectures in Few-Shots Learning Domain
by Shruti Jadon

08-13-2020

LGNN: a Context-aware Line Segment Detector
by Quan Meng et al

08-13-2020

Self-supervised Video Representation Learning by Pace Prediction
by Jiangliu Wang et al

08-12-2020

Self-Path: Self-supervision for Classification of Pathology Images with Limited Annotations
by Navid Alemi Koohbanani et al

08-13-2020

Weight Equalizing Shift Scaler-Coupled Post-training Quantization
by Jihun Oh et al

08-12-2020

Sparse Coding Driven Deep Decision Tree Ensembles for Nuclear Segmentation in Digital Pathology Images
by Jie Song et al

08-13-2020

Learning Temporally Invariant and Localizable Features via Data Augmentation for Video Recognition
by Taeoh Kim et al

08-13-2020

Contextual Diversity for Active Learning
by Sharat Agarwal et al

08-13-2020

Recurrent Deconvolutional Generative Adversarial Networks with Application to Text Guided Video Generation
by Hongyuan Yu et al

08-12-2020

More Diverse Means Better: Multimodal Deep Learning Meets Remote Sensing Imagery Classification
by Danfeng Hong et al

08-12-2020

Multi-level Stress Assessment Using Multi-domain Fusion of ECG Signal
by Zeeshan Ahmad et al

08-12-2020

Continual Class Incremental Learning for CT Thoracic Segmentation
by Abdelrahman Elskhawy et al

08-11-2020

Retrieval Guided Unsupervised Multi-domain Image-to-Image Translation
by Raul Gomez et al

08-13-2020

Testing the Safety of Self-driving Vehicles by Simulating Perception and Prediction
by Kelvin Wong et al

08-12-2020

We Have So Much In Common: Modeling Semantic Relational Set Abstractions in Videos
by Alex Andonian et al

08-11-2020

Dynamic Object Removal and Spatio-Temporal RGB-D Inpainting via Geometry-Aware Adversarial Learning
by Borna Bešić et al

08-13-2020

On failures of RGB cameras and their effects in autonomous driving applications
by Francesco Secci et al

08-12-2020

Local Temperature Scaling for Probability Calibration
by Zhipeng Ding et al

08-13-2020

Pose Estimation for Vehicle-mounted Cameras via Horizontal and Vertical Planes
by Istan Gergo Gal et al

08-13-2020

Revisiting Temporal Modeling for Video Super-resolution
by Takashi Isobe et al

08-12-2020

Towards Geometry Guided Neural Relighting with Flash Photography
by Di Qiu et al

08-13-2020

Reliability of Decision Support in Cross-spectral Biometric-enabled Systems
by Kenneth Lai et al

08-13-2020

DFEW: A Large-Scale Database for Recognizing Dynamic Facial Expressions in the Wild
by Xingxun Jiang et al

08-14-2020

Survey of XAI in digital pathology
by Milda Pocevičiūtė et al

08-13-2020

DSDNet: Deep Structured self-Driving Network
by Wenyuan Zeng et al

08-11-2020

Online Graph Completion: Multivariate Signal Recovery in Computer Vision
by Won Hwa Kim et al

08-13-2020

Adversarial Knowledge Transfer from Unlabeled Data
by Akash Gupta et al

08-12-2020

Facial Expression Recognition Under Partial Occlusion from Virtual Reality Headsets based on Transfer Learning
by Bita Houshmand et al

08-13-2020

Weakly Supervised Generative Network for Multiple 3D Human Pose Hypotheses
by Chen Li et al

08-13-2020

Hybrid Dynamic-static Context-aware Attention Network for Action Assessment in Long Videos
by Ling-An Zeng et al

08-13-2020

Alleviating Human-level Shift : A Robust Domain Adaptation Method for Multi-person Pose Estimation
by Xixia Xu et al

08-12-2020

FATNN: Fast and Accurate Ternary Neural Networks
by Peng Chen et al

08-12-2020

Facial Expression Retargeting from Human to Avatar Made Easy
by Juyong Zhang et al

08-12-2020

Pixel-level Corrosion Detection on Metal Constructions by Fusion of Deep Learning Semantic and Contour Segmentation
by Iason Katsamenis et al

08-13-2020

Multi-Modality Pathology Segmentation Framework: Application to Cardiac Magnetic Resonance Images
by Zhen Zhang et al

08-12-2020

Guided Collaborative Training for Pixel-wise Semi-Supervised Learning
by Zhanghan Ke et al

08-11-2020

Select Good Regions for Deblurring based on Convolutional Neural Networks
by Hang Yang et al

08-14-2020

Abstracting Deep Neural Networks into Concept Graphs for Concept Level Interpretability
by Avinash Kori et al

08-11-2020

Campus3D: A Photogrammetry Point Cloud Benchmark for Hierarchical Understanding of Outdoor Scene
by Xinke Li et al

08-12-2020

Large-Scale Analysis of Iliopsoas Muscle Volumes in the UK Biobank
by Julie Fitzpatrick et al

08-13-2020

Can weight sharing outperform random architecture search? An investigation with TuNAS
by Gabriel Bender et al

08-12-2020

DXSLAM: A Robust and Efficient Visual SLAM System with Deep Features
by Dongjiang Li et al

08-12-2020

Look here! A parametric learning based approach to redirect visual attention
by Youssef Alami Mejjati et al

08-14-2020

Machine learning for COVID-19 detection and prognostication using chest radiographs and CT scans: a systematic methodological review
by Michael Roberts et al

08-14-2020

Optimized Deep Encoder-Decoder Methods for Crack Segmentation
by Jacob König et al

08-12-2020

Learning to Learn from Mistakes: Robust Optimization for Adversarial Noise
by Alex Serban et al

08-14-2020

RODEO: Replay for Online Object Detection
by Manoj Acharya et al

08-14-2020

SPINN: Synergistic Progressive Inference of Neural Networks over Device and Cloud
by Stefanos Laskaridis et al

08-13-2020

Apparel-invariant Feature Learning for Apparel-changed Person Re-identification
by Zhengxu Yu et al

08-11-2020

Real-Time Sign Language Detection using Human Pose Estimation
by Amit Moryossef et al

08-11-2020

Adversarial Generative Grammars for Human Activity Prediction
by AJ Piergiovanni et al

08-12-2020

An Inter- and Intra-Band Loss for Pansharpening Convolutional Neural Networks
by Jiajun Cai et al

08-12-2020

HOSE-Net: Higher Order Structure Embedded Network for Scene Graph Generation
by Meng Wei et al

08-12-2020

Balanced Depth Completion between Dense Depth Inference and Sparse Range Measurements via KISS-GP
by Sungho Yoon et al

08-12-2020

Representative Graph Neural Network
by Changqian Yu et al

08-12-2020

Factor Graph based 3D Multi-Object Tracking in Point Clouds
by Johannes Pöschmann et al

08-13-2020

Integrating uncertainty in deep neural networks for MRI based stroke analysis
by Lisa Herzog et al

08-12-2020

Inter-Image Communication for Weakly Supervised Localization
by Xiaolin Zhang et al

08-14-2020

Structure-Aware Network for Lane Marker Extraction with Dynamic Vision Sensor
by Wensheng Cheng et al

08-14-2020

Feedback Attention for Cell Image Segmentation
by Hiroki Tsuda et al

08-11-2020

TextRay: Contour-based Geometric Modeling for Arbitrary-shaped Scene Text Detection
by Fangfang Wang et al

08-14-2020

Self-adapting confidence estimation for stereo
by Matteo Poggi et al

08-12-2020

Improving the Performance of Fine-Grain Image Classifiers via Generative Data Augmentation
by Shashank Manjunath et al

08-12-2020

A Longitudinal Method for Simultaneous Whole-Brain and Lesion Segmentation in Multiple Sclerosis
by Stefano Cerri et al

08-11-2020

PiNet: Attention Pooling for Graph Classification
by Peter Meltzer et al

08-13-2020

Geometric Deep Learning for Post-Menstrual Age Prediction based on the Neonatal White Matter Cortical Surface
by Vitalis Vosylius et al

08-11-2020

Extension of JPEG XS for Two-Layer Lossless Coding
by Hiroyuki Kobayashi et al

08-14-2020

Rb-PaStaNet: A Few-Shot Human-Object Interaction Detection Based on Rules and Part States
by Shenyu Zhang et al

08-14-2020

Homotopic Gradients of Generative Density Priors for MR Image Reconstruction
by Cong Quan et al

08-11-2020

Learning to Cluster under Domain Shift
by Willi Menapace et al

08-12-2020

Fine-grained Visual Textual Alignment for Cross-Modal Retrieval using Transformer Encoders
by Nicola Messina et al

08-11-2020

3D FLAT: Feasible Learned Acquisition Trajectories for Accelerated MRI
by Jonathan Alush-Aben et al

08-12-2020

Stable Low-rank Tensor Decomposition for Compression of Convolutional Neural Network
by Anh-Huy Phan et al

08-12-2020

Renal Cell Carcinoma Detection and Subtyping with Minimal Point-Based Annotation in Whole-Slide Images
by Zeyu Gao et al

08-12-2020

TF-NAS: Rethinking Three Search Freedoms of Latency-Constrained Differentiable Neural Architecture Search
by Yibo Hu et al

08-14-2020

WAN: Watermarking Attack Network
by Seung-Hun Nam et al

08-13-2020

MIXCAPS: A Capsule Network-based Mixture of Experts for Lung Nodule Malignancy Prediction
by Parnian Afshar et al

08-14-2020

Not 3D Re-ID: a Simple Single Stream 2D Convolution for Robust Video Re-identification
by Toby P. Breckon et al

08-11-2020

PX-NET: Simple, Efficient Pixel-Wise Training of Photometric Stereo Networks
by Fotios Logothetis et al

08-12-2020

PAM:Point-wise Attention Module for 6D Object Pose Estimation
by Myoungha Song et al

08-11-2020

Reinforced Wasserstein Training for Severity-Aware Semantic Segmentation in Autonomous Driving
by Xiaofeng Liu et al

08-11-2020

The Umbrella software suite for automated asteroid detection
by Malin Stanescu et al

08-11-2020

Surgical Mask Detection with Convolutional Neural Networks and Data Augmentations on Spectrograms
by Steffen Illium et al

08-14-2020

An Improved Deep Convolutional Neural Network-Based Autonomous Road Inspection Scheme Using Unmanned Aerial Vehicles
by Syed Ali Hassan et al

08-12-2020

Anomaly localization by modeling perceptual features
by David Dehaene et al

08-14-2020

Parameters Sharing Exploration and Hetero-Center based Triplet Loss for Visible-Thermal Person Re-Identification
by Haijun Liu et al

08-14-2020

BriNet: Towards Bridging the Intra-class and Inter-class Gaps in One-Shot Segmentation
by Xianghui Yang et al

08-11-2020

Learning Stereo Matchability in Disparity Regression Networks
by Jingyang Zhang et al

08-11-2020

R-MNet: A Perceptual Adversarial Network for Image Inpainting
by Jireh Jam et al

08-13-2020

Landmark detection in Cardiac Magnetic Resonance Imaging Using A Convolutional Neural Network
by Hui Xue et al

08-12-2020

ASAP-Net: Attention and Structure Aware Point Cloud Sequence Segmentation
by Hanwen Cao et al

08-14-2020

ConsNet: Learning Consistency Graph for Zero-Shot Human-Object Interaction Detection
by Ye Liu et al

08-14-2020

GeoLayout: Geometry Driven Room Layout Estimation Based on Depth Maps of Planes
by Weidong Zhang et al

08-13-2020

BioMetricNet: deep unconstrained face verification through learning of metrics regularized onto Gaussian distributions
by Arslan Ali et al

08-11-2020

VI-Net: View-Invariant Quality of Human Movement Assessment
by Faegheh Sardari et al

08-12-2020

A Zero-Shot Sketch-based Inter-Modal Object Retrieval Scheme for Remote Sensing Images
by Ushasi Chaudhuri et al

08-13-2020

Effect of Architectures and Training Methods on the Performance of Learned Video Frame Prediction
by M. Akin Yilmaz et al

08-14-2020

Deep Atrous Guided Filter for Image Restoration in Under Display Cameras
by Varun Sundar et al

08-11-2020

ClimAlign: Unsupervised statistical downscaling of climate variables via normalizing flows
by Brian Groenke et al

08-12-2020

Identity-Aware Attribute Recognition via Real-Time Distributed Inference in Mobile Edge Clouds
by Zichuan Xu et al

08-11-2020

Deep UAV Localization with Reference View Rendering
by Timo Hinzmann et al

08-11-2020

Hardware-Centric AutoML for Mixed-Precision Quantization
by Kuan Wang et al

08-14-2020

A Learning-based Method for Online Adjustment of C-arm Cone-Beam CT Source Trajectories for Artifact Avoidance
by Mareike Thies et al

08-11-2020

Fully-Automated Packaging Structure Recognition in Logistics Environments
by Laura Dörr et al

08-11-2020

Self-supervised Light Field View Synthesis Using Cycle Consistency
by Yang Chen et al

08-11-2020

HydraMix-Net: A Deep Multi-task Semi-supervised Learning Approach for Cell Detection and Classification
by R. M. Saad Bashir et al

08-11-2020

Transfer Learning for Protein Structure Classification and Function Inference at Low Resolution
by Alexander Hudson et al

08-12-2020

Image-based Portrait Engraving
by Paul L. Rosin et al

08-12-2020

DAWN: Vehicle Detection in Adverse Weather Nature Dataset
by Mourad A. Kenk et al

08-11-2020

Multi-modal segmentation of 3D brain scans using neural networks
by Jonathan Zopes et al

08-11-2020

Sharp Multiple Instance Learning for DeepFake Video Detection
by Xiaodan Li et al

08-12-2020

Automatic assembly of aero engine low pressure turbine shaft based on 3D vision measurement
by Jiaxiang Wang et al

08-13-2020

Interpretation of Brain Morphology in Association to Alzheimers Disease Dementia Classification Using Graph Convolutional Networks on Triangulated Meshes
by Emanuel A. Azcona et al

08-12-2020

Towards Unsupervised Crowd Counting via Regression-Detection Bi-knowledge Transfer
by Yuting Liu et al

08-11-2020

Detecting Urban Dynamics Using Deep Siamese Convolutional Neural Networks
by Ephrem Admasu Yekun et al

08-11-2020

Robust Long-Term Object Tracking via Improved Discriminative Model Prediction
by Seokeon Choi et al

08-11-2020

Left Ventricular Wall Motion Estimation by Active Polynomials for Acute Myocardial Infarction Detection
by Serkan Kiranyaz et al

08-12-2020

Defending Adversarial Examples via DNN Bottleneck Reinforcement
by Wenqing Liu et al

08-12-2020

RAF-AU Database: In-the-Wild Facial Expressions with Subjective Emotion Judgement and Objective AU Annotations
by Wenjing Yan et al

08-14-2020

PointMixup: Augmentation for Point Clouds
by Yunlu Chen et al

08-11-2020

Exposing Deep-faked Videos by Anomalous Co-motion Pattern Detection
by Gengxing Wang et al

08-11-2020

BiHand: Recovering Hand Mesh with Multi-stage Bisected Hourglass Networks
by Lixin Yang et al

08-11-2020

Thick Cloud Removal of Remote Sensing Images Using Temporal Smoothness and Sparsity-Regularized Tensor Optimization
by Chenxi Duan et al

08-11-2020

Fast and Accurate Optical Flow based Depth Map Estimation from Light Fields
by Yang Chen et al

08-13-2020

A Multimodal Late Fusion Model for E-Commerce Product Classification
by Ye Bi et al

08-11-2020

Learned Proximal Networks for Quantitative Susceptibility Mapping
by Kuo-Wei Lai et al

08-11-2020

TCL: an ANN-to-SNN Conversion with Trainable Clipping Layers
by Nguyen-Dong Ho et al

08-11-2020

TransNet V2: An effective deep network architecture for fast shot transition detection
by Tomáš Souček et al

08-13-2020

Semantically Adversarial Learnable Filters
by Ali Shahin Shamsabadi et al

08-12-2020

LogoDet-3K: A Large-Scale Image Dataset for Logo Detection
by Jing Wang et al

08-14-2020

Renormalization for Initialization of Rolling Shutter Visual-Inertial Odometry
by Branislav Micusik et al

08-13-2020

Novelty Detection Through Model-Based Characterization of Neural Networks
by Gukyeong Kwon et al

08-13-2020

Deep Domain Adaptation for Ordinal Regression of Pain Intensity Estimation Using Weakly-Labelled Videos
by Gnana Praveen R et al

08-11-2020

Unified Representation Learning for Cross Model Compatibility
by Chien-Yi Wang et al

08-11-2020

Attention-based 3D Object Reconstruction from a Single Image
by Andrey Salvi et al

08-11-2020

KBGN: Knowledge-Bridge Graph Network for Adaptive Vision-Text Reasoning in Visual Dialogue
by Xiaoze Jiang et al

08-11-2020

End-to-End Rate-Distortion Optimization for Bi-Directional Learned Video Compression
by M. Akin Yilmaz et al

08-11-2020

PROFIT: A Novel Training Method for sub-4-bit MobileNet Models
by Eunhyeok Park et al

08-11-2020

Implanting Synthetic Lesions for Improving Liver Lesion Segmentation in CT Exams
by Dario Augusto Borges Oliveira

08-11-2020

A Study of Efficient Light Field Subsampling and Reconstruction Strategies
by Yang Chen et al

08-13-2020

Automated detection and quantification of COVID-19 airspace disease on chest radiographs: A novel approach achieving radiologist-level performance using a CNN trained on digital reconstructed radiographs (DRRs) from CT-based ground-truth
by Eduardo Mortani Barbosa et al

08-11-2020

AtrialJSQnet: A New Framework for Joint Segmentation and Quantification of Left Atrium and Scars Incorporating Spatial and Shape Information
by Lei Li et al

08-11-2020

Estimating Magnitude and Phase of Automotive Radar Signals under Multiple Interference Sources with Fully Convolutional Networks
by Nicolae-Cătălin Ristea et al

 
Craig Smith