2020.12.28 Vision papers

 

12-23-2020

Training data-efficient image transformers & distillation through attention
by Hugo Touvron et al

12-22-2020

YolactEdge: Real-time Instance Segmentation on the Edge (Jetson AGX Xavier: 30 FPS, RTX 2080 Ti: 170 FPS)
by Haotian Liu et al

12-23-2020

Focal Frequency Loss for Generative Models
by Liming Jiang et al

12-22-2020

Time-Travel Rephotography
by Xuan Luo et al

12-24-2020

Soft-IntroVAE: Analyzing and Improving the Introspective Variational Autoencoder
by Tal Daniel et al

12-23-2020

A Survey on Visual Transformer
by Kai Han et al

12-23-2020

Vid2Actor: Free-viewpoint Animatable Person Synthesis from Video in the Wild
by Chung-Yi Weng et al

12-24-2020

Deep Learning-Based Human Pose Estimation: A Survey
by Ce Zheng et al

12-22-2020

AudioViewer: Learning to Visualize Sound
by Yuchi Zhang et al

12-23-2020

Learning by Self-Explanation, with Application to Neural Architecture Search
by Ramtin Hosseini et al

12-24-2020

Global Convergence of Model Function Based Bregman Proximal Minimization Algorithms
by Mahesh Chandra Mukkamala et al

12-24-2020

SubICap: Towards Subword-informed Image Captioning
by Naeha Sharif et al

12-24-2020

Person Re-Identification using Deep Learning Networks: A Systematic Review
by Ankit Yadav et al

12-23-2020

Union-net: A deep neural network model adapted to small data sets
by Qingfang He et al

12-23-2020

MobileSal: Extremely Efficient RGB-D Salient Object Detection
by Yu-Huan Wu et al

12-24-2020

WEmbSim: A Simple yet Effective Metric for Image Captioning
by Naeha Sharif et al

12-23-2020

Learning from Crowds by Modeling Common Confusions
by Zhendong Chu et al

12-24-2020

Detecting Hateful Memes Using a Multimodal Deep Ensemble
by Vlad Sandulescu

12-22-2020

Latent Feature Representation via Unsupervised Learning for Pattern Discovery in Massive Electron Microscopy Image Volumes
by Gary B Huang et al

12-24-2020

Interpolating Points on a Non-Uniform Grid using a Mixture of Gaussians
by Ivan Skorokhodov

12-24-2020

Control of computer pointer using hand gesture recognition in motion pictures
by Yalda Foroutan et al

12-24-2020

Unsupervised deep clustering and reinforcement learning can accurately segment MRI brain tumors with very small training sets
by Joseph Stember et al

12-23-2020

Efficient video annotation with visual interpolation and frame selection guidance
by A. Kuznetsova et al

12-22-2020

Non-Rigid Neural Radiance Fields: Reconstruction and Novel View Synthesis of a Deforming Scene from Monocular Video
by Edgar Tretschk et al

12-23-2020

Semantic Segmentation on Swiss3DCities: A Benchmark Study on Aerial Photogrammetric 3D Pointcloud Dataset
by Gülcan Can et al

12-24-2020

FracTrain: Fractionally Squeezing Bit Savings Both Temporally and Spatially for Efficient DNN Training
by Yonggan Fu et al

12-23-2020

General Domain Adaptation Through Proportional Progressive Pseudo Labeling
by Mohammad J. Hashemi et al

12-24-2020

MRDet: A Multi-Head Network for Accurate Oriented Object Detection in Aerial Images
by Ran Qin et al

12-23-2020

Low-latency Perception in Off-Road Dynamical Low Visibility Environments
by Nelson Alves et al

12-22-2020

RAP-Net: Coarse-to-Fine Multi-Organ Segmentation with Single Random Anatomical Prior
by Ho Hin Lee et al

12-24-2020

Dynamic Facial Expression Recognition under Partial Occlusion with Optical Flow Reconstruction
by Delphine Poux et al

12-24-2020

Global Context Networks
by Yue Cao et al

12-22-2020

Multi-Contrast Computed Tomography Healthy Kidney Atlas
by Ho Hin Lee et al

12-23-2020

Physics-based Shadow Image Decomposition for Shadow Removal
by Hieu Le et al

12-23-2020

Towards Overcoming False Positives in Visual Relationship Detection
by Daisheng Jin et al

12-22-2020

Unadversarial Examples: Designing Objects for Robust Vision
by Hadi Salman et al

12-24-2020

Adversarial Momentum-Contrastive Pre-Training
by Cong Xu et al

12-24-2020

Parallel-beam X-ray CT datasets of apples with internal defects and label balancing for machine learning
by Sophia Bethany Coban et al

12-22-2020

Pit30M: A Benchmark for Global Localization in the Age of Self-Driving Cars
by Julieta Martinez et al

12-24-2020

Improving the Certified Robustness of Neural Networks via Consistency Regularization
by Mengting Xu et al

12-22-2020

FcaNet: Frequency Channel Attention Networks
by Zequn Qin et al

12-24-2020

Seed Phenotyping on Neural Networks using Domain Randomization and Transfer Learning
by Venkat Margapuri et al

12-23-2020

EDN: Salient Object Detection via Extremely-Downsampled Network
by Yu-Huan Wu et al

12-24-2020

Hausdorff Point Convolution with Geometric Priors
by Pengdi Huang et al

12-22-2020

Dual-encoder Bidirectional Generative Adversarial Networks for Anomaly Detection
by Teguh Budianto et al

12-23-2020

Private-Shared Disentangled Multimodal VAE for Learning of Hybrid Latent Representations
by Mihee Lee et al

12-24-2020

Memory-Efficient Hierarchical Neural Architecture Search for Image Restoration
by Haokui Zhang et al

12-22-2020

HDR Denoising and Deblurring by Learning Spatio-temporal Distortion Models
by Uğur Çoğalan et al

12-24-2020

Effective Deployment of CNNs for 3DoF Pose Estimation and Grasping in Industrial Settings
by Daniele De Gregorio et al

12-22-2020

Residual Matrix Product State for Machine Learning
by Ye-Ming Meng et al

12-23-2020

ANR: Articulated Neural Rendering for Virtual Avatars
by Amit Raj et al

12-24-2020

A non-alternating graph hashing algorithm for large scale image search
by Sobhan Hemati et al

12-23-2020

Multiclass Spinal Cord Tumor Segmentation on MRI with Deep Learning
by Andreanne Lemay et al

12-23-2020

An Efficient Recurrent Adversarial Framework for Unsupervised Real-Time Video Enhancement
by Dario Fuoli et al

12-24-2020

Spatio-temporal Multi-task Learning for Cardiac MRI Left Ventricle Quantification
by Sulaiman Vesal et al

12-22-2020

Seeing past words: Testing the cross-modal capabilities of pretrained V&L models
by Letitia Parcalabescu et al

12-23-2020

P4Contrast: Contrastive Learning with Pairs of Point-Pixel Pairs for RGB-D Scene Understanding
by Yunze Liu et al

12-23-2020

Rotation Equivariant Siamese Networks for Tracking
by Deepak K. Gupta et al

12-24-2020

Unveiling Real-Life Effects of Online Photo Sharing
by Van-Khoa Nguyen et al

12-24-2020

UMLE: Unsupervised Multi-discriminator Network for Low Light Enhancement
by Yangyang Qu et al

12-23-2020

SyNet: An Ensemble Network for Object Detection in UAV Images
by Berat Mert Albaba et al

12-23-2020

Convolutional Neural Network for Elderly Wandering Prediction in Indoor Scenarios
by Rafael F. C. Oliveira et al

12-22-2020

GuidedStyle: Attribute Knowledge Guided Style Manipulation for Semantic Face Editing
by Xianxu Hou et al

12-24-2020

Appearance-Invariant 6-DoF Visual Localization using Generative Adversarial Networks
by Yimin Lin et al

12-23-2020

Noisy Labels Can Induce Good Representations
by Jingling Li et al

12-22-2020

IIRC: Incremental Implicitly-Refined Classification
by Mohamed Abdelsalam et al

12-24-2020

LEUGAN:Low-Light Image Enhancement by Unsupervised Generative Attentional Networks
by Yangyang Qu et al

12-22-2020

Video Influencers: Unboxing the Mystique
by Prashant Rajaram et al

12-22-2020

A Feasibility study for Deep learning based automated brain tumor segmentation using Magnetic Resonance Images
by Shanaka Ramesh Gunasekara et al

12-23-2020

Task-Adaptive Negative Class Envision for Few-Shot Open-Set Recognition
by Shiyuan Huang et al

12-22-2020

Deep learning-based virtual refocusing of images using an engineered point-spread function
by Xilin Yang et al

12-22-2020

Skeleton-based Approaches based on Machine Vision: A Survey
by Jie Li et al

12-22-2020

QuickTumorNet: Fast Automatic Multi-Class Segmentation of Brain Tumors
by Benjamin Maas et al

12-22-2020

Efficient and Visualizable Convolutional Neural Networks for COVID-19 Classification Using Chest CT
by Aksh Garg et al

12-22-2020

Comparison of Classification Algorithms Towards Subject-Specific and Subject-Independent BCI
by Parisa Ghane et al

12-23-2020

Diabetic Retinopathy Grading System Based on Transfer Learning
by Eman AbdelMaksoud et al

12-23-2020

White matter hyperintensities volume and cognition: Assessment of a deep learning based lesion detection and quantification algorithm on the Alzheimers Disease Neuroimaging Initiative
by Lavanya Umapathy et al

12-23-2020

Detecting Hate Speech in Memes Using Multimodal Deep Learning Approaches: Prize-winning solution to Hateful Memes Challenge
by Riza Velioglu et al

12-22-2020

Predicting Online Video Advertising Effects with Multimodal Deep Learning
by Jun Ikeda et al

12-23-2020

Small-Group Learning, with Application to Neural Architecture Search
by Xuefeng Du et al

12-23-2020

Deep manifold learning reveals hidden dynamics of proteasome autoregulation
by Zhaolong Wu et al

12-22-2020

Multi-Task Multi-Sensor Fusion for 3D Object Detection
by Ming Liang et al

12-22-2020

Open source software for automatic subregional assessment of knee cartilage degradation using quantitative T2 relaxometry and deep learning
by Kevin A. Thomas et al

12-23-2020

GANDA: A deep generative adversarial network predicts the spatial distribution of nanoparticles in tumor pixelly
by Jiulou Zhang et al

12-22-2020

This is not the Texture you are looking for! Introducing Novel Counterfactual Explanations for Non-Experts using Generative Adversarial Learning
by Silvan Mertes et al

12-22-2020

Hierarchical Recurrent Attention Networks for Structured Online Maps
by Namdar Homayounfar et al

12-23-2020

Direct Estimation of Spinal Cobb Angles by Structured Multi-Output Regression
by Haoliang Sun et al

12-23-2020

Unsupervised Domain Adaptation for Semantic Segmentation by Content Transfer
by Suhyeon Lee et al

12-23-2020

On Calibration of Scene-Text Recognition Models
by Ron Slossberg et al

12-22-2020

FracBNN: Accurate and FPGA-Efficient Binary Neural Networks with Fractional Activations
by Yichi Zhang et al

12-23-2020

Analyzing Representations inside Convolutional Neural Networks
by Uday Singh Saini et al

12-22-2020

Adversarial Multiscale Feature Learning for Overlapping Chromosome Segmentation
by Liye Mei et al

12-23-2020

The Translucent Patch: A Physical and Universal Attack on Object Detectors
by Alon Zolfi et al

12-22-2020

Stochastic Gradient Variance Reduction by Solving a Filtering Problem
by Xingyi Yang

12-22-2020

Fast and Furious: Real Time End-to-End 3D Detection, Tracking and Motion Forecasting with a Single Convolutional Net
by Wenjie Luo et al

12-23-2020

Warping of Radar Data into Camera Image for Cross-Modal Supervision in Automotive Applications
by Christopher Grimm et al

12-22-2020

MG-SAGC: A multiscale graph and its self-adaptive graph convolution network for 3D point clouds
by Bo Wu et al

12-22-2020

CholecSeg8k: A Semantic Segmentation Dataset for Laparoscopic Cholecystectomy Based on Cholec80
by W. -Y. Hong et al

12-22-2020

Deep Unsupervised Image Hashing by Maximizing Bit Entropy
by Yunqiang Li et al

12-22-2020

Image to Bengali Caption Generation Using Deep CNN and Bidirectional Gated Recurrent Unit
by Al Momin Faruk et al

12-22-2020

Towards Boosting the Channel Attention in Real Image Denoising : Sub-band Pyramid Attention
by Huayu Li et al

12-23-2020

Multi-grained Trajectory Graph Convolutional Networks for Habit-unrelated Human Motion Prediction
by Jin Liu et al

12-22-2020

Do We Really Need Scene-specific Pose Encoders?
by Yoli Shavit et al

12-23-2020

Coarse-to-Fine Object Tracking Using Deep Features and Correlation Filters
by Ahmed Zgaren et al

12-22-2020

A Structure-Aware Method for Direct Pose Estimation
by Hunter Blanton et al

12-22-2020

On Frank-Wolfe Optimization for Adversarial Robustness and Interpretability
by Theodoros Tsiligkaridis et al

12-23-2020

Principled network extraction from images
by Diego Baptista et al

12-23-2020

Vehicle Re-identification Based on Dual Distance Center Loss
by Zhijun Hu et al

12-23-2020

Blur More To Deblur Better: Multi-Blur2Deblur For Efficient Video Deblurring
by Dongwon Park et al

12-22-2020

Optical Braille Recognition Using Object Detection CNN
by Ilya G. Ovodov

12-23-2020

StainNet: a fast and robust stain normalization network
by Hongtao Kang et al

12-22-2020

Towards Histopathological Stain Invariance by Unsupervised Domain Augmentation using Generative Adversarial Networks
by Jelica Vasiljević et al

12-22-2020

Prediction of Chronic Kidney Disease Using Deep Neural Network
by Iliyas Ibrahim Iliyas et al

12-23-2020

SWA Object Detection
by Haoyang Zhang et al

12-23-2020

Exploring Instance-Level Uncertainty for Medical Detection
by Jiawei Yang et al

12-22-2020

Objective Evaluation of Deep Uncertainty Predictions for COVID-19 Detection
by Hamzeh Asgharnezhad et al

12-22-2020

Turn Signal Prediction: A Federated Learning Case Study
by Sonal Doomra et al

12-23-2020

Active Sampling for Accelerated MRI with Low-Rank Tensors
by Zichang He et al

12-22-2020

3D Point-to-Keypoint Voting Network for 6D Pose Estimation
by Weitong Hua et al

12-22-2020

A Hybrid VDV Model for Automatic Diagnosis of Pneumothorax using Class-Imbalanced Chest X-rays Dataset
by Tahira Iqbal et al

12-23-2020

ConvMath: A Convolutional Sequence Network for Mathematical Expression Recognition
by Zuoyu Yan et al

12-22-2020

Localization in the Crowd with Topological Constraints
by Shahira Abousamra et al

12-22-2020

DAGMapper: Learning to Map by Discovering Lane Topology
by Namdar Homayounfar et al

12-22-2020

Correspondence Learning for Controllable Person Image Generation
by Shilong Shen

12-24-2020

Objective Class-based Micro-Expression Recognition through Simultaneous Action Unit Detection and Feature Aggregation
by Ling Zhou et al

12-23-2020

Estimation of Drivers Gaze Region from Head Position and Orientation using Probabilistic Confidence Regions
by Sumit Jha et al

12-22-2020

Multiple Instance Segmentation in Brachial Plexus Ultrasound Image Using BPMSegNet
by Yi Ding et al

12-23-2020

Deep Semantic Dictionary Learning for Multi-label Image Classification
by Fengtao Zhou et al

12-23-2020

Multi-Modality Cut and Paste for 3D Object Detection
by Wenwei Zhang et al

12-22-2020

Limitation of Acyclic Oriented Graphs Matching as Cell Tracking Accuracy Measure when Evaluating Mitosis
by Ye Chen et al

12-22-2020

Human Action Recognition from Various Data Modalities: A Review
by Zehua Sun et al

12-23-2020

Prognostic Power of Texture Based Morphological Operations in a Radiomics Study for Lung Cancer
by Paul Desbordes et al

12-22-2020

Learning Joint 2D-3D Representations for Depth Completion
by Yun Chen et al

12-22-2020

Cloud removal in remote sensing images using generative adversarial networks and SAR-to-optical image translation
by Faramarz Naderi Darbaghshahi et al

12-23-2020

ICMSC: Intra- and Cross-modality Semantic Consistency for Unsupervised Domain Adaptation on Hip Joint Bone Segmentation
by Guodong Zeng et al

12-22-2020

Geometric robust descriptor for 3D point cloud
by Seung Hwan Jung et al

12-22-2020

Underwater image filtering: methods, datasets and evaluation
by Chau Yi Li et al

12-22-2020

Generative Interventions for Causal Learning
by Chengzhi Mao et al

12-23-2020

Chest x-ray automated triage: a semiologic approach designed for clinical implementation, exploiting different types of labels through a combination of four Deep Learning architectures
by Candelaria Mosquera et al

12-22-2020

Flexible deep transfer learning by separate feature embeddings and manifold alignment
by Samuel Rivera et al

12-22-2020

Training Convolutional Neural Networks With Hebbian Principal Component Analysis
by Gabriele Lagani et al

12-24-2020

Joint super-resolution and synthesis of 1 mm isotropic MP-RAGE volumes from clinical MRI exams with scans of different orientation, resolution and contrast
by Juan Eugenio Iglesias et al

 
Craig Smith