2021.3.22 Vision papers

 

03-18-2021

FastNeRF: High-Fidelity Neural Rendering at 200FPS
by Stephan J. Garbin et al

03-17-2021

Learning to Resize Images for Computer Vision Tasks
by Hossein Talebi et al

03-16-2021

Back to the Feature: Learning Robust Camera Localization from Pixels to Pose
by Paul-Edouard Sarlin et al

03-17-2021

Training GANs with Stronger Augmentations via Contrastive Discriminator
by Jongheon Jeong et al

03-18-2021

Large Scale Image Completion via Co-Modulated Generative Adversarial Networks
by Shengyu Zhao et al

03-18-2021

Using latent space regression to analyze and leverage compositionality in GANs
by Lucy Chai et al

03-17-2021

You Only Look One-level Feature
by Qiang Chen et al

03-16-2021

Is it Enough to Optimize CNN Architectures on ImageNet?
by Lukas Tuggener et al

03-16-2021

Multilingual Multimodal Pre-training for Zero-Shot Cross-Lingual Transfer of Vision-Language Models
by Po-Yao Huang et al

03-18-2021

On Semantic Similarity in Video Retrieval
by Michael Wray et al

03-16-2021

Spatial Dependency Networks: Neural Layers for Improved Generative Image Modeling
by Đorđe Miladinović et al

03-19-2021

ConViT: Improving Vision Transformers with Soft Convolutional Inductive Biases
by Stéphane d'Ascoli et al

03-19-2021

Paint by Word
by David Bau et al

03-18-2021

Robust Vision-Based Cheat Detection in Competitive Gaming
by Aditya Jonnalagadda et al

03-18-2021

Neural Parts: Learning Expressive 3D Shape Abstractions with Invertible Neural Networks
by Despoina Paschalidou et al

03-18-2021

How I failed machine learning in medical imaging -- shortcomings and recommendations
by Gaël Varoquaux et al

03-18-2021

CDFI: Compression-Driven Network Design for Frame Interpolation
by Tianyu Ding et al

03-17-2021

PredRNN: A Recurrent Neural Network for Spatiotemporal Predictive Learning
by Yunbo Wang et al

03-18-2021

The Case for High-Accuracy Classification: Think Small, Think Many!
by Mohammad Hosseini et al

03-18-2021

Consistency-based Active Learning for Object Detection
by Weiping Yu et al

03-18-2021

UNETR: Transformers for 3D Medical Image Segmentation
by Ali Hatamizadeh et al

03-17-2021

Large-Scale Zero-Shot Image Classification from Rich and Diverse Textual Descriptions
by Sebastian Bujwid et al

03-18-2021

Deep Online Correction for Monocular Visual Odometry
by Jiaxin Zhang et al

03-18-2021

Generating Diverse Structure for Image Inpainting With Hierarchical VQ-VAE
by Jialun Peng et al

03-18-2021

Challenges of 3D Surface Reconstruction in Capsule Endoscopy
by Olivier Rukundo

03-18-2021

Dementia Severity Classification under Small Sample Size and Weak Supervision in Thick Slice MRI
by Reza Shirkavand et al

03-18-2021

RangeDet:In Defense of Range View for LiDAR-based 3D Object Detection
by Lue Fan et al

03-18-2021

Spectral Reconstruction and Disparity from Spatio-Spectrally Coded Light Fields via Multi-Task Deep Learning
by Maximilian Schambach et al

03-18-2021

Data-free mixed-precision quantization using novel sensitivity metric
by Donghyun Lee et al

03-17-2021

The Untapped Potential of Off-the-Shelf Convolutional Neural Networks
by Matthew Inkawhich et al

03-17-2021

Revisiting the Loss Weight Adjustment in Object Detection
by Wenxin Yu et al

03-18-2021

MSMatch: Semi-Supervised Multispectral Scene Classification with Few Labels
by Pablo Gómez et al

03-17-2021

Pose-GNN : Camera Pose Estimation System Using Graph Neural Networks
by Ahmed Elmoogy et al

03-18-2021

A Location-Sensitive Local Prototype Network for Few-Shot Medical Image Segmentation
by Qinji Yu et al

03-17-2021

CheXbreak: Misclassification Identification for Deep Learning Models Interpreting Chest X-rays
by Emma Chen et al

03-17-2021

The Invertible U-Net for Optical-Flow-free Video Interframe Generation
by Saem Park et al

03-18-2021

Bayesian Imaging With Data-Driven Priors Encoded by Neural Networks: Theory, Methods, and Algorithms
by Matthew Holden et al

03-18-2021

The Low-Rank Simplicity Bias in Deep Networks
by Minyoung Huh et al

03-18-2021

Beyond Trivial Counterfactual Explanations with Diverse Valuable Explanations
by Pau Rodriguez et al

03-16-2021

Dense Interaction Learning for Video-based Person Re-identification
by Tianyu He et al

03-16-2021

PC-HMR: Pose Calibration for 3D Human Mesh Recovery from 2D Images/Videos
by Tianyu Luan et al

03-18-2021

Pseudo-ISP: Learning Pseudo In-camera Signal Processing Pipeline from A Color Image Denoiser
by Yue Cao et al

03-18-2021

Space-Time Crop & Attend: Improving Cross-modal Video Representation Learning
by Mandela Patrick et al

03-18-2021

DanceNet3D: Music Based Dance Generation with Parametric Motion Transformer
by Buyu Li et al

03-18-2021

Higher Performance Visual Tracking with Dual-Modal Localization
by Jinghao Zhou et al

03-17-2021

Topology-Aware Segmentation Using Discrete Morse Theory
by Xiaoling Hu et al

03-17-2021

COVIDx-US -- An open-access benchmark dataset of ultrasound imaging data for AI-driven COVID-19 analytics
by Ashkan Ebadi et al

03-18-2021

Enhancing Transformer for Video Understanding Using Gated Multi-Level Attention and Temporal Adversarial Training
by Saurabh Sahu et al

03-18-2021

Learning Multimodal Affinities for Textual Editing in Images
by Or Perel et al

03-18-2021

Real-Time Visual Object Tracking via Few-Shot Learning
by Jinghao Zhou et al

03-17-2021

Bias-Free FedGAN
by Vaikkunth Mugunthan et al

03-18-2021

Danish Fungi 2020 -- Not Just Another Image Recognition Dataset
by Lukáš Picek et al

03-17-2021

Automatic Generation of Contrast Sets from Scene Graphs: Probing the Compositional Consistency of GQA
by Yonatan Bitton et al

03-18-2021

KoDF: A Large-scale Korean DeepFake Detection Dataset
by Patrick Kwon et al

03-17-2021

ALADIN: All Layer Adaptive Instance Normalization for Fine-grained Style Similarity
by Dan Ruta et al

03-17-2021

Deep Wiener Deconvolution: Wiener Meets Deep Learning for Image Deblurring
by Jiangxin Dong et al

03-17-2021

On the Whitney extension problem for near isometries and beyond
by Steven B. Damelin

03-18-2021

Reading Isnt Believing: Adversarial Attacks On Multi-Modal Neurons
by David A. Noever et al

03-18-2021

Equivariant Filters for Efficient Tracking in 3D Imaging
by Daniel Moyer et al

03-18-2021

Discriminative and Semantic Feature Selection for Place Recognition towards Dynamic Environments
by Yuxin Tian et al

03-18-2021

Computer Vision Aided URLL Communications: Proactive Service Identification and Coexistence
by Muhammad Alrabeiah et al

03-18-2021

Real-Time, Deep Synthetic Aperture Sonar (SAS) Autofocus
by Isaac D. Gerg et al

03-18-2021

RP-VIO: Robust Plane-based Visual-Inertial Odometry for Dynamic Environments
by Karnik Ram et al

03-18-2021

Scalable Visual Transformers with Hierarchical Pooling
by Zizheng Pan et al

03-18-2021

Collective Decision of One-vs-Rest Networks for Open Set Recognition
by Jaeyeon Jang et al

03-18-2021

OmniPose: A Multi-Scale Framework for Multi-Person Pose Estimation
by Bruno Artacho et al

03-18-2021

Spatio-temporal Crop Classification On Volumetric Data
by Muhammad Usman Qadeer et al

03-18-2021

Impressions2Font: Generating Fonts by Specifying Impressions
by Seiya Matsuda et al

03-18-2021

Efficient Algorithms for Rotation Averaging Problems
by Yihong Dong et al

03-17-2021

Improved Deep Classwise Hashing With Centers Similarity Learning for Image Retrieval
by Ming Zhang et al

03-17-2021

Adversarial Attacks on Camera-LiDAR Models for 3D Car Detection
by Mazen Abdelfattah et al

03-17-2021

On the Role of Images for Analyzing Claims in Social Media
by Gullal S. Cheema et al

03-18-2021

TrivialAugment: Tuning-free Yet State-of-the-Art Data Augmentation
by Samuel G. Müller et al

03-18-2021

Which to Match? Selecting Consistent GT-Proposal Assignment for Pedestrian Detection
by Yan Luo et al

03-18-2021

Self-Supervised Adaptation for Video Super-Resolution
by Jinsu Yoo et al

03-16-2021

Leveraging Recent Advances in Deep Learning for Audio-Visual Emotion Recognition
by Liam Schoneveld et al

03-18-2021

Learning to Amend Facial Expression Representation via De-albino and Affinity
by Jiawei Shi et al

03-18-2021

Similarity Transfer for Knowledge Distillation
by Haoran Zhao et al

03-17-2021

Single Underwater Image Restoration by Contrastive Learning
by Junlin Han et al

03-19-2021

Beyond Linear Subspace Clustering: A Comparative Study of Nonlinear Manifold Clustering Algorithms
by Maryam Abdolali et al

03-18-2021

Learning to Recommend Frame for Interactive Video Object Segmentation in the Wild
by Zhaoyuan Yin et al

03-17-2021

Learning with Group Noise
by Qizhou Wang et al

03-18-2021

Sequential End-to-end Network for Efficient Person Search
by Zhengjia Li et al

03-17-2021

Machine Vision based Sample-Tube Localization for Mars Sample Return
by Shreyansh Daftry et al

03-18-2021

Future Frame Prediction for Robot-assisted Surgery
by Xiaojie Gao et al

03-18-2021

Lighting Enhancement Aids Reconstruction of Colonoscopic Surfaces
by Yubo Zhang et al

03-18-2021

TPPI-Net: Towards Efficient and Practical Hyperspectral Image Classification
by Hao Chen et al

03-17-2021

Rapid treatment planning for low-dose-rate prostate brachytherapy with TP-GAN
by Tajwar Abrar Aleef et al

03-18-2021

SparsePoint: Fully End-to-End Sparse 3D Object Detector
by Zili Liu et al

03-18-2021

Investigate Indistinguishable Points in Semantic Segmentation of 3D Point Cloud
by Mingye Xu et al

03-17-2021

Fast and High-Quality Blind Multi-Spectral Image Pansharpening
by Lantao Yu et al

03-17-2021

CNN Model & Tuning for Global Road Damage Detection
by Rahul Vishwakarma et al

03-17-2021

Virtual Dress Swap Using Landmark Detection
by Odar Zeynal et al

03-18-2021

Hopper: Multi-hop Transformer for Spatiotemporal Reasoning
by Honglu Zhou et al

03-17-2021

Hierarchical Attention-based Age Estimation and Bias Estimation
by Shakediel Hiba et al

03-18-2021

SG-Net: Spatial Granularity Network for One-Stage Video Instance Segmentation
by Dongfang Liu et al

03-16-2021

Collapsible Linear Blocks for Super-Efficient Super Resolution
by Kartikeya Bhardwaj et al

03-18-2021

Decoupled Spatial Temporal Graphs for Generic Visual Grounding
by Qianyu Feng et al

03-16-2021

Bio-inspired Robustness: A Review
by Harshitha Machiraju et al

03-16-2021

Sparse Curriculum Reinforcement Learning for End-to-End Driving
by Pranav Agarwal et al

03-19-2021

Learning the Superpixel in a Non-iterative and Lifelong Manner
by Lei Zhu et al

03-17-2021

Impact of Facial Tattoos and Paintings on Face Recognition Systems
by Mathias Ibsen et al

03-16-2021

SPICE: Semantic Pseudo-labeling for Image Clustering
by Chuang Niu et al

03-16-2021

Triplet-Watershed for Hyperspectral Image Classification
by Aditya Challa et al

03-19-2021

Cluster-to-Conquer: A Framework for End-to-End Multi-Instance Learning for Whole Slide Image Classification
by Yash Sharma et al

03-16-2021

Pros and Cons of GAN Evaluation Measures: New Developments
by Ali Borji

03-16-2021

Combining Morphological and Histogram based Text Line Segmentation in the OCR Context
by Pit Schneider

03-16-2021

Multi-Prize Lottery Ticket Hypothesis: Finding Accurate Binary Neural Networks by Pruning A Randomly Weighted Network
by James Diffenderfer et al

03-16-2021

Unsupervised Missing Cone Deep Learning in Optical Diffraction Tomography
by Hyungjin Chung et al

03-17-2021

Disentangled Cycle Consistency for Highly-realistic Virtual Try-On
by Chongjian Ge et al

03-17-2021

Learning Discriminative Prototypes with Dynamic Time Warping
by Xiaobin Chang et al

03-16-2021

Invertible Residual Network with Regularization for Effective Medical Image Segmentation
by Kashu Yamazaki et al

03-18-2021

Knowledge-Guided Object Discovery with Acquired Deep Impressions
by Jinyang Yuan et al

03-17-2021

HAMIL: Hierarchical Aggregation-Based Multi-Instance Learning for Microscopy Image Classification
by Yanlun Tu et al

03-17-2021

Gradient Projection Memory for Continual Learning
by Gobinda Saha et al

03-17-2021

Meta-learning of Pooling Layers for Character Recognition
by Takato Otsuzuki et al

03-16-2021

RackLay: Multi-Layer Layout Estimation for Warehouse Racks
by Meher Shashwat Nigam et al

03-16-2021

Anti-Adversarially Manipulated Attributions for Weakly and Semi-Supervised Semantic Segmentation
by Jungbeom Lee et al

03-17-2021

Prediction-assistant Frame Super-Resolution for Video Streaming
by Wang Shen et al

03-16-2021

Super-Resolving Cross-Domain Face Miniatures by Peeking at One-Shot Exemplar
by Peike Li et al

03-16-2021

Co-Generation and Segmentation for Generalized Surgical Instrument Segmentation on Unlabelled Data
by Megha Kalia et al

03-16-2021

Hebbian Semi-Supervised Learning in a Sample Efficiency Setting
by Gabriele Lagani et al

03-17-2021

An Efficient Method for the Classification of Croplands in Scarce-Label Regions
by Houtan Ghaffari

03-19-2021

Robustness via Cross-Domain Ensembles
by Teresa Yeo et al

03-16-2021

YOLOStereo3D: A Step Back to 2D for Efficient Stereo 3D Detection
by Yuxuan Liu et al

03-17-2021

Hierarchical Random Walker Segmentation for Large Volumetric Biomedical Data
by Dominik Drees et al

03-17-2021

Theoretical bounds on data requirements for the ray-based classification
by Brian J. Weber et al

03-16-2021

Unsupervised anomaly detection in digital pathology using GANs
by Milda Pocevičiūtė et al

03-16-2021

Repurposing Pretrained Models for Robust Out-of-domain Few-Shot Learning
by Namyeong Kwon et al

03-18-2021

Training image classifiers using Semi-Weak Label Data
by Anxiang Zhang et al

03-19-2021

Improving Image co-segmentation via Deep Metric Learning
by Zhengwen Li et al

03-18-2021

Noise Modulation: Let Your Model Interpret Itself
by Haoyang Li et al

03-17-2021

ShipSRDet: An End-to-End Remote Sensing Ship Detector Using Super-Resolved Feature Representation
by Shitian He et al

03-16-2021

A Large-Scale Dataset for Benchmarking Elevator Button Segmentation and Character Recognition
by Jianbang Liu et al

03-19-2021

MetaLabelNet: Learning to Generate Soft-Labels from Noisy-Labels
by Görkem Algan et al

03-16-2021

WheatNet: A Lightweight Convolutional Neural Network for High-throughput Image-based Wheat Head Detection and Counting
by Saeed Khaki et al

03-17-2021

Temporal Cluster Matching for Change Detection of Structures from Satellite Imagery
by Caleb Robinson et al

03-16-2021

Colorectal Cancer Segmentation using Atrous Convolution and Residual Enhanced UNet
by Nisarg A. Shah et al

03-17-2021

Few-Shot Visual Grounding for Natural Human-Robot Interaction
by Giorgos Tziafas et al

03-17-2021

Multi-channel Deep Supervision for Crowd Counting
by Bo Wei et al

03-16-2021

LRGNet: Learnable Region Growing for Class-Agnostic Point Cloud Segmentation
by Jingdao Chen et al

03-17-2021

What s in My LiDAR Odometry Toolbox?
by Pierre Dellenbach et al

03-17-2021

Trans-SVNet: Accurate Phase Recognition from Surgical Videos via Hybrid Embedding Aggregation Transformer
by Xiaojie Gao et al

03-17-2021

Quantitative Effectiveness Assessment and Role Categorization of Individual Units in Convolutional Neural Networks
by Yang Zhao et al

03-17-2021

Interpretable Distance Metric Learning for Handwritten Chinese Character Recognition
by Boxiang Dong et al

03-16-2021

Semi-Supervised Learning for Eye Image Segmentation
by Aayush K. Chaudhary et al

03-19-2021

Toward Compact Deep Neural Networks via Energy-Aware Pruning
by Seul-Ki Yeom et al

03-17-2021

Fourier Transform of Percoll Gradients Boosts CNN Classification of Hereditary Hemolytic Anemias
by Ario Sadafi et al

03-16-2021

Adversarial YOLO: Defense Human Detection Patch Attacks via Detecting Adversarial Patches
by Nan Ji et al

03-19-2021

Computational Emotion Analysis From Images: Recent Advances and Future Directions
by Sicheng Zhao et al

03-19-2021

Tf-GCZSL: Task-Free Generalized Continual Zero-Shot Learning
by Chandan Gautam et al

03-18-2021

Dynamic Transfer for Multi-Source Domain Adaptation
by Yunsheng Li et al

03-16-2021

Lite-HDSeg: LiDAR Semantic Segmentation Using Lite Harmonic Dense Convolutions
by Ryan Razani et al

03-16-2021

Balancing Biases and Preserving Privacy on Balanced Faces in the Wild
by Joseph P Robinson et al

03-16-2021

BBAM: Bounding Box Attribution Map for Weakly Supervised Semantic and Instance Segmentation
by Jungbeom Lee et al

03-19-2021

LSDAT: Low-Rank and Sparse Decomposition for Decision-based Adversarial Attack
by Ashkan Esmaeili et al

03-16-2021

Unsupervised Anomaly Segmentation using Image-Semantic Cycle Translation
by Chenxin Li et al

03-16-2021

QueryDet: Cascaded Sparse Query for Accelerating High-Resolution Small Object Detection
by Chenhongyi Yang et al

03-19-2021

Deep Label Fusion: A 3D End-to-End Hybrid Multi-Atlas Segmentation and Deep Learning Pipeline
by Long Xie et al

03-19-2021

Degrade is Upgrade: Learning Degradation for Low-light Image Enhancement
by Kui Jiang et al

03-16-2021

Adversarial Driving: Attacking End-to-End Autonomous Driving Systems
by Han Wu et al

03-19-2021

XProtoNet: Diagnosis in Chest Radiography with Global and Local Explanations
by Eunji Kim et al

03-18-2021

Boosting Adversarial Transferability through Enhanced Momentum
by Xiaosen Wang et al

03-16-2021

EADNet: Efficient Asymmetric Dilated Network for Semantic Segmentation
by Qihang Yang et al

03-16-2021

Modulating Localization and Classification for Harmonized Object Detection
by Taiheng Zhang et al

03-16-2021

Conceptual Text Region Network: Cognition-Inspired Accurate Scene Text Detection
by Chenwei Cui et al

03-17-2021

Generating Annotated Training Data for 6D Object Pose Estimation in Operational Environments with Minimal User Interaction
by Paul Koch et al

03-18-2021

Image Synthesis for Data Augmentation in Medical CT using DeepReinforcement Learning
by Arjun Krishna et al

03-19-2021

CE-FPN: Enhancing Channel Information for Object Detection
by Yihao Luo et al

03-18-2021

Concentric Spherical GNN for 3D Representation Learning
by James Fox et al

03-19-2021

CoordiNet: uncertainty-aware pose regressor for reliable vehicle localization
by Arthur Moreau et al

03-19-2021

MDMMT: Multidomain Multimodal Transformer for Video Retrieval
by Maksim Dzabraev et al

03-18-2021

CLTA: Contents and Length-based Temporal Attention for Few-shot Action Recognition
by Yang Bo et al

03-18-2021

PSCC-Net: Progressive Spatio-Channel Correlation Network for Image Manipulation Detection and Localization
by Xiaohong Liu et al

03-18-2021

Fusion-FlowNet: Energy-Efficient Optical Flow Estimation using Sensor Fusion and Deep Fused Spiking-Analog Network Architectures
by Chankyu Lee et al

03-19-2021

UniMoCo: Unsupervised, Semi-Supervised and Full-Supervised Visual Representation Learning
by Zhigang Dai et al

03-18-2021

Generic Perceptual Loss for Modeling Structured Output Dependencies
by Yifan Liu et al

03-16-2021

Frequency-aware Discriminative Feature Learning Supervised by Single-Center Loss for Face Forgery Detection
by Jiaming Li et al

03-16-2021

Simultaneous Multi-View Camera Pose Estimation and Object Tracking with Square Planar Markers
by Hamid Sarmadi et al

03-18-2021

DCF-ASN: Coarse-to-fine Real-time Visual Tracking via Discriminative Correlation Filter and Attentional Siamese Network
by Xizhe Xue et al

03-19-2021

Learning Multiscale Correlations for Human Motion Prediction
by Honghong Zhou et al

03-17-2021

Aggregated Multi-GANs for Controlled 3D Human Motion Prediction
by Zhenguang Liu et al

03-16-2021

Design and Development of Autonomous Delivery Robot
by Aniket Gujarathi et al

03-19-2021

Connecting Images through Time and Sources: Introducing Low-data, Heterogeneous Instance Retrieval
by Dimitri Gominski et al

03-19-2021

Skeleton Merger: an Unsupervised Aligned Keypoint Detector
by Ruoxi Shi et al

03-19-2021

DFS: A Diverse Feature Synthesis Model for Generalized Zero-Shot Learning
by Bonan Li et al

03-18-2021

Neural Networks for Semantic Gaze Analysis in XR Settings
by Lena Stubbemann et al

03-18-2021

Ano-Graph: Learning Normal Scene Contextual Graphs to Detect Video Anomalies
by Masoud Pourreza et al

03-19-2021

Sewer-ML: A Multi-Label Sewer Defect Classification Dataset and Benchmark
by Joakim Bruslund Haurum et al

03-16-2021

Consistent Posterior Distributions under Vessel-Mixing: A Regularization for Cross-Domain Retinal Artery/Vein Classification
by Chenxin Li et al

03-19-2021

GLOWin: A Flow-based Invertible Generative Framework for Learning Disentangled Feature Representations in Medical Images
by Aadhithya Sankar et al

03-18-2021

Hyperspectral Image Super-Resolution in Arbitrary Input-Output Band Settings
by Zhongyang Zhang et al

03-19-2021

Variational Knowledge Distillation for Disease Classification in Chest X-Rays
by Tom van Sonsbeek et al

03-18-2021

3D Human Pose Estimation with Spatial and Temporal Transformers
by Ce Zheng et al

03-19-2021

ClawCraneNet: Leveraging Object-level Relation for Text-based Video Segmentation
by Chen Liang et al

03-16-2021

A comparative study of deep learning methods for building footprints detection using high spatial resolution aerial images
by Hongjie He et al

03-16-2021

Digital Peter: Dataset, Competition and Handwriting Recognition Methods
by Mark Potanin et al

03-18-2021

Recent Advances in Deep Learning Techniques for Face Recognition
by Md. Tahmid Hasan Fuad et al

03-19-2021

Carton dataset synthesis based on foreground texture replacement
by Lijun Gou et al

03-19-2021

There and Back Again: Self-supervised Multispectral Correspondence Estimation
by Celyn Walters et al

03-18-2021

Localization of Cochlear Implant Electrodes from Cone Beam Computed Tomography using Particle Belief Propagation
by Hendrik Hachmann et al

 
Craig Smith