2021.3.22 Vision papers

03-18-2021	FastNeRF: High-Fidelity Neural Rendering at 200FPS by Stephan J. Garbin et al
03-17-2021	Learning to Resize Images for Computer Vision Tasks by Hossein Talebi et al
03-16-2021	Back to the Feature: Learning Robust Camera Localization from Pixels to Pose by Paul-Edouard Sarlin et al
03-17-2021	Training GANs with Stronger Augmentations via Contrastive Discriminator by Jongheon Jeong et al
03-18-2021	Large Scale Image Completion via Co-Modulated Generative Adversarial Networks by Shengyu Zhao et al
03-18-2021	Using latent space regression to analyze and leverage compositionality in GANs by Lucy Chai et al
03-17-2021	You Only Look One-level Feature by Qiang Chen et al
03-16-2021	Is it Enough to Optimize CNN Architectures on ImageNet? by Lukas Tuggener et al
03-16-2021	Multilingual Multimodal Pre-training for Zero-Shot Cross-Lingual Transfer of Vision-Language Models by Po-Yao Huang et al
03-18-2021	On Semantic Similarity in Video Retrieval by Michael Wray et al
03-16-2021	Spatial Dependency Networks: Neural Layers for Improved Generative Image Modeling by Đorđe Miladinović et al
03-19-2021	ConViT: Improving Vision Transformers with Soft Convolutional Inductive Biases by Stéphane d'Ascoli et al
03-19-2021	Paint by Word by David Bau et al
03-18-2021	Robust Vision-Based Cheat Detection in Competitive Gaming by Aditya Jonnalagadda et al
03-18-2021	Neural Parts: Learning Expressive 3D Shape Abstractions with Invertible Neural Networks by Despoina Paschalidou et al
03-18-2021	How I failed machine learning in medical imaging -- shortcomings and recommendations by Gaël Varoquaux et al
03-18-2021	CDFI: Compression-Driven Network Design for Frame Interpolation by Tianyu Ding et al
03-17-2021	PredRNN: A Recurrent Neural Network for Spatiotemporal Predictive Learning by Yunbo Wang et al
03-18-2021	The Case for High-Accuracy Classification: Think Small, Think Many! by Mohammad Hosseini et al
03-18-2021	Consistency-based Active Learning for Object Detection by Weiping Yu et al
03-18-2021	UNETR: Transformers for 3D Medical Image Segmentation by Ali Hatamizadeh et al
03-17-2021	Large-Scale Zero-Shot Image Classification from Rich and Diverse Textual Descriptions by Sebastian Bujwid et al
03-18-2021	Deep Online Correction for Monocular Visual Odometry by Jiaxin Zhang et al
03-18-2021	Generating Diverse Structure for Image Inpainting With Hierarchical VQ-VAE by Jialun Peng et al
03-18-2021	Challenges of 3D Surface Reconstruction in Capsule Endoscopy by Olivier Rukundo
03-18-2021	Dementia Severity Classification under Small Sample Size and Weak Supervision in Thick Slice MRI by Reza Shirkavand et al
03-18-2021	RangeDet:In Defense of Range View for LiDAR-based 3D Object Detection by Lue Fan et al
03-18-2021	Spectral Reconstruction and Disparity from Spatio-Spectrally Coded Light Fields via Multi-Task Deep Learning by Maximilian Schambach et al
03-18-2021	Data-free mixed-precision quantization using novel sensitivity metric by Donghyun Lee et al
03-17-2021	The Untapped Potential of Off-the-Shelf Convolutional Neural Networks by Matthew Inkawhich et al
03-17-2021	Revisiting the Loss Weight Adjustment in Object Detection by Wenxin Yu et al
03-18-2021	MSMatch: Semi-Supervised Multispectral Scene Classification with Few Labels by Pablo Gómez et al
03-17-2021	Pose-GNN : Camera Pose Estimation System Using Graph Neural Networks by Ahmed Elmoogy et al
03-18-2021	A Location-Sensitive Local Prototype Network for Few-Shot Medical Image Segmentation by Qinji Yu et al
03-17-2021	CheXbreak: Misclassification Identification for Deep Learning Models Interpreting Chest X-rays by Emma Chen et al
03-17-2021	The Invertible U-Net for Optical-Flow-free Video Interframe Generation by Saem Park et al
03-18-2021	Bayesian Imaging With Data-Driven Priors Encoded by Neural Networks: Theory, Methods, and Algorithms by Matthew Holden et al
03-18-2021	The Low-Rank Simplicity Bias in Deep Networks by Minyoung Huh et al
03-18-2021	Beyond Trivial Counterfactual Explanations with Diverse Valuable Explanations by Pau Rodriguez et al
03-16-2021	Dense Interaction Learning for Video-based Person Re-identification by Tianyu He et al
03-16-2021	PC-HMR: Pose Calibration for 3D Human Mesh Recovery from 2D Images/Videos by Tianyu Luan et al
03-18-2021	Pseudo-ISP: Learning Pseudo In-camera Signal Processing Pipeline from A Color Image Denoiser by Yue Cao et al
03-18-2021	Space-Time Crop & Attend: Improving Cross-modal Video Representation Learning by Mandela Patrick et al
03-18-2021	DanceNet3D: Music Based Dance Generation with Parametric Motion Transformer by Buyu Li et al
03-18-2021	Higher Performance Visual Tracking with Dual-Modal Localization by Jinghao Zhou et al
03-17-2021	Topology-Aware Segmentation Using Discrete Morse Theory by Xiaoling Hu et al
03-17-2021	COVIDx-US -- An open-access benchmark dataset of ultrasound imaging data for AI-driven COVID-19 analytics by Ashkan Ebadi et al
03-18-2021	Enhancing Transformer for Video Understanding Using Gated Multi-Level Attention and Temporal Adversarial Training by Saurabh Sahu et al
03-18-2021	Learning Multimodal Affinities for Textual Editing in Images by Or Perel et al
03-18-2021	Real-Time Visual Object Tracking via Few-Shot Learning by Jinghao Zhou et al
03-17-2021	Bias-Free FedGAN by Vaikkunth Mugunthan et al
03-18-2021	Danish Fungi 2020 -- Not Just Another Image Recognition Dataset by Lukáš Picek et al
03-17-2021	Automatic Generation of Contrast Sets from Scene Graphs: Probing the Compositional Consistency of GQA by Yonatan Bitton et al
03-18-2021	KoDF: A Large-scale Korean DeepFake Detection Dataset by Patrick Kwon et al
03-17-2021	ALADIN: All Layer Adaptive Instance Normalization for Fine-grained Style Similarity by Dan Ruta et al
03-17-2021	Deep Wiener Deconvolution: Wiener Meets Deep Learning for Image Deblurring by Jiangxin Dong et al
03-17-2021	On the Whitney extension problem for near isometries and beyond by Steven B. Damelin
03-18-2021	Reading Isnt Believing: Adversarial Attacks On Multi-Modal Neurons by David A. Noever et al
03-18-2021	Equivariant Filters for Efficient Tracking in 3D Imaging by Daniel Moyer et al
03-18-2021	Discriminative and Semantic Feature Selection for Place Recognition towards Dynamic Environments by Yuxin Tian et al
03-18-2021	Computer Vision Aided URLL Communications: Proactive Service Identification and Coexistence by Muhammad Alrabeiah et al
03-18-2021	Real-Time, Deep Synthetic Aperture Sonar (SAS) Autofocus by Isaac D. Gerg et al
03-18-2021	RP-VIO: Robust Plane-based Visual-Inertial Odometry for Dynamic Environments by Karnik Ram et al
03-18-2021	Scalable Visual Transformers with Hierarchical Pooling by Zizheng Pan et al
03-18-2021	Collective Decision of One-vs-Rest Networks for Open Set Recognition by Jaeyeon Jang et al
03-18-2021	OmniPose: A Multi-Scale Framework for Multi-Person Pose Estimation by Bruno Artacho et al
03-18-2021	Spatio-temporal Crop Classification On Volumetric Data by Muhammad Usman Qadeer et al
03-18-2021	Impressions2Font: Generating Fonts by Specifying Impressions by Seiya Matsuda et al
03-18-2021	Efficient Algorithms for Rotation Averaging Problems by Yihong Dong et al
03-17-2021	Improved Deep Classwise Hashing With Centers Similarity Learning for Image Retrieval by Ming Zhang et al
03-17-2021	Adversarial Attacks on Camera-LiDAR Models for 3D Car Detection by Mazen Abdelfattah et al
03-17-2021	On the Role of Images for Analyzing Claims in Social Media by Gullal S. Cheema et al
03-18-2021	TrivialAugment: Tuning-free Yet State-of-the-Art Data Augmentation by Samuel G. Müller et al
03-18-2021	Which to Match? Selecting Consistent GT-Proposal Assignment for Pedestrian Detection by Yan Luo et al
03-18-2021	Self-Supervised Adaptation for Video Super-Resolution by Jinsu Yoo et al
03-16-2021	Leveraging Recent Advances in Deep Learning for Audio-Visual Emotion Recognition by Liam Schoneveld et al
03-18-2021	Learning to Amend Facial Expression Representation via De-albino and Affinity by Jiawei Shi et al
03-18-2021	Similarity Transfer for Knowledge Distillation by Haoran Zhao et al
03-17-2021	Single Underwater Image Restoration by Contrastive Learning by Junlin Han et al
03-19-2021	Beyond Linear Subspace Clustering: A Comparative Study of Nonlinear Manifold Clustering Algorithms by Maryam Abdolali et al
03-18-2021	Learning to Recommend Frame for Interactive Video Object Segmentation in the Wild by Zhaoyuan Yin et al
03-17-2021	Learning with Group Noise by Qizhou Wang et al
03-18-2021	Sequential End-to-end Network for Efficient Person Search by Zhengjia Li et al
03-17-2021	Machine Vision based Sample-Tube Localization for Mars Sample Return by Shreyansh Daftry et al
03-18-2021	Future Frame Prediction for Robot-assisted Surgery by Xiaojie Gao et al
03-18-2021	Lighting Enhancement Aids Reconstruction of Colonoscopic Surfaces by Yubo Zhang et al
03-18-2021	TPPI-Net: Towards Efficient and Practical Hyperspectral Image Classification by Hao Chen et al
03-17-2021	Rapid treatment planning for low-dose-rate prostate brachytherapy with TP-GAN by Tajwar Abrar Aleef et al
03-18-2021	SparsePoint: Fully End-to-End Sparse 3D Object Detector by Zili Liu et al
03-18-2021	Investigate Indistinguishable Points in Semantic Segmentation of 3D Point Cloud by Mingye Xu et al
03-17-2021	Fast and High-Quality Blind Multi-Spectral Image Pansharpening by Lantao Yu et al
03-17-2021	CNN Model & Tuning for Global Road Damage Detection by Rahul Vishwakarma et al
03-17-2021	Virtual Dress Swap Using Landmark Detection by Odar Zeynal et al
03-18-2021	Hopper: Multi-hop Transformer for Spatiotemporal Reasoning by Honglu Zhou et al
03-17-2021	Hierarchical Attention-based Age Estimation and Bias Estimation by Shakediel Hiba et al
03-18-2021	SG-Net: Spatial Granularity Network for One-Stage Video Instance Segmentation by Dongfang Liu et al
03-16-2021	Collapsible Linear Blocks for Super-Efficient Super Resolution by Kartikeya Bhardwaj et al
03-18-2021	Decoupled Spatial Temporal Graphs for Generic Visual Grounding by Qianyu Feng et al
03-16-2021	Bio-inspired Robustness: A Review by Harshitha Machiraju et al
03-16-2021	Sparse Curriculum Reinforcement Learning for End-to-End Driving by Pranav Agarwal et al
03-19-2021	Learning the Superpixel in a Non-iterative and Lifelong Manner by Lei Zhu et al
03-17-2021	Impact of Facial Tattoos and Paintings on Face Recognition Systems by Mathias Ibsen et al
03-16-2021	SPICE: Semantic Pseudo-labeling for Image Clustering by Chuang Niu et al
03-16-2021	Triplet-Watershed for Hyperspectral Image Classification by Aditya Challa et al
03-19-2021	Cluster-to-Conquer: A Framework for End-to-End Multi-Instance Learning for Whole Slide Image Classification by Yash Sharma et al
03-16-2021	Pros and Cons of GAN Evaluation Measures: New Developments by Ali Borji
03-16-2021	Combining Morphological and Histogram based Text Line Segmentation in the OCR Context by Pit Schneider
03-16-2021	Multi-Prize Lottery Ticket Hypothesis: Finding Accurate Binary Neural Networks by Pruning A Randomly Weighted Network by James Diffenderfer et al
03-16-2021	Unsupervised Missing Cone Deep Learning in Optical Diffraction Tomography by Hyungjin Chung et al
03-17-2021	Disentangled Cycle Consistency for Highly-realistic Virtual Try-On by Chongjian Ge et al

03-17-2021	Learning Discriminative Prototypes with Dynamic Time Warping by Xiaobin Chang et al
03-16-2021	Invertible Residual Network with Regularization for Effective Medical Image Segmentation by Kashu Yamazaki et al
03-18-2021	Knowledge-Guided Object Discovery with Acquired Deep Impressions by Jinyang Yuan et al
03-17-2021	HAMIL: Hierarchical Aggregation-Based Multi-Instance Learning for Microscopy Image Classification by Yanlun Tu et al
03-17-2021	Gradient Projection Memory for Continual Learning by Gobinda Saha et al
03-17-2021	Meta-learning of Pooling Layers for Character Recognition by Takato Otsuzuki et al
03-16-2021	RackLay: Multi-Layer Layout Estimation for Warehouse Racks by Meher Shashwat Nigam et al
03-16-2021	Anti-Adversarially Manipulated Attributions for Weakly and Semi-Supervised Semantic Segmentation by Jungbeom Lee et al
03-17-2021	Prediction-assistant Frame Super-Resolution for Video Streaming by Wang Shen et al
03-16-2021	Super-Resolving Cross-Domain Face Miniatures by Peeking at One-Shot Exemplar by Peike Li et al
03-16-2021	Co-Generation and Segmentation for Generalized Surgical Instrument Segmentation on Unlabelled Data by Megha Kalia et al
03-16-2021	Hebbian Semi-Supervised Learning in a Sample Efficiency Setting by Gabriele Lagani et al
03-17-2021	An Efficient Method for the Classification of Croplands in Scarce-Label Regions by Houtan Ghaffari
03-19-2021	Robustness via Cross-Domain Ensembles by Teresa Yeo et al
03-16-2021	YOLOStereo3D: A Step Back to 2D for Efficient Stereo 3D Detection by Yuxuan Liu et al
03-17-2021	Hierarchical Random Walker Segmentation for Large Volumetric Biomedical Data by Dominik Drees et al
03-17-2021	Theoretical bounds on data requirements for the ray-based classification by Brian J. Weber et al
03-16-2021	Unsupervised anomaly detection in digital pathology using GANs by Milda Pocevičiūtė et al
03-16-2021	Repurposing Pretrained Models for Robust Out-of-domain Few-Shot Learning by Namyeong Kwon et al
03-18-2021	Training image classifiers using Semi-Weak Label Data by Anxiang Zhang et al
03-19-2021	Improving Image co-segmentation via Deep Metric Learning by Zhengwen Li et al
03-18-2021	Noise Modulation: Let Your Model Interpret Itself by Haoyang Li et al
03-17-2021	ShipSRDet: An End-to-End Remote Sensing Ship Detector Using Super-Resolved Feature Representation by Shitian He et al
03-16-2021	A Large-Scale Dataset for Benchmarking Elevator Button Segmentation and Character Recognition by Jianbang Liu et al
03-19-2021	MetaLabelNet: Learning to Generate Soft-Labels from Noisy-Labels by Görkem Algan et al
03-16-2021	WheatNet: A Lightweight Convolutional Neural Network for High-throughput Image-based Wheat Head Detection and Counting by Saeed Khaki et al
03-17-2021	Temporal Cluster Matching for Change Detection of Structures from Satellite Imagery by Caleb Robinson et al
03-16-2021	Colorectal Cancer Segmentation using Atrous Convolution and Residual Enhanced UNet by Nisarg A. Shah et al
03-17-2021	Few-Shot Visual Grounding for Natural Human-Robot Interaction by Giorgos Tziafas et al
03-17-2021	Multi-channel Deep Supervision for Crowd Counting by Bo Wei et al
03-16-2021	LRGNet: Learnable Region Growing for Class-Agnostic Point Cloud Segmentation by Jingdao Chen et al
03-17-2021	What s in My LiDAR Odometry Toolbox? by Pierre Dellenbach et al
03-17-2021	Trans-SVNet: Accurate Phase Recognition from Surgical Videos via Hybrid Embedding Aggregation Transformer by Xiaojie Gao et al
03-17-2021	Quantitative Effectiveness Assessment and Role Categorization of Individual Units in Convolutional Neural Networks by Yang Zhao et al
03-17-2021	Interpretable Distance Metric Learning for Handwritten Chinese Character Recognition by Boxiang Dong et al
03-16-2021	Semi-Supervised Learning for Eye Image Segmentation by Aayush K. Chaudhary et al
03-19-2021	Toward Compact Deep Neural Networks via Energy-Aware Pruning by Seul-Ki Yeom et al
03-17-2021	Fourier Transform of Percoll Gradients Boosts CNN Classification of Hereditary Hemolytic Anemias by Ario Sadafi et al
03-16-2021	Adversarial YOLO: Defense Human Detection Patch Attacks via Detecting Adversarial Patches by Nan Ji et al
03-19-2021	Computational Emotion Analysis From Images: Recent Advances and Future Directions by Sicheng Zhao et al
03-19-2021	Tf-GCZSL: Task-Free Generalized Continual Zero-Shot Learning by Chandan Gautam et al
03-18-2021	Dynamic Transfer for Multi-Source Domain Adaptation by Yunsheng Li et al
03-16-2021	Lite-HDSeg: LiDAR Semantic Segmentation Using Lite Harmonic Dense Convolutions by Ryan Razani et al
03-16-2021	Balancing Biases and Preserving Privacy on Balanced Faces in the Wild by Joseph P Robinson et al
03-16-2021	BBAM: Bounding Box Attribution Map for Weakly Supervised Semantic and Instance Segmentation by Jungbeom Lee et al
03-19-2021	LSDAT: Low-Rank and Sparse Decomposition for Decision-based Adversarial Attack by Ashkan Esmaeili et al
03-16-2021	Unsupervised Anomaly Segmentation using Image-Semantic Cycle Translation by Chenxin Li et al
03-16-2021	QueryDet: Cascaded Sparse Query for Accelerating High-Resolution Small Object Detection by Chenhongyi Yang et al
03-19-2021	Deep Label Fusion: A 3D End-to-End Hybrid Multi-Atlas Segmentation and Deep Learning Pipeline by Long Xie et al
03-19-2021	Degrade is Upgrade: Learning Degradation for Low-light Image Enhancement by Kui Jiang et al
03-16-2021	Adversarial Driving: Attacking End-to-End Autonomous Driving Systems by Han Wu et al
03-19-2021	XProtoNet: Diagnosis in Chest Radiography with Global and Local Explanations by Eunji Kim et al
03-18-2021	Boosting Adversarial Transferability through Enhanced Momentum by Xiaosen Wang et al
03-16-2021	EADNet: Efficient Asymmetric Dilated Network for Semantic Segmentation by Qihang Yang et al
03-16-2021	Modulating Localization and Classification for Harmonized Object Detection by Taiheng Zhang et al
03-16-2021	Conceptual Text Region Network: Cognition-Inspired Accurate Scene Text Detection by Chenwei Cui et al
03-17-2021	Generating Annotated Training Data for 6D Object Pose Estimation in Operational Environments with Minimal User Interaction by Paul Koch et al
03-18-2021	Image Synthesis for Data Augmentation in Medical CT using DeepReinforcement Learning by Arjun Krishna et al
03-19-2021	CE-FPN: Enhancing Channel Information for Object Detection by Yihao Luo et al
03-18-2021	Concentric Spherical GNN for 3D Representation Learning by James Fox et al
03-19-2021	CoordiNet: uncertainty-aware pose regressor for reliable vehicle localization by Arthur Moreau et al
03-19-2021	MDMMT: Multidomain Multimodal Transformer for Video Retrieval by Maksim Dzabraev et al
03-18-2021	CLTA: Contents and Length-based Temporal Attention for Few-shot Action Recognition by Yang Bo et al
03-18-2021	PSCC-Net: Progressive Spatio-Channel Correlation Network for Image Manipulation Detection and Localization by Xiaohong Liu et al
03-18-2021	Fusion-FlowNet: Energy-Efficient Optical Flow Estimation using Sensor Fusion and Deep Fused Spiking-Analog Network Architectures by Chankyu Lee et al
03-19-2021	UniMoCo: Unsupervised, Semi-Supervised and Full-Supervised Visual Representation Learning by Zhigang Dai et al
03-18-2021	Generic Perceptual Loss for Modeling Structured Output Dependencies by Yifan Liu et al
03-16-2021	Frequency-aware Discriminative Feature Learning Supervised by Single-Center Loss for Face Forgery Detection by Jiaming Li et al
03-16-2021	Simultaneous Multi-View Camera Pose Estimation and Object Tracking with Square Planar Markers by Hamid Sarmadi et al
03-18-2021	DCF-ASN: Coarse-to-fine Real-time Visual Tracking via Discriminative Correlation Filter and Attentional Siamese Network by Xizhe Xue et al
03-19-2021	Learning Multiscale Correlations for Human Motion Prediction by Honghong Zhou et al
03-17-2021	Aggregated Multi-GANs for Controlled 3D Human Motion Prediction by Zhenguang Liu et al
03-16-2021	Design and Development of Autonomous Delivery Robot by Aniket Gujarathi et al
03-19-2021	Connecting Images through Time and Sources: Introducing Low-data, Heterogeneous Instance Retrieval by Dimitri Gominski et al
03-19-2021	Skeleton Merger: an Unsupervised Aligned Keypoint Detector by Ruoxi Shi et al
03-19-2021	DFS: A Diverse Feature Synthesis Model for Generalized Zero-Shot Learning by Bonan Li et al
03-18-2021	Neural Networks for Semantic Gaze Analysis in XR Settings by Lena Stubbemann et al
03-18-2021	Ano-Graph: Learning Normal Scene Contextual Graphs to Detect Video Anomalies by Masoud Pourreza et al
03-19-2021	Sewer-ML: A Multi-Label Sewer Defect Classification Dataset and Benchmark by Joakim Bruslund Haurum et al
03-16-2021	Consistent Posterior Distributions under Vessel-Mixing: A Regularization for Cross-Domain Retinal Artery/Vein Classification by Chenxin Li et al
03-19-2021	GLOWin: A Flow-based Invertible Generative Framework for Learning Disentangled Feature Representations in Medical Images by Aadhithya Sankar et al
03-18-2021	Hyperspectral Image Super-Resolution in Arbitrary Input-Output Band Settings by Zhongyang Zhang et al
03-19-2021	Variational Knowledge Distillation for Disease Classification in Chest X-Rays by Tom van Sonsbeek et al
03-18-2021	3D Human Pose Estimation with Spatial and Temporal Transformers by Ce Zheng et al
03-19-2021	ClawCraneNet: Leveraging Object-level Relation for Text-based Video Segmentation by Chen Liang et al
03-16-2021	A comparative study of deep learning methods for building footprints detection using high spatial resolution aerial images by Hongjie He et al
03-16-2021	Digital Peter: Dataset, Competition and Handwriting Recognition Methods by Mark Potanin et al
03-18-2021	Recent Advances in Deep Learning Techniques for Face Recognition by Md. Tahmid Hasan Fuad et al
03-19-2021	Carton dataset synthesis based on foreground texture replacement by Lijun Gou et al
03-19-2021	There and Back Again: Self-supervised Multispectral Correspondence Estimation by Celyn Walters et al
03-18-2021	Localization of Cochlear Implant Electrodes from Cone Beam Computed Tomography using Particle Belief Propagation by Hendrik Hachmann et al

Craig SmithMarch 22, 2021