2021.6.28 Vision papers

06-24-2021	Federated Noisy Client Learning by Li Li et al
06-25-2021	Building Intelligent Autonomous Navigation Agents by Devendra Singh Chaplot
06-25-2021	Image-to-image Transformation with Auxiliary Condition by Robert Leer et al
06-23-2021	What makes visual place recognition easy or hard? by Stefan Schubert et al
06-23-2021	Conditional Deformable Image Registration with Convolutional Neural Network by Tony C. W. Mok et al
06-24-2021	ChaLearn Looking at People: Inpainting and Denoising challenges by Sergio Escalera et al
06-24-2021	VOLO: Vision Outlooker for Visual Recognition by Li Yuan et al
06-23-2021	Transformer Meets Convolution: A Bilateral Awareness Net-work for Semantic Segmentation of Very Fine Resolution Ur-ban Scene Images by Libo Wang et al
06-24-2021	When Differential Privacy Meets Interpretability: A Case Study by Rakshit Naidu et al
06-24-2021	Driver-centric Risk Object Identification by Chengxi Li et al
06-23-2021	3D human tongue reconstruction from single in-the-wild images by Stylianos Ploumpis et al
06-24-2021	Advancing biological super-resolution microscopy through deep learning: a brief review by Tianjie Yang et al
06-25-2021	Semantic annotation for computational pathology: Multidisciplinary experience and best practice recommendations by Noorul Wahab et al
06-23-2021	Multi-modal and frequency-weighted tensor nuclear norm for hyperspectral image denoising by Sheng Liu et al
06-24-2021	DnS: Distill-and-Select for Efficient and Accurate Video Indexing and Retrieval by Giorgos Kordopatis-Zilos et al
06-23-2021	A Review of Assistive Technologies for Activities of Daily Living of Elderly by Nirmalya Thakur et al
06-24-2021	Towards Automatic Speech to Sign Language Generation by Parul Kapoor et al
06-24-2021	FaDIV-Syn: Fast Depth-Independent View Synthesis by Andre Rochow et al
06-23-2021	Alias-Free Generative Adversarial Networks by Tero Karras et al
06-22-2021	Euro-PVI: Pedestrian Vehicle Interactions in Dense Urban Centers by Apratim Bhattacharyya et al
06-23-2021	How Well do Feature Visualizations Support Causal Understanding of CNN Activations? by Roland S. Zimmermann et al
06-24-2021	Generalized One-Class Learning Using Pairs of Complementary Classifiers by Anoop Cherian et al
06-25-2021	Partially fake it till you make it: mixing real and fake thermal images for improved object detection by Francesco Bongini et al
06-23-2021	Real-time Instance Segmentation with Discriminative Orientation Maps by Wentao Du et al
06-24-2021	Physics perception in sloshing scenes with guaranteed thermodynamic consistency by Beatriz Moya et al
06-22-2021	Give Me Your Trained Model: Domain Adaptive Semantic Segmentation without Source Data by Yuxi Wang et al
06-22-2021	A Survey on Human-aware Robot Navigation by Ronja Möller et al
06-22-2021	Universal Domain Adaptation in Ordinal Regression by Chidlovskii Boris et al
06-22-2021	Automatic Head Overcoat Thickness Measure with NASNet-Large-Decoder Net by Youshan Zhang et al
06-22-2021	Deep3DPose: Realtime Reconstruction of Arbitrarily Posed Human Bodies from Single RGB Images by Liguo Jiang et al
06-25-2021	Interactive Multi-level Stroke Control for Neural Style Transfer by Max Reimann et al
06-24-2021	Fast Monte Carlo Rendering via Multi-Resolution Sampling by Qiqi Hou et al
06-22-2021	Data Augmentation for Opcode Sequence Based Malware Detection by Niall McLaughlin et al
06-22-2021	Evaluation of a Region Proposal Architecture for Multi-task Document Layout Analysis by Lorenzo Quirós et al
06-23-2021	Sentinel-1 and Sentinel-2 Spatio-Temporal Data Fusion for Clouds Removal by Alessandro Sebastianelli et al
06-24-2021	Generalized Unsupervised Clustering of Hyperspectral Images of Geological Targets in the Near Infrared by Angela F. Gao et al
06-22-2021	MEAL: Manifold Embedding-based Active Learning by Deepthi Sreenivasaiah et al
06-22-2021	Confidence-Aware Learning for Camouflaged Object Detection by Jiawei Liu et al
06-22-2021	PatentNet: A Large-Scale Incomplete Multiview, Multimodal, Multilabel Industrial Goods Image Database by Fangyuan Lei et al
06-22-2021	Self-Supervised Iterative Contextual Smoothing for Efficient Adversarial Defense against Gray- and Black-Box Attack by Sungmin Cha et al
06-23-2021	Continuous-Time Deep Glioma Growth Models by Jens Petersen et al
06-24-2021	Handling Data Heterogeneity with Generative Replay in Collaborative Learning for Medical Imaging by Liangqiong Qu et al
06-24-2021	CausalCity: Complex Simulations with Agency for Causal Discovery and Reasoning by Daniel McDuff et al
06-22-2021	Towards Reducing Labeling Cost in Deep Object Detection by Ismail Elezi et al
06-22-2021	nuPlan: A closed-loop ML-based planning benchmark for autonomous vehicles by Holger Caesar et al
06-23-2021	Deformed2Self: Self-Supervised Denoising for Dynamic Medical Imaging by Junshen Xu et al
06-24-2021	Domain-guided Machine Learning for Remotely Sensed In-Season Crop Growth Estimation by George Worrall et al
06-24-2021	FOVQA: Blind Foveated Video Quality Assessment by Yize Jin et al
06-23-2021	Behavior Mimics Distribution: Combining Individual and Group Behaviors for Federated Learning by Hua Huang et al
06-23-2021	Estimating the Robustness of Classification Models by the Structure of the Learned Feature-Space by Kalun Ho et al
06-22-2021	Exploiting Negative Learning for Implicit Pseudo Label Rectification in Source-Free Domain Adaptive Semantic Segmentation by Xin Luo et al
06-25-2021	Efficient Document Image Classification Using Region-Based Graph Neural Network by Jaya Krishna Mandivarapu et al
06-24-2021	Attention Toward Neighbors: A Context Aware Framework for High Resolution Image Segmentation by Fahim Faisal Niloy et al
06-22-2021	Long-term Cross Adversarial Training: A Robust Meta-learning Method for Few-shot Classification Tasks by Fan Liu et al
06-25-2021	Animatable Neural Radiance Fields from Monocular RGB Video by Jianchuan Chen et al
06-22-2021	Learning-Based Practical Light Field Image Compression Using A Disparity-Aware Model by Mohana Singh et al
06-23-2021	APNN-TC: Accelerating Arbitrary Precision Neural Networks on Ampere GPU Tensor Cores by Boyuan Feng et al
06-23-2021	Open Images V5 Text Annotation and Yet Another Mask Text Spotter by Ilya Krylov et al
06-24-2021	Video Super-Resolution with Long-Term Self-Exemplars by Guotao Meng et al
06-25-2021	Probing Inter-modality: Visual Parsing with Self-Attention for Vision-Language Pre-training by Hongwei Xue et al
06-22-2021	Transfer Learning of Deep Spatiotemporal Networks to Model Arbitrarily Long Videos of Seizures by Fernando Pérez-García et al
06-24-2021	Generative Modeling for Multi-task Visual Learning by Zhipeng Bao et al
06-22-2021	A Comparison for Patch-level Classification of Deep Learning Methods on Transparent Images: from Convolutional Neural Networks to Visual Transformers by Hechen Yang et al
06-25-2021	PVTv2: Improved Baselines with Pyramid Vision Transformer by Wenhai Wang et al
06-22-2021	Part-Aware Measurement for Robust Multi-View Multi-Human 3D Pose Estimation and Tracking by Hau Chu et al
06-24-2021	A Systematic Collection of Medical Image Datasets for Deep Learning by Johann Li et al
06-23-2021	IA-RED22: Interpretability-Aware Redundancy Reduction for Vision Transformers by Bowen Pan et al
06-23-2021	Co-advise: Cross Inductive Bias Distillation by Sucheng Ren et al
06-22-2021	PALMAR: Towards Adaptive Multi-inhabitant Activity Recognition in Point-Cloud Technology by Mohammad Arif Ul Alam et al
06-23-2021	Image-to-Image Translation of Synthetic Samples for Rare Classes by Edoardo Lanzini et al
06-24-2021	Semi-supervised Meta-learning with Disentanglement for Domain-generalised Medical Image Segmentation by Xiao Liu et al
06-24-2021	Q-space Conditioned Translation Networks for Directional Synthesis of Diffusion Weighted Images from Multi-modal Structural MRI by Mengwei Ren et al
06-24-2021	Continual Novelty Detection by Rahaf Aljundi et al
06-24-2021	Class agnostic moving target detection by color and location prediction of moving area by Zhuang He et al
06-24-2021	VinDr-SpineXR: A deep learning framework for spinal lesions detection and classification from radiographs by Hieu T. Nguyen et al
06-25-2021	Hierarchical Object-oriented Spatio-Temporal Reasoning for Video Question Answering by Long Hoang Dang et al
06-22-2021	Multi-layered Semantic Representation Network for Multi-label Image Classification by Xiwen Qu et al
06-23-2021	A Global Appearance and Local Coding Distortion based Fusion Framework for CNN based Filtering in Video Coding by Jian Yue et al
06-23-2021	Adapting Off-the-Shelf Source Segmenter for Target Medical Image Segmentation by Xiaofeng Liu et al
06-22-2021	LegoFormer: Transformers for Block-by-Block Multi-view 3D Reconstruction by Farid Yagubbayli et al
06-23-2021	Gradient-Based Interpretability Methods and Binarized Neural Networks by Amy Widdicombe et al
06-23-2021	Feature Alignment for Approximated Reversibility in Neural Networks by Tiago de Souza Farias et al
06-24-2021	FitVid: Overfitting in Pixel-Level Video Prediction by Mohammad Babaeizadeh et al
06-25-2021	Video Moment Retrieval with Text Query Considering Many-to-Many Correspondence Using Potentially Relevant Pair by Sho Maeoki et al
06-24-2021	Free-viewpoint Indoor Neural Relighting from Multi-view Stereo by Julien Philip et al
06-23-2021	FoldIt: Haustral Folds Detection and Segmentation in Colonoscopy Videos by Shawn Mathew et al
06-25-2021	A Picture May Be Worth a Hundred Words for Visual Question Answering by Yusuke Hirota et al
06-22-2021	MIMIR: Deep Regression for Automated Analysis of UK Biobank Body MRI by Taro Langner et al
06-24-2021	Self-Supervised Monocular Depth Estimation of Untextured Indoor Rotated Scenes by Benjamin Keltjens et al
06-24-2021	Rate Distortion Characteristic Modeling for Neural Image Compression by Chuanmin Jia et al
06-24-2021	Regularisation for PCA- and SVD-type matrix factorisations by Abdolrahman Khoshrou et al
06-22-2021	Unsupervised Object-Level Representation Learning from Scene Images by Jiahao Xie et al
06-24-2021	AVHYAS: A Free and Open Source QGIS Plugin for Advanced Hyperspectral Image Analysis by Rosly Boy Lyngdoh et al
06-22-2021	RGB2Hands: Real-Time Tracking of 3D Hand Interactions from Monocular RGB Video by Jiayi Wang et al
06-22-2021	HybVIO: Pushing the Limits of Real-time Visual-inertial Odometry by Otto Seiskari et al
06-22-2021	G-VAE, a Geometric Convolutional VAE for ProteinStructure Generation by Hao Huang et al
06-24-2021	RSN: Range Sparse Net for Efficient, Accurate LiDAR 3D Object Detection by Pei Sun et al
06-24-2021	Towards Fully Interpretable Deep Neural Networks: Are We There Yet? by Sandareka Wickramanayake et al
06-23-2021	Region-Aware Network: Model Humans Top-Down Visual Perception Mechanism for Crowd Counting by Yuehai Chen et al
06-22-2021	P2T: Pyramid Pooling Transformer for Scene Understanding by Yu-Huan Wu et al
06-22-2021	On Matrix Factorizations in Subspace Clustering by Reeshad Arian et al
06-24-2021	Energy-Based Generative Cooperative Saliency Prediction by Jing Zhang et al

06-24-2021	To the Point: Efficient 3D Object Detection in the Range Image with Graph Convolution Kernels by Yuning Chai et al
06-24-2021	Bayesian Eye Tracking by Qiang Ji et al
06-24-2021	Detection of Deepfake Videos Using Long Distance Attention by Wei Lu et al
06-24-2021	MatchVIE: Exploiting Match Relevancy between Entities for Visual Information Extraction by Guozhi Tang et al
06-23-2021	Generative Self-training for Cross-domain Unsupervised Tagged-to-Cine MRI Synthesis by Xiaofeng Liu et al
06-25-2021	Vision Transformer Architecture Search by Xiu Su et al
06-22-2021	Domain-Smoothing Network for Zero-Shot Sketch-Based Image Retrieval by Zhipeng Wang et al
06-23-2021	Fairness in Cardiac MR Image Analysis: An Investigation of Bias Due to Data Imbalance in Deep Learning Based Segmentation by Esther Puyol-Anton et al
06-22-2021	Diabetic Retinopathy Detection using Ensemble Machine Learning by Israa Odeh et al
06-24-2021	Exploring Stronger Feature for Temporal Action Localization by Zhiwu Qing et al
06-22-2021	On the importance of cross-task features for class-incremental learning by Albin Soutif--Cormerais et al
06-24-2021	Differential Morph Face Detection using Discriminative Wavelet Sub-bands by Baaria Chaudhary et al
06-23-2021	Neural Fashion Image Captioning : Accounting for Data Diversity by Gilles Hacheme et al
06-22-2021	The Neurally-Guided Shape Parser: A Monte Carlo Method for Hierarchical Labeling of Over-segmented 3D Shapes by R. Kenny Jones et al
06-22-2021	Team PyKale (xy9) Submission to the EPIC-Kitchens 2021 Unsupervised Domain Adaptation Challenge for Action Recognition by Xianyuan Liu et al
06-24-2021	HAN: An Efficient Hierarchical Self-Attention Network for Skeleton-Based Gesture Recognition by Jianbo Liu et al
06-24-2021	Interpreting Depression From Question-wise Long-term Video Recording of SDS Evaluation by Wanqing Xie et al
06-24-2021	Countering Adversarial Examples: Combining Input Transformation and Noisy Training by Cheng Zhang et al
06-22-2021	Kernel Clustering with Sigmoid-based Regularization for Efficient Segmentation of Sequential Data by Tung Doan et al
06-22-2021	Winning the CVPR2021 Kinetics-GEBD Challenge: Contrastive Learning Approach by Hyolim Kang et al
06-25-2021	Multiview Video Compression Using Advanced HEVC Screen Content Coding by Jarosław Samelak et al
06-23-2021	Florida Wildlife Camera Trap Dataset by Crystal Gagne et al
06-23-2021	STRESS: Super-Resolution for Dynamic Fetal MRI using Self-Supervised Learning by Junshen Xu et al
06-23-2021	Coarse-to-Fine Q-attention: Efficient Learning for Visual Robotic Manipulation via Discretisation by Stephen James et al
06-23-2021	Human Activity Recognition using Continuous Wavelet Transform and Convolutional Neural Networks by Anna Nedorubova et al
06-22-2021	MetaAvatar: Learning Animatable Clothed Human Models from Few Depth Images by Shaofei Wang et al
06-22-2021	RootPainter3D: Interactive-machine-learning enables rapid and accurate contouring for radiotherapy by Abraham George Smith et al
06-22-2021	Fine-Tuning StyleGAN2 For Cartoon Face Generation by Jihye Back
06-22-2021	A Latent Transformer for Disentangled and Identity-Preserving Face Editing by Xu Yao et al
06-24-2021	Unsupervised Deep Image Stitching: Reconstructing Stitched Features to Images by Lang Nie et al
06-22-2021	Hand-Drawn Electrical Circuit Recognition using Object Detection and Node Recognition by Rachala Rohith Reddy et al
06-23-2021	Bootstrap Representation Learning for Segmentation on Medical Volumes and Sequences by Zejian Chen et al
06-25-2021	Connecting Sphere Manifolds Hierarchically for Regularization by Damien Scieur et al
06-23-2021	CxSE: Chest X-ray Slow Encoding CNN forCOVID-19 Diagnosis by Thangarajah Akilan
06-23-2021	Mutual-Information Based Few-Shot Classification by Malik Boudiaf et al
06-23-2021	Topological Semantic Mapping by Consolidation of Deep Visual Features by Ygor C. N. Sousa et al
06-24-2021	Evaluation of deep lift pose models for 3D rodent pose estimation based on geometrically triangulated data by Indrani Sarkar et al
06-25-2021	On the Robustness of Pretraining and Self-Supervision for a Deep Learning-based Analysis of Diabetic Retinopathy by Vignesh Srinivasan et al
06-25-2021	Diversifying Semantic Image Synthesis and Editing via Class- and Layer-wise VAEs by Yuki Endo et al
06-24-2021	A Transformer-based Cross-modal Fusion Model with Adversarial Training for VQA Challenge 2021 by Ke-Han Lu et al
06-25-2021	Zero Shot Point Cloud Upsampling by Kaiyue Zhou et al
06-22-2021	Prototypical Cross-Attention Networks for Multiple Object Tracking and Segmentation by Lei Ke et al
06-24-2021	High-resolution Image Registration of Consecutive and Re-stained Sections in Histopathology by Johannes Lotz et al
06-24-2021	Learning by Planning: Language-Guided Global Image Editing by Jing Shi et al
06-23-2021	A Circular-Structured Representation for Visual Emotion Distribution Learning by Jingyuan Yang et al
06-23-2021	Deep unsupervised 3D human body reconstruction from a sparse set of landmarks by Meysam Madadi et al
06-23-2021	A Label Management Mechanism for Retinal Fundus Image Classification of Diabetic Retinopathy by Mengdi Gao et al
06-25-2021	Single Image Texture Translation for Data Augmentation by Boyi Li et al
06-22-2021	Zero-Shot Chinese Character Recognition with Stroke-Level Decomposition by Jingye Chen et al
06-23-2021	Vision Permutator: A Permutable MLP-Like Architecture for Visual Recognition by Qibin Hou et al
06-22-2021	SSUL: Semantic Segmentation with Unknown Label for Exemplar-based Class-Incremental Learning by Sungmin Cha. Beomyoung Kim et al
06-22-2021	Creating A New Color Space utilizing PSO and FCM to Perform Skin Detection by using Neural Network and ANFIS by Kobra Nazaria et al
06-23-2021	A new Video Synopsis Based Approach Using Stereo Camera by Talha Dilber et al
06-24-2021	SGTBN: Generating Dense Depth Maps from Single-Line LiDAR by Hengjie Lu et al
06-24-2021	Relationship between pulmonary nodule malignancy and surrounding pleurae, airways and vessels: a quantitative study using the public LIDC-IDRI dataset by Yulei Qin et al
06-25-2021	SRPN: similarity-based region proposal networks for nuclei and cells detection in histology images by Yibao Sun et al
06-25-2021	Graph Pattern Loss based Diversified Attention Network for Cross-Modal Retrieval by Xueying Chen et al
06-25-2021	Circumpapillary OCT-Focused Hybrid Learning for Glaucoma Grading Using Tailored Prototypical Neural Networks by Gabriel García et al
06-25-2021	A Novel Self-Learning Framework for Bladder Cancer Grading Using Histopathological Images by Gabriel García et al
06-23-2021	Feature Completion for Occluded Person Re-Identification by Ruibing Hou et al
06-23-2021	Multi-Modal 3D Object Detection in Autonomous Driving: a Survey by Yingjie Wang et al
06-23-2021	Frequency Domain Convolutional Neural Network: Accelerated CNN for Large Diabetic Retinopathy Image Classification by Ee Fey Goh et al
06-23-2021	Planetary UAV localization based on Multi-modal Registration with Pre-existing Digital Terrain Model by Xue Wan et al
06-25-2021	Re-parameterizing VAEs for stability by David Dehaene et al
06-25-2021	Projection-wise Disentangling for Fair and Interpretable Representation Learning: Application to 3D Facial Shape Analysis by Xianjing Liu et al
06-23-2021	High-Throughput Precision Phenotyping of Left Ventricular Hypertrophy with Cardiovascular Deep Learning by Grant Duffy et al
06-24-2021	Symmetric Wasserstein Autoencoders by Sun Sun et al
06-23-2021	Multi-Class Classification of Blood Cells -- End to End Computer Vision based diagnosis case study by Sai Sukruth Bezugam
06-22-2021	DocFormer: End-to-End Transformer for Document Understanding by Srikar Appalaraju et al
06-22-2021	Residual Networks as Flows of Velocity Fields for Diffeomorphic Time Series Alignment by Hao Huang et al
06-22-2021	Enhanced Separable Disentanglement for Unsupervised Domain Adaptation by Youshan Zhang et al
06-25-2021	NP-DRAW: A Non-Parametric Structured Latent Variable Modelfor Image Generation by Xiaohui Zeng et al
06-24-2021	Video Swin Transformer by Ze Liu et al
06-24-2021	Depth Confidence-aware Camouflaged Object Detection by Jing Zhang et al
06-22-2021	Volume Rendering of Neural Implicit Surfaces by Lior Yariv et al
06-23-2021	Deep Fake Detection: Survey of Facial Manipulation Detection Solutions by Samay Pashine et al
06-23-2021	Vision-based Behavioral Recognition of Novelty Preference in Pigs by Aniket Shirke et al
06-23-2021	Collaborative Visual Inertial SLAM for Multiple Smart Phones by Jialing Liu et al
06-23-2021	All You Need is a Second Look: Towards Arbitrary-Shaped Text Detection by Meng Cao et al
06-24-2021	DCoM: A Deep Column Mapper for Semantic Data Type Detection by Subhadip Maji et al
06-22-2021	Differentiable Architecture Search Without Training Nor Labels: A Pruning Perspective by Miao Zhang et al
06-23-2021	Handwritten Digit Recognition using Machine and Deep Learning Algorithms by Samay Pashine et al
06-23-2021	ATP-Net: An Attention-based Ternary Projection Network For Compressed Sensing by Guanxiong Nie et al
06-24-2021	AutoAdapt: Automated Segmentation Network Search for Unsupervised Domain Adaptation by Xueqing Deng et al
06-24-2021	HyperNeRF: A Higher-Dimensional Representation for Topologically Varying Neural Radiance Fields by Keunhong Park et al
06-24-2021	A Simple and Strong Baseline: Progressively Region-based Scene Text Removal Networks by Yuxin Wang et al
06-23-2021	Instance-based Vision Transformer for Subtyping of Papillary Renal Cell Carcinoma in Histopathological Image by Zeyu Gao et al
06-25-2021	Shape registration in the time of transformers by Giovanni Trappolini et al
06-22-2021	Tracking Instances as Queries by Shusheng Yang et al
06-24-2021	Exploring Corruption Robustness: Inductive Biases in Vision Transformers and MLP-Mixers by Katelyn Morrison et al
06-24-2021	Unsupervised Learning of Depth and Depth-of-Field Effect from Natural Images with Aperture Rendering Generative Adversarial Networks by Takuhiro Kaneko
06-24-2021	AudioCLIP: Extending CLIP to Image, Text and Audio by Andrey Guzhov et al
06-22-2021	Reachability Analysis of Convolutional Neural Networks by Xiaodong Yang et al
06-22-2021	The Hitchhikers Guide to Prior-Shift Adaptation by Tomas Sipka et al
06-24-2021	GaussiGAN: Controllable Image Synthesis with 3D Gaussians from Unposed Silhouettes by Youssef A. Mejjati et al
06-23-2021	Learning from Pseudo Lesion: A Self-supervised Framework for COVID-19 Diagnosis by Zhongliang Li et al
06-23-2021	FusionPainting: Multimodal Fusion with Adaptive Attention for 3D Object Detection by Shaoqing Xu et al
06-22-2021	Towards Consistent Predictive Confidence through Fitted Ensembles by Navid Kardan et al
06-24-2021	Sparse Needlets for Lighting Estimation with Spherical Transport Loss by Fangneng Zhan et al

Craig SmithJune 28, 2021