2021.9.20 Vision papers

09-16-2021	DisUnknown: Distilling Unknown Factors for Disentanglement Learning by Sitao Xiang et al
09-15-2021	Integrating Sensing and Communication in Cellular Networks via NR Sidelink by Dariush Salami et al
09-14-2021	Broaden the Vision: Geo-Diverse Visual Commonsense Reasoning by Da Yin et al
09-17-2021	Cross Modification Attention Based Deliberation Model for Image Captioning by Zheng Lian et al
09-17-2021	Transformer-Unet: Raw Image Processing with Unet by Youyang Sha et al
09-14-2021	Improved Few-shot Segmentation by Redefinition of the Roles of Multi-level CNN Features by Zhijie Wang et al
09-16-2021	Adaptive Hierarchical Dual Consistency for Semi-Supervised Left Atrium Segmentation on Cross-Domain Data by Jun Chen et al
09-16-2021	MHFC: Multi-Head Feature Collaboration for Few-Shot Learning by Shuai Shao et al
09-16-2021	Towards Non-Line-of-Sight Photography by Jiayong Peng et al
09-16-2021	Marginal MAP Estimation for Inverse RL under Occlusion with Observer Noise by Prasanth Sengadu Suresh et al
09-17-2021	Pointly-supervised 3D Scene Parsing with Viewpoint Bottleneck by Liyi Luo et al
09-16-2021	Stereo Video Reconstruction Without Explicit Depth Maps for Endoscopic Surgery by Annika Brundyn et al
09-14-2021	Multi-Scale Aligned Distillation for Low-Resolution Detection by Lu Qi et al
09-14-2021	The pitfalls of using open data to develop deep learning solutions for COVID-19 detection in chest X-rays by Rachael Harkness et al
09-14-2021	Semi-Supervised Wide-Angle Portraits Correction by Multi-Scale Transformer by Fushun Zhu et al
09-15-2021	Pose Transformers (POTR): Human Motion Prediction with Non-Autoregressive Transformers by Angel Martínez-González et al
09-16-2021	Raising context awareness in motion forecasting by Hédi Ben-Younes et al
09-16-2021	A Machine Learning Framework for Automatic Prediction of Human Semen Motility by Sandra Ottl et al
09-16-2021	Eformer: Edge Enhancement based Transformer for Medical Image Denoising by Achleshwar Luthra et al
09-16-2021	Mass Segmentation in Automated 3-D Breast Ultrasound Using Dual-Path U-net by Hamed Fayyaz et al
09-15-2021	MD-CSDNetwork: Multi-Domain Cross Stitched Network for Deepfake Detection by Aayushi Agarwal et al
09-14-2021	ZFlow: Gated Appearance Flow-based Virtual Try-on with 3D Priors by Ayush Chopra et al
09-14-2021	Identifying partial mouse brain microscopy images from Allen reference atlas using a contrastively learned semantic space by Justinas Antanavicius et al
09-16-2021	Semi-Supervised Visual Representation Learning for Fashion Compatibility by Ambareesh Revanur et al
09-16-2021	Label-Attention Transformer with Geometrically Coherent Objects for Image Captioning by Shikha Dubey et al
09-16-2021	A computationally efficient framework for vector representation of persistence diagrams by Kit C. Chan et al
09-14-2021	Luminance Attentive Networks for HDR Image and Panorama Reconstruction by Hanning Yu et al
09-15-2021	OPV2V: An Open Benchmark Dataset and Fusion Pipeline for Perception with Vehicle-to-Vehicle Communication by Runsheng Xu et al
09-16-2021	Resolution based Feature Distillation for Cross Resolution Person Re-Identification by Asad Munir et al
09-15-2021	Anchor DETR: Query Design for Transformer-Based Detector by Yingming Wang et al
09-17-2021	Messing Up 3D Virtual Environments: Transferable Adversarial 3D Objects by Enrico Meloni et al
09-15-2021	DSOR: A Scalable Statistical Filter for Removing Falling Snow from LiDAR Point Clouds in Severe Winter Weather by Akhil Kurup et al
09-15-2021	Hybrid Local-Global Transformer for Image Dehazing by Dong Zhao et al
09-14-2021	High-Resolution Image Harmonization via Collaborative Dual Transformations by Wenyan Cong et al
09-16-2021	LoGG3D-Net: Locally Guided Global Descriptor Learning for 3D Place Recognition by Kavisha Vidanapathirana et al
09-16-2021	Rotation Averaging in a Split Second: A Primal-Dual Method and a Closed-Form for Cycle Graphs by Gabriel Moreira et al
09-14-2021	One-Class Meta-Learning: Towards Generalizable Few-Shot Open-Set Classification by Jedrzej Kozerawski et al
09-16-2021	Torch.manual_seed(3407) is all you need: On the influence of random seeds in deep learning architectures for computer vision by David Picard
09-14-2021	LRWR: Large-Scale Benchmark for Lip Reading in Russian language by Evgeniy Egorov et al
09-17-2021	Diverse Generation from a Single Video Made Possible by Niv Haim et al
09-16-2021	Urdu text in natural scene images: a new dataset and preliminary text detection by Hazrat Ali et al
09-16-2021	Invertable Frowns: Video-to-Video Facial Emotion Translation by Ian Magnusson et al
09-15-2021	Navigation-Oriented Scene Understanding for Robotic Autonomy: Learning to Segment Driveability in Egocentric Images by Galadrielle Humblot-Renaux et al
09-15-2021	F-CAM: Full Resolution CAM via Guided Parametric Upscaling by Soufiane Belharbi et al
09-16-2021	Neural Network Based Lidar Gesture Recognition for Realtime Robot Teleoperation by Simon Chamorro et al
09-16-2021	An End-to-End Transformer Model for 3D Object Detection by Ishan Misra et al
09-16-2021	Compact Binary Fingerprint for Image Copy Re-Ranking by Nazar Mohammad et al
09-17-2021	Self-Supervised Neural Architecture Search for Imbalanced Datasets by Aleksandr Timofeev et al
09-16-2021	Aesthetics and neural network image representations by Romuald A. Janik
09-17-2021	GoG: Relation-aware Graph-over-Graph Network for Visual Dialog by Feilong Chen et al
09-15-2021	Distract Your Attention: Multi-head Cross Attention Network for Facial Expression Recognition by Zhengyao Wen et al
09-14-2021	Image Synthesis via Semantic Composition by Yi Wang et al
09-15-2021	FFAVOD: Feature Fusion Architecture for Video Object Detection by Hughes Perreault et al
09-15-2021	New Perspective on Progressive GANs Distillationfor One-class Novelty Detection by Zhiwei Zhang et al
09-16-2021	Label Assignment Distillation for Object Detection by Minghao Gao et al
09-14-2021	Spiking Neural Networks for Visual Place Recognition via Weighted Neuronal Assignments by Somayeh Hussaini et al
09-16-2021	A Medical Pre-Diagnosis System for Histopathological Image of Breast Cancer by Shiyu Fan et al
09-16-2021	SketchHairSalon: Deep Sketch-based Hair Image Synthesis by Chufeng Xiao et al
09-14-2021	Space Time Recurrent Memory Network by Hung Nguyen et al
09-15-2021	PointManifoldCut: Point-wise Augmentation in the Manifold for Point Clouds by Tianfang Zhu et al
09-15-2021	UCP-Net: Unstructured Contour Points for Instance Segmentation by Camille Dupont et al
09-16-2021	Are we ready for beyond-application high-volume data? The Reeds robot perception benchmark dataset by Ola Benderius et al
09-14-2021	Uncertainty Quantification in Medical Image Segmentation with Multi-decoder U-Net by Yanwu Yang et al
09-16-2021	Dense Pruning of Pointwise Convolutions in the Frequency Domain by Mark Buckler et al
09-14-2021	Image-Based Alignment of 3D Scans by Dolores Messer et al
09-14-2021	Cross-Region Domain Adaptation for Class-level Alignment by Zhijie Wang et al
09-14-2021	COVID-Net MLSys: Designing COVID-Net for the Clinical Workflow by Audrey G. Chung et al
09-16-2021	Humanly Certifying Superhuman Classifiers by Qiongkai Xu et al
09-14-2021	Dense Deep Unfolding Network with 3D-CNN Prior for Snapshot Compressive Imaging by Zhuoyuan Wu et al
09-15-2021	FSER: Deep Convolutional Neural Networks for Speech Emotion Recognition by Bonaventure F. P. Dossou et al
09-15-2021	RGB-D Saliency Detection via Cascaded Mutual Information Minimization by Jing Zhang et al
09-15-2021	A Framework for Multisensory Foresight for Embodied Agents by Xiaohui Chen et al
09-17-2021	Including Keyword Position in Image-based Models for Act Segmentation of Historical Registers by Mélodie Boillet et al
09-15-2021	Partner-Assisted Learning for Few-Shot Image Classification by Jiawei Ma et al
09-15-2021	Patch-based medical image segmentation using Quantum Tensor Networks by Raghavendra Selvan et al
09-17-2021	ActionCLIP: A New Paradigm for Video Action Recognition by Mengmeng Wang et al
09-17-2021	CardiSort: a convolutional neural network for cross vendor automated sorting of cardiac MR images by Ruth P Lim et al
09-17-2021	Multimodal Incremental Transformer with Visual Grounding for Visual Dialogue Generation by Feilong Chen et al
09-17-2021	Realistic PointGoal Navigation via Auxiliary Losses and Information Bottleneck by Guillermo Grande et al
09-17-2021	What we see and What we dont see: Imputing Occluded Crowd Structures from Robot Sensing by Javad Amirian et al
09-16-2021	A Comparative Study of Machine Learning Methods for Predicting the Evolution of Brain Connectivity from a Baseline Timepoint by Şeymanur Aktı et al
09-14-2021	Dodging Attack Using Carefully Crafted Natural Makeup by Nitzan Guetta et al
09-17-2021	Semantic Snapping for Guided Multi-View Visualization Design by Yngve S. Kristiansen et al
09-16-2021	Evaluating Continual Learning Algorithms by Generating 3D Virtual Environments by Enrico Meloni et al
09-16-2021	Explainability Requires Interactivity by Matthias Kirchler et al
09-16-2021	Towards agricultural autonomy: crop row detection under varying field conditions using deep learning by Rajitha de Silva et al
09-16-2021	A Survey on Temporal Sentence Grounding in Videos by Xiaohan Lan et al
09-14-2021	PnP-DETR: Towards Efficient Visual Analysis with Transformers by Tao Wang et al
09-15-2021	Neural Architecture Search in operational context: a remote sensing case-study by Anthony Cazasnoves et al
09-15-2021	Image Captioning for Effective Use of Language Models in Knowledge-Based Visual Question Answering by Ander Salaberria et al
09-15-2021	SPIN Road Mapper: Extracting Roads from Aerial Images via Spatial and Interaction Space Graph Reasoning for Autonomous Driving by Wele Gedara Chaminda Bandara et al
09-15-2021	ROS-X-Habitat: Bridging the ROS Ecosystem with Embodied AI by Guanxiong Chen et al
09-15-2021	Neural Human Performer: Learning Generalizable Radiance Fields for Human Performance Rendering by Youngjoong Kwon et al
09-16-2021	Habitat-Matterport 3D Dataset (HM3D): 1000 Large-scale 3D Environments for Embodied AI by Santhosh K. Ramakrishnan et al
09-14-2021	Multi-Scale Input Strategies for Medulloblastoma Tumor Classification using Deep Transfer Learning by Marcel Bengs et al
09-14-2021	3-Dimensional Deep Learning with Spatial Erasing for Unsupervised Anomaly Segmentation in Brain MRI by Marcel Bengs et al
09-14-2021	Multi-Level Features Contrastive Networks for Unsupervised Domain Adaptation by Le Liu et al
09-17-2021	Bio-Inspired Audio-Visual Cues Integration for Visual Attention Prediction by Yuan Yuan et al
09-14-2021	A trainable monogenic ConvNet layer robust in front of large contrast changes in image classification by E. Ulises Moya-Sánchez et al
09-17-2021	PIRenderer: Controllable Portrait Image Generation via Semantic Neural Rendering by Yurui Ren et al
09-16-2021	DeepMTS: Deep Multi-task Learning for Survival Prediction in Patients with Advanced Nasopharyngeal Carcinoma using Pretreatment PET/CT by Mingyuan Meng et al
09-15-2021	Contact-Aware Retargeting of Skinned Motion by Ruben Villegas et al
09-14-2021	Sampling Network Guided Cross-Entropy Method for Unsupervised Point Cloud Registration by Haobo Jiang et al
09-16-2021	A Divide-and-Merge Point Cloud Clustering Algorithm for LiDAR Panoptic Segmentation by Yiming Zhao et al
09-15-2021	RAFT-Stereo: Multilevel Recurrent Field Transforms for Stereo Matching by Lahav Lipson et al
09-15-2021	Learning the Regularization in DCE-MR Image Reconstruction for Functional Imaging of Kidneys by Aziz Koçanaoğulları et al
09-15-2021	Resolution-robust Large Mask Inpainting with Fourier Convolutions by Roman Suvorov et al
09-15-2021	3D Annotation Of Arbitrary Objects In The Wild by Kenneth Blomqvist et al
09-14-2021	Tesla-Rapture: A Lightweight Gesture Recognition System from mmWave Radar Point Clouds by Dariush Salami et al

09-16-2021	Lifting 2D Object Locations to 3D by Discounting LiDAR Outliers across Objects and Views by Robert McCraith et al
09-16-2021	Exploiting Activation based Gradient Output Sparsity to Accelerate Backpropagation in CNNs by Anup Sarma et al
09-17-2021	Autonomous Vision-based UAV Landing with Collision Avoidance using Deep Learning by Tianpei Liao et al
09-16-2021	Heterogeneous Relational Complement for Vehicle Re-identification by Jiajian Zhao et al
09-16-2021	Automated risk classification of colon biopsies based on semantic segmentation of histopathology images by John-Melle Bokhorsta et al
09-16-2021	Multi-Level Visual Similarity Based Personalized Tourist Attraction Recommendation Using Geo-Tagged Photos by Ling Chen et al
09-14-2021	Multi-modal Wound Classification using Wound Image and Location by Deep Neural Network by D. M. Anisuzzaman et al
09-16-2021	Detection Accuracy for Evaluating Compositional Explanations of Units by Sayo M. Makinwa et al
09-15-2021	A Wide-area, Low-latency, and Power-efficient 6-DoF Pose Tracking System for Rigid Objects by Young-Ho Kim et al
09-17-2021	Expression Snippet Transformer for Robust Video-based Facial Expression Recognition by Yuanyuan Liu et al
09-15-2021	A Pathology Deep Learning System Capable of Triage of Melanoma Specimens Utilizing Dermatopathologist Consensus as Ground Truth by Sivaramakrishnan Sankarapandian et al
09-15-2021	Hybrid ICP by Kamil Dreczkowski et al
09-14-2021	Anomaly Attribution of Multivariate Time Series using Counterfactual Reasoning by Violeta Teodora Trifunov et al
09-15-2021	Direct and Sparse Deformable Tracking by Jose Lamarca et al
09-15-2021	A Unified Framework for Biphasic Facial Age Translation with Noisy-Semantic Guided Generative Adversarial Networks by Muyi Sun et al
09-15-2021	MISSFormer: An Effective Medical Image Segmentation Transformer by Xiaohong Huang et al
09-16-2021	Few-Shot Object Detection by Attending to Per-Sample-Prototype by Hojun Lee et al
09-16-2021	KATANA: Simple Post-Training Robustness Using Test Time Augmentations by Gilad Cohen et al
09-16-2021	Quality-aware Cine Cardiac MRI Reconstruction and Analysis from Undersampled k-space Data by Ines Machado et al
09-16-2021	Real Time Monocular Vehicle Velocity Estimation using Synthetic Data by Robert McCraith et al
09-16-2021	Overview of Tencent Multi-modal Ads Video Understanding Challenge by Zhenzhi Wang et al
09-16-2021	Learnable Multi-level Frequency Decomposition and Hierarchical Attention Mechanism for Generalized Face Presentation Attack Detection by Meiling Fang et al
09-14-2021	Focus on Impact: Indoor Exploration with Intrinsic Motivation by Roberto Bigazzi et al
09-14-2021	High-Fidelity GAN Inversion for Image Attribute Editing by Tengfei Wang et al
09-15-2021	Dynamic Fusion Network for RGBT Tracking by Jingchao Peng et al
09-14-2021	MotionHint: Self-Supervised Monocular Visual Odometry with Motion Constraints by Cong Wang et al
09-14-2021	Hardware-aware Real-time Myocardial Segmentation Quality Control in Contrast Echocardiography by Dewen Zeng et al
09-15-2021	Semi-supervised Contrastive Learning for Label-efficient Medical Image Segmentation by Xinrong Hu et al
09-16-2021	Harnessing Perceptual Adversarial Patches for Crowd Counting by Shunchang Liu et al
09-16-2021	TANet: A new Paradigm for Global Face Super-resolution via Transformer-CNN Aggregation Network by Yuanzhi Wang et al
09-15-2021	Predicting 3D shapes, masks, and properties of materials, liquids, and objects inside transparent containers, using the TransProteus CGI dataset by Sagi Eppel et al
09-14-2021	A Semantic Indexing Structure for Image Retrieval by Ying Wang et al
09-15-2021	Learning Dynamical Human-Joint Affinity for 3D Pose Estimation in Videos by Junhao Zhang et al
09-14-2021	ImUnity: a generalizable VAE-GAN solution for multicenter MR image harmonization by Stenzel Cackowski et al
09-14-2021	Dynamic Attentive Graph Learning for Image Restoration by Chong Mou et al
09-16-2021	Dense Semantic Contrast for Self-Supervised Visual Representation Learning by Xiaoni Li et al
09-16-2021	Mask-Guided Feature Extraction and Augmentation for Ultra-Fine-Grained Visual Categorization by Zicheng Pan et al
09-16-2021	End-to-End Partially Observable Visual Navigation in a Diverse Environment by Bo Ai et al
09-16-2021	ObjectFolder: A Dataset of Objects with Implicit Visual, Auditory, and Tactile Representations by Ruohan Gao et al
09-15-2021	Learning to Aggregate and Refine Noisy Labels for Visual Sentiment Analysis by Wei Zhu et al
09-15-2021	Federated Contrastive Learning for Decentralized Unlabeled Medical Images by Nanqing Dong et al
09-15-2021	Progressive Hard-case Mining across Pyramid Levels in Object Detection by Binghong Wu et al
09-15-2021	DeFungi: Direct Mycological Examination of Microscopic Fungi Images by Camilo Javier Pineda Sopo et al
09-16-2021	Context-aware Padding for Semantic Segmentation by Yu-Hui Huang et al
09-15-2021	Sign-MAML: Efficient Model-Agnostic Meta-Learning by SignSGD by Chen Fan et al
09-15-2021	A Multi-Task Cross-Task Learning Architecture for Ad-hoc Uncertainty Estimation in 3D Cardiac MRI Image Segmentation by S. M. Kamrul Hasan et al
09-16-2021	Neural \{E}tendue Expander for Ultra-Wide-Angle High-Fidelity Holographic Display by Seung-Hwan Baek et al
09-17-2021	LOF: Structure-Aware Line Tracking based on Optical Flow by Meixiang Quan et al
09-14-2021	Automatic hippocampal surface generation via 3D U-net and active shape modeling with hybrid particle swarm optimization by Pinyuan Zhong et al
09-15-2021	FCA: Learning a 3D Full-coverage Vehicle Camouflage for Multi-view Physical Adversarial Attack by DonghuaWang et al
09-16-2021	Generating Dataset For Large-scale 3D Facial Emotion Recognition by Faizan Farooq Khan et al
09-17-2021	A review of deep learning methods for MRI reconstruction by Arghya Pal et al
09-15-2021	S3LAM: Structured Scene SLAM by Mathieu Gonzalez et al
09-16-2021	M2RNet: Multi-modal and Multi-scale Refined Network for RGB-D Salient Object Detection by Xian Fang et al
09-14-2021	Seeking an Optimal Approach for Computer-Aided Pulmonary Embolism Detection by Nahid Ul Islam et al
09-15-2021	Deep Bregman Divergence for Contrastive Learning of Visual Representations by Mina Rezaei et al
09-17-2021	GraFormer: Graph Convolution Transformer for 3D Pose Estimation by Weixi Zhao et al
09-15-2021	METEOR: A Massive Dense & Heterogeneous Behavior Dataset for Autonomous Driving by Rohan Chandra et al
09-14-2021	A Deep Learning Approach for Masking Fetal Gender in Ultrasound Images by Amit Borundiya et al

Craig SmithSeptember 20, 2021