2022.2.28 Vision papers

02-24-2022	Self-Distilled StyleGAN: Towards Generation from Internet Photos by Ron Mokady et al
02-23-2022	Self-Supervised Transformers for Unsupervised Object Discovery using Normalized Cut by Yangtao Wang et al
02-24-2022	FreeSOLO: Learning to Segment Objects without Annotations by Xinlong Wang et al
02-23-2022	Near Perfect GAN Inversion by Qianli Feng et al
02-23-2022	Diffractive optical system design by cascaded propagation by Boris Ferdman et al
02-23-2022	CAISE: Conversational Agent for Image Search and Editing by Hyounghun Kim et al
02-24-2022	Auto-scaling Vision Transformers without Training by Wuyang Chen et al
02-23-2022	Paying U-Attention to Textures: Multi-Stage Hourglass Vision Transformer for Universal Texture Synthesis by Shouchang Guo et al
02-24-2022	Learning to Merge Tokens in Vision Transformers by Cedric Renggli et al
02-24-2022	Phrase-Based Affordance Detection via Cyclic Bilateral Interaction by Liangsheng Lu et al
02-22-2022	Retrieval Augmented Classification for Long-Tail Visual Recognition by Alexander Long et al
02-23-2022	Commonsense Reasoning for Identifying and Understanding the Implicit Need of Help and Synthesizing Assistive Actions by Maëlic Neau et al
02-24-2022	Retriever: Learning Content-Style Representation as a Token-Level Bipartite Graph by Dacheng Yin et al
02-23-2022	Explanatory Paradigms in Neural Networks by Ghassan AlRegib et al
02-23-2022	Are We Ready for Robust and Resilient SLAM? A Framework For Quantitative Characterization of SLAM Datasets by Islam Ali et al
02-23-2022	A spectral-spatial fusion anomaly detection method for hyperspectral imagery by Zengfu Hou et al
02-23-2022	When do GANs replicate? On the choice of dataset size by Qianli Feng et al
02-23-2022	Thermal hand image segmentation for biometric recognition by Xavier Font-Aragones et al
02-23-2022	Augmentation based unsupervised domain adaptation by Mauricio Orbes-Arteaga et al
02-23-2022	Improving Robustness of Convolutional Neural Networks Using Element-Wise Activation Scaling by Zhi-Yuan Zhang et al
02-23-2022	Weakly-supervised learning for image-based classification of primary melanomas into genomic immune subgroups by Lucy Godson et al
02-24-2022	When Transformer Meets Robotic Grasping: Exploits Context for Efficient Grasp Detection by Shaochen Wang et al
02-24-2022	Assessing generalisability of deep learning-based polyp detection and segmentation methods through a computer vision challenge by Sharib Ali et al
02-23-2022	Think Global, Act Local: Dual-scale Graph Transformer for Vision-and-Language Navigation by Shizhe Chen et al
02-23-2022	A Note on Machine Learning Approach for Computational Imaging by Bin Dong
02-23-2022	M2I: From Factored Marginal Trajectory Prediction to Interactive Prediction by Qiao Sun et al
02-23-2022	Controlling Memorability of Face Images by Mohammad Younesi et al
02-23-2022	New Benchmark for Household Garbage Image Recognition by Zhize Wu et al
02-24-2022	Rare Gems: Finding Lottery Tickets at Initialization by Kartik Sreenivasan et al
02-23-2022	Discovering Multiple and Diverse Directions for Cognitive Image Properties by Umut Kocasari et al
02-23-2022	Learning Multi-Object Dynamics with Compositional Neural Radiance Fields by Danny Driess et al
02-23-2022	MITI: SLAM Benchmark for Laparoscopic Surgery by Regine Hartwig et al
02-23-2022	ISDA: Position-Aware Instance Segmentation with Deformable Attention by Kaining Ying et al
02-22-2022	Learning from the Pros: Extracting Professional Goalkeeper Technique from Broadcast Footage by Matthew Wear et al
02-23-2022	EcoFusion: Energy-Aware Adaptive Sensor Fusion for Efficient Autonomous Vehicle Perception by Arnav Vaibhav Malawade et al
02-23-2022	Reconstruction Task Finds Universal Winning Tickets by Ruichen Li et al
02-24-2022	Towards Effective and Robust Neural Trojan Defenses via Input Filtering by Kien Do et al
02-23-2022	Art Creation with Multi-Conditional StyleGANs by Konstantin Dobler et al
02-22-2022	The Winning Solution to the iFLYTEK Challenge 2021 Cultivated Land Extraction from High-Resolution Remote Sensing Image by Zhen Zhao et al
02-23-2022	Visual-tactile sensing for Real-time liquid Volume Estimation in Grasping by Fan Zhu et al
02-23-2022	CG-SSD: Corner Guided Single Stage 3D Object Detection from LiDAR Point Cloud by Ruiqi Ma et al
02-22-2022	Roto-Translation Equivariant Super-Resolution of Two-Dimensional Flows Using Convolutional Neural Networks by Yuki Yasuda
02-23-2022	HMD-EgoPose: Head-Mounted Display-Based Egocentric Marker-Less Tool and Hand Pose Estimation for Augmented Surgical Guidance by Mitchell Doughty et al
02-23-2022	Multi-Teacher Knowledge Distillation for Incremental Implicitly-Refined Classification by Longhui Yu et al
02-25-2022	Local Intensity Order Transformation for Robust Curvilinear Object Segmentation by Tianyi Shi et al
02-25-2022	ARIA: Adversarially Robust Image Attribution for Content Provenance by Maksym Andriushchenko et al
02-24-2022	A Transformer-based Network for Deformable Medical Image Registration by Yibo Wang et al
02-23-2022	A modification of the conjugate direction method for motion estimation by Marcos Faundez-Zanuy et al
02-23-2022	On PAC-Bayesian reconstruction guarantees for VAEs by Badr-Eddine Chérief-Abdellatif et al
02-24-2022	Provable Stochastic Optimization for Global Contrastive Learning: Small Batch Does Not Harm Performance by Zhuoning Yuan et al
02-23-2022	On-line signature verification system with failure to enroll managing by Joan Fabregas et al
02-24-2022	Data variation-aware medical image segmentation by Arkadiy Dushatskiy et al
02-23-2022	RadioTransformer: A Cascaded Global-Focal Transformer for Visual Attention-guided Disease Classification by Moinak Bhattacharya et al
02-25-2022	Learning to Identify Perceptual Bugs in 3D Video Games by Benedict Wilkins et al
02-24-2022	Factorizer: A Scalable Interpretable Approach to Context Modeling for Medical Image Segmentation by Pooya Ashtari et al
02-23-2022	Deep Metric Learning-Based Semi-Supervised Regression With Alternate Learning by Adina Zell et al
02-24-2022	Structure-aware Unsupervised Tagged-to-Cine MRI Synthesis with Self Disentanglement by Xiaofeng Liu et al
02-24-2022	Monogenic Wavelet Scattering Network for Texture Image Classification by Wai Ho Chak et al
02-24-2022	A novel unsupervised covid lung lesion segmentation based on the lung tissue identification by Faeze Gholamian Khah et al
02-24-2022	Slow-Fast Visual Tempo Learning for Video-based Action Recognition by Yuanzhong Liu et al
02-22-2022	LPF-Defense: 3D Adversarial Defense based on Frequency Analysis by Hanieh Naderi et al
02-24-2022	Learn From the Past: Experience Ensemble Knowledge Distillation by Chaofei Wang et al
02-23-2022	Deep Bayesian ICP Covariance Estimation by Andrea De Maio et al
02-23-2022	A Method for Waste Segregation using Convolutional Neural Networks by Jash Shah et al
02-24-2022	Transformers in Medical Image Analysis: A Review by Kelei He et al
02-23-2022	EMOTHAW: A novel database for emotional state recognition from handwriting by Laurence Likforman-Sulem et al
02-24-2022	Interpolation-based Contrastive Learning for Few-Label Semi-Supervised Learning by Xihong Yang et al
02-25-2022	Data refinement for fully unsupervised visual inspection using pre-trained networks by Antoine Cordier et al
02-23-2022	SLRNet: Semi-Supervised Semantic Segmentation Via Label Reuse for Human Decomposition Images by Sara Mousavi et al
02-24-2022	Uncertainty-driven Planner for Exploration and Navigation by Georgios Georgakis et al
02-25-2022	Active Learning for Point Cloud Semantic Segmentation via Spatial-Structural Diversity Reasoning by Feifei Shao et al
02-25-2022	An exploration of the performances achievable by combining unsupervised background subtraction algorithms by Sébastien Piérard et al
02-24-2022	Measuring CLEVRness: Blackbox testing of Visual Reasoning Models by Spyridon Mouselinos et al
02-24-2022	AFFDEX 2.0: A Real-Time Facial Expression Analysis Toolkit by Mina Bishay et al
02-23-2022	Nuclei panoptic segmentation and composition regression with multi-task deep neural networks by Satoshi Kondo et al
02-25-2022	Confidence Calibration for Object Detection and Segmentation by Fabian Küppers et al
02-23-2022	Image Classification on Small Datasets via Masked Feature Mixing by Christoph Reinders et al
02-23-2022	Absolute Zero-Shot Learning by Rui Gao et al
02-25-2022	6D Rotation Representation For Unconstrained Head Pose Estimation by Thorsten Hempel et al
02-24-2022	SMILE: Sequence-to-Sequence Domain Adaption with Minimizing Latent Entropy for Text Image Recognition by Yen-Cheng Chang et al
02-24-2022	Domain Disentangled Generative Adversarial Network for Zero-Shot Sketch-Based 3D Shape Retrieval by Rui Xu et al
02-23-2022	Amodal Panoptic Segmentation by Rohit Mohan et al
02-23-2022	Multi-scale Sparse Representation-Based Shadow Inpainting for Retinal OCT Images by Yaoqi Tang et al
02-23-2022	Localizing Small Apples in Complex Apple Orchard Environments by Christian Wilms et al
02-25-2022	Predicting 4D Liver MRI for MR-guided Interventions by Gino Gulamhussene et al
02-25-2022	An Ensemble Approach for Patient Prognosis of Head and Neck Tumor Using Multimodal Data by Numan Saeed et al
02-24-2022	Fully Self-Supervised Learning for Semantic Segmentation by Yuan Wang et al
02-24-2022	N-QGN: Navigation Map from a Monocular Camera using Quadtree Generating Networks by Daniel Braun et al
02-25-2022	LF-VIO: A Visual-Inertial-Odometry Framework for Large Field-of-View Cameras with Negative Plane by Ze Wang et al
02-25-2022	RRL:Regional Rotation Layer in Convolutional Neural Networks by Zongbo Hao et al
02-24-2022	Computer Aided Diagnosis and Out-of-Distribution Detection in Glaucoma Screening Using Color Fundus Photography by Satoshi Kondo et al
02-25-2022	A Novel Hand Gesture Detection and Recognition system based on ensemble-based Convolutional Neural Network by Abir Sen et al
02-25-2022	TeachAugment: Data Augmentation Optimization Using Teacher Knowledge by Teppei Suzuki
02-23-2022	ProFormer: Learning Data-efficient Representations of Body Movement with Prototype-based Feature Augmentation and Visual Transformers by Kunyu Peng et al
02-23-2022	Mixed-Block Neural Architecture Search for Medical Image Segmentation by Martijn M. A. Bosma et al
02-23-2022	Human Motion Detection Using Sharpened Dimensionality Reduction and Clustering by Jeewon Heo et al
02-23-2022	Anomaly Detection in 3D Point Clouds using Deep Geometric Descriptors by Paul Bergmann et al
02-25-2022	Implicit Optimizer for Diffeomorphic Image Registration by Kun Han et al
02-25-2022	Sensing accident-prone features in urban scenes for proactive driving and accident prevention by Sumit Mishra et al
02-22-2022	An End-to-End Cascaded Image Deraining and Object Detection Neural Network by Kaige Wang et al

02-24-2022	DeepFusionMOT: A 3D Multi-Object Tracking Framework Based on Camera-LiDAR Fusion with Deep Association by Xiyang Wang et al
02-25-2022	On Modality Bias Recognition and Reduction by Yangyang Guo et al
02-25-2022	Faithful learning with sure data for lung nodule diagnosis by Hanxiao Zhang et al
02-22-2022	Reliable Inlier Evaluation for Unsupervised Point Cloud Registration by Yaqi Shen et al
02-23-2022	A comparative study of in-air trajectories at short and long distances in online handwriting by Carlos Alonso-Martinez et al
02-25-2022	Reconstruction of Perceived Images from fMRI Patterns and Semantic Brain Exploration using Instance-Conditioned GANs by Furkan Ozcelik et al
02-23-2022	SLOGAN: Handwriting Style Synthesis for Arbitrary-Length and Out-of-Vocabulary Text by Canjie Luo et al
02-25-2022	RELMOBNET: A Robust Two-Stage End-To-End Training Approach For MOBILENETV3 Based Relative Camera Pose Estimation by Praveen Kumar Rajendran et al
02-24-2022	Light Robust Monocular Depth Estimation For Outdoor Environment Via Monochrome And Color Camera Fusion by Hyeonsoo Jang et al
02-24-2022	TwistSLAM: Constrained SLAM in Dynamic Environment by Mathieu Gonzalez et al
02-25-2022	Improving Amharic Handwritten Word Recognition Using Auxiliary Task by Mesay Samuel Gondere et al
02-25-2022	NeuralFusion: Neural Volumetric Rendering under Human-object Interactions by Yuheng Jiang et al
02-25-2022	Improving generalization with synthetic training data for deep learning based quality inspection by Antoine Cordier et al
02-23-2022	Deepfake Detection for Facial Images with Facemasks by Donggeun Ko et al
02-25-2022	Deep Dirichlet uncertainty for unsupervised out-of-distribution detection of eye fundus photographs in glaucoma screening by Teresa Araújo et al
02-23-2022	Skeleton Sequence and RGB Frame Based Multi-Modality Feature Fusion Network for Action Recognition by Xiaoguang Zhu et al
02-22-2022	Indiscriminate Poisoning Attacks on Unsupervised Contrastive Learning by Hao He et al
02-24-2022	Effective Actor-centric Human-object Interaction Detection by Kunlun Xu et al
02-25-2022	Towards Safe, Real-Time Systems: Stereo vs Images and LiDAR for 3D Object Detection by Matthew Levine
02-25-2022	Improved Dual Correlation Reduction Network by Yue Liu et al
02-25-2022	Joint Answering and Explanation for Visual Commonsense Reasoning by Zhenyang Li et al
02-24-2022	Understanding Adversarial Robustness from Feature Maps of Convolutional Layers by Cong Xu et al
02-24-2022	GIAOTracker: A comprehensive framework for MCMOT with global information and optimizing strategies in VisDrone 2021 by Yunhao Du et al
02-23-2022	A Novel Self-Supervised Cross-Modal Image Retrieval Method In Remote Sensing by Gencer Sumbul et al
02-22-2022	FUNQUE: Fusion of Unified Quality Evaluators by Abhinau K. Venkataramanan et al
02-22-2022	Evaluating Feature Attribution Methods in the Image Domain by Arne Gevaert et al
02-23-2022	Synthesizing Photorealistic Images with Deep Generative Learning by Chuanxia Zheng
02-24-2022	The effect of fatigue on the performance of online writer recognition by Enric Sesa-Nogueras et al
02-24-2022	Online handwriting, signature and touch dynamics: tasks and potential applications in the field of security and health by Marcos Faundez-Zanuy et al
02-24-2022	Fourier-Based Augmentations for Improved Robustness and Uncertainty Calibration by Ryan Soklaski et al
02-24-2022	Learning Transferable Reward for Query Object Localization with Policy Adaptation by Tingfeng Li et al
02-24-2022	Efficient Video Segmentation Models with Per-frame Inference by Yifan Liu et al
02-24-2022	Analyzing Human Observer Ability in Morphing Attack Detection -- Where Do We Stand? by Sankini Rancha Godage et al
02-22-2022	Enabling Efficient Deep Convolutional Neural Network-based Sensor Fusion for Autonomous Driving by Xiaoming Zeng et al
02-24-2022	Optimal channel selection with discrete QCQP by Yeonwoo Jeong et al
02-22-2022	Learning with Free Object Segments for Long-Tailed Instance Segmentation by Cheng Zhang et al
02-24-2022	Instantaneous Physiological Estimation using Video Transformers by Ambareesh Revanur et al
02-24-2022	On Monocular Depth Estimation and Uncertainty Quantification using Classification Approaches for Regression by Xuanlong Yu et al
02-24-2022	Highly-Efficient Binary Neural Networks for Visual Place Recognition by Bruno Ferrarini et al
02-22-2022	Arbitrary Shape Text Detection using Transformers by Zobeir Raisi et al
02-24-2022	Time Efficient Training of Progressive Generative Adversarial Network using Depthwise Separable Convolution and Super Resolution Generative Adversarial Network by Atharva Karwande et al
02-24-2022	RescueNet: A High Resolution UAV Semantic Segmentation Benchmark Dataset for Natural Disaster Damage Assessment by Tashnim Chowdhury et al
02-24-2022	StyleCLIPDraw: Coupling Content and Style in Text-to-Drawing Translation by Peter Schaldenbrand et al

Craig SmithFebruary 28, 2022