2022.5.2 Vision papers

04-26-2022	PolyLoss: A Polynomial Expansion Perspective of Classification Loss Functions by Zhaoqi Leng et al
04-28-2022	CogView2: Faster and Better Text-to-Image Generation via Hierarchical Transformers by Ming Ding et al
04-28-2022	NeurMiPs: Neural Mixture of Planar Experts for View Synthesis by Zhi-Hao Lin et al
04-26-2022	ClothFormer:Taming Video Virtual Try-on in All Module by Jianbin Jiang et al
04-26-2022	Understanding The Robustness in Vision Transformers by Daquan Zhou et al
04-27-2022	Use All The Labels: A Hierarchical Multi-Label Contrastive Learning Framework by Shu Zhang et al
04-28-2022	HuMMan: Multi-Modal 4D Human Dataset for Versatile Sensing and Modeling by Zhongang Cai et al
04-28-2022	Unlocking High-Accuracy Differentially Private Image Classification through Scale by Soham De et al
04-26-2022	From One Hand to Multiple Hands: Imitation Learning for Dexterous Manipulation from Single-Camera Teleoperation by Yuzhe Qin et al
04-27-2022	Dataset for Robust and Accurate Leading Vehicle Velocity Recognition by Genya Ogawa et al
04-27-2022	Few-Shot Head Swapping in the Wild by Changyong Shu et al
04-27-2022	Grasping the Arrow of Time from the Singularity: Decoding Micromotion in Low-dimensional Latent Spaces from StyleGAN by Qiucheng Wu et al
04-29-2022	Flamingo: a Visual Language Model for Few-Shot Learning by Jean-Baptiste Alayrac et al
04-26-2022	Density-preserving Deep Point Cloud Compression by Yun He et al
04-29-2022	PyramidCLIP: Hierarchical Feature Alignment for Vision-language Model Pretraining by Yuting Gao et al
04-26-2022	Expanding the Latent Space of StyleGAN for Real Face Editing by Yin Yu et al
04-28-2022	List-Mode PET Image Reconstruction Using Deep Image Prior by Kibo Ote et al
04-28-2022	Keep the Caption Information: Preventing Shortcut Learning in Contrastive Image-Caption Retrieval by Maurits Bleeker et al
04-28-2022	Articulated Objects in Free-form Hand Interaction by Zicong Fan et al
04-28-2022	Two Decades of Colorization and Decolorization for Images and Videos by Shiguang Liu
04-27-2022	The MeVer DeepFake Detection Service: Lessons Learnt from Developing and Deploying in the Wild by Spyridon Baxevanakis et al
04-28-2022	An Overview of Color Transfer and Style Transfer for Images and Videos by Shiguang Liu
04-27-2022	Adversarial Fine-tune with Dynamically Regulated Adversary by Pengyue Hou et al
04-29-2022	Fix the Noise: Disentangling Source Feature for Transfer Learning of StyleGAN by Dongyeun Lee et al
04-27-2022	Offline Visual Representation Learning for Embodied Navigation by Karmesh Yadav et al
04-29-2022	Leaner and Faster: Two-Stage Model Compression for Lightweight Text-Image Retrieval by Siyu Ren et al
04-26-2022	RadioPathomics: Multimodal Learning in Non-Small Cell Lung Cancer for Adaptive Radiotherapy by Matteo Tortora et al
04-26-2022	On Fragile Features and Batch Normalization in Adversarial Training by Nils Philipp Walter et al
04-28-2022	Reliable Visual Question Answering: Abstain Rather Than Answer Incorrectly by Spencer Whitehead et al
04-28-2022	Continual Learning with Bayesian Model based on a Fixed Pre-trained Feature Extractor by Yang Yang et al
04-26-2022	Deeper Insights into ViTs Robustness towards Common Corruptions by Rui Tian et al
04-28-2022	Vision-Language Pre-Training for Boosting Scene Text Detectors by Sibo Song et al
04-28-2022	Mixup-based Deep Metric Learning Approaches for Incomplete Supervision by Luiz H. Buris et al
04-27-2022	An Iterative Labeling Method for Annotating Fisheries Imagery by Zhiyong Zhang et al
04-26-2022	MM-TTA: Multi-Modal Test-Time Adaptation for 3D Semantic Segmentation by Inkyu Shin et al
04-26-2022	Stochastic Coherence Over Attention Trajectory For Continuous Learning In Video Streams by Matteo Tiezzi et al
04-26-2022	Where and What: Driver Attention-based Object Detection by Yao Rong et al
04-26-2022	A survey on attention mechanisms for medical applications: are we moving towards better algorithms? by Tiago Gonçalves et al
04-28-2022	Rotationally Equivariant 3D Object Detection by Hong-Xing Yu et al
04-28-2022	Poly-CAM: High resolution class activation map for convolutional neural networks by Alexandre Englebert et al
04-29-2022	A Challenging Benchmark of Anime Style Recognition by Haotang Li et al
04-26-2022	An Algorithm for the Labeling and Interactive Visualization of the Cerebrovascular System of Ischemic Strokes by Florian Thamm et al
04-28-2022	Unsupervised Spatial-spectral Hyperspectral Image Reconstruction and Clustering with Diffusion Geometry by Kangning Cui et al
04-28-2022	Oracle Guided Image Synthesis with Relative Queries by Alec Helbling et al
04-26-2022	AAU-net: An Adaptive Attention U-net for Breast Lesions Segmentation in Ultrasound Images by Gongping Chen et al
04-26-2022	Unsupervised Segmentation of Hyperspectral Remote Sensing Images with Superpixels by Mirko Paolo Barbato et al
04-27-2022	Interpretable Graph Convolutional Network of Multi-Modality Brain Imaging for Alzheimers Disease Diagnosis by Houliang Zhou et al
04-26-2022	An Overview of Recent Work in Media Forensics: Methods and Threats by Kratika Bhagtani et al
04-28-2022	BAGNet: Bidirectional Aware Guidance Network for Malignant Breast lesions Segmentation by Gongping Chen et al
04-28-2022	Computer Vision for Road Imaging and Pothole Detection: A State-of-the-Art Review of Systems and Algorithms by Nachuan Ma et al
04-26-2022	Understanding the Impact of Edge Cases from Occluded Pedestrians for ML Systems by Jens Henriksson et al
04-28-2022	Learning to Split for Automatic Bias Detection by Yujia Bao et al
04-26-2022	A Comparative Study on Approaches to Acoustic Scene Classification using CNNs by Ishrat Jahan Ananya et al
04-27-2022	Minimizing Client Drift in Federated Learning via Adaptive Bias Estimation by Farshid Varno et al
04-28-2022	TJ4DRadSet: A 4D Radar Dataset for Autonomous Driving by Lianqing Zheng et al
04-28-2022	MMRotate: A Rotated Object Detection Benchmark using Pytorch by Yue Zhou et al
04-27-2022	A Scalable Combinatorial Solver for Elastic Geometrically Consistent 3D Shape Matching by Paul Roetzer et al
04-27-2022	Self-Driving Car Steering Angle Prediction: Let Transformer Be a Car Again by Chingis Oinar et al
04-28-2022	Deep Orientation-Aware Functional Maps: Tackling Symmetry Issues in Shape Matching by Nicolas Donati et al
04-27-2022	Epicardial Adipose Tissue Segmentation from CT Images with A Semi-3D Neural Network by Marin Benčević et al
04-26-2022	SCGC : Self-Supervised Contrastive Graph Clustering by Gayan K. Kulatilleke et al
04-28-2022	Goldilocks-curriculum Domain Randomization and Fractal Perlin Noise with Application to Sim2Real Pneumonia Lesion Detection by Takahiro Suzuki et al
04-28-2022	COVID-Net US-X: Enhanced Deep Neural Network for Detection of COVID-19 Patient Cases from Convex Ultrasound Imaging Through Extended Linear-Convex Ultrasound Augmentation Learning by E. Zhixuan Zeng et al
04-26-2022	Learning Dual-Pixel Alignment for Defocus Deblurring by Yu Li et al
04-27-2022	Human-Centered Prior-Guided and Task-Dependent Multi-Task Representation Learning for Action Recognition Pre-Training by Guanhong Wang et al
04-28-2022	On the Role of Field of View for Occlusion Removal with Airborne Optical Sectioning by Francis Seits et al
04-26-2022	Neural Maximum A Posteriori Estimation on Unpaired Data for Motion Deblurring by Youjian Zhang et al
04-28-2022	Tragedy Plus Time: Capturing Unintended Human Activities from Weakly-labeled Videos by Arnav Chakravarthy et al
04-28-2022	Audio-Visual Contrastive Learning for Self-supervised Action Recognition by Haoyuan Lan et al
04-28-2022	Deep Generalized Unfolding Networks for Image Restoration by Chong Mou et al
04-26-2022	Coarse-to-fine Q-attention with Tree Expansion by Stephen James et al
04-28-2022	Temporal Progressive Attention for Early Action Prediction by Alexandros Stergiou et al
04-27-2022	A Multi-Head Convolutional Neural Network With Multi-path Attention improves Image Denoising by Jiahong Zhang et al
04-28-2022	Morphing Attack Potential by Matteo Ferrara et al
04-28-2022	Lightweight Bimodal Network for Single-Image Super-Resolution via Symmetric CNN and Recursive Transformer by Guangwei Gao et al
04-28-2022	Resource-efficient domain adaptive pre-training for medical images by Yasar Mehmood et al
04-28-2022	Generative Adversarial Networks for Image Super-Resolution: A Survey by Chunwei Tian et al
04-28-2022	SemAttNet: Towards Attention-based Semantic Aware Guided Depth Completion by Danish Nazir et al
04-26-2022	Restricted Black-box Adversarial Attack Against DeepFake Face Swapping by Junhao Dong et al
04-28-2022	Depth Estimation with Simplified Transformer by John Yang et al
04-26-2022	Robust Face Anti-Spoofing with Dual Probabilistic Modeling by Yuanhan Zhang et al
04-26-2022	Sound Localization by Self-Supervised Time Delay Estimation by Ziyang Chen et al
04-28-2022	Discriminative-Region Attention and Orthogonal-View Generation Model for Vehicle Re-Identification by Huadong Li et al
04-27-2022	Mapping suburban bicycle lanes using street scene images and deep learning by Tyler Saxton
04-28-2022	Unified Simulation, Perception, and Generation of Human Behavior by Ye Yuan
04-28-2022	Inverse-Designed Meta-Optics with Spectral-Spatial Engineered Response to Mimic Color Perception by Chris Munley et al
04-28-2022	A Closer Look at Branch Classifiers of Multi-exit Architectures by Shaohui Lin et al
04-28-2022	Semi-MoreGAN: A New Semi-supervised Generative Adversarial Network for Mixture of Rain Removal by Yiyang Shen et al
04-26-2022	Evaluating the Quality of a Synthesized Motion with the Fr\echet Motion Distance by Antoine Maiorca et al
04-26-2022	Focal Sparse Convolutional Networks for 3D Object Detection by Yukang Chen et al
04-26-2022	Meta-free representation learning for few-shot learning via stochastic weight averaging by Kuilin Chen et al
04-26-2022	Optimized latent-code selection for explainable conditional text-to-image GANs by Zhenxing Zhang et al
04-29-2022	Multiple Degradation and Reconstruction Network for Single Image Denoising via Knowledge Distillation by Juncheng Li et al
04-26-2022	Contrastive Language-Action Pre-training for Temporal Localization by Mengmeng Xu et al
04-26-2022	Instance-Specific Feature Propagation for Referring Segmentation by Chang Liu et al
04-28-2022	Equine radiograph classification using deep convolutional neural networks by Raniere Gaia Costa da Silva et al
04-27-2022	BBBD: Bounding Box Based Detector for Occlusion Detection and Order Recovery by Kaziwa Saleh et al
04-27-2022	Collaborative Learning for Hand and Object Reconstruction with Attention-guided Graph Convolution by Tze Ho Elden Tse et al
04-27-2022	PRE-NAS: Predictor-assisted Evolutionary Neural Architecture Search by Yameng Peng et al
04-27-2022	Towards assessing agricultural land suitability with causal machine learning by Georgios Giannarakis et al
04-26-2022	U-Net with ResNet Backbone for Garment Landmarking Purpose by Khay Boon Hong
04-29-2022	Struct-MDC: Mesh-Refined Unsupervised Depth Completion Leveraging Structural Regularities from Visual SLAM by Jinwoo Jeon et al
04-28-2022	Noise-reducing attention cross fusion learning transformer for histological image classification of osteosarcoma by Liangrui Pan et al
04-26-2022	RAPQ: Rescuing Accuracy for Power-of-Two Low-bit Post-training Quantization by Hongyi Yao et al
04-28-2022	Symmetric Transformer-based Network for Unsupervised Image Registration by Mingrui Ma et al
04-26-2022	Differentiable Zooming for Multiple Instance Learning on Whole-Slide Images by Kevin Thandiackal et al
04-26-2022	MILES: Visual BERT Pre-training with Injected Language Semantics for Video-text Retrieval by Yuying Ge et al
04-27-2022	Power Bundle Adjustment for Large-Scale 3D Reconstruction by Simon Weber et al

04-28-2022	Learning to Extract Building Footprints from Off-Nadir Aerial Images by Jinwang Wang et al
04-26-2022	ROMA: Cross-Domain Region Similarity Matching for Unpaired Nighttime Infrared to Daytime Visible Video Translation by Zhenjie Yu et al
04-28-2022	Learning cosmology and clustering with cosmic graphs by Pablo Villanueva-Domingo et al
04-26-2022	TranSiam: Fusing Multimodal Visual Features Using Transformer for Medical Image Segmentation by Xuejian Li et al
04-27-2022	DearKD: Data-Efficient Early Knowledge Distillation for Vision Transformers by Xianing Chen et al
04-27-2022	Defending Against Person Hiding Adversarial Patch Attack with a Universal White Frame by Youngjoon Yu et al
04-28-2022	Region-level Contrastive and Consistency Learning for Semi-Supervised Semantic Segmentation by Jianrong Zhang et al
04-28-2022	Unsupervised Multi-Modal Medical Image Registration via Discriminator-Free Image-to-Image Translation by Zekang Chen et al
04-28-2022	GRIT: General Robust Image Task Benchmark by Tanmay Gupta et al
04-27-2022	Talking Head Generation Driven by Speech-Related Facial Action Units and Audio- Based on Multimodal Representation Fusion by Sen Chen et al
04-26-2022	ViTPose: Simple Vision Transformer Baselines for Human Pose Estimation by Yufei Xu et al
04-26-2022	Acquiring a Dynamic Light Field through a Single-Shot Coded Image by Ryoya Mizuno et al
04-28-2022	Controllable Image Captioning by Luka Maxwell
04-28-2022	Streaming Multiscale Deep Equilibrium Models by Can Ufuk Ertenli et al
04-27-2022	Conformer and Blind Noisy Students for Improved Image Quality Assessment by Marcos V. Conde et al
04-27-2022	CATrans: Context and Affinity Transformer for Few-Shot Segmentation by Shan Zhang et al
04-26-2022	Causal Transportability for Visual Recognition by Chengzhi Mao et al
04-28-2022	KING: Generating Safety-Critical Driving Scenarios for Robust Imitation via Kinematics Gradients by Niklas Hanselmann et al
04-29-2022	Using 3D Shadows to Detect Object Hiding Attacks on Autonomous Vehicle Perception by Zhongyuan Hau et al
04-27-2022	Forecasting Urban Development from Satellite Images by Nando Metzger
04-28-2022	Learning from Pixel-Level Noisy Label : A New Perspective for Light Field Saliency Detection by Mingtao Feng et al
04-26-2022	Unified GCNs: Towards Connecting GCNs with CNNs by Ziyan Zhang et al
04-28-2022	AE-NeRF: Auto-Encoding Neural Radiance Fields for 3D-Aware Object Manipulation by Mira Kim et al
04-28-2022	Hybrid Relation Guided Set Matching for Few-shot Action Recognition by Xiang Wang et al
04-27-2022	Dropout Inference with Non-Uniform Weight Scaling by Zhaoyuan Yang et al
04-26-2022	Attentive Fine-Grained Structured Sparsity for Image Restoration by Junghun Oh et al
04-28-2022	GenDR: A Generalized Differentiable Renderer by Felix Petersen et al
04-26-2022	Context-Aware Sequence Alignment using 4D Skeletal Augmentation by Taein Kwon et al
04-29-2022	AdaInt: Learning Adaptive Intervals for 3D Lookup Tables on Real-time Image Enhancement by Canqian Yang et al
04-29-2022	Learning Adaptive Warping for Real-World Rolling Shutter Correction by Mingdeng Cao et al
04-27-2022	Self-Supervised Text Erasing with Controllable Image Synthesis by Gangwei Jiang et al
04-26-2022	Intercategorical Label Interpolation for Emotional Face Generation with Conditional Generative Adversarial Networks by Silvan Mertes et al
04-27-2022	Person Re-Identification by Mustafa Ebrahim Chasmai et al
04-27-2022	SSR-GNNs: Stroke-based Sketch Representation with Graph Neural Networks by Sheng Cheng et al
04-29-2022	Privacy-Preserving Model Upgrades with Bidirectional Compatible Training in Image Retrieval by Shupeng Su et al
04-26-2022	Multi stain graph fusion for multimodal integration in pathology by Chaitanya Dwivedi et al
04-27-2022	Attention Consistency on Visual Corruptions for Single-Source Domain Generalization by Ilke Cugu et al
04-27-2022	3D Magic Mirror: Clothing Reconstruction from a Single Image via a Causal Perspective by Zhedong Zheng et al
04-27-2022	An Improved Nearest Neighbour Classifier by Eric Setterqvist et al
04-29-2022	A Deep Learning based No-reference Quality Assessment Model for UGC Videos by Wei Sun et al
04-27-2022	MAPLE-Edge: A Runtime Latency Predictor for Edge Devices by Saeejith Nair et al
04-26-2022	Adaptive Split-Fusion Transformer by Zixuan Su et al
04-27-2022	Ollivier-Ricci Curvature For Head Pose Estimation From a Single Image by Lucia Cascone et al
04-27-2022	Relevance-based Margin for Contrastively-trained Video Retrieval Models by Alex Falcon et al
04-27-2022	Gleo-Det: Deep Convolution Feature-Guided Detector with Local Entropy Optimization for Salient Points by Chao Li et al
04-29-2022	Deep Geometry Post-Processing for Decompressed Point Clouds by Xiaoqing Fan et al
04-29-2022	Preoperative brain tumor imaging: models and software for segmentation and standardized reporting by D. Bouget et al
04-29-2022	SCS-Co: Self-Consistent Style Contrastive Learning for Image Harmonization by Yucheng Hang et al
04-27-2022	Self-Supervised Learning of Object Parts for Semantic Segmentation by Adrian Ziegler et al
04-26-2022	Urban Change Detection Using a Dual-Task Siamese Network and Semi-Supervised Learning by Sebastian Hafner et al
04-26-2022	Boosting Adversarial Transferability of MLP-Mixer by Haoran Lyu et al
04-27-2022	HRDA: Context-Aware High-Resolution Domain-Adaptive Semantic Segmentation by Lukas Hoyer et al
04-28-2022	Where in the World is this Image? Transformer-based Geo-localization in the Wild by Shraman Pramanick et al
04-26-2022	Generating Topological Structure of Floorplans from Room Attributes by Yin Yu et al
04-28-2022	Automatic Detection and Classification of Symbols in Engineering Drawings by Sourish Sarkar et al
04-29-2022	Segmentation of kidney stones in endoscopic video feeds by Zachary A Stoebner et al
04-27-2022	Global Trajectory Helps Person Retrieval in a Camera Network by Xin Zhang et al
04-27-2022	CapOnImage: Context-driven Dense-Captioning on Image by Yiqi Gao et al
04-26-2022	Improving the Transferability of Adversarial Examples with Restructure Embedded Patches by Huipeng Zhou et al
04-29-2022	Hardware Trojan Detection Using Unsupervised Deep Learning on Quantum Diamond Microscope Magnetic Field Images by Maitreyi Ashok et al
04-26-2022	Coupled Iterative Refinement for 6D Multi-Object Pose Estimation by Lahav Lipson et al
04-29-2022	Improving Transferability for Domain Adaptive Detection Transformers by Kaixiong Gong et al
04-26-2022	Building Change Detection using Multi-Temporal Airborne LiDAR Data by Ritu Yadav et al
04-29-2022	SideRT: A Real-time Pure Transformer Architecture for Single Image Depth Estimation by Chang Shu et al
04-29-2022	C3-STISR: Scene Text Image Super-resolution with Triple Clues by Minyi Zhao et al
04-27-2022	Low-rank Meets Sparseness: An Integrated Spatial-Spectral Total Variation Approach to Hyperspectral Denoising by Haijin Zeng et al
04-29-2022	CLIP-Art: Contrastive Pre-training for Fine-Grained Art Classification by Marcos V. Conde et al
04-29-2022	OSSGAN: Open-Set Semi-Supervised Image Generation by Kai Katsumata et al
04-26-2022	Evaluation of Self-taught Learning-based Representations for Facial Emotion Recognition by Bruna Delazeri et al
04-29-2022	Towards Automatic Parsing of Structured Visual Content through the Use of Synthetic Data by Lukas Scholch et al
04-29-2022	Adversarial Distortion Learning for Medical Image Denoising by Morteza Ghahremani et al
04-26-2022	The Influence of the Other-Race Effect on Susceptibility to Face Morphing Attacks by Snipta Mallick et al
04-29-2022	Neural Implicit Representations for Physical Parameter Inference from a Single Video by Florian Hofherr et al
04-29-2022	Seeing without Looking: Analysis Pipeline for Child Sexual Abuse Datasets by Camila Laranjeira et al
04-29-2022	Learning Localization-aware Target Confidence for Siamese Visual Tracking by Jiahao Nie et al
04-28-2022	Understanding the impact of image and input resolution on deep digital pathology patch classifiers by Eu Wern Teh et al
04-29-2022	EndoMapper dataset of complete calibrated endoscopy procedures by Pablo Azagra et al
04-26-2022	A Close Look into Human Activity Recognition Models using Deep Learning by Wei Zhong Tee et al
04-26-2022	Leveraging Unlabeled Data for Sketch-based Understanding by Javier Morales et al
04-26-2022	AccMPEG: Optimizing Video Encoding for Video Analytics by Kuntai Du et al
04-28-2022	Unsupervised Voice-Face Representation Learning by Cross-Modal Prototype Contrast by Boqing Zhu et al
04-28-2022	One Model to Synthesize Them All: Multi-contrast Multi-scale Transformer for Missing Data Imputation by Jiang Liu et al
04-26-2022	Unsupervised Learning of Unbiased Visual Representations by Carlo Alberto Barbano et al
04-28-2022	Coupling Deep Imputation with Multitask Learning for Downstream Tasks on Genomics Data by Sophie Peacock et al
04-27-2022	Channel Pruned YOLOv5-based Deep Learning Approach for Rapid and Accurate Outdoor Obstacles Detection by Zeqian Li et al

Craig SmithMay 2, 2022