2022.1.10 Vision papers

01-07-2022	NeROIC: Neural Rendering of Objects from Online Image Collections by Zhengfei Kuang et al
01-05-2022	Robust Self-Supervised Audio-Visual Speech Recognition by Bowen Shi et al
01-07-2022	Generalized Category Discovery by Sagar Vaze et al
01-04-2022	Sound and Visual Representation Learning with Multiple Pretraining Tasks by Arun Balajee Vasudevan et al
01-06-2022	A Light in the Dark: Deep Learning Practices for Industrial Computer Vision by Maximilian Harl et al
01-06-2022	De-rendering 3D Objects in the Wild by Felix Wimbauer et al
01-05-2022	All You Need In Sign Language Production by Razieh Rastgoo et al
01-06-2022	Consistent Style Transfer by Xuan Luo et al
01-06-2022	Self-Training Vision Language BERTs with a Unified Conditional Model by Xiaofeng Yang et al
01-07-2022	Detecting Twenty-thousand Classes using Image-level Supervision by Xingyi Zhou et al
01-04-2022	Corrupting Data to Remove Deceptive Perturbation: Using Preprocessing Method to Improve System Robustness by Hieu Le et al
01-06-2022	Compact Bidirectional Transformer for Image Captioning by Yuanen Zhou et al
01-05-2022	Incremental Object Grounding Using Scene Graphs by John Seon Keun Yi et al
01-06-2022	Bio-inspired Min-Nets Improve the Performance and Robustness of Deep Networks by Philipp Grüning et al
01-06-2022	ASL-Skeleton3D and ASL-Phono: Two Novel Datasets for the American Sign Language by Cleison Correia de Amorim et al
01-05-2022	DeepMLS: Geometry-Aware Control Point Deformation by Meitar Shechter et al
01-05-2022	Contrastive Neighborhood Alignment by Pengkai Zhu et al
01-05-2022	Quantum Capsule Networks by Zidu Liu et al
01-05-2022	Debiased Learning from Naturally Imbalanced Pseudo-Labels for Zero-Shot and Semi-Supervised Learning by Xudong Wang et al
01-05-2022	Formal Analysis of Art: Proxy Learning of Visual Concepts from Style Through Language Models by Diana Kim et al
01-06-2022	Aerial Scene Parsing: From Tile-level Scene Classification to Pixel-wise Semantic Labeling by Yang Long et al
01-05-2022	Lawin Transformer: Improving Semantic Segmentation Transformer with Multi-Scale Representations via Large Window Attention by Haotian Yan et al
01-04-2022	Deep Learning-Based Sparse Whole-Slide Image Analysis for the Diagnosis of Gastric Intestinal Metaplasia by Jon Braatz et al
01-06-2022	Egocentric Deep Multi-Channel Audio-Visual Active Speaker Localization by Hao Jiang et al
01-05-2022	Tackling the Class Imbalance Problem of Deep Learning Based Head and Neck Organ Segmentation by Elias Tappeiner et al
01-05-2022	Towards realistic symmetry-based completion of previously unseen point clouds by Taras Rumezhak et al
01-04-2022	Self-supervised Learning from 100 Million Medical Images by Florin C. Ghesu et al
01-06-2022	An Abstraction-Refinement Approach to Verifying Convolutional Neural Networks by Matan Ostrovsky et al
01-06-2022	HyperionSolarNet: Solar Panel Detection from Aerial Images by Poonam Parhar et al
01-04-2022	Variational Stacked Local Attention Networks for Diverse Video Captioning by Tonmoay Deb et al
01-04-2022	Weakly-supervised continual learning for class-incremental segmentation by Gaston Lenczner et al
01-05-2022	Challenges of Artificial Intelligence -- From Machine Learning and Computer Vision to Emotional Intelligence by Matti Pietikäinen et al
01-05-2022	Exemplar-free Class Incremental Learning via Discriminative and Comparable One-class Classifiers by Wenju Sun et al
01-07-2022	Video Summarization Based on Video-text Representation by Li Haopeng et al
01-05-2022	Cross-SRN: Structure-Preserving Super-Resolution Network with Cross Convolution by Yuqing Liu et al
01-05-2022	Revisiting Deep Subspace Alignment for Unsupervised Domain Adaptation by Kowshik Thopalli et al
01-07-2022	Bayesian Neural Networks for Reversible Steganography by Ching-Chun Chang
01-05-2022	POCO: Point Convolution for Surface Reconstruction by Alexandre Boulch et al
01-07-2022	Negative Evidence Matters in Interpretable Histology Image Classification by Soufiane Belharbi et al
01-06-2022	Deep Learning Based Classification System For Recognizing Local Spinach by Mirajul Islam et al
01-06-2022	Multi-Label Classification on Remote-Sensing Images by Aditya Kumar Singh et al
01-06-2022	Decompose to Adapt: Cross-domain Object Detection via Feature Disentanglement by Dongnan Liu et al
01-04-2022	The cluster structure function by Andrew R. Cohen et al
01-04-2022	Aligning Domain-specific Distribution and Classifier for Cross-domain Classification from Multiple Sources by Yongchun Zhu et al
01-04-2022	Attention Mechanism Meets with Hybrid Dense Network for Hyperspectral Image Classification by Muhammad Ahmad et al
01-04-2022	Self-Supervised Approach to Addressing Zero-Shot Learning Problem by Ademola Okerinde et al
01-04-2022	MoCoPnet: Exploring Local Motion and Contrast Priors for Infrared Small Target Super-Resolution by Xinyi Ying et al
01-06-2022	Repurposing Existing Deep Networks for Caption and Aesthetic-Guided Image Cropping by Nora Horanyi et al
01-05-2022	Multiple Sclerosis Lesions Segmentation using Attention-Based CNNs in FLAIR Images by Mehdi SadeghiBakhi et al
01-04-2022	Eye Know You Too: A DenseNet Architecture for End-to-end Biometric Authentication via Eye Movements by Dillon Lohr et al
01-05-2022	Towards Uniform Point Distribution in Feature-preserving Point Cloud Filtering by Shuaijun Chen et al
01-06-2022	Enhancing Egocentric 3D Pose Estimation with Third Person Views by Ameya Dhamanaskar et al
01-06-2022	Multi-Domain Joint Training for Person Re-Identification by Lu Yang et al
01-07-2022	Sign Language Video Retrieval with Free-Form Textual Queries by Amanda Duarte et al
01-05-2022	Learning True Rate-Distortion-Optimization for End-To-End Image Compression by Fabian Brand et al
01-05-2022	Sign Language Recognition System using TensorFlow Object Detection API by Sharvani Srivastava et al
01-04-2022	Advancing 3D Medical Image Analysis with Variable Dimension Transform based Supervised 3D Pre-training by Shu Zhang et al
01-05-2022	Lumbar Bone Mineral Density Estimation from Chest X-ray Images: Anatomy-aware Attentive Multi-ROI Modeling by Fakai Wang et al
01-04-2022	Multi-Representation Adaptation Network for Cross-domain Image Classification by Yongchun Zhu et al
01-05-2022	The Effect of Model Compression on Fairness in Facial Expression Recognition by Samuil Stoychev et al
01-05-2022	On the Real-World Adversarial Robustness of Real-Time Semantic Segmentation Models for Autonomous Driving by Giulio Rossolini et al
01-04-2022	DIAL: Deep Interactive and Active Learning for Semantic Segmentation in Remote Sensing by Gaston Lenczner et al
01-06-2022	Budget-aware Few-shot Learning via Graph Convolutional Network by Shipeng Yan et al
01-05-2022	Deep Probabilistic Graph Matching by He Liu et al
01-05-2022	Rethinking Depth Estimation for Multi-View Stereo: A Unified Representation and Focal Loss by Rui Peng et al
01-04-2022	Swin UNETR: Swin Transformers for Semantic Segmentation of Brain Tumors in MRI Images by Ali Hatamizadeh et al
01-04-2022	Towards Understanding and Harnessing the Effect of Image Transformation in Adversarial Detection by Hui Liu et al
01-04-2022	Identifying the exterior image of buildings on a 3D map and extracting elevation information using deep learning and digital image processing by Donghwa Shon et al
01-05-2022	Memory-guided Image De-raining Using Time-Lapse Data by Jaehoon Cho et al
01-04-2022	Problem-dependent attention and effort in neural networks with an application to image resolution by Chris Rohlfs
01-04-2022	Latent Vector Expansion using Autoencoder for Anomaly Detection by UJu Gim et al
01-04-2022	Learning to Generate Novel Classes for Deep Metric Learning by Kyungmoon Lee et al
01-07-2022	A Review of Deep Learning Techniques for Markerless Human Motion on Synthetic Datasets by Doan Duy Vo et al
01-04-2022	Towards Unsupervised Open World Semantic Segmentation by Svenja Uhlemeyer et al
01-05-2022	Automated Scoring of Graphical Open-Ended Responses Using Artificial Neural Networks by Matthias von Davier et al
01-05-2022	Biphasic Face Photo-Sketch Synthesis via Semantic-Driven Generative Adversarial Network with Graph Representation Learning by Xingqun Qi et al
01-04-2022	Linear Variational State Space Filtering by Daniel Pfrommer et al
01-06-2022	A three-dimensional dual-domain deep network for high-pitch and sparse helical CT reconstruction by Wei Wang et al
01-06-2022	An unambiguous cloudiness index for nonwovens by Michael Godehardt et al
01-05-2022	An Investigation of Benfords Law Divergence and Machine Learning Techniques for Intra-Class Separability of Fingerprint Images by Aamo Iorliam et al
01-06-2022	TransVPR: Transformer-based place recognition with multi-level attention aggregation by Ruotong Wang et al
01-06-2022	SASA: Semantics-Augmented Set Abstraction for Point-based 3D Object Detection by Chen Chen et al
01-05-2022	Culture-to-Culture Image Translation with Generative Adversarial Networks by Giulia Zaino et al
01-07-2022	Deep Domain Adversarial Adaptation for Photon-efficient Imaging Based on Spatiotemporal Inception Network by Yiwei Chen et al
01-05-2022	Improving Object Detection, Multi-object Tracking, and Re-Identification for Disaster Response Drones by Chongkeun Paik et al
01-07-2022	Cross-Modality Deep Feature Learning for Brain Tumor Segmentation by Dingwen Zhang et al
01-05-2022	Multi-Robot Collaborative Perception with Graph Neural Networks by Yang Zhou et al
01-04-2022	Detailed Facial Geometry Recovery from Multi-view Images by Learning an Implicit Function by Yunze Xiao et al
01-05-2022	FAVER: Blind Quality Prediction of Variable Frame Rate Videos by Qi Zheng et al
01-05-2022	Evaluation of Thermal Imaging on Embedded GPU Platforms for Application in Vehicular Assistance Systems by Muhammad Ali Farooq et al
01-04-2022	Neural KEM: A Kernel Method with Deep Coefficient Prior for PET Image Reconstruction by Siqi Li et al
01-05-2022	Robust photon-efficient imaging using a pixel-wise residual shrinkage network by Gongxin Yao et al
01-04-2022	A Robust Visual Sampling Model Inspired by Receptive Field by Liwen Hu et al
01-04-2022	What Hinders Perceptual Quality of PSNR-oriented Methods? by Tianshuo Xu et al
01-05-2022	Learning Audio-Visual Speech Representation by Masked Multimodal Cluster Prediction by Bowen Shi et al
01-04-2022	3DVSR: 3D EPI Volume-based Approach for Angular and Spatial Light field Image Super-resolution by Trung-Hieu Tran et al
01-04-2022	A Transformer-Based Siamese Network for Change Detection by Wele Gedara Chaminda Bandara et al
01-07-2022	Effect of Prior-based Losses on Segmentation Performance: A Benchmark by Rosana {EL JURDI} et al
01-04-2022	Fusing Convolutional Neural Network and Geometric Constraint for Image-based Indoor Localization by Jingwei Song et al
01-04-2022	Learning Quality-aware Representation for Multi-person Pose Regression by Yabo Xiao et al
01-05-2022	Flow-Guided Sparse Transformer for Video Deblurring by Jing Lin et al
01-04-2022	Image Processing Methods for Coronal Hole Segmentation, Matching, and Map Classification by V. Jatla et al
01-04-2022	Synthesizing Tensor Transformations for Visual Self-attention by Xian Wei et al
01-04-2022	Attention-based Dual Supervised Decoder for RGBD Semantic Segmentation by Yang Zhang et al
01-06-2022	Balancing Generalization and Specialization in Zero-shot Learning by Yun Li et al
01-04-2022	Transfer Learning for Retinal Vascular Disease Detection: A Pilot Study with Diabetic Retinopathy and Retinopathy of Prematurity by Guan Wang et al
01-06-2022	Extending One-Stage Detection with Open-World Proposals by Sachin Konan et al
01-06-2022	RestoreDet: Degradation Equivariant Representation for Object Detection in Low Resolution Images by Ziteng Cui et al
01-05-2022	Probing TryOnGAN by Saurabh Kumar et al
01-05-2022	Surface-Aligned Neural Radiance Fields for Controllable 3D Human Synthesis by Tianhan Xu et al
01-04-2022	Data Augmentation for Depression Detection Using Skeleton-Based Gait Information by Jingjing Yang et al
01-06-2022	EM-driven unsupervised learning for efficient motion segmentation by Etienne Meunier et al
01-06-2022	Realistic Full-Body Anonymization with Surface-Guided GANs by Håkon Hukkelås et al
01-04-2022	Towards Transferable Unrestricted Adversarial Examples with Minimum Changes by Fangcheng Liu et al
01-04-2022	DeepFGS: Fine-Grained Scalable Coding for Learned Image Compression by Yi Ma et al
01-07-2022	Embodied Hands: Modeling and Capturing Hands and Bodies Together by Javier Romero et al
01-06-2022	A Unified Framework for Attention-Based Few-Shot Object Detection by Pierre Le Jeune et al
01-05-2022	TableParser: Automatic Table Parsing with Weak Supervision from Spreadsheets by Susie Xi Rao et al
01-06-2022	Persistent Homology for Breast Tumor Classification using Mammogram Scans by Aras Asaad et al
01-07-2022	A Novel Incremental Learning Driven Instance Segmentation Framework to Recognize Highly Cluttered Instances of the Contraband Items by Taimur Hassan et al
01-05-2022	GLAN: A Graph-based Linear Assignment Network by He Liu et al
01-07-2022	Deep Generative Framework for Interactive 3D Terrain Authoring and Manipulation by Shanthika Naik et al
01-07-2022	Auto-Weighted Layer Representation Based View Synthesis Distortion Estimation for 3-D Video Coding by Jian Jin et al
01-07-2022	Motion Prediction via Joint Dependency Modeling in Phase Space by Pengxiang Su et al
01-04-2022	Short Range Correlation Transformer for Occluded Person Re-Identification by Yunbin Zhao et al
01-07-2022	Multiresolution Fully Convolutional Networks to detect Clouds and Snow through Optical Satellite Images by Debvrat Varshney et al
01-07-2022	An Incremental Learning Approach to Automatically Recognize Pulmonary Diseases from the Multi-vendor Chest Radiographs by Mehreen Sirshar et al
01-04-2022	Online Multi-Object Tracking with Unsupervised Re-Identification Learning and Occlusion Estimation by Qiankun Liu et al
01-07-2022	Leveraging Scale-Invariance and Uncertainity with Self-Supervised Domain Adaptation for Semantic Segmentation of Foggy Scenes by Javed Iqbal et al
01-07-2022	Learning Target-aware Representation for Visual Tracking via Informative Interactions by Mingzhe Guo et al
01-05-2022	Multi-Grid Redundant Bounding Box Annotation for Accurate Object Detection by Solomon Negussie Tesema et al
01-07-2022	Detecting Human-to-Human-or-Object (H2O) Interactions with DIABOLO by Astrid Orcesi et al
01-04-2022	DenseTact: Optical Tactile Sensor for Dense Shape Reconstruction by Won Kyung Do et al
01-05-2022	Learning Semantic Ambiguities for Zero-Shot Learning by Celina Hanouti et al
01-06-2022	CitySurfaces: City-Scale Semantic Segmentation of Sidewalk Materials by Maryam Hosseini et al
01-07-2022	Uncertainty-Aware Cascaded Dilation Filtering for High-Efficiency Deraining by Qing Guo et al
01-07-2022	Amplitude SAR Imagery Splicing Localization by Edoardo Daniele Cannas et al
01-06-2022	A Keypoint Detection and Description Network Based on the Vessel Structure for Multi-Modal Retinal Image Registration by Aline Sindel et al
01-07-2022	Equalized Focal Loss for Dense Long-Tailed Object Detection by Bo Li et al
01-06-2022	ITSA: An Information-Theoretic Approach to Automatic Shortcut Avoidance and Domain Generalization in Stereo Matching Networks by WeiQin Chuah et al
01-05-2022	3D Intracranial Aneurysm Classification and Segmentation via Unsupervised Dual-branch Learning by Di Shao et al

Craig SmithJanuary 11, 2022