2022.2.14 Vision papers

02-10-2022	Block-NeRF: Scalable Large Scene Neural View Synthesis by Matthew Tancik et al
02-08-2022	MaskGIT: Masked Generative Image Transformer by Huiwen Chang et al
02-08-2022	Bingham Policy Parameterization for 3D Rotations in Reinforcement Learning by Stephen James et al
02-08-2022	The EMory BrEast imaging Dataset (EMBED): A Racially Diverse, Granular Dataset of 3.5M Screening and Diagnostic Mammograms by Jiwoong J. Jeong et al
02-09-2022	Conditional Motion In-betweening by Jihoon Kim et al
02-08-2022	DALL-Eval: Probing the Reasoning Skills and Social Biases of Text-to-Image Generative Transformers by Jaemin Cho et al
02-11-2022	CLIPasso: Semantically-Aware Object Sketching by Yael Vinker et al
02-09-2022	The Abduction of Sherlock Holmes: A Dataset for Visual Abductive Reasoning by Jack Hessel et al
02-10-2022	N\UWA-LIP: Language Guided Image Inpainting with Defect-free VQGAN by Minheng Ni et al
02-08-2022	Causal Scene BERT: Improving object detection by searching for challenging groups of data by Cinjon Resnick et al
02-09-2022	Point-Level Region Contrast for Object Detection Pre-Training by Yutong Bai et al
02-10-2022	FILM: Frame Interpolation for Large Motion by Fitsum Reda et al
02-09-2022	PINs: Progressive Implicit Networks for Multi-Scale Neural Representations by Zoe Landgraf et al
02-08-2022	GiraffeDet: A Heavy-Neck Paradigm for Object Detection by Yiqi Jiang et al
02-10-2022	Image-to-Image Regression with Distribution-Free Uncertainty Quantification and Applications in Imaging by Anastasios N Angelopoulos et al
02-09-2022	Object-Guided Day-Night Visual Localization in Urban Scenes by Assia Benbihi et al
02-08-2022	Self-Conditioned Generative Adversarial Networks for Image Editing by Yunzhe Liu et al
02-08-2022	Results and findings of the 2021 Image Similarity Challenge by Zoë Papakipos et al
02-09-2022	Estimation of Clinical Workload and Patient Activity using Deep Learning and Optical Flow by Thanh Nguyen-Duc et al
02-09-2022	Can Open Domain Question Answering Systems Answer Visual Knowledge Questions? by Jiawen Zhang et al
02-10-2022	Equivariance Regularization for Image Reconstruction by Junqi Tang
02-08-2022	Whats Cracking? A Review and Analysis of Deep Learning Methods for Structural Crack Segmentation, Detection and Quantification by Jacob König et al
02-09-2022	Image Difference Captioning with Pre-training and Contrastive Learning by Linli Yao et al
02-11-2022	Towards Weakly-Supervised Text Spotting using a Multi-Task Transformer by Yair Kittenplon et al
02-10-2022	Monotonically Convergent Regularization by Denoising by Yuyang Hu et al
02-08-2022	Quality Metric Guided Portrait Line Drawing Generation from Unpaired Training Data by Ran Yi et al
02-08-2022	How to Understand Masked Autoencoders by Shuhao Cao et al
02-08-2022	Motion-Aware Transformer For Occluded Person Re-identification by Mi Zhou et al
02-10-2022	Visual Servoing for Pose Control of Soft Continuum Arm in a Structured Environment by Shivani Kamtikar et al
02-08-2022	Trained Model in Supervised Deep Learning is a Conditional Risk Minimizer by Yutong Xie et al
02-09-2022	Predicting the intended action using internal simulation of perception by Zahra Gharaee
02-10-2022	Motion Puzzle: Arbitrary Motion Style Transfer by Body Part by Deok-Kyeong Jang et al
02-09-2022	Can Humans Do Less-Than-One-Shot Learning? by Maya Malaviya et al
02-10-2022	OWL (Observe, Watch, Listen): Localizing Actions in Egocentric Video via Audiovisual Temporal Context by Merey Ramazanova et al
02-09-2022	NIMBLE: A Non-rigid Hand Model with Bones and Muscles by Yuwei Li et al
02-08-2022	Latent gaze information in highly dynamic decision-tasks by Benedikt Hosp
02-09-2022	Multi-modal unsupervised brain image registration using edge maps by Vasiliki Sideri-Lampretsa et al
02-09-2022	Anchor Graph Structure Fusion Hashing for Cross-Modal Similarity Search by Lu Wang et al
02-08-2022	Self-supervised Contrastive Learning for Cross-domain Hyperspectral Image Representation by Hyungtae Lee et al
02-09-2022	FCM-DNN: diagnosing coronary artery disease by deep accuracy Fuzzy C-Means clustering model by Javad Hassannataj Joloudari et al
02-10-2022	Improving performance of aircraft detection in satellite imagery while limiting the labelling effort: Hybrid active learning by Julie Imbert et al
02-08-2022	SCR: Smooth Contour Regression with Geometric Priors by Gaetan Bahl et al
02-10-2022	Including Facial Expressions in Contextual Embeddings for Sign Language Generation by Carla Viegas et al
02-08-2022	Hair Color Digitization through Imaging and Deep Inverse Graphics by Robin Kips et al
02-11-2022	ACORT: A Compact Object Relation Transformer for Parameter Efficient Image Captioning by Jia Huei Tan et al
02-10-2022	Deep Learning for Computational Cytology: A Survey by Hao Jiang et al
02-10-2022	F8Net: Fixed-Point 8-bit Only Multiplication for Network Quantization by Qing Jin et al
02-10-2022	Memory-based gaze prediction in deep imitation learning for robot manipulation by Heecheol Kim et al
02-10-2022	Towards the automated large-scale reconstruction of past road networks from historical maps by Johannes H. Uhl et al
02-08-2022	Equivariance versus Augmentation for Spherical Images by Jan E. Gerken et al
02-09-2022	Semantic Segmentation of Anaemic RBCs Using Multilevel Deep Convolutional Encoder-Decoder Network by Muhammad Shahzad et al
02-09-2022	Learning to Bootstrap for Combating Label Noise by Yuyin Zhou et al
02-10-2022	Adults as Augmentations for Children in Facial Emotion Recognition with Contrastive Learning by Marco Virgolin et al
02-10-2022	Domain Adversarial Training: A Game Perspective by David Acuna et al
02-10-2022	A Human-Centered Machine-Learning Approach for Muscle-Tendon Junction Tracking in Ultrasound Images by Christoph Leitner et al
02-09-2022	CRAT-Pred: Vehicle Trajectory Prediction with Crystal Graph Convolutional Neural Networks and Multi-Head Self-Attention by Julian Schmidt et al
02-09-2022	Decreasing Annotation Burden of Pairwise Comparisons with Human-in-the-Loop Sorting: Application in Medical Image Artifact Rating by Ikbeom Jang et al
02-10-2022	Give me a knee radiograph, I will tell you where the knee joint area is: a deep convolutional neural network adventure by Shi Yan et al
02-08-2022	A Survey of Breast Cancer Screening Techniques: Thermography and Electrical Impedance Tomography by Juan Zuluaga-Gomez et al
02-10-2022	Feature-level augmentation to improve robustness of deep neural networks to affine transformations by Adrian Sandru et al
02-08-2022	Uncertainty Modeling for Out-of-Distribution Generalization by Xiaotong Li et al
02-09-2022	Amplitude Spectrum Transformation for Open Compound Domain Adaptive Semantic Segmentation by Jogendra Nath Kundu et al
02-08-2022	CAD-RADS Scoring using Deep Learning and Task-Specific Centerline Labeling by Felix Denzinger et al
02-09-2022	End-to-End Blind Quality Assessment for Laparoscopic Videos using Neural Networks by Zohaib Amjad Khan et al
02-08-2022	Adversarial Detection without Model Information by Abhishek Moitra et al
02-08-2022	Class Density and Dataset Quality in High-Dimensional, Unstructured Data by Adam Byerly et al
02-08-2022	A Unified Multi-Task Learning Framework of Real-Time Drone Supervision for Crowd Counting by Siqi Gu et al
02-08-2022	Social-DualCVAE: Multimodal Trajectory Forecasting Based on Social Interactions Pattern Aware and Dual Conditional Variational Auto-Encoder by Jiashi Gao et al
02-11-2022	Unsupervised HDR Imaging: What Can Be Learned from a Single 8-bit Video? by Francesco Banterle et al
02-08-2022	A multiscale spatiotemporal approach for smallholder irrigation detection by Terence Conlon et al
02-08-2022	Real-Time Event-Based Tracking and Detection for Maritime Environments by Stephanie Aelmore et al
02-09-2022	Exploring Structural Sparsity in Neural Image Compression by Shanzhi Yin et al
02-09-2022	Discovering Concepts in Learned Representations using Statistical Inference and Interactive Visualization by Adrianna Janik et al
02-10-2022	Describing image focused in cognitive and visual details for visually impaired people: An approach to generating inclusive paragraphs by Daniel Louzada Fernandes et al
02-10-2022	Real-Time Siamese Multiple Object Tracker with Enhanced Proposals by Lorenzo Vaquero et al
02-08-2022	Consistency-Regularized Region-Growing Network for Semantic Segmentation of Urban Scenes with Point-Level Annotations by Yonghao Xu et al
02-08-2022	GLPU: A Geometric Approach For Lidar Pointcloud Upsampling by George Eskandar et al
02-09-2022	Bias-Eliminated Semantic Refinement for Any-Shot Learning by Liangjun Feng et al
02-10-2022	Spherical Transformer by Sungmin Cho et al
02-08-2022	Segmentation by Test-Time Optimization (TTO) for CBCT-based Adaptive Radiation Therapy by Xiao Liang et al
02-08-2022	Learning Robust Convolutional Neural Networks with Relevant Feature Focusing via Explanations by Kazuki Adachi et al
02-09-2022	Deep Feature Rotation for Multimodal Image Style Transfer by Son Truong Nguyen et al
02-08-2022	Self-Paced Imbalance Rectification for Class Incremental Learning by Zhiheng Liu et al
02-08-2022	Towards Compositional Adversarial Robustness: Generalizing Adversarial Training to Composite Semantic Perturbations by Yun-Yun Tsai et al
02-08-2022	A Novel Image Descriptor with Aggregated Semantic Skeleton Representation for Long-term Visual Place Recognition by Nie Jiwei et al
02-10-2022	PVSeRF: Joint Pixel-, Voxel- and Surface-Aligned Radiance Field for Single-Image Novel View Synthesis by Xianggang Yu et al
02-10-2022	Consistency and Diversity induced Human Motion Segmentation by Tao Zhou et al
02-08-2022	Exploring Inter-Channel Correlation for Diversity-preserved KnowledgeDistillation by Li Liu et al
02-10-2022	Towards Predicting Fine Finger Motions from Ultrasound Images via Kinematic Representation by Dean Zadok et al
02-08-2022	Distillation with Contrast is All You Need for Self-Supervised Point Cloud Representation Learning by Kexue Fu et al
02-11-2022	Multi-Modal Knowledge Graph Construction and Application: A Survey by Xiangru Zhu et al
02-09-2022	Multiclass histogram-based thresholding using kernel density estimation and scale-space representations by S. Korneev et al
02-09-2022	Adversarial Attack and Defense of YOLO Detectors in Autonomous Driving Scenarios by Jung Im Choi et al
02-11-2022	Meta-learning with GANs for anomaly detection, with deployment in high-speed rail inspection system by Haoyang Cao et al
02-10-2022	Towards Assessing and Characterizing the Semantic Robustness of Face Recognition by Juan C. Pérez et al
02-09-2022	Weakly-Supervised Semantic Segmentation with Visual Words Learning and Hybrid Pooling by Lixiang Ru et al
02-08-2022	STC: Spatio-Temporal Contrastive Learning for Video Instance Segmentation by Zhengkai Jiang et al
02-08-2022	Learning Optical Flow with Adaptive Graph Reasoning by Ao Luo et al
02-08-2022	Self-supervised Contrastive Learning for Volcanic Unrest Detection by Nikolaos Ioannis Bountos et al
02-08-2022	Edge-based fever screening system over private 5G by Murugan Sankaradas et al
02-08-2022	NEWSKVQA: Knowledge-Aware News Video Question Answering by Pranay Gupta et al
02-09-2022	Geometric Digital Twinning of Industrial Facilities: Retrieval of Industrial Shapes by Eva Agapaki et al
02-09-2022	Reducing Redundancy in the Bottleneck Representation of the Autoencoders by Firas Laakom et al
02-08-2022	Navigating to Objects in Unseen Environments by Distance Prediction by Minzhao Zhu et al
02-08-2022	If a Human Can See It, So Should Your System: Reliability Requirements for Machine Vision Components by Boyue Caroline Hu et al
02-08-2022	Residual Aligned: Gradient Optimization for Non-Negative Image Synthesis by Flora Yu Shen et al
02-08-2022	On the Pitfalls of Using the Residual Error as Anomaly Score by Felix Meissen et al
02-08-2022	Binary Neural Networks as a general-propose compute paradigm for on-device computer vision by Guhong Nie et al
02-08-2022	Ada-NETS: Face Clustering via Adaptive Neighbour Discovery in the Structure Space by Yaohua Wang et al
02-09-2022	Distance Estimation and Animal Tracking for Wildlife Camera Trapping by Peter Johanns et al
02-08-2022	TransformNet: Self-supervised representation learning through predicting geometric transformations by Sayed Hashim et al
02-10-2022	Exploiting Spatial Sparsity for Event Cameras with Visual Transformers by Zuowen Wang et al
02-08-2022	BIQ2021: A Large-Scale Blind Image Quality Assessment Database by Nisar Ahmed et al
02-08-2022	Network Comparison Study of Deep Activation Feature Discriminability with Novel Objects by Michael Karnes et al
02-08-2022	Cross-level Contrastive Learning and Consistency Constraint for Semi-supervised Medical Image Segmentation by Xinkai Zhao et al
02-11-2022	Towards Adversarially Robust Deepfake Detection: An Ensemble Approach by Ashish Hooda et al
02-11-2022	SuperCon: Supervised Contrastive Learning for Imbalanced Skin Lesion Classification by Keyu Chen et al
02-08-2022	Untrimmed Action Anticipation by Ivan Rodin et al
02-08-2022	A Novel Plug-in Module for Fine-Grained Visual Classification by Po-Yung Chou et al
02-11-2022	Vehicle and License Plate Recognition with Novel Dataset for Toll Collection by Muhammad Usama et al
02-11-2022	A Wasserstein GAN for Joint Learning of Inpainting and its Spatial Optimisation by Pascal Peter
02-11-2022	Artemis: Articulated Neural Pets with Appearance and Motion synthesis by Haimin Luo et al
02-08-2022	Disentangle Saliency Detection into Cascaded Detail Modeling and Body Filling by Yue Song et al
02-08-2022	Addressing Data Scarcity in Multimodal User State Recognition by Combining Semi-Supervised and Supervised Learning by Hendric Voß et al
02-11-2022	Assessing Privacy Risks from Feature Vector Reconstruction Attacks by Emily Wenger et al
02-08-2022	Federated Learning of Generative Image Priors for MRI Reconstruction by Gokberk Elmas et al
02-09-2022	A Joint Variational Multichannel Multiphase Segmentation Framework by Nadja Gruber et al
02-09-2022	Sampling Strategy for Fine-Tuning Segmentation Models to Crisis Area under Scarcity of Data by Adrianna Janik et al
02-11-2022	Entroformer: A Transformer-based Entropy Model for Learned Image Compression by Yichen Qian et al
02-11-2022	Exemplar-free Online Continual Learning by Jiangpeng He et al
02-11-2022	Deep soccer captioning with transformer: dataset, semantics-related losses, and multi-level evaluation by Ahmad Hammoudeh et al
02-08-2022	Face2PPG: An unsupervised pipeline for blood volume pulse extraction from faces by Constantino Álvarez Casado et al
02-10-2022	Learning the Pedestrian-Vehicle Interaction for Pedestrian Trajectory Prediction by Chi Zhang et al
02-09-2022	Graph Neural Network for Cell Tracking in Microscopy Videos by Tal Ben-Haim et al
02-11-2022	SafePicking: Learning Safe Object Extraction via Object-Level Mapping by Kentaro Wada et al
02-10-2022	Incremental Learning of Structured Memory via Closed-Loop Transcription by Shengbang Tong et al
02-10-2022	Characterizing and overcoming the greedy nature of learning in multi-modal deep neural networks by Nan Wu et al
02-11-2022	Tiny Object Tracking: A Large-scale Dataset and A Baseline by Yabin Zhu et al
02-11-2022	WAD-CMSN: Wasserstein Distance based Cross-Modal Semantic Network for Zero-Shot Sketch-Based Image Retrieval by Guanglong Xu et al
02-09-2022	On Real-time Image Reconstruction with Neural Networks for MRI-guided Radiotherapy by David E. J. Waddington et al
02-10-2022	Mining the manifolds of deep generative models for multiple data-consistent solutions of ill-posed tomographic imaging problems by Sayantan Bhadra et al
02-08-2022	Wireless Transmission of Images With The Assistance of Multi-level Semantic Information by Zhenguo Zhang et al
02-11-2022	Borrowing from yourself: Faster future video segmentation with partial channel update by Evann Courdier et al
02-10-2022	Dynamic Background Subtraction by Generative Neural Networks by Fateme Bahri et al
02-11-2022	Multi-Modal Fusion for Sensorimotor Coordination in Steering Angle Prediction by Farzeen Munir et al
02-10-2022	Face Beneath the Ink: Synthetic Data and Tattoo Removal with Application to Face Recognition by Mathias Ibsen et al
02-10-2022	Coded ResNeXt: a network for designing disentangled information paths by Apostolos Avranas et al
02-11-2022	Dilated convolutional neural network-based deep reference picture generation for video compression by Haoyue Tian et al
02-10-2022	Optimal Transport for Super Resolution Applied to Astronomy Imaging by Michael Rawson et al
02-10-2022	The MeLa BitChute Dataset by Milo Trujillo et al
02-09-2022	Class Distance Weighted Cross-Entropy Loss for Ulcerative Colitis Severity Estimation by Gorkem Polat et al
02-11-2022	Patch-NetVLAD+: Learned patch descriptor and weighted matching strategy for place recognition by Yingfeng Cai et al
02-11-2022	Video-driven Neural Physically-based Facial Asset for Production by Longwen Zhang et al
02-11-2022	Bench-Marking And Improving Arabic Automatic Image Captioning Through The Use Of Multi-Task Learning Paradigm by Muhy Eddin Za'ter et al
02-08-2022	Joint-bone Fusion Graph Convolutional Network for Semi-supervised Skeleton Action Recognition by Zhigang Tu et al
02-10-2022	Towards a Guideline for Evaluation Metrics in Medical Image Segmentation by Dominik Müller et al
02-10-2022	A Deep Learning Approach for Digital ColorReconstruction of Lenticular Films by Stefano D'Aronco et al
02-10-2022	A Plug-and-Play Approach to Multiparametric Quantitative MRI: Image Reconstruction using Pre-Trained Deep Denoisers by Ketan Fatania et al
02-10-2022	HNF-Netv2 for Brain Tumor Segmentation using multi-modal MR Imaging by Haozhe Jia et al
02-10-2022	A Field of Experts Prior for Adapting Neural Networks at Test Time by Neerav Karani et al

Craig SmithFebruary 14, 2022