2022.2.21 Vision papers

02-16-2022	Vision Models Are More Robust And Fair When Pretrained On Uncurated Images Without Supervision by Priya Goyal et al
02-16-2022	A Data-Augmentation Is Worth A Thousand Samples: Exact Quantification From Analytical Augmented Sample Moments by Randall Balestriero et al
02-16-2022	AKB-48: A Real-World Articulated Object Knowledge Base by Liu Liu et al
02-16-2022	Limitations of Neural Collapse for Understanding Generalization in Deep Learning by Like Hui et al
02-15-2022	General-purpose, long-context autoregressive modeling with Perceiver AR by Curtis Hawthorne et al
02-16-2022	Anomalib: A Deep Learning Library for Anomaly Detection by Samet Akcay et al
02-15-2022	Dont Lie to Me! Robust and Efficient Explainability with Verified Perturbation Analysis by Thomas Fel et al
02-16-2022	Ditto: Building Digital Twins of Articulated Objects from Interaction by Zhenyu Jiang et al
02-16-2022	Learning Smooth Neural Functions via Lipschitz Regularization by Hsueh-Ti Derek Liu et al
02-15-2022	ViNTER: Image Narrative Generation with Emotion-Arc-Aware Transformer by Kohei Uehara et al
02-15-2022	CommerceMM: Large-Scale Commerce MultiModal Representation Learning with Omni Retrieval by Licheng Yu et al
02-17-2022	Feels Bad Man: Dissecting Automated Hateful Meme Detection Through the Lens of Facebooks Challenge by Catherine Jennifer et al
02-15-2022	Not All Patches are What You Need: Expediting Vision Transformers via Token Reorganizations by Youwei Liang et al
02-16-2022	Bias in Automated Image Colorization: Metrics and Error Types by Frank Stapel et al
02-17-2022	V2X-Sim: A Virtual Collaborative Perception Dataset for Autonomous Driving by Yiming Li et al
02-17-2022	Grammar-Based Grounded Lexicon Learning by Jiayuan Mao et al
02-15-2022	Fairness Indicators for Systematic Assessments of Visual Feature Extractors by Priya Goyal et al
02-16-2022	Generative modeling with projected entangled-pair states by Tom Vieijra et al
02-17-2022	Fourier PlenOctrees for Dynamic Radiance Field Rendering in Real-time by Liao Wang et al
02-17-2022	A study of deep perceptual metrics for image quality assessment by Rémi Kazmierczak et al
02-15-2022	Ab-initio Contrast Estimation and Denoising of Cryo-EM Images by Yunpeng Shi et al
02-17-2022	General Cyclical Training of Neural Networks by Leslie N. Smith
02-17-2022	OmniSyn: Synthesizing 360 Videos with Wide-baseline Panoramas by David Li et al
02-16-2022	Planckian jitter: enhancing the color quality of self-supervised visual representations by Simone Zini et al
02-16-2022	When Did It Happen? Duration-informed Temporal Localization of Narrated Actions in Vlogs by Oana Ignat et al
02-16-2022	Evaluation and Analysis of Different Aggregation and Hyperparameter Selection Methods for Federated Brain Tumor Segmentation by Ece Isik-Polat et al
02-16-2022	OpenKBP-Opt: An international and reproducible evaluation of 76 knowledge-based planning pipelines by Aaron Babier et al
02-16-2022	Meta Knowledge Distillation by Jihao Liu et al
02-16-2022	Can Deep Learning be Applied to Model-Based Multi-Object Tracking? by Juliano Pinto et al
02-15-2022	Neural Architecture Search for Dense Prediction Tasks in Computer Vision by Thomas Elsken et al
02-18-2022	Autoencoding Low-Resolution MRI for Semantically Smooth Interpolation of Anisotropic MRI by Jörg Sander et al
02-17-2022	A hybrid 2-stage vision transformer for AI-assisted 5 class pathologic diagnosis of gastric endoscopic biopsies by Yujin Oh et al
02-17-2022	A Study of Designing Compact Audio-Visual Wake Word Spotting System Based on Iterative Fine-Tuning in Neural Network Pruning by Hengshun Zhou et al
02-17-2022	CADRE: A Cascade Deep Reinforcement Learning Framework for Vision-based Autonomous Urban Driving by Yinuo Zhao et al
02-18-2022	VCVTS: Multi-speaker Video-to-Speech synthesis via cross-modal knowledge transfer from voice conversion by Disong Wang et al
02-17-2022	Two-Stage Architectural Fine-Tuning with Neural Architecture Search using Early-Stopping in Image Classification by Youngkee Kim et al
02-17-2022	CSCNet: Contextual Semantic Consistency Network for Trajectory Prediction in Crowded Spaces by Beihao Xia et al
02-17-2022	Dynamic Object Comprehension: A Framework For Evaluating Artificial Visual Perception by Scott Y. L. Chin et al
02-16-2022	CortexODE: Learning Cortical Surface Reconstruction by Neural ODEs by Qiang Ma et al
02-18-2022	VLP: A Survey on Vision-Language Pre-training by Feilong Chen et al
02-16-2022	Learning to Generalize across Domains on Single Test Samples by Zehao Xiao et al
02-16-2022	A multi-reconstruction study of breast density estimation using Deep Learning by Vikash Gupta et al
02-15-2022	DualConv: Dual Convolutional Kernels for Lightweight Deep Neural Networks by Jiachen Zhong et al
02-17-2022	Detecting and Learning the Unknown in Semantic Segmentation by Robin Chan et al
02-16-2022	IPD:An Incremental Prototype based DBSCAN for large-scale data with cluster representatives by Jayasree Saha et al
02-16-2022	Cross-Modal Common Representation Learning with Triplet Loss Functions by Felix Ott et al
02-16-2022	Diagnosing Batch Normalization in Class Incremental Learning by Minghao Zhou et al
02-15-2022	A Unified Framework for Masked and Mask-Free Face Recognition via Feature Rectification by Shaozhe Hao et al
02-17-2022	Point Cloud Generation with Continuous Conditioning by Larissa T. Triess et al
02-16-2022	PENCIL: Deep Learning with Noisy Labels by Kun Yi et al
02-17-2022	KINet: Keypoint Interaction Networks for Unsupervised Forward Modeling by Alireza Rezazadeh et al
02-17-2022	Survey on Self-supervised Representation Learning Using Image Transformations by Muhammad Ali et al
02-17-2022	CLS: Cross Labeling Supervision for Semi-Supervised Learning by Yao Yao et al
02-16-2022	ADAM Challenge: Detecting Age-related Macular Degeneration from Fundus Images by Huihui Fang et al
02-16-2022	Phase Aberration Robust Beamformer for Planewave US Using Self-Supervised Learning by Shujaat Khan et al
02-17-2022	TransCG: A Large-Scale Real-World Dataset for Transparent Object Depth Completion and Grasping by Hongjie Fang et al
02-18-2022	Unsupervised Multiple-Object Tracking with a Dynamical Variational Autoencoder by Xiaoyu Lin et al
02-17-2022	Synthetic data for unsupervised polyp segmentation by Enric Moreu et al
02-17-2022	EBHI:A New Enteroscope Biopsy Histopathological H&E Image Dataset for Image Classification Evaluation by Weiming Hu et al
02-18-2022	Generalizing Aggregation Functions in GNNs:High-Capacity GNNs via Nonlinear Neighborhood Aggregators by Beibei Wang et al
02-17-2022	An overview of deep learning in medical imaging by Imran Ul Haq
02-16-2022	Visual attention analysis of pathologists examining whole slide images of Prostate cancer by Souradeep Chakraborty et al
02-15-2022	Few-shot semantic segmentation via mask aggregation by Wei Ao et al
02-17-2022	End-to-end Neuron Instance Segmentation based on Weakly Supervised Efficient UNet and Morphological Post-processing by Huaqian Wu et al
02-15-2022	Reducing Overconfidence Predictions for Autonomous Driving Perception by Gledson Melotti et al
02-18-2022	Critical Checkpoints for Evaluating Defence Models Against Adversarial Attack and Robustness by Kanak Tekwani et al
02-17-2022	PCB Component Detection using Computer Vision for Hardware Assurance by Wenwei Zhao et al
02-17-2022	TraSeTR: Track-to-Segment Transformer with Contrastive Query for Instance-level Instrument Segmentation in Robotic Surgery by Zixu Zhao et al
02-15-2022	Lie Point Symmetry Data Augmentation for Neural PDE Solvers by Johannes Brandstetter et al
02-15-2022	Applying adversarial networks to increase the data efficiency and reliability of Self-Driving Cars by Aakash Kumar
02-17-2022	Domain Randomization for Object Counting by Enric Moreu et al
02-17-2022	Mirror-Yolo: An attention-based instance segmentation and detection model for mirrors by Fengze Li et al
02-15-2022	Beyond Natural Motion: Exploring Discontinuity for Video Frame Interpolation by Sangjin Lee et al
02-16-2022	ActionFormer: Localizing Moments of Actions with Transformers by Chenlin Zhang et al
02-17-2022	Adiabatic Quantum Computing for Multi Object Tracking by Jan-Nico Zaech et al
02-15-2022	Multimodal Driver Referencing: A Comparison of Pointing to Objects Inside and Outside the Vehicle by Abdul Rafey Aftab et al
02-17-2022	Visual Ground Truth Construction as Faceted Classification by Fausto Giunchiglia et al
02-15-2022	Exploring the Devil in Graph Spectral Domain for 3D Point Cloud Attacks by Qianjiang Hu et al
02-16-2022	FPIC: A Novel Semantic Dataset for Optical PCB Assurance by Nathan Jessurun et al
02-15-2022	Deep Constrained Least Squares for Blind Image Super-Resolution by Ziwei Luo et al
02-17-2022	3D-Aware Indoor Scene Synthesis with Depth Priors by Zifan Shi et al
02-18-2022	MultiRes-NetVLAD: Augmenting Place Recognition Training with Low-Resolution Imagery by Ahmad Khaliq et al
02-16-2022	Image translation of Ultrasound to Pseudo Anatomical Display Using Artificial Intelligence by Lilach Barkat et al
02-15-2022	A Survey of Semen Quality Evaluation in Microscopic Videos Using Computer Assisted Sperm Analysis by Wenwei Zhao et al
02-15-2022	Segmentation and Risk Score Prediction of Head and Neck Cancers in PET/CT Volumes with 3D U-Net and Cox Proportional Hazard Neural Networks by Fereshteh Yousefirizi et al
02-15-2022	RNGDet: Road Network Graph Detection by Transformer in Aerial Images by Zhenhua Xu et al
02-15-2022	Spatial Transformer K-Means by Romain Cosentino et al
02-17-2022	R2-D2: Repetitive Reprediction Deep Decipher for Semi-Supervised Deep Learning by Guo-Hua Wang et al
02-17-2022	Point cloud completion on structured feature map with feedback network by Zejia Su et al
02-17-2022	Anatomically Parameterized Statistical Shape Model: Explaining Morphometry through Statistical Learning by Arnaud Boutillon et al
02-15-2022	Review of the Fingerprint Liveness Detection (LivDet) competition series: from 2009 to 2021 by Marco Micheletto et al
02-15-2022	Texture Aware Autoencoder Pre-training And Pairwise Learning Refinement For Improved Iris Recognition by Manashi Chakraborty et al
02-16-2022	How to Fill the Optimum Set? Population Gradient Descent with Harmless Diversity by Chengyue Gong et al
02-16-2022	360 Depth Estimation in the Wild -- The Depth360 Dataset and the SegFuse Network by Qi Feng et al
02-16-2022	Shift-Memory Network for Temporal Scene Segmentation by Guo Cheng et al
02-17-2022	Single UHD Image Dehazing via Interpretable Pyramid Network by Boxue Xiao et al
02-17-2022	Domain Adaptation for Underwater Image Enhancement via Content and Style Separation by Yu-Wei Chen et al
02-15-2022	Deeply-Supervised Knowledge Distillation by Shiya Luo et al

02-17-2022	How Well Do Self-Supervised Methods Perform in Cross-Domain Few-Shot Learning? by Yiyi Zhang et al
02-15-2022	A precortical module for robust CNNs to light variations by R. Fioresi et al
02-15-2022	Deep Learning-based Anomaly Detection on X-ray Images of Fuel Cell Electrodes by Simon B. Jensen et al
02-17-2022	Semantically Proportional Patchmix for Few-Shot Learning by Jingquan Wang et al
02-18-2022	Exploring Adversarially Robust Training for Unsupervised Domain Adaptation by Shao-Yuan Lo et al
02-17-2022	LG-LSQ: Learned Gradient Linear Symmetric Quantization by Shih-Ting Lin et al
02-17-2022	An Active and Contrastive Learning Framework for Fine-Grained Off-Road Semantic Segmentation by Biao Gao et al
02-17-2022	TAFNet: A Three-Stream Adaptive Fusion Network for RGB-T Crowd Counting by Haihan Tang et al
02-17-2022	REFUGE2 Challenge: Treasure for Multi-Domain Learning in Glaucoma Assessment by Huihui Fang et al
02-16-2022	Unified smoke and fire detection in an evolutionary framework with self-supervised progressive data augment by Hang Zhang et al
02-15-2022	Label fusion and training methods for reliable representation of inter-rater uncertainty by Andreanne Lemay et al
02-16-2022	Practical Network Acceleration with Tiny Sets by Guo-Hua Wang et al
02-18-2022	Iterative Learning for Instance Segmentation by Tuomas Sormunen et al
02-16-2022	Neural Marionette: Unsupervised Learning of Motion Skeleton and Latent Dynamics from Volumetric Video by Jinseok Bae et al
02-15-2022	Balancing Domain Experts for Long-Tailed Camera-Trap Recognition by Byeongjun Park et al
02-18-2022	Incorporating Texture Information into Dimensionality Reduction for High-Dimensional Images by Alexander Vieth et al
02-18-2022	A Machine Learning Paradigm for Studying Pictorial Realism: Are Constables Clouds More Real than His Contemporaries? by Zhuomin Zhang et al
02-18-2022	Lightweight Multi-Drone Detection and 3D-Localization via YOLO by Aryan Sharma et al
02-16-2022	Contextualize differential privacy in image database: a lightweight image differential privacy approach based on principle component analysis inverse by Shiliang Zhang et al
02-15-2022	HAA4D: Few-Shot Human Atomic Action Recognition via 3D Spatio-Temporal Skeletal Alignment by Mu-Ruei Tseng et al
02-15-2022	MeshLeTemp: Leveraging the Learnable Vertex-Vertex Relationship to Generalize Human Pose and Mesh Reconstruction for In-the-Wild Scenes by Trung Q. Tran et al
02-17-2022	Classification of ADHD Patients by Kernel Hierarchical Extreme Learning Machine by Sartaj Ahmed Salman et al
02-15-2022	Using Social Media Images for Building Function Classification by Eike Jens Hoffmann et al
02-15-2022	Cross-view and Cross-domain Underwater Localization based on Optical Aerial and Acoustic Underwater Images by Matheus M. Dos Santos et al
02-17-2022	Level set based particle filter driven by optical flow: an application to track the salt boundary from X-ray CT time-series by Karim Makki et al
02-18-2022	Towards better understanding and better generalization of few-shot classification in histology images with contrastive learning by Jiawei Yang et al
02-18-2022	(2.5+1)D Spatio-Temporal Scene Graphs for Video Question Answering by Anoop Cherian et al
02-18-2022	Task Specific Attention is one more thing you need for object detection by Sang Yon Lee
02-15-2022	Hyper-relationship Learning Network for Scene Graph Generation by Yibing Zhan et al
02-17-2022	Colonoscopy polyp detection with massive endoscopic images by Jialin Yu et al
02-16-2022	Learning to Detect People on the Fly: A Bio-inspired Event-based Visual System for Drones by Ali Safa et al
02-15-2022	Random Walks for Adversarial Meshes by Amir Belder et al
02-17-2022	A Wavelet-based Dual-stream Network for Underwater Image Enhancement by Ziyin Ma et al
02-15-2022	PCRP: Unsupervised Point Cloud Object Retrieval and Pose Estimation by Pranav Kadam et al
02-16-2022	Label Propagation for Annotation-Efficient Nuclei Segmentation from Pathology Images by Yi Lin et al
02-16-2022	Less is More: Surgical Phase Recognition from Timestamp Supervision by Zixun Wang et al
02-17-2022	Joint Learning of Frequency and Spatial Domains for Dense Predictions by Shaocheng Jia et al
02-17-2022	Realistic Blur Synthesis for Learning Image Deblurring by Jaesung Rim et al
02-16-2022	FUN-SIS: a Fully UNsupervised approach for Surgical Instrument Segmentation by Luca Sestini et al
02-16-2022	Learning to Adapt to Light by Kai-Fu Yang et al
02-15-2022	Improving the repeatability of deep learning models with Monte Carlo dropout by Andreanne Lemay et al
02-17-2022	A Comprehensive Survey with Quantitative Comparison of Image Analysis Methods for Microorganism Biovolume Measurements by Jiawei Zhang et al
02-17-2022	Graph Convolutional Networks for Multi-modality Medical Imaging: Methods, Architectures, and Clinical Applications by Kexin Ding et al
02-18-2022	Towards Simple and Accurate Human Pose Estimation with Stair Network by Chenru Jiang et al
02-15-2022	Energy-Efficient Parking Analytics System using Deep Reinforcement Learning by Yoones Rezaei et al
02-15-2022	SODAR: Segmenting Objects by DynamicallyAggregating Neighboring Mask Representations by Tao Wang et al
02-16-2022	A Developmentally-Inspired Examination of Shape versus Texture Bias in Machines by Alexa R. Tartaglini et al
02-15-2022	On Representation Learning with Feedback by Hao Li
02-15-2022	ScoreNet: Learning Non-Uniform Attention and Augmentation for Transformer-Based Histopathological Image Classification by Thomas Stegmüller et al
02-15-2022	Post-Training Quantization for Cross-Platform Learned Image Compression by Dailan He et al
02-17-2022	Prior image-based medical image reconstruction using a style-based generative adversarial network by Varun A. Kelkar et al
02-17-2022	When, Why, and Which Pretrained GANs Are Useful? by Timofey Grigoryev et al
02-17-2022	On Guiding Visual Attention with Language Specification by Suzanne Petryk et al
02-18-2022	Spatio-Temporal Outdoor Lighting Aggregation on Image Sequences using Transformer Networks by Haebom Lee et al
02-18-2022	Guide Local Feature Matching by Overlap Estimation by Ying Chen et al
02-15-2022	A Subjective Quality Study for Video Frame Interpolation by Duolikun Danier et al
02-16-2022	Flexible-Modal Face Anti-Spoofing: A Benchmark by Zitong Yu et al
02-15-2022	Beyond Deterministic Translation for Unsupervised Domain Adaptation by Eleni Chiou et al
02-15-2022	Normalized K-Means for Noise-Insensitive Multi-Dimensional Feature Learning by Nicholas Pellegrino et al
02-15-2022	Enhancing Deformable Convolution based Video Frame Interpolation with Coarse-to-fine 3D CNN by Duolikun Danier et al
02-15-2022	Misinformation Detection in Social Media Video Posts by Kehan Wang et al
02-17-2022	Machine learning models and facial regions videos for estimating heart rate: a review on Patents, Datasets and Literature by Tiago Palma Pagano et al
02-15-2022	Privacy Preserving Visual Question Answering by Cristian-Paul Bara et al
02-16-2022	Cyclical Focal Loss by Leslie N. Smith
02-15-2022	Deep Learning-Assisted Co-registration of Full-Spectral Autofluorescence Lifetime Microscopic Images with H&E-Stained Histology Images by Qiang Wang et al
02-17-2022	Deep Transfer Learning on Satellite Imagery Improves Air Quality Estimates in Developing Nations by Nishant Yadav et al
02-15-2022	Self-Supervised Class-Cognizant Few-Shot Classification by Ojas Kishore Shirekar et al
02-17-2022	Continuous-Time vs. Discrete-Time Vision-based SLAM: A Comparative Study by Giovanni Cioffi et al
02-17-2022	Developing Imperceptible Adversarial Patches to Camouflage Military Assets From Computer Vision Enabled Technologies by Christopher Wise et al

Craig SmithFebruary 21, 2022