2022.1.24 Vision papers

01-20-2022	Stitch it in Time: GAN-Based Facial Editing of Real Videos by Rotem Tzaban et al
01-20-2022	Omnivore: A Single Model for Many Visual Modalities by Rohit Girdhar et al
01-18-2022	Online Deep Learning based on Auto-Encoder by Si-si Zhang et al
01-20-2022	Learning Pixel Trajectories with Multiscale Contrastive Random Walks by Zhangxing Bian et al
01-20-2022	SPAMs: Structured Implicit Parametric Models by Pablo Palafox et al
01-19-2022	Estimating Egocentric 3D Human Pose in the Wild with External Weak Supervision by Jian Wang et al
01-20-2022	MeMViT: Memory-Augmented Multiscale Vision Transformer for Efficient Long-Term Video Recognition by Chao-Yuan Wu et al
01-20-2022	End-to-end Generative Pretraining for Multimodal Video Captioning by Paul Hongsuck Seo et al
01-21-2022	Point-NeRF: Point-based Neural Radiance Fields by Qiangeng Xu et al
01-19-2022	ConDor: Self-Supervised Canonicalization of 3D Pose for Partial Shapes by Rahul Sajnani et al
01-20-2022	The Elements of Temporal Sentence Grounding in Videos: A Survey and Future Directions by Hao Zhang et al
01-19-2022	Nonlinear Unknown Input Observability and Unknown Input Reconstruction: The General Analytical Solution by Agostino Martinelli
01-19-2022	CAST: Character labeling in Animation using Self-supervision by Tracking by Oron Nir et al
01-20-2022	Real-time Rendering for Integral Imaging Light Field Displays Based on a Voxel-Pixel Lookup Table by Quanzhen Wan
01-20-2022	AirPose: Multi-View Fusion Network for Aerial 3D Human Pose and Shape Estimation by Nitin Saini et al
01-19-2022	Towards a General Deep Feature Extractor for Facial Expression Recognition by Liam Schoneveld et al
01-20-2022	Deep Unsupervised Contrastive Hashing for Large-Scale Cross-Modal Text-Image Retrieval in Remote Sensing by Georgii Mikriukov et al
01-20-2022	Revisiting Weakly Supervised Pre-Training of Visual Perception Models by Mannat Singh et al
01-20-2022	DIVA-DAF: A Deep Learning Framework for Historical Document Image Analysis by Lars Vögtlin et al
01-19-2022	Look Closer: Bridging Egocentric and Third-Person Views with Transformers for Robotic Manipulation by Rishabh Jangir et al
01-20-2022	A Joint Morphological Profiles and Patch Tensor Change Detection for Hyperspectral Imagery by Zengfu Hou et al
01-20-2022	TerViT: An Efficient Ternary Vision Transformer by Sheng Xu et al
01-19-2022	Virtual Coil Augmentation Technology for MRI via Deep Learning by Cailian Yang et al
01-19-2022	Experimental Large-Scale Jet Flames Geometrical Features Extraction for Risk Management Using Infrared Images and Deep Learning Segmentation Methods by Carmina Pérez-Guerrero et al
01-19-2022	Weakly Supervised Semantic Segmentation of Remote Sensing Images for Tree Species Classification Based on Explanation Methods by Steve Ahlswede et al
01-20-2022	Predicting Vegetation Stratum Occupancy from Airborne LiDAR Data with Deep Learning by Ekaterina Kalinicheva et al
01-20-2022	PRMI: A Dataset of Minirhizotron Images for Diverse Plant Root Study by Weihuang Xu et al
01-18-2022	STURE: Spatial-Temporal Mutual Representation Learning for Robust Data Association in Online Multi-Object Tracking by Haidong Wang et al
01-19-2022	Visualization and Analysis of Wearable Health Data From COVID-19 Patients by Susanne K. Suter et al
01-19-2022	Self-supervised Video Representation Learning with Cascade Positive Retrieval by Cheng-En Wu et al
01-20-2022	Physically Embodied Deep Image Optimisation by Daniela Mihai et al
01-20-2022	WPPNets: Unsupervised CNN Training with Wasserstein Patch Priors for Image Superresolution by Fabian Altekrüger et al
01-20-2022	Modeling and hexahedral meshing of arterial networks from centerlines by Méghane Decroocq et al
01-19-2022	CELESTIAL: Classification Enabled via Labelless Embeddings with Self-supervised Telescope Image Analysis Learning by Suhas Kotha et al
01-20-2022	Domain Generalization via Frequency-based Feature Disentanglement and Interaction by Jingye Wang et al
01-20-2022	What can we learn from misclassified ImageNet images? by Shixian Wen et al
01-20-2022	A Computational Model for Machine Thinking by Slimane Larabi
01-20-2022	GeoFill: Reference-Based Image Inpainting of Scenes with Complex Geometry by Yunhan Zhao et al
01-19-2022	ASL Video Corpora & Sign Bank: Resources Available through the American Sign Language Linguistic Research Project (ASLLRP) by Carol Neidle et al
01-19-2022	Superpixel Pre-Segmentation of HER2 Slides for Efficient Annotation by Mathias Öttl et al
01-20-2022	Lightweight Salient Object Detection in Optical Remote Sensing Images via Feature Correlation by Gongyang Li et al
01-19-2022	GASCN: Graph Attention Shape Completion Network by Haojie Huang et al
01-19-2022	Learning-by-Novel-View-Synthesis for Full-Face Appearance-based 3D Gaze Estimation by Jiawei Qin et al
01-20-2022	CP-Net: Contour-Perturbed Reconstruction Network for Self-Supervised Point Cloud Learning by Mingye Xu et al
01-18-2022	KappaFace: Adaptive Additive Angular Margin Loss for Deep Face Recognition by Chingis Oinar et al
01-19-2022	TransFuse: A Unified Transformer-based Image Fusion Framework using Self-supervised Learning by Linhao Qu et al
01-19-2022	A pipeline for automated processing of Corona KH-4 (1962-1972) stereo imagery by Sajid Ghuffar et al
01-20-2022	HumanIBR: High Quality Image-based Rendering of Challenging Human Performers using Sparse Views by Tiansong Zhou et al
01-19-2022	Self-Supervised Deep Blind Video Super-Resolution by Haoran Bai et al
01-19-2022	A Survey on Training Challenges in Generative Adversarial Networks for Biomedical Image Analysis by Muhammad Muneeb Saad et al
01-19-2022	Global-Local Path Networks for Monocular Depth Estimation with Vertical CutDepth by Doyeon Kim et al
01-18-2022	AI-based Carcinoma Detection and Classification Using Histopathological Images: A Systematic Review by Swathi Prabhua et al
01-19-2022	Enhanced Performance of Pre-Trained Networks by Matched Augmentation Distributions by Touqeer Ahmad et al
01-19-2022	DMF-Net: Dual-Branch Multi-Scale Feature Fusion Network for copy forgery identification of anti-counterfeiting QR code by Zhongyuan Guo et al
01-19-2022	A Confidence-based Iterative Solver of Depths and Surface Normals for Deep Multi-view Stereo by Wang Zhao et al
01-19-2022	Real-time Recognition of Yoga Poses using computer Vision for Smart Health Care by Abhishek Sharma et al
01-19-2022	Simpler is better: spectral regularization and up-sampling techniques for variational autoencoders by Sara Björk et al
01-18-2022	Attentional Feature Refinement and Alignment Network for Aircraft Detection in SAR Imagery by Yan Zhao et al
01-18-2022	RePre: Improving Self-Supervised Vision Transformer with Reconstructive Pre-training by Luya Wang et al
01-19-2022	WebUAV-3M: A Benchmark Unveiling the Power of Million-Scale Deep UAV Tracking by Chunhui Zhang et al
01-19-2022	The Role of Facial Expressions and Emotion in ASL by Lee Kezar et al
01-18-2022	Deep Learning Based Framework for Iranian License Plate Detection and Recognition by Mojtaba Shahidi Zandi et al
01-21-2022	Dangerous Cloaking: Natural Trigger based Backdoor Attacks on Object Detectors in the Physical World by Hua Ma et al
01-18-2022	TriCoLo: Trimodal Contrastive Loss for Fine-grained Text to Shape Retrieval by Yue Ruan et al
01-18-2022	Pruning-aware Sparse Regularization for Network Pruning by Nanfei Jiang et al
01-18-2022	Weakly Supervised Contrastive Learning for Better Severity Scoring of Lung Ultrasound by Gautam Rajendrakumar Gare et al
01-19-2022	Variable Augmented Network for Invertible MR Coil Compression by Xianghao Liao et al
01-18-2022	Lung Swapping Autoencoder: Learning a Disentangled Structure-texture Representation of Chest Radiographs by Lei Zhou et al
01-19-2022	ROS georegistration: Aerial Multi-spectral Image Simulator for the Robot Operating System by Andrew R. Willis et al
01-19-2022	Object Detection in Autonomous Vehicles: Status and Open Challenges by Abhishek Balasubramaniam et al
01-20-2022	Watermarking Pre-trained Encoders in Contrastive Learning by Yutong Wu et al
01-20-2022	SoftDropConnect (SDC) -- Effective and Efficient Quantification of the Network Uncertainty in Deep MR Image Analysis by Qing Lyu et al
01-18-2022	Poseur: Direct Human Pose Regression with Transformers by Weian Mao et al
01-19-2022	Learned Cone-Beam CT Reconstruction Using Neural Ordinary Differential Equations by Mareike Thies et al
01-18-2022	Swin-Pose: Swin Transformer Based Human Pose Estimation by Zinan Xiong et al
01-19-2022	BLINC: Lightweight Bimodal Learning for Low-Complexity VVC Intra Coding by Farhad Pakdaman et al
01-19-2022	Using Self-Supervised Pretext Tasks for Active Learning by John Seon Keun Yi et al
01-21-2022	Distance-Ratio-Based Formulation for Metric Learning by Hyeongji Kim et al
01-20-2022	A Visual Analytics Approach to Building Logistic Regression Models and its Application to Health Records by Erasmo Artur et al
01-19-2022	Open Source Handwritten Text Recognition on Medieval Manuscripts using Mixed Models and Document-Specific Finetuning by Christian Reul et al
01-18-2022	When Facial Expression Recognition Meets Few-Shot Learning: A Joint and Alternate Learning Framework by Xinyi Zou et al
01-21-2022	Conceptor Learning for Class Activation Mapping by Guangwu Qian et al
01-18-2022	OSSID: Online Self-Supervised Instance Detection by (and for) Pose Estimation by Qiao Gu et al
01-18-2022	Resistance Training using Prior Bias: toward Unbiased Scene Graph Generation by Chao Chen et al
01-18-2022	Adaptive Weighted Guided Image Filtering for Depth Enhancement in Shape-From-Focus by Yuwen Li et al
01-21-2022	VIPriors 2: Visual Inductive Priors for Data-Efficient Deep Learning Challenges by Attila Lengyel et al
01-18-2022	Pistol: Pupil Invisible Supportive Tool to extract Pupil, Iris, Eye Opening, Eye Movements, Pupil and Iris Gaze Vector, and 2D as well as 3D Gaze by Wolfgang Fuhl et al
01-18-2022	The Role of Pleura and Adipose in Lung Ultrasound AI by Gautam Rajendrakumar Gare et al
01-18-2022	Deformable One-Dimensional Object Detection for Routing and Manipulation by Azarakhsh Keipour et al
01-21-2022	What Can Machine Vision Do for Lymphatic Histopathology Image Analysis: A Comprehensive Review by Xiaoqi Li et al
01-21-2022	SparseAlign: A Super-Resolution Algorithm for Automatic Marker Localization and Deformation Estimation in Cryo-Electron Tomography by Poulami Somanya Ganguly et al
01-19-2022	Q-ViT: Fully Differentiable Quantization for Vision Transformer by Zhexin Li et al
01-19-2022	Semi-automatic 3D Object Keypoint Annotation and Detection for the Masses by Kenneth Blomqvist et al
01-21-2022	Multi-view Monocular Depth and Uncertainty Prediction with Deep SfM in Dynamic Environments by Christian Homeyer et al
01-20-2022	An Empirical Investigation of Model-to-Model Distribution Shifts in Trained Convolutional Filters by Paul Gavrikov et al
01-20-2022	Vertical Federated Edge Learning with Distributed Integrated Sensing and Communication by Peixi Liu et al
01-19-2022	High-fidelity 3D Model Compression based on Key Spheres by Yuanzhan Li et al
01-21-2022	SegTransVAE: Hybrid CNN -- Transformer with Regularization for medical image segmentation by Quan-Dung Pham et al
01-21-2022	Enhancing Pseudo Label Quality for Semi-SupervisedDomain-Generalized Medical Image Segmentation by Huifeng Yao et al
01-20-2022	FaceOcc: A Diverse, High-quality Face Occlusion Dataset for Human Face Extraction by Xiangnan Yin et al
01-19-2022	Improving Specificity in Mammography Using Cross-correlation between Wavelet and Fourier Transform by Liuhua Zhang
01-21-2022	Pseudo-Labeled Auto-Curriculum Learning for Semi-Supervised Keypoint Localization by Can Wang et al
01-21-2022	Contrastive and Selective Hidden Embeddings for Medical Image Segmentation by Zhuowei Li et al
01-21-2022	Object Detection in Aerial Images: What Improves the Accuracy? by Hashmat Shadab Malik et al
01-21-2022	Classroom Slide Narration System by Jobin K. V. et al
01-19-2022	GroupGazer: A Tool to Compute the Gaze per Participant in Groups with integrated Calibration to Map the Gaze Online to a Screen or Beamer Projection by Wolfgang Fuhl
01-21-2022	A Comprehensive Study of Vision Transformers on Dense Prediction Tasks by Kishaan Jeeveswaran et al
01-21-2022	ERS: a novel comprehensive endoscopy image dataset for machine learning, compliant with the MST 3.0 specification by Jan Cychnerski et al
01-21-2022	Exploring Fusion Strategies for Accurate RGBT Visual Object Tracking by Zhangyong Tang et al
01-21-2022	AiTLAS: Artificial Intelligence Toolbox for Earth Observation by Ivica Dimitrovski et al
01-21-2022	Dynamic Deep Convolutional Candlestick Learner by Jun-Hao Chen et al
01-21-2022	Improving Across-Dataset Brain Tissue Segmentation Using Transformer by Vishwanatha M. Rao et al
01-21-2022	Reliable Detection of Doppelg\angers based on Deep Face Representations by Christian Rathgeb et al
01-18-2022	DDU-Net: Dual-Decoder-U-Net for Road Extraction Using High-Resolution Remote Sensing Images by Ying Wang et al
01-20-2022	Steerable Pyramid Transform Enables Robust Left Ventricle Quantification by Xiangyang Zhu et al
01-21-2022	Fast Differentiable Matrix Square Root by Yue Song et al

Craig SmithJanuary 24, 2022