2022.4.18 Vision papers

04-12-2022	GARF: Gaussian Activated Radiance Fields for High Fidelity Reconstruction and Pose Estimation by Shin-Fang Chng et al
04-14-2022	DeiT III: Revenge of the ViT by Hugo Touvron et al
04-14-2022	Neighborhood Attention Transformer by Ali Hassani et al
04-14-2022	Masked Siamese Networks for Label-Efficient Learning by Mahmoud Assran et al
04-14-2022	Sim-to-Real 6D Object Pose Estimation via Iterative Self-training for Robotic Bin-picking by Kai Chen et al
04-13-2022	COAP: Compositional Articulated Occupancy of People by Marko Mihajlovic et al
04-14-2022	Any-resolution Training for High-resolution Image Synthesis by Lucy Chai et al
04-13-2022	DL4SciVis: A State-of-the-Art Survey on Deep Learning for Scientific Visualization by Chaoli Wang et al
04-14-2022	Whats in your hands? 3D Reconstruction of Generic Objects in Hands by Yufei Ye et al
04-13-2022	Geometric Understanding of Sketches by Raghav Brahmadesam Venkataramaiyer
04-12-2022	Machine Learning Security against Data Poisoning: Are We There Yet? by Antonio Emanuele Cinà et al
04-14-2022	MiniViT: Compressing Vision Transformers with Weight Multiplexing by Jinnian Zhang et al
04-14-2022	GIFS: Neural Implicit Function for General Shape Representation by Jianglong Ye et al
04-12-2022	ReCLIP: A Strong Zero-Shot Baseline for Referring Expression Comprehension by Sanjay Subramanian et al
04-13-2022	Multi-View Consistent Generative Adversarial Networks for 3D-aware Image Synthesis by Xuanmeng Zhang et al
04-14-2022	Ensuring accurate stain reproduction in deep generative networks for virtual immunohistochemistry by Christopher D. Walsh et al
04-13-2022	Wassmap: Wasserstein Isometric Mapping for Image Manifold Learning by Keaton Hamm et al
04-14-2022	BEHAVE: Dataset and Method for Tracking Human Object Interactions by Bharat Lal Bhatnagar et al
04-13-2022	Controllable Video Generation through Global and Local Motion Dynamics by Aram Davtyan et al
04-12-2022	Open-World Instance Segmentation: Exploiting Pseudo Ground Truth From Learned Pairwise Affinity by Weiyao Wang et al
04-13-2022	Towards Metrical Reconstruction of Human Faces by Wojciech Zielonka et al
04-14-2022	A Level Set Theory for Neural Implicit Evolution under Explicit Flows by Ishit Mehta et al
04-13-2022	Deep Learning-based Framework for Automatic Cranial Defect Reconstruction and Implant Modeling by Marek Wodzinski et al
04-12-2022	VisCUIT: Visual Auditor for Bias in CNN Image Classifier by Seongmin Lee et al
04-14-2022	Deformable Sprites for Unsupervised Video Decomposition by Vickie Ye et al
04-14-2022	Geometric Deep Learning to Identify the Critical 3D Structural Features of the Optic Nerve Head for Glaucoma Diagnosis by Fabian A. Braeu et al
04-13-2022	What Matters in Language Conditioned Robotic Imitation Learning by Oier Mees et al
04-13-2022	Reuse your features: unifying retrieval and feature-metric alignment by Javier Morlana et al
04-12-2022	Generative Negative Replay for Continual Learning by Gabriele Graffieti et al
04-14-2022	Interpretability of Machine Learning Methods Applied to Neuroimaging by Elina Thibeau-Sutre et al
04-13-2022	Deep Learning Model with GA based Feature Selection and Context Integration by Ranju Mandal et al
04-14-2022	From Environmental Sound Representation to Robustness of 2D CNN Models Against Adversarial Attacks by Mohammad Esmaeilpour et al
04-15-2022	MVSTER: Epipolar Transformer for Efficient Multi-View Stereo by Xiaofeng Wang et al
04-13-2022	TIB-VA at SemEval-2022 Task 5: A Multimodal Architecture for the Detection and Classification of Misogynous Memes by Sherzod Hakimov et al
04-14-2022	HyDe: The First Open-Source, Python-Based, GPU-Accelerated Hyperspectral Denoising Package by Daniel Coquelin et al
04-14-2022	Modeling Indirect Illumination for Inverse Rendering by Yuanqing Zhang et al
04-12-2022	X-DETR: A Versatile Architecture for Instance-wise Vision-Language Tasks by Zhaowei Cai et al
04-12-2022	Back to the Roots: Reconstructing Large and Complex Cranial Defects using an Image-based Statistical Shape Model by Jianning Li et al
04-13-2022	Active Diffusion and VCA-Assisted Image Segmentation of Hyperspectral Images by Sam L. Polk et al
04-14-2022	Medical Application of Geometric Deep Learning for the Diagnosis of Glaucoma by Alexandre H. Thiery et al
04-14-2022	Guided Co-Modulated GAN for 360{\deg} Field of View Extrapolation by Mohammad Reza Karimi Dastjerdi et al
04-14-2022	Unsupervised Deep Learning Meets Chan-Vese Model by Dihan Zheng et al
04-12-2022	Examining the Proximity of Adversarial Examples to Class Manifolds in Deep Networks by Štefan Pócoš et al
04-13-2022	Dynamic Neural Textures: Generating Talking-Face Videos with Continuously Controllable Expressions by Zipeng Ye et al
04-12-2022	Multi-View Breast Cancer Classification via Hypercomplex Neural Networks by Eleonora Lopez et al
04-12-2022	LifeLonger: A Benchmark for Continual Disease Classification by Mohammad Mahdi Derakhshani et al
04-12-2022	GORDA: Graph-based ORientation Distribution Analysis of SLI scatterometry Patterns of Nerve Fibres by Esteban Vaca et al
04-12-2022	Continual Predictive Learning from Videos by Geng Chen et al
04-12-2022	RL-CoSeg : A Novel Image Co-Segmentation Algorithm with Deep Reinforcement Learning by Xin Duan et al
04-14-2022	The multi-modal universe of fast-fashion: the Visuelle 2.0 benchmark by Geri Skenderi et al
04-12-2022	Unsupervised Anomaly and Change Detection with Multivariate Gaussianization by José A. Padrón-Hidalgo et al
04-13-2022	Estimating Structural Disparities for Face Models by Shervin Ardeshir et al
04-12-2022	Automatic detection of glaucoma via fundus imaging and artificial intelligence: A review by Lauren Coan et al
04-14-2022	Sketch guided and progressive growing GAN for realistic and editable ultrasound image synthesis by Jiamin Liang et al
04-14-2022	LEFM-Nets: Learnable Explicit Feature Map Deep Networks for Segmentation of Histopathological Images of Frozen Sections by Dario Sitnik et al
04-12-2022	Adaptive Cross-Attention-Driven Spatial-Spectral Graph Convolutional Network for Hyperspectral Image Classification by Jin-Yu Yang et al
04-13-2022	Context-based Deep Learning Architecture with Optimal Integration Layer for Image Parsing by Ranju Mandal et al
04-12-2022	Towards Open-Set Object Detection and Discovery by Jiyang Zheng et al
04-15-2022	Deep CardioSound: An Ensembled Deep Learning Model for Heart Sound MultiLabelling by Li Guo et al
04-14-2022	Learning Spatially Varying Pixel Exposures for Motion Deblurring by Cindy M. Nguyen et al
04-13-2022	Learning Convolutional Neural Networks in the Frequency Domain by Hengyue Pan et al
04-12-2022	Arch-Graph: Acyclic Architecture Relation Predictor for Task-Transferable Neural Architecture Search by Minbin Huang et al
04-13-2022	DMCNet: Diversified Model Combination Network for Understanding Engagement from Video Screengrabs by Sarthak Batra et al
04-14-2022	Q-TART: Quickly Training for Adversarial Robustness and in-Transferability by Madan Ravi Ganesh et al
04-12-2022	Compact Model Training by Low-Rank Projection with Energy Transfer by Kailing Guo et al
04-14-2022	Atmospheric Turbulence Removal with Complex-Valued Convolutional Neural Network by Nantheera Anantrasirichai
04-14-2022	Cross-Image Relational Knowledge Distillation for Semantic Segmentation by Chuanguang Yang et al
04-12-2022	TopFormer: Token Pyramid Transformer for Mobile Semantic Segmentation by Wenqiang Zhang et al
04-13-2022	ViViD++: Vision for Visibility Dataset by Alex Junho Lee et al
04-13-2022	Defensive Patches for Robust Recognition in the Physical World by Jiakai Wang et al
04-12-2022	Video Captioning: a comparative review of where we are and which could be the route by Daniela Moctezuma et al
04-12-2022	Probabilistic Compositional Embeddings for Multimodal Image Retrieval by Andrei Neculai et al
04-13-2022	Receding Neuron Importances for Structured Pruning by Mihai Suteu et al
04-13-2022	HASA: Hybrid Architecture Search with Aggregation Strategy for Echinococcosis Classification and Ovary Segmentation in Ultrasound Images by Jikuan Qian et al
04-14-2022	Detection of Degraded Acacia tree species using deep neural networks on uav drone imagery by Anne Achieng Osio et al
04-12-2022	NightLab: A Dual-level Architecture with Hardness Detection for Segmentation at Night by Xueqing Deng et al
04-13-2022	WSSS4LUAD: Grand Challenge on Weakly-supervised Tissue Semantic Segmentation for Lung Adenocarcinoma by Chu Han et al
04-15-2022	Vision-and-Language Pretrained Models: A Survey by Siqu Long et al
04-12-2022	SwinNet: Swin Transformer drives edge-aware RGB-D and RGB-T salient object detection by Zhengyi Liu et al
04-13-2022	Deep learning based automatic detection of offshore oil slicks using SAR data and contextual information by Emna Amri et al
04-12-2022	Hierarchical Text-Conditional Image Generation with CLIP Latents by Aditya Ramesh et al
04-12-2022	Regression or Classification? Reflection on BP prediction from PPG data using Deep Neural Networks in the scope of practical applications by Fabian Schrumpf et al
04-12-2022	On the Equity of Nuclear Norm Maximization in Unsupervised Domain Adaptation by Wenju Zhang et al
04-12-2022	HyperDet3D: Learning a Scene-conditioned 3D Object Detector by Yu Zheng et al
04-14-2022	High-performance Evolutionary Algorithms for Online Neuron Control by Binxu Wang et al
04-12-2022	Towards Reliable Image Outpainting: Learning Structure-Aware Multimodal Fusion with Depth Guidance by Lei Zhang et al
04-12-2022	Undoing the Damage of Label Shift for Cross-domain Semantic Segmentation by Yahao Liu et al
04-13-2022	5G Features and Standards for Vehicle Data Exploitation by Gorka Velez et al
04-14-2022	Semi-Supervised Training to Improve Player and Ball Detection in Soccer by Renaud Vandeghen et al
04-12-2022	Open-set Text Recognition via Character-Context Decoupling by Chang Liu et al
04-14-2022	Activation Regression for Continuous Domain Generalization with Applications to Crop Classification by Samar Khanna et al
04-12-2022	Exploring Event Camera-based Odometry for Planetary Robots by Florian Mahlknecht et al
04-12-2022	Content and Style Aware Generation of Text-line Images for Handwriting Recognition by Lei Kang et al
04-12-2022	Neural Texture Extraction and Distribution for Controllable Person Image Synthesis by Yurui Ren et al
04-13-2022	Calibrating Class Weights with Multi-Modal Information for Partial Video Domain Adaptation by Xiyu Wang et al
04-12-2022	DAIR-V2X: A Large-Scale Dataset for Vehicle-Infrastructure Cooperative 3D Object Detection by Haibao Yu et al
04-12-2022	Malceiver: Perceiver with Hierarchical and Multi-modal Features for Android Malware Detection by Niall McLaughlin
04-14-2022	YOLO-Pose: Enhancing YOLO for Multi Person Pose Estimation Using Object Keypoint Similarity Loss by Debapriya Maji et al
04-14-2022	Clothes-Changing Person Re-identification with RGB Modality Only by Xinqian Gu et al
04-14-2022	SemiMultiPose: A Semi-supervised Multi-animal Pose Estimation Framework by Ari Blau et al
04-13-2022	3D-SPS: Single-Stage 3D Visual Grounding via Referred Point Progressive Selection by Junyu Luo et al
04-12-2022	Super-Resolution for Selfie Biometrics: Introduction and Application to Face and Iris by Fernando Alonso-Fernandez et al
04-12-2022	3DeformRS: Certifying Spatial Deformations on Point Clouds by Gabriel Pérez S. et al
04-15-2022	COTS: Collaborative Two-Stream Vision-Language Pre-Training Model for Cross-Modal Retrieval by Haoyu Lu et al
04-14-2022	Look Back and Forth: Video Super-Resolution with Explicit Temporal Difference Modeling by Takashi Isobe et al
04-13-2022	Rapid model transfer for medical image segmentation via iterative human-in-the-loop update: from labelled public to unlabelled clinical datasets for multi-organ segmentation in CT by Wenao Ma et al
04-13-2022	Transparent Shape from Single Polarization Images by Shao Mingqi et al
04-14-2022	Improving Cross-Modal Understanding in Visual Dialog via Contrastive Learning by Feilong Chen et al
04-14-2022	Explainable Analysis of Deep Learning Methods for SAR Image Classification by Shenghan Su et al
04-13-2022	Recognition of Freely Selected Keypoints on Human Limbs by Katja Ludwig et al
04-12-2022	EVOPS Benchmark: Evaluation of Plane Segmentation from RGBD and LiDAR Data by Anastasiia Kornilova et al
04-14-2022	SoccerNet-Tracking: Multiple Object Tracking Dataset and Benchmark in Soccer Videos by Anthony Cioppa et al
04-13-2022	MINSU (Mobile Inventory And Scanning Unit):Computer Vision and AI by Jihoon Ryoo et al
04-14-2022	Implicit Sample Extension for Unsupervised Person Re-Identification by Xinyu Zhang et al
04-12-2022	Unidirectional Video Denoising by Mimicking Backward Recurrent Modules with Look-ahead Forward Ones by Junyi Li et al
04-12-2022	Three-Stream Joint Network for Zero-Shot Sketch-Based Image Retrieval by Yu-Wei Zhan et al
04-12-2022	DistPro: Searching A Fast Knowledge Distillation Process via Meta Optimization by Xueqing Deng et al
04-14-2022	Invisible-to-Visible: Privacy-Aware Human Instance Segmentation using Airborne Ultrasound via Collaborative Learning Variational Autoencoder by Risako Tanigawa et al
04-12-2022	Semantic keypoint-based pose estimation from single RGB frames by Karl Schmeckpeper et al
04-13-2022	Mitigating Bias in Facial Analysis Systems by Incorporating Label Diversity by Camila Kolling et al
04-13-2022	A deep learning algorithm for reducing false positives in screening mammography by Stefano Pedemonte et al
04-15-2022	Synthesizing Informative Training Samples with GAN by Bo Zhao et al
04-12-2022	DCMS: Motion Forecasting with Dual Consistency and Multi-Pseudo-Target Supervision by Maosheng Ye et al
04-14-2022	Human Identity-Preserved Motion Retargeting in Video Synthesis by Feature Disentanglement by Jingzhe Ma et al
04-12-2022	SRMD: Sparse Random Mode Decomposition by Nicholas Richardson et al
04-14-2022	OmniPD: One-Step Person Detection in Top-View Omnidirectional Indoor Scenes by Jingrui Yu et al
04-12-2022	Unsupervised Anomaly Detection in 3D Brain MRI using Deep Learning with impured training data by Finn Behrendt et al
04-15-2022	SSR-HEF: Crowd Counting with Multi-Scale Semantic Refining and Hard Example Focusing by Jiwei Chen et al
04-15-2022	Towards PAC Multi-Object Detection and Tracking by Shuo Li et al
04-14-2022	Autonomous Satellite Detection and Tracking using Optical Flow by David Zuehlke et al
04-12-2022	Localization Distillation for Object Detection by Zhaohui Zheng et al
04-15-2022	Crowd counting with segmentation attention convolutional neural network by Jiwei Chen et al
04-13-2022	Out-of-distribution Detection with Deep Nearest Neighbors by Yiyou Sun et al
04-14-2022	CroCo: Cross-Modal Contrastive learning for localization of Earth Observation data by Wei-Hsin Tseng et al
04-15-2022	Crowd counting with crowd attention convolutional neural network by Jiwei Chen et al
04-14-2022	Joint Forecasting of Panoptic Segmentations with Difference Attention by Colin Graber et al
04-14-2022	Spatial Likelihood Voting with Self-Knowledge Distillation for Weakly Supervised Object Detection by Ze Chen et al
04-14-2022	ViTOL: Vision Transformer for Weakly Supervised Object Localization by Saurav Gupta et al

04-14-2022	End-to-end Learning for Joint Depth and Image Reconstruction from Diffracted Rotation by Mazen Mel et al
04-14-2022	Weakly Supervised Attended Object Detection Using Gaze Data as Annotations by Michele Mazzamuto et al
04-14-2022	Pyramidal Attention for Saliency Detection by Tanveer Hussain et al
04-14-2022	Residual Swin Transformer Channel Attention Network for Image Demosaicing by Wenzhu Xing et al
04-13-2022	Do You Really Mean That? Content Driven Audio-Visual Deepfake Dataset and Multimodal Method for Temporal Forgery Localization by Zhixi Cai et al
04-14-2022	Egocentric Human-Object Interaction Detection Exploiting Synthetic Data by Rosario Leonardi et al
04-15-2022	INSTA-BNN: Binary Neural Network with INSTAnce-aware Threshold by Changhun Lee et al
04-12-2022	Baseline Computation for Attribution Methods Based on Interpolated Inputs by Miguel Lerma et al
04-14-2022	Visual-Inertial Odometry with Online Calibration of Velocity-Control Based Kinematic Motion Models by Haolong Li et al
04-12-2022	How to Register a Live onto a Liver ? Partial Matching in the Space of Varifolds by Pierre-Louis Antonsanti et al
04-15-2022	Unconditional Image-Text Pair Generation with Multimodal Cross Quantizer by Hyungyung Lee et al
04-13-2022	Assessing cloudiness in nonwovens by Michael Godehardt et al
04-12-2022	Label Distribution Learning for Generalizable Multi-source Person Re-identification by Lei Qi et al
04-12-2022	Few-shot Forgery Detection via Guided Adversarial Interpolation by Haonan Qiu et al
04-15-2022	Transfer Learning for Instance Segmentation of Waste Bottles using Mask R-CNN Algorithm by Punitha Jaikumar et al
04-14-2022	RecurSeed and CertainMix for Weakly Supervised Semantic Segmentation by Sang Hyun Jo et al
04-14-2022	Deep Vehicle Detection in Satellite Video by Roman Pflugfelder et al
04-14-2022	3D Shuffle-Mixer: An Efficient Context-Aware Vision Learner of Transformer-MLP Paradigm for Dense Prediction in Medical Volume by Jianye Pang et al
04-14-2022	Panoptic Segmentation using Synthetic and Real Data by Camillo Quattrocchi et al
04-13-2022	Neural Vector Fields for Surface Representation and Inference by Edoardo Mello Rella et al
04-15-2022	Revisiting the Adversarial Robustness-Accuracy Tradeoff in Robot Learning by Mathias Lechner et al
04-13-2022	A9-Dataset: Multi-Sensor Infrastructure-Based Dataset for Mobility Research by Christian Creß et al
04-13-2022	Does depth estimation help object detection? by Bedrettin Cetinkaya et al
04-14-2022	Pushing the Limits of Simple Pipelines for Few-Shot Learning: External Data and Fine-Tuning Make a Difference by Shell Xu Hu et al
04-15-2022	2D Human Pose Estimation: A Survey by Haoming Chen et al
04-14-2022	MetaSets: Meta-Learning on Point Sets for Generalizable Representations by Chao Huang et al
04-15-2022	End-to-End Sensitivity-Based Filter Pruning by Zahra Babaiee et al
04-14-2022	Unsupervised Domain Adaptation with Implicit Pseudo Supervision for Semantic Segmentation by Wanyu Xu et al
04-14-2022	Interpretable Vertebral Fracture Quantification via Anchor-Free Landmarks Localization by Alexey Zakharov et al
04-13-2022	Character-focused Video Thumbnail Retrieval by Shervin Ardeshir et al
04-15-2022	ResT V2: Simpler, Faster and Stronger by Qing-Long Zhang et al
04-13-2022	SpoofGAN: Synthetic Fingerprint Spoof Images by Steven A. Grosz et al
04-15-2022	Image Captioning In the Transformer Age by Yang Xu et al
04-15-2022	Detecting Violence in Video Based on Deep Features Fusion Technique by Heyam M. Bin Jahlan et al
04-15-2022	Scalable and Real-time Multi-Camera Vehicle Detection, Re-Identification, and Tracking by Pirazh Khorramshahi et al
04-15-2022	Learning Multi-View Aggregation In the Wild for Large-Scale 3D Semantic Segmentation by Damien Robert et al
04-13-2022	A Novel Approach for Optimum-Path Forest Classification Using Fuzzy Logic by Renato W. R. de Souza et al
04-14-2022	Information fusion approach for biomass estimation in a plateau mountainous forest using a synergistic system comprising UAS-based digital camera and LiDAR by Rong Huang et al
04-12-2022	AGQA 2.0: An Updated Benchmark for Compositional Spatio-Temporal Reasoning by Madeleine Grunde-McLaughlin et al
04-15-2022	ORCNet: A context-based network to simultaneously segment the ocular region components by Diego Rafael Lucio et al
04-15-2022	Patch-wise Contrastive Style Learning for Instagram Filter Removal by Furkan Kınlı et al
04-15-2022	A Keypoint-based Global Association Network for Lane Detection by Jinsheng Wang et al
04-13-2022	OccAMs Laser: Occlusion-based Attribution Maps for 3D Object Detectors on LiDAR Data by David Schinagl et al
04-15-2022	Guiding Attention using Partial-Order Relationships for Image Captioning by Murad Popattia et al
04-14-2022	Model-agnostic Multi-Domain Learning with Domain-Specific Adapters for Action Recognition by Kazuki Omi et al
04-14-2022	Dense Learning based Semi-Supervised Object Detection by Binghui Chen et al
04-13-2022	Illumination-Invariant Active Camera Relocalization for Fine-Grained Change Detection in the Wild by Nan Li et al
04-15-2022	Sensitivity of sparse codes to image distortions by Kyle Luther et al
04-14-2022	Feature Compression for Rate Constrained Object Detection on the Edge by Zhongzheng Yuan et al
04-15-2022	Semi-supervised atmospheric component learning in low-light image problem by Masud An Nur Islam Fahim et al
04-15-2022	FasterVideo: Efficient Online Joint Object Detection And Tracking by Issa Mouawad et al
04-15-2022	SOTVerse: A User-defined Task Space of Single Object Tracking by Shiyu Hu et al
04-13-2022	Adaptive Memory Management for Video Object Segmentation by Ali Pourganjalikhan et al
04-15-2022	Condition-Invariant and Compact Visual Place Description by Convolutional Autoencoder by Hanjing Ye et al
04-15-2022	CAiD: Context-Aware Instance Discrimination for Self-supervised Learning in Medical Imaging by Mohammad Reza Hosseinzadeh Taher et al
04-14-2022	Interactive Object Segmentation in 3D Point Clouds by Theodora Kontogianni et al
04-14-2022	Early Myocardial Infarction Detection with One-Class Classification over Multi-view Echocardiography by Aysen Degerli et al
04-14-2022	Imposing Consistency for Optical Flow Estimation by Jisoo Jeong et al
04-13-2022	Semantic-Aware Pretraining for Dense Video Captioning by Teng Wang et al
04-13-2022	Deep Relation Learning for Regression and Its Application to Brain Age Estimation by Sheng He et al
04-14-2022	Measuring Compositional Consistency for Video Question Answering by Mona Gandhi et al
04-14-2022	Robotic and Generative Adversarial Attacks in Offline Writer-independent Signature Verification by Jordan J. Bird
04-14-2022	PLGAN: Generative Adversarial Networks for Power-Line Segmentation in Aerial Images by Rabab Abdelfattah et al

Craig SmithApril 18, 2022