2022.3.7 Vision papers

03-10-2022	Model soups: averaging weights of multiple fine-tuned models improves accuracy without increasing inference time by Mitchell Wortsman et al
03-09-2022	On the surprising tradeoff between ImageNet accuracy and perceptual similarity by Manoj Kumar et al
03-08-2022	EdgeFormer: Improving Light-weight ConvNets by Learning from Vision Transformers by Haokui Zhang et al
03-10-2022	Cluttered Food Grasping with Adaptive Fingers and Synthetic-Data Trained Object Detection by Avinash Ummadisingu et al
03-11-2022	The Role of ImageNet Classes in Fr\echet Inception Distance by Tuomas Kynkäänniemi et al
03-10-2022	LoopITR: Combining Dual and Cross Encoder Architectures for Image-Text Retrieval by Jie Lei et al
03-09-2022	Leveling Down in Computer Vision: Pareto Inefficiencies in Fair Deep Classifiers by Dominik Zietlow et al
03-10-2022	Conditional Prompt Learning for Vision-Language Models by Kaiyang Zhou et al
03-08-2022	Dynamic Dual-Output Diffusion Models by Yaniv Benny et al
03-10-2022	BEAT: A Large-Scale Semantic and Emotional Multi-Modal Dataset for Conversational Gestures Synthesis by Haiyang Liu et al
03-10-2022	StyleBabel: Artistic Style Tagging and Captioning by Dan Ruta et al
03-08-2022	RC-MVSNet: Unsupervised Multi-View Stereo with Neural Rendering by Di Chang et al
03-09-2022	NLX-GPT: A Model for Natural Language Explanations in Vision and Vision-Language Tasks by Fawaz Sammani et al
03-10-2022	MORE: Multi-Order RElation Mining for Dense Captioning in 3D Scenes by Yang Jiao et al
03-08-2022	ART-Point: Improving Rotation Robustness of Point Cloud Classifiers via Adversarial Rotation by Robin Wang et al
03-10-2022	A Closer Look at Debiased Temporal Sentence Grounding in Videos: Dataset, Metric, and Approach by Xiaohan Lan et al
03-10-2022	Iterative Corresponding Geometry: Fusing Region and Depth for Highly Efficient 3D Tracking of Textureless Objects by Manuel Stoiber et al
03-08-2022	StyleHEAT: One-Shot High-Resolution Editable Talking Face Generation via Pretrained StyleGAN by Fei Yin et al
03-11-2022	ActiveMLP: An MLP-like Architecture with Active Token Mixer by Guoqiang Wei et al
03-10-2022	Knowledge-enriched Attention Network with Group-wise Semantic for Visual Storytelling by Tengpeng Li et al
03-11-2022	FLAG: Flow-based 3D Avatar Generation from Sparse Observations by Sadegh Aliakbarian et al
03-11-2022	Masked Visual Pre-training for Motor Control by Tete Xiao et al
03-08-2022	Multi-Modal Mixup for Robust Fine-tuning by Junhyuk So et al
03-08-2022	Tuning-free multi-coil compressed sensing MRI with Parallel Variable Density Approximate Message Passing (P-VDAMP) by Charles Millard et al
03-08-2022	Visual-Language Navigation Pretraining via Prompt-based Environmental Self-exploration by Xiwen Liang et al
03-09-2022	Pose Guided Multi-person Image Generation From Text by Soon Yau Cheong et al
03-09-2022	A Unified Transformer Framework for Group-based Segmentation: Co-Segmentation, Co-Saliency Detection and Video Salient Object Detection by Yukun Su et al
03-09-2022	FlexIT: Towards Flexible Semantic Image Translation by Guillaume Couairon et al
03-08-2022	Semantic Distillation Guided Salient Object Detection by Bo Xu et al
03-08-2022	Where Does the Performance Improvement Come From? - A Reproducibility Concern about Image-Text Retrieval by Jun Rao et al
03-09-2022	Mapping global dynamics of benchmark creation and saturation in artificial intelligence by Adriano Barbosa-Silva et al
03-10-2022	Hyperspectral Imaging for cherry tomato by Yun Xiang et al
03-08-2022	On Generalizing Beyond Domains in Cross-Domain Continual Learning by Christian Simon et al
03-08-2022	Analyzing General-Purpose Deep-Learning Detection and Segmentation Models with Images from a Lidar as a Camera Sensor by Yu Xianjia et al
03-09-2022	Model-Agnostic Multitask Fine-tuning for Few-shot Vision-Language Transfer Learning by Zhenhailong Wang et al
03-08-2022	Motron: Multimodal Probabilistic Human Motion Forecasting by Tim Salzmann et al
03-09-2022	Coarse-to-Fine Sparse Transformer for Hyperspectral Image Reconstruction by Jing Lin et al
03-11-2022	Democratizing Contrastive Language-Image Pre-training: A CLIP Benchmark of Data, Model, and Supervision by Yufeng Cui et al
03-08-2022	Source-free Domain Adaptation for Multi-site and Lifespan Brain Skull Stripping by Yunxiang Li et al
03-08-2022	Efficient and Accurate Hyperspectral Pansharpening Using 3D VolumeNet and 2.5D Texture Transfer by Yinao Li et al
03-10-2022	Synopses of Movie Narratives: a Video-Language Dataset for Story Understanding by Yidan Sun et al
03-08-2022	A New 27 Class Sign Language Dataset Collected from 173 Individuals by Arda Mavi et al
03-09-2022	Low-light Image and Video Enhancement via Selective Manipulation of Chromaticity by Sumit Shekhar et al
03-09-2022	Cross-modal Map Learning for Vision and Language Navigation by Georgios Georgakis et al
03-08-2022	Breast cancer detection using artificial intelligence techniques: A systematic literature review by Ali Bou Nassif et al
03-10-2022	Zero-Shot Action Recognition with Transformer-based Video Semantic Embedding by Keval Doshi et al
03-10-2022	Online Deep Metric Learning via Mutual Distillation by Gao-Dong Liu et al
03-09-2022	HDL: Hybrid Deep Learning for the Synthesis of Myocardial Velocity Maps in Digital Twins for Cardiac Analysis by Xiaodan Xing et al
03-10-2022	Autofocusing+: Noise-Resilient Motion Correction in Magnetic Resonance Imaging by Ekaterina Kuzmina et al
03-10-2022	An Audio-Visual Attention Based Multimodal Network for Fake Talking Face Videos Detection by Ganglai Wang et al
03-09-2022	The Transitive Information Theory and its Application to Deep Generative Models by Trung Ngo et al
03-10-2022	Toward Efficient Hyperspectral Image Processing inside Camera Pixels by Gourav Datta et al
03-10-2022	Back to Reality: Weakly-supervised 3D Object Detection with Shape-guided Label Enhancement by Xiuwei Xu et al
03-10-2022	ReF -- Rotation Equivariant Features for Local Feature Matching by Abhishek Peri et al
03-08-2022	Understanding person identification via gait by Simon Hanisch et al
03-10-2022	Representation Compensation Networks for Continual Semantic Segmentation by Chang-Bin Zhang et al
03-10-2022	AGCN: Augmented Graph Convolutional Network for Lifelong Multi-label Image Recognition by Kaile Du et al
03-09-2022	What Matters For Meta-Learning Vision Regression Tasks? by Ning Gao et al
03-10-2022	Towards Less Constrained Macro-Neural Architecture Search by Vasco Lopes et al
03-09-2022	Triangular Character Animation Sampling with Motion, Emotion, and Relation by Yizhou Zhao et al
03-10-2022	Learning-based Localizability Estimation for Robust LiDAR Localization by Julian Nubert et al
03-11-2022	Multi-modal Graph Learning for Disease Prediction by Shuai Zheng et al
03-11-2022	Graph Neural Networks for Relational Inductive Bias in Vision-based Deep Reinforcement Learning of Robot Control by Marco Oliva et al
03-09-2022	Adaptive Trajectory Prediction via Transferable GNN by Yi Xu et al
03-08-2022	End-to-end Multiple Instance Learning with Gradient Accumulation by Axel Andersson et al
03-08-2022	VoViT: Low Latency Graph-based Audio-Visual Voice Separation Transformer by Juan F. Montesinos et al
03-09-2022	CEU-Net: Ensemble Semantic Segmentation of Hyperspectral Images Using Clustering by Nicholas Soucy et al
03-11-2022	Flexible Amortized Variational Inference in qBOLD MRI by Ivor J. A. Simpson et al
03-08-2022	Trustable Co-label Learning from Multiple Noisy Annotators by Shikun Li et al
03-09-2022	Ray Tracing-Guided Design of Plenoptic Cameras by Tim Michels et al
03-08-2022	Selective-Supervised Contrastive Learning with Noisy Labels by Shikun Li et al
03-11-2022	WLASL-LEX: a Dataset for Recognising Phonological Properties in American Sign Language by Federico Tavella et al
03-08-2022	Sharing Generative Models Instead of Private Data: A Simulation Study on Mammography Patch Classification by Zuzanna Szafranowska et al
03-10-2022	Membership Privacy Protection for Image Translation Models via Adversarial Knowledge Distillation by Saeed Ranjbar Alvar et al
03-08-2022	Easy Ensemble: Simple Deep Ensemble Learning for Sensor-Based Human Activity Recognition by Tatsuhito Hasegawa et al
03-09-2022	A Tree-Structured Multi-Task Model Recommender by Lijun Zhang et al
03-09-2022	Frequency-driven Imperceptible Adversarial Attack on Semantic Similarity by Cheng Luo et al
03-11-2022	ROOD-MRI: Benchmarking the robustness of deep learning segmentation models to out-of-distribution and corrupted data in MRI by Lyndon Boone et al
03-09-2022	A Neuro-vector-symbolic Architecture for Solving Ravens Progressive Matrices by Michael Hersche et al
03-10-2022	Domain Generalization via Shuffled Style Assembly for Face Anti-Spoofing by Zhuo Wang et al
03-10-2022	Suspected Object Matters: Rethinking Models Prediction for One-stage Visual Grounding by Yang Jiao et al
03-08-2022	The Flag Median and FlagIRLS by Nathan Mankovich et al
03-09-2022	Anti-Oversmoothing in Deep Vision Transformers via the Fourier Domain Analysis: From Theory to Practice by Peihao Wang et al
03-10-2022	A Survey of Surface Defect Detection of Industrial Products Based on A Small Number of Labeled Data by Qifan Jin et al
03-10-2022	Prediction-Guided Distillation for Dense Object Detection by Chenhongyi Yang et al
03-10-2022	EyeLoveGAN: Exploiting domain-shifts to boost network learning with cycleGANs by Josefine Vilsbøll Sundgaard et al
03-08-2022	YouTube-GDD: A challenging gun detection dataset with rich contextual information by Yongxiang Gu et al
03-10-2022	Information-Theoretic Odometry Learning by Sen Zhang et al
03-08-2022	DeltaCNN: End-to-End CNN Inference of Sparse Frame Differences in Videos by Mathias Parger et al
03-10-2022	Domain Generalisation for Object Detection by Karthik Seemakurthy et al
03-09-2022	OpenTAL: Towards Open Set Temporal Action Localization by Wentao Bao et al
03-10-2022	TrueType Transformer: Character and Font Style Recognition in Outline Format by Yusuke Nagata et al
03-08-2022	ClearPose: Large-scale Transparent Object Dataset and Benchmark by Xiaotong Chen et al
03-11-2022	Detection of multiple retinal diseases in ultra-widefield fundus images using deep learning: data-driven identification of relevant regions by Justin Engelmann et al
03-09-2022	Practical Evaluation of Adversarial Robustness via Adaptive Auto Attack by Ye Liu et al
03-10-2022	Learning Distinctive Margin toward Active Domain Adaptation by Ming Xie et al
03-09-2022	Improving Neural ODEs via Knowledge Distillation by Haoyu Chu et al
03-09-2022	Align-Deform-Subtract: An Interventional Framework for Explaining Object Differences by Cian Eastwood et al
03-08-2022	A Gating Model for Bias Calibration in Generalized Zero-shot Learning by Gukyeong Kwon et al
03-09-2022	Simulation of Plenoptic Cameras by Tim Michels et al
03-09-2022	Inadequately Pre-trained Models are Better Feature Extractors by Andong Deng et al
03-09-2022	Manifold Modeling in Quotient Space: Learning An Invariant Mapping with Decodability of Image Patches by Tatsuya Yokota et al
03-10-2022	An Empirical Investigation of 3D Anomaly Detection and Segmentation by Eliahu Horwitz et al
03-11-2022	Deep AutoAugment by Yu Zheng et al
03-08-2022	Data augmentation with mixtures of max-entropy transformations for filling-level classification by Apostolos Modas et al
03-08-2022	Dynamic Group Transformer: A General Vision Transformer Backbone with Dynamic Group Attention by Kai Liu et al
03-10-2022	GrainSpace: A Large-scale Dataset for Fine-grained and Domain-adaptive Recognition of Cereal Grains by Lei Fan et al
03-08-2022	Robust Multi-Task Learning and Online Refinement for Spacecraft Pose Estimation across Domain Gap by Tae Ha Park et al
03-08-2022	Learning to Erase the Bayer-Filter to See in the Dark by Xingbo Dong et al
03-08-2022	MICDIR: Multi-scale Inverse-consistent Deformable Image Registration using UNetMSS with Self-Constructing Graph Latent by Soumick Chatterjee et al
03-11-2022	AI-enabled Automatic Multimodal Fusion of Cone-Beam CT and Intraoral Scans for Intelligent 3D Tooth-Bone Reconstruction and Clinical Applications by Jin Hao et al
03-09-2022	Intention-aware Feature Propagation Network for Interactive Segmentation by Chuyu Zhang et al
03-09-2022	Multiscale Convolutional Transformer with Center Mask Pretraining for Hyperspectral Image Classificationtion by Yifan Wang et al
03-10-2022	MVP: Multimodality-guided Visual Pre-training by Longhui Wei et al
03-08-2022	Evolutionary Neural Cascade Search across Supernetworks by Alexander Chebykin et al
03-10-2022	NeRFocus: Neural Radiance Field for 3D Synthetic Defocus by Yinhuai Wang et al
03-08-2022	Mutual Contrastive Learning to Disentangle Whole Slide Image Representations for Glioma Grading by Lipei Zhang et al

03-10-2022	QDrop: Randomly Dropping Quantization for Extremely Low-bit Post-Training Quantization by Xiuying Wei et al
03-09-2022	Rethinking data-driven point spread function modeling with a differentiable optical model by Tobias Liaudat et al
03-11-2022	PseudoProp: Robust Pseudo-Label Generation for Semi-Supervised Object Detection in Autonomous Driving Systems by Shu Hu et al
03-11-2022	Deep Class Incremental Learning from Decentralized Data by Xiaohan Zhang et al
03-10-2022	Crowd Source Scene Change Detection and Local Map Update by Itzik Wilf et al
03-10-2022	Knowledge Distillation as Efficient Pre-training: Faster Convergence, Higher Data-efficiency, and Better Transferability by Ruifei He et al
03-08-2022	Universal Prototype Transport for Zero-Shot Action Recognition and Localization by Pascal Mettes
03-08-2022	Tune your Place Recognition: Self-Supervised Domain Calibration via Robust SLAM by Pierre-Yves Lajoie et al
03-08-2022	DuMLP-Pin: A Dual-MLP-dot-product Permutation-invariant Network for Set Feature Extraction by Jiajun Fei et al
03-11-2022	Active Phase-Encode Selection for Slice-Specific Fast MR Scanning Using a Transformer-Based Deep Reinforcement Learning Framework by Yiming Liu et al
03-11-2022	Federated Remote Physiological Measurement with Imperfect Data by Xin Liu et al
03-10-2022	A Screen-Shooting Resilient Document Image Watermarking Scheme using Deep Neural Network by Sulong Ge et al
03-09-2022	Efficient Image Representation Learning with Federated Sampled Softmax by Sagar M. Waghmare et al
03-08-2022	Generative Cooperative Learning for Unsupervised Video Anomaly Detection by Muhammad Zaigham Zaheer et al
03-11-2022	Embedding Earth: Self-supervised contrastive pre-training for dense land cover classification by Michail Tarasiou et al
03-10-2022	Evaluating U-net Brain Extraction for Multi-site and Longitudinal Preclinical Stroke Imaging by Erendiz Tarakci et al
03-09-2022	Object-Based Visual Camera Pose Estimation From Ellipsoidal Model and 3D-Aware Ellipse Prediction by Matthieu Zins et al
03-09-2022	Uni4Eye: Unified 2D and 3D Self-supervised Pre-training via Masked Image Modeling Transformer for Ophthalmic Image Classification by Zhiyuan Cai et al
03-11-2022	TAPE: Task-Agnostic Prior Embedding for Image Restoration by Lin Liu et al
03-09-2022	Domain Generalization using Pretrained Models without Fine-tuning by Ziyue Li et al
03-08-2022	A Simple Multi-Modality Transfer Learning Baseline for Sign Language Translation by Yutong Chen et al
03-08-2022	Quantification of Occlusion Handling Capability of a 3D Human Pose Estimation Framework by Mehwish Ghafoor et al
03-10-2022	Annotation Efficient Person Re-Identification with Diverse Cluster-Based Pair Selection by Lantian Xue et al
03-08-2022	End-to-End Semi-Supervised Learning for Video Action Detection by Akash Kumar et al
03-08-2022	Gait Recognition with Mask-based Regularization by Chuanfu Shen et al
03-10-2022	Spatial Commonsense Graph for Object Localisation in Partial Scenes by Francesco Giuliari et al
03-08-2022	Skating-Mixer: Multimodal MLP for Scoring Figure Skating by Jingfei Xia et al
03-10-2022	Non-generative Generalized Zero-shot Learning via Task-correlated Disentanglement and Controllable Samples Synthesis by Yaogong Feng et al
03-08-2022	NeReF: Neural Refractive Field for Fluid Surface Reconstruction and Implicit Representation by Ziyu Wang et al
03-10-2022	Towards Scale Consistent Monocular Visual Odometry by Learning from the Virtual World by Sen Zhang et al
03-10-2022	Image-based Stroke Assessment for Multi-site Preclinical Evaluation of Cerebroprotectants by Ryan P. Cabeen et al
03-08-2022	Attention-Based Lip Audio-Visual Synthesis for Talking Face Generation in the Wild by Ganglai Wang et al
03-08-2022	Globally-Optimal Event Camera Motion Estimation by Xin Peng et al
03-10-2022	Towards Bi-directional Skip Connections in Encoder-Decoder Architectures and Beyond by Tiange Xiang et al
03-09-2022	Joint Learning of Salient Object Detection, Depth Estimation and Contour Extraction by Xiaoqi Zhao et al
03-10-2022	Contrastive Boundary Learning for Point Cloud Segmentation by Liyao Tang et al
03-08-2022	Contrastive Enhancement Using Latent Prototype for Few-Shot Segmentation by Xiaoyu Zhao et al
03-09-2022	Attention-effective multiple instance learning on weakly stem cell colony segmentation by Novanto Yudistira et al
03-08-2022	Probabilistic Warp Consistency for Weakly-Supervised Semantic Correspondences by Prune Truong et al
03-10-2022	Geometric Synthesis: A Free lunch for Large-scale Palmprint Recognition Model Pretraining by Kai Zhao et al
03-08-2022	Part-level Action Parsing via a Pose-guided Coarse-to-Fine Framework by Xiaodong Chen et al
03-08-2022	Image Steganography based on Style Transfer by Donghui Hu et al
03-08-2022	An Efficient Polyp Segmentation Network by Tugberk Erol et al
03-09-2022	CMX: Cross-Modal Fusion for RGB-X Semantic Segmentation with Transformers by Huayao Liu et al
03-08-2022	SimpleTrack: Rethinking and Improving the JDE Approach for Multi-Object Tracking by Jiaxin Li et al
03-10-2022	Real-time Scene Text Detection Based on Global Level and Word Level Features by Fuqiang Zhao et al
03-11-2022	Automatic Fine-grained Glomerular Lesion Recognition in Kidney Pathology by Yang Nan et al
03-09-2022	Normal and Visibility Estimation of Human Face from a Single Image by Fuzhi Zhong et al
03-09-2022	Semi-supervision semantic segmentation with uncertainty-guided self cross supervision by Yunyang Zhang et al
03-09-2022	MetAug: Contrastive Learning via Meta Feature Augmentation by Jiangmeng Li et al
03-08-2022	The Combinatorial Brain Surgeon: Pruning Weights That Cancel One Another in Neural Networks by Xin Yu et al
03-10-2022	Deep Learning-Based Perceptual Stimulus Encoder for Bionic Vision by Lucas Relic et al
03-10-2022	SelfTune: Metrically Scaled Monocular Depth Estimation through Self-Supervised Learning by Jaehoon Choi et al
03-08-2022	Discriminability-Transferability Trade-Off: An Information-Theoretic Perspective by Quan Cui et al
03-09-2022	Learning the Degradation Distribution for Blind Image Super-Resolution by Zhengxiong Luo et al
03-09-2022	DEER: Detection-agnostic End-to-End Recognizer for Scene Text Spotting by Seonghyeon Kim et al
03-08-2022	E2EC: An End-to-End Contour-based Method for High-Quality High-Speed Instance Segmentation by Tao Zhang et al
03-08-2022	Part-Aware Self-Supervised Pre-Training for Person Re-Identification by Kuan Zhu et al
03-08-2022	MLSeg: Image and Video Segmentation as Multi-Label Classification and Selected-Label Pixel Classification by Haodi He et al
03-09-2022	Structure-Aware Flow Generation for Human Body Reshaping by Jianqiang Ren et al
03-11-2022	Multi-sensor large-scale dataset for multi-view 3D reconstruction by Oleg Voynov et al
03-10-2022	Backbone is All Your Need: A Simplified Architecture for Visual Object Tracking by Boyu Chen et al
03-08-2022	CIDER: Exploiting Hyperspherical Embeddings for Out-of-Distribution Detection by Yifei Ming et al
03-08-2022	Counting with Adaptive Auxiliary Learning by Yanda Meng et al
03-08-2022	Lightweight Monocular Depth Estimation through Guided Decoding by Michael Rudolph et al
03-08-2022	AssistQ: Affordance-centric Question-driven Task Completion for Egocentric Assistant by Benita Wong et al
03-10-2022	Transfer of Representations to Video Label Propagation: Implementation Factors Matter by Daniel McKee et al
03-09-2022	Fast Road Segmentation via Uncertainty-aware Symmetric Network by Yicong Chang et al
03-08-2022	Robust Local Preserving and Global Aligning Network for Adversarial Domain Adaptation by Wenwen Qiang et al
03-08-2022	BEVSegFormer: Birds Eye View Semantic Segmentation From Arbitrary Camera Rigs by Lang Peng et al
03-08-2022	UENAS: A Unified Evolution-based NAS Framework by Zimian Wei et al
03-08-2022	A Lightweight and Detector-free 3D Single Object Tracker on Point Clouds by Yan Xia et al
03-11-2022	PD-Flow: A Point Cloud Denoising Framework with Normalizing Flows by Aihua Mao et al
03-11-2022	Peng Cheng Object Detection Benchmark for Smart City by Yaowei Wang et al
03-08-2022	GaitStrip: Gait Recognition via Effective Strip-based Feature Representations and Multi-Level Framework by Ming Wang et al
03-10-2022	6-DoF Pose Estimation of Household Objects for Robotic Manipulation: An Accessible Dataset and Benchmark by Stephen Tyree et al
03-08-2022	Graph Attention Transformer Network for Multi-Label Image Classification by Jin Yuan et al
03-08-2022	Shape-invariant 3D Adversarial Point Clouds by Qidong Huang et al
03-08-2022	PyNET-QxQ: A Distilled PyNET for QxQ Bayer Pattern Demosaicing in CMOS Image Sensor by Minhyeok Cho et al
03-08-2022	Neural Face Identification in a 2D Wireframe Projection of a Manifold Object by Kehan Wang et al
03-08-2022	Towards Universal Texture Synthesis by Combining Texton Broadcasting with Noise Injection in StyleGAN-2 by Jue Lin et al
03-10-2022	Transferring Dual Stochastic Graph Convolutional Network for Facial Micro-expression Recognition by Hui Tang et al
03-09-2022	SkinningNet: Two-Stream Graph Convolutional Neural Network for Skinning Prediction of Synthetic Characters by Albert Mosella-Montoro et al
03-08-2022	Stage-Aware Feature Alignment Network for Real-Time Semantic Segmentation of Street Scenes by Xi Weng et al
03-09-2022	Neural Data-Dependent Transform for Learned Image Compression by Dezhao Wang et al
03-10-2022	Attack Analysis of Face Recognition Authentication Systems Using Fast Gradient Sign Method by Arbena Musa et al
03-09-2022	NeRF-Pose: A First-Reconstruct-Then-Regress Approach for Weakly-supervised 6D Object Pose Estimation by Fu Li et al
03-11-2022	Physics-informed Reinforcement Learning for Perception and Reasoning about Fluids by Beatriz Moya et al
03-10-2022	Dual-Domain Reconstruction Networks with V-Net and K-Net for fast MRI by Xiaohan Liu et al
03-10-2022	Towards Open-Set Text Recognition via Label-to-Prototype Learning by Chang Liu et al
03-10-2022	Temporal Context for Robust Maritime Obstacle Detection by Lojze Žust et al
03-10-2022	Adaptive Background Matting Using Background Matching by Jinlin Liu
03-10-2022	Two-stream Hierarchical Similarity Reasoning for Image-text Matching by Ran Chen et al
03-08-2022	GaitEdge: Beyond Plain End-to-end Gait Recognition for Better Practicality by Junhao Liang et al
03-11-2022	WiCV 2021: The Eighth Women In Computer Vision Workshop by Arushi Goel et al
03-09-2022	Creating Realistic Ground Truth Data for the Evaluation of Calibration Methods for Plenoptic and Conventional Cameras by Tim Michels et al
03-09-2022	Using Human Gaze For Surgical Activity Recognition by Abdishakour Awale et al
03-11-2022	LFW-Beautified: A Dataset of Face Images with Beautification and Augmented Reality Filters by Pontus Hedman et al
03-10-2022	PETR: Position Embedding Transformation for Multi-View 3D Object Detection by Yingfei Liu et al
03-09-2022	Resource-Efficient Invariant Networks: Exponential Gains by Unrolled Optimization by Sam Buchanan et al
03-08-2022	An Online Semantic Mapping System for Extending and Enhancing Visual SLAM by Thorsten Hempel et al
03-10-2022	The Overlooked Classifier in Human-Object Interaction Recognition by Ying Jin et al
03-09-2022	Practical No-box Adversarial Attacks with Training-free Hybrid Image Transformation by Qilong Zhang et al
03-11-2022	Hyperbolic Image Segmentation by Mina GhadimiAtigh et al
03-11-2022	DRTAM: Dual Rank-1 Tensor Attention Module by Hanxing Chi et al
03-11-2022	Spatial Consistency Loss for Training Multi-Label Classifiers from Single-Label Annotations by Thomas Verelst et al
03-08-2022	Lane Detection with Versatile AtrousFormer and Local Semantic Guidance by Jiaxing Yang et al

03-10-2022	Point Density-Aware Voxels for LiDAR 3D Object Detection by Jordan S. K. Hu et al
03-09-2022	Defending Black-box Skeleton-based Human Activity Classifiers by He Wang et al
03-09-2022	Controllable Evaluation and Generation of Physical Adversarial Patch on Face Recognition by Xiao Yang et al
03-08-2022	Language Matters: A Weakly Supervised Pre-training Approach for Scene Text Detection and Spotting by Chuhui Xue et al
03-09-2022	VGQ-CNN: Moving Beyond Fixed Cameras and Top-Grasps for Grasp Quality Prediction by A. Konrad et al
03-08-2022	3SD: Self-Supervised Saliency Detection With No Labels by Rajeev Yasarla et al
03-09-2022	Evaluating Proposed Fairness Models for Face Recognition Algorithms by John J. Howard et al
03-09-2022	SynWoodScape: Synthetic Surround-view Fisheye Camera Dataset for Autonomous Driving by Ahmed Rida Sekkat et al
03-08-2022	Weakly Supervised Semantic Segmentation using Out-of-Distribution Data by Jungbeom Lee et al
03-08-2022	Boosting Mask R-CNN Performance for Long, Thin Forensic Traces with Pre-Segmentation and IoU Region Merging by Moritz Zink et al
03-08-2022	Semi-Supervised Semantic Segmentation Using Unreliable Pseudo-Labels by Yuchao Wang et al
03-10-2022	High Definition, Inexpensive, Underwater Mapping by Bharat Joshi et al
03-09-2022	3D Dense Face Alignment with Fused Features by Aggregating CNNs and GCNs by Yanda Meng et al
03-08-2022	Autonomous Mosquito Habitat Detection Using Satellite Imagery and Convolutional Neural Networks for Disease Risk Mapping by Sriram Elango et al
03-10-2022	Gesture based Arabic Sign Language Recognition for Impaired People based on Convolution Neural Network by Rady El Rwelli et al
03-10-2022	Human Face Recognition from Part of a Facial Image based on Image Stitching by Osama R. Shahin et al
03-09-2022	Dynamic Instance Domain Adaptation by Zhongying Deng et al
03-08-2022	Update Compression for Deep Neural Networks on the Edge by Bo Chen et al
03-08-2022	Visual anomaly detection in video by variational autoencoder by Faraz Waseem et al
03-10-2022	City-wide Street-to-Satellite Image Geolocalization of a Mobile Ground Agent by Lena M. Downes et al
03-11-2022	Visualizing and Understanding Patch Interactions in Vision Transformer by Jie Ma et al
03-11-2022	TFCNet: Temporal Fully Connected Networks for Static Unbiased Temporal Reasoning by Shiwen Zhang
03-08-2022	Probabilistic Rotation Representation With an Efficiently Computable Bingham Loss Function and Its Application to Pose Estimation by Hiroya Sato et al
03-11-2022	aiWave: Volumetric Image Compression with 3-D Trained Affine Wavelet-like Transform by Dongmei Xue et al
03-08-2022	Self-Supervision, Remote Sensing and Abstraction: Representation Learning Across 3 Million Locations by Sachith Seneviratne et al
03-08-2022	Pointillism: Accurate 3D bounding box estimation with multi-radars by Kshitiz Bansal et al
03-09-2022	Monocular Depth Distribution Alignment with Low Computation by Fei Sheng et al
03-08-2022	Unrolled Primal-Dual Networks for Lensless Cameras by Oliver Kingshott et al
03-11-2022	REX: Reasoning-aware and Grounded Explanation by Shi Chen et al
03-08-2022	Diffusion Models for Medical Anomaly Detection by Julia Wolleb et al
03-08-2022	SuperPoint features in endoscopy by O. L. Barbed et al
03-11-2022	Neuromorphic Data Augmentation for Training Spiking Neural Networks by Yuhang Li et al
03-11-2022	Font Shape-to-Impression Translation by Masaya Ueda et al
03-09-2022	A high-precision underwater object detection based on joint self-supervised deblurring and improved spatial transformer network by Xiuyuan Li et al
03-08-2022	Multi-Scale Adaptive Network for Single Image Denoising by Yuanbiao Gou et al
03-10-2022	Self Pre-training with Masked Autoencoders for Medical Image Analysis by Lei Zhou et al
03-11-2022	Towards Self-Supervised Learning of Global and Object-Centric Representations by Federico Baldassarre et al
03-09-2022	Optical Flow Training under Limited Label Budget via Active Learning by Shuai Yuan et al
03-08-2022	Deep Multi-Branch Aggregation Network for Real-Time Semantic Segmentation in Street Scenes by Xi Weng et al
03-11-2022	Democracy Does Matter: Comprehensive Feature Mining for Co-Salient Object Detection by Siyue Yu et al
03-09-2022	How many Observations are Enough? Knowledge Distillation for Trajectory Forecasting by Alessio Monti et al
03-09-2022	Metastatic Cancer Outcome Prediction with Injective Multiple Instance Pooling by Jianan Chen et al
03-09-2022	UNeXt: MLP-based Rapid Medical Image Segmentation Network by Jeya Maria Jose Valanarasu et al
03-09-2022	MLNav: Learning to Safely Navigate on Martian Terrains by Shreyansh Daftry et al
03-09-2022	All You Need is LUV: Unsupervised Collection of Labeled Images using Invisible UV Fluorescent Indicators by Brijen Thananjeyan et al
03-11-2022	Improve Convolutional Neural Network Pruning by Maximizing Filter Variety by Nathan Hubens et al
03-08-2022	Live Laparoscopic Video Retrieval with Compressed Uncertainty by Tong Yu et al
03-09-2022	Artificial Intelligence Solution for Effective Treatment Planning for Glioblastoma Patients by Vikram Goddla
03-11-2022	Video Coding for Machines with Feature-Based Rate-Distortion Optimization by Kristian Fischer et al
03-09-2022	Evaluation of YOLO Models with Sliced Inference for Small Object Detection by Muhammed Can Keles et al
03-09-2022	CP-ViT: Cascade Vision Transformer Pruning via Progressive Sparsity Prediction by Zhuoran Song et al
03-10-2022	PC-SwinMorph: Patch Representation for Unsupervised Medical Image Registration and Segmentation by Lihao Liu et al
03-10-2022	Leveraging Labeling Representations in Uncertainty-based Semi-supervised Segmentation by Sukesh Adiga et al
03-10-2022	Deep Multimodal Guidance for Medical Image Classification by Mayur Mallya et al
03-10-2022	Deep Convolutional Neural Networks for Molecular Subtyping of Gliomas Using Magnetic Resonance Imaging by Dong Wei et al
03-08-2022	End-to-end system for object detection from sub-sampled radar data by Madhumitha Sakthi et al
03-09-2022	Multi-modal Brain Tumor Segmentation via Missing Modality Synthesis and Modality-level Attention Fusion by Ziqi Huang et al
03-08-2022	Predicting conversion of mild cognitive impairment to Alzheimers disease by Yiran Wei et al
03-09-2022	PHTrans: Parallelly Aggregating Global and Local Representations for Medical Image Segmentation by Wentao Liu et al
03-08-2022	An error correction scheme for improved air-tissue boundary in real-time MRI video for speech production by Anwesha Roy et al
03-11-2022	BabyNet: Reconstructing 3D faces of babies from uncalibrated photographs by Araceli Morales et al
03-11-2022	Saliency-Driven Versatile Video Coding for Neural Object Detection by Kristian Fischer et al
03-09-2022	ChiTransformer:Towards Reliable Stereo from Cues by Qing Su et al
03-09-2022	Learning Temporal Consistency for Source-Free Video Domain Adaptation by Yuecong Xu et al
03-09-2022	A high-precision self-supervised monocular visual odometry in foggy weather based on robust cycled generative adversarial networks and multi-task learning aided depth estimation by Xiuyuan Li et al
03-09-2022	Text-DIAE: Degradation Invariant Autoencoders for Text Recognition and Document Enhancement by Mohamed Ali Souibgui et al
03-09-2022	Recovering medical images from CT film photos by Quan Quan et al
03-09-2022	Active Self-Semi-Supervised Learning for Few Labeled Samples Fast Training by Ziting Wen et al
03-09-2022	Region-Aware Face Swapping by Chao Xu et al
03-11-2022	Human Silhouette and Skeleton Video Synthesis through Wi-Fi signals by Danilo Avola et al
03-10-2022	On-the-Fly Test-time Adaptation for Medical Image Segmentation by Jeya Maria Jose Valanarasu et al
03-10-2022	Unfolded Deep Kernel Estimation for Blind Image Super-resolution by Hongyi Zheng et al
03-10-2022	Multi-Channel Convolutional Analysis Operator Learning for Dual-Energy CT Reconstruction by Alessandro Perelli et al
03-10-2022	Label-efficient Hybrid-supervised Learning for Medical Image Segmentation by Junwen Pan et al
03-09-2022	LiftReg: Limited Angle 2D/3D Deformable Registration by Lin Tian et al

Craig SmithMarch 14, 2022