2022.3.7 Vision papers

03-01-2022	Generative Adversarial Networks by Gilad Cohen et al
03-03-2022	Understanding Failure Modes of Self-Supervised Learning by Neha Mukund Kalibhat et al
03-03-2022	Efficient Video Instance Segmentation via Tracklet Query and Proposal by Jialian Wu et al
03-03-2022	BatchFormer: Learning to Explore Sample Relationships for Robust Representation Learning by Zhi Hou et al
03-03-2022	NeRF-Supervision: Learning Dense Object Descriptors from Neural Radiance Fields by Lin Yen-Chen et al
03-02-2022	TableFormer: Table Structure Understanding with Transformers by Ahmed Nassar et al
03-03-2022	Recovering 3D Human Mesh from Monocular Images: A Survey by Yating Tian et al
03-01-2022	Variational Autoencoders Without the Variation by Gregory A. Daly et al
03-01-2022	CLIP-GEN: Language-Free Training of a Text-to-Image Generator with CLIP by Zihao Wang et al
03-03-2022	Playable Environments: Video Manipulation in Space and Time by Willi Menapace et al
03-04-2022	Freeform Body Motion Generation from Speech by Jing Xu et al
03-01-2022	D^2ETR: Decoder-Only DETR with Computationally Efficient Cross-Scale Attention by Junyu Lin et al
03-03-2022	Vision-Language Intelligence: Tasks, Representation Learning, and Large Models by Feng Li et al
03-04-2022	DiT: Self-supervised Pre-training for Document Image Transformer by Junlong Li et al
03-02-2022	HighMMT: Towards Modality and Task Generalization for High-Modality Representation Learning by Paul Pu Liang et al
03-03-2022	Mind the Gap: Understanding the Modality Gap in Multi-modal Contrastive Representation Learning by Weixin Liang et al
03-01-2022	Self-Supervised Vision Transformers Learn Visual Concepts in Histopathology by Richard J. Chen et al
03-01-2022	Affordance Learning from Play for Sample-Efficient Policy Learning by Jessica Borja-Diaz et al
03-01-2022	Recent, rapid advancement in visual question answering architecture by Venkat Kodali et al
03-03-2022	PINA: Learning a Personalized Implicit Neural Avatar from a Single RGB-D Video Sequence by Zijian Dong et al
03-01-2022	Unsupervised Vision-and-Language Pre-training via Retrieval-based Multi-Granular Alignment by Mingyang Zhou et al
03-03-2022	A Deep Neural Framework for Image Caption Generation Using GRU-Based Attention Mechanism by Rashid Khan et al
03-01-2022	InCloud: Incremental Learning for Point Cloud Place Recognition by Joshua Knights et al
03-01-2022	Styleverse: Towards Identity Stylization across Heterogeneous Domains by Jia Li et al
03-01-2022	Benchmarking Robustness of Deep Learning Classifiers Using Two-Factor Perturbation by Wei Dai et al
03-03-2022	Autoregressive Image Generation using Residual Quantization by Doyup Lee et al
03-01-2022	Towards Creativity Characterization of Generative Models via Group-based Subset Scanning by Celia Cintas et al
03-01-2022	CrossPoint: Self-Supervised Cross-Modal Contrastive Learning for 3D Point Cloud Understanding by Mohamed Afham et al
03-02-2022	Hyperspectral Pixel Unmixing with Latent Dirichlet Variational Autoencoder by Kiran Mantripragada et al
03-03-2022	Random Quantum Neural Networks (RQNN) for Noisy Image Recognition by Debanjan Konar et al
03-03-2022	ROCT-Net: A new ensemble deep convolutional model with improved spatial resolution learning for detecting common diseases from retinal OCT images by Mohammad Rahimzadeh et al
03-02-2022	DisARM: Displacement Aware Relation Module for 3D Detection by Yao Duan et al
03-03-2022	Investigating the limited performance of a deep-learning-based SPECT denoising approach: An observer-study-based characterization by Zitong Yu et al
03-03-2022	Interactive Image Synthesis with Panoptic Layout Generation by Bo Wang et al
03-02-2022	MetaDT: Meta Decision Tree for Interpretable Few-Shot Learning by Baoquan Zhang et al
03-02-2022	PetsGAN: Rethinking Priors for Single Image Generation by Zicheng Zhang et al
03-01-2022	Multi-Task Multi-Scale Learning For Outcome Prediction in 3D PET Images by Amine Amyar et al
03-03-2022	Capturing Shape Information with Multi-Scale Topological Loss Terms for 3D Reconstruction by Dominik J. E. Waibel et al
03-03-2022	Selective Residual M-Net for Real Image Denoising by Chi-Mao Fan et al
03-01-2022	How certain are your uncertainties? by Luke Whitbread et al
03-03-2022	NeuroFluid: Fluid Dynamics Grounding with Particle-Driven Neural Radiance Fields by Shanyan Guan et al
03-01-2022	Colon Nuclei Instance Segmentation using a Probabilistic Two-Stage Detector by Pedro Costa et al
03-01-2022	Compliance Challenges in Forensic Image Analysis Under the Artificial Intelligence Act by Benedikt Lorch et al
03-02-2022	Differentiable IFS Fractals by Cory Braker Scott
03-02-2022	Enhancing Adversarial Robustness for Deep Metric Learning by Mo Zhou et al
03-03-2022	Recent Advances in Vision Transformer: A Survey and Outlook of Recent Work by Khawar Islam
03-03-2022	Detecting High-Quality GAN-Generated Face Images using Neural Networks by Ehsan Nowroozi et al
03-03-2022	DIME: Fine-grained Interpretations of Multimodal Models via Disentangled Local Explanations by Yiwei Lyu et al
03-01-2022	Towards IID representation learning and its application on biomedical data by Jiqing Wu et al
03-03-2022	Label-Only Model Inversion Attacks via Boundary Repulsion by Mostafa Kahla et al
03-03-2022	On Learning Contrastive Representations for Learning with Noisy Labels by Li Yi et al
03-02-2022	Protecting Celebrities with Identity Consistency Transformer by Xiaoyi Dong et al
03-01-2022	Generalizable Person Re-Identification via Self-Supervised Batch Norm Test-Time Adaption by Ke Han et al
03-03-2022	Ensembles of Vision Transformers as a New Paradigm for Automated Classification in Ecology by S. Kyathanahally et al
03-03-2022	TCTrack: Temporal Contexts for Aerial Tracking by Ziang Cao et al
03-01-2022	Can No-reference features help in Full-reference image quality estimation? by Saikat Dutta et al
03-01-2022	Separable-HoverNet and Instance-YOLO for Colon Nuclei Identification and Counting by Chunhui Lin et al
03-03-2022	Instance Segmentation for Autonomous Log Grasping in Forestry Operations by Jean-Michel Fortin et al
03-03-2022	Cross-Modality Earth Movers Distance for Visible Thermal Person Re-Identification by Yongguo Ling et al
03-03-2022	DenseUNets with feedback non-local attention for the segmentation of specular microscopy images of the corneal endothelium with Fuchs dystrophy by Juan P. Vigueras-Guillén et al
03-03-2022	Rethinking the role of normalization and residual blocks for spiking neural networks by Shin-ichi Ikegawa et al
03-03-2022	Self-supervised Transparent Liquid Segmentation for Robotic Pouring by Gautham Narayan Narasimhan et al
03-01-2022	A unified 3D framework for Organs at Risk Localization and Segmentation for Radiation Therapy Planning by Fernando Navarro et al
03-02-2022	ADVISE: ADaptive Feature Relevance and VISual Explanations for Convolutional Neural Networks by Mohammad Mahdi Dehshibi et al
03-03-2022	LatentFormer: Multi-Agent Transformer-Based Interaction Modeling and Trajectory Prediction by Elmira Amirloo et al
03-02-2022	VAE-iForest: Auto-encoding Reconstruction and Isolation-based Anomalies Detecting Fallen Objects on Road Surface by Takato Yasuno et al
03-03-2022	LGT-Net: Indoor Panoramic Room Layout Estimation with Geometry-Aware Transformer Network by Zhigang Jiang et al
03-03-2022	Adaptive Path Planning for UAVs for Multi-Resolution Semantic Segmentation by Felix Stache et al
03-03-2022	Modality-Adaptive Mixup and Invariant Decomposition for RGB-Infrared Person Re-Identification by Zhipeng Huang et al
03-01-2022	SwitchHit: A Probabilistic, Complementarity-Based Switching System for Improved Visual Place Recognition in Changing Environments by Maria Waheed et al
03-03-2022	Adaptive Local-Global Relational Network for Facial Action Units Recognition and Facial Paralysis Estimation by Xuri Ge et al
03-01-2022	Towards deep learning-powered IVF: A large public benchmark for morphokinetic parameter prediction by Tristan Gomez et al
03-02-2022	H4D: Human 4D Modeling by Learning Neural Compositional Representation by Boyan Jiang et al
03-02-2022	LILE: Look In-Depth before Looking Elsewhere -- A Dual Attention Network using Transformers for Cross-Modal Information Retrieval in Histopathology Archives by Danial Maleki et al
03-03-2022	Translational Lung Imaging Analysis Through Disentangled Representations by Pedro M. Gordaliza et al
03-01-2022	Image analysis for automatic measurement of crustose lichens by Pedro Guedes et al
03-03-2022	NUQ: A Noise Metric for Diffusion MRI via Uncertainty Discrepancy Quantification by Shreyas Fadnavis et al
03-03-2022	CenterSnap: Single-Shot Multi-Object 3D Shape Reconstruction and Categorical 6D Pose and Size Estimation by Muhammad Zubair Irshad et al
03-01-2022	When A Conventional Filter Meets Deep Learning: Basis Composition Learning on Image Filters by Fu Lee Wang et al
03-01-2022	Omni-frequency Channel-selection Representations for Unsupervised Anomaly Detection by Yufei Liang et al
03-03-2022	Revisiting Click-based Interactive Video Object Segmentation by Stephane Vujasinovic et al
03-02-2022	OVE6D: Object Viewpoint Encoding for Depth-based 6D Object Pose Estimation by Dingding Cai et al
03-01-2022	Stable, accurate and efficient deep neural networks for inverse problems with analysis-sparse models by Maksym Neyra-Nesterenko et al
03-03-2022	An Efficient Subpopulation-based Membership Inference Attack by Shahbaz Rezaei et al
03-04-2022	Carbon Footprint of Selecting and Training Deep Learning Models for Medical Image Analysis by Raghavendra Selvan et al
03-03-2022	Ad2Attack: Adaptive Adversarial Attack on Real-Time UAV Tracking by Changhong Fu et al
03-04-2022	Uncertainty Estimation for Heatmap-based Landmark Localization by Lawrence Schobs et al
03-03-2022	A study on the distribution of social biases in self-supervised learning visual models by Kirill Sirotkin et al
03-01-2022	Towards a unified view of unsupervised non-local methods for image denoising: the NL-Ridge approach by Sébastien Herbreteau et al
03-02-2022	A Simple and Universal Rotation Equivariant Point-cloud Network by Ben Finkelshtein et al
03-01-2022	MixSTE: Seq2seq Mixed Spatio-Temporal Encoder for 3D Human Pose Estimation in Video by Jinlu Zhang et al
03-01-2022	Bridge the Gap between Supervised and Unsupervised Learning for Fine-Grained Classification by Jiabao Wang et al
03-02-2022	Video Question Answering: Datasets, Algorithms and Challenges by Yaoyao Zhong et al
03-03-2022	Why adversarial training can hurt robust accuracy by Jacob Clarysse et al
03-02-2022	Shape constrained CNN for segmentation guided prediction of myocardial shape and pose parameters in cardiac MRI by Sofie Tilborghs et al
03-03-2022	CAFE: Learning to Condense Dataset by Aligning Features by Kai Wang et al
03-03-2022	Exploring Patch-wise Semantic Relation for Contrastive Learning in Image-to-Image Translation Tasks by Chanyong Jung et al
03-01-2022	X-Trans2Cap: Cross-Modal Knowledge Transfer using Transformer for 3D Dense Captioning by Zhihao Yuan et al
03-03-2022	Curriculum-style Local-to-global Adaptation for Cross-domain Remote Sensing Image Segmentation by Bo Zhang et al
03-02-2022	Improving Lidar-Based Semantic Segmentation of Top-View Grid Maps by Learning Features in Complementary Representations by Frank Bieder et al
03-02-2022	Hybrid Tracker with Pixel and Instance for Video Panoptic Segmentation by Weicai Ye et al
03-03-2022	Weakly Supervised Object Localization as Domain Adaption by Lei Zhu et al
03-03-2022	Region-of-Interest Based Neural Video Compression by Yura Perugachi-Diaz et al
03-02-2022	Asynchronous Optimisation for Event-based Visual Odometry by Daqi Liu et al
03-01-2022	Boundary Corrected Multi-scale Fusion Network for Real-time Semantic Segmentation by Tianjiao Jiang et al
03-02-2022	A Generalized Approach for Cancellable Template and Its Realization for Minutia Cylinder-Code by Xingbo Dong et al
03-01-2022	Robust Seatbelt Detection and Usage Recognition for Driver Monitoring Systems by Feng Hu
03-02-2022	Detecting Adversarial Perturbations in Multi-Task Perception by Marvin Klingner et al

03-02-2022	GRASP EARTH: Intuitive Software for Discovering Changes on the Planet by Waku Hatakeyama et al
03-02-2022	Learning Moving-Object Tracking with FMCW LiDAR by Yi Gu et al
03-01-2022	ProgressLabeller: Visual Data Stream Annotation for Training Object-Centric 3D Perception by Xiaotong Chen et al
03-03-2022	SegTAD: Precise Temporal Action Detection via Semantic Segmentation by Chen Zhao et al
03-01-2022	Full RGB Just Noticeable Difference (JND) Modelling by Jian Jin et al
03-02-2022	Conditional Reconstruction for Open-set Semantic Segmentation by Ian Nunes et al
03-01-2022	Exploring Wilderness Using Explainable Machine Learning in Satellite Imagery by Timo T. Stomberg et al
03-01-2022	Descriptellation: Deep Learned Constellation Descriptors for SLAM by Chunwei Xing et al
03-02-2022	Self-Supervised Scene Flow Estimation with 4D Automotive Radar by Fangqiang Ding et al
03-04-2022	Safety-aware metrics for object detectors in autonomous driving by Andrea Ceccarelli et al
03-01-2022	Beam-Shape Effects and Noise Removal from THz Time-Domain Images in Reflection Geometry in the 0.25-6 THz Range by Marina Ljubenovic et al
03-03-2022	Beyond 3D Siamese Tracking: A Motion-Centric Paradigm for 3D Single Object Tracking in Point Clouds by Chaoda Zheng et al
03-03-2022	Computer Vision Aided Blockage Prediction in Real-World Millimeter Wave Deployments by Gouranga Charan et al
03-01-2022	OmniFusion: 360 Monocular Depth Estimation via Geometry-Aware Fusion by Yuyan Li et al
03-01-2022	GSC Loss: A Gaussian Score Calibrating Loss for Deep Learning by Qingsong Zhao et al
03-01-2022	Instance-aware multi-object self-supervision for monocular depth prediction by Houssem eddine Boulahbal et al
03-02-2022	Exploring Hierarchical Graph Representation for Large-Scale Zero-Shot Image Classification by Kai Yi et al
03-01-2022	Tempera: Spatial Transformer Feature Pyramid Network for Cardiac MRI Segmentation by Christoforos Galazis et al
03-01-2022	3DCTN: 3D Convolution-Transformer Network for Point Cloud Classification by Dening Lu et al
03-04-2022	Do Explanations Explain? Model Knows Best by Ashkan Khakzar et al
03-02-2022	Self-Supervised Learning for Real-World Super-Resolution from Dual Zoomed Observations by Zhilu Zhang et al
03-01-2022	Clean-Annotation Backdoor Attack against Lane Detection Systems in the Wild by Xingshuo Han et al
03-03-2022	Intensity Image-based LiDAR Fiducial Marker System by Yibo Liu et al
03-02-2022	A Unified Query-based Paradigm for Point Cloud Understanding by Zetong Yang et al
03-02-2022	Continual BatchNorm Adaptation (CBNA) for Semantic Segmentation by Marvin Klingner et al
03-03-2022	Constrained unsupervised anomaly segmentation by Julio Silva-Rodríguez et al
03-02-2022	CycleMix: A Holistic Strategy for Medical Image Segmentation from Scribble Supervision by Ke Zhang et al
03-03-2022	Robustness and Adaptation to Hidden Factors of Variation by William Paul et al
03-02-2022	Self-supervised Transformer for Deepfake Detection by Hanqing Zhao et al
03-02-2022	Image-based material analysis of ancient historical documents by Thomas Reynolds et al
03-03-2022	Bridging the Source-to-target Gap for Cross-domain Person Re-Identification with Intermediate Domains by Yongxing Dai et al
03-03-2022	Relative distance matters for one-shot landmark detection by Qingsong Yao et al
03-02-2022	3D Common Corruptions and Data Augmentation by Oğuzhan Fatih Kar et al
03-03-2022	Correlation-Aware Deep Tracking by Fei Xie et al
03-03-2022	WPNAS: Neural Architecture Search by jointly using Weight Sharing and Predictor by Ke Lin et al
03-01-2022	Adversarial samples for deep monocular 6D object pose estimation by Jinlai Zhang et al
03-01-2022	Comprehensive Analysis of the Object Detection Pipeline on UAVs by Leon Amadeus Varga et al
03-02-2022	Unsupervised Anomaly Detection from Time-of-Flight Depth Images by Pascal Schneider et al
03-01-2022	Dense Voxel Fusion for 3D Object Detection by Anas Mahmoud et al
03-02-2022	A Split Semantic Detection Algorithm for Psychological Sandplay Image by Xiaokun Feng et al
03-02-2022	Visual Feature Encoding for GNNs on Road Networks by Oliver Stromann et al
03-02-2022	Fast and Robust Ground Surface Estimation from LIDAR Measurements using Uniform B-Splines by Sascha Wirges et al
03-01-2022	Temporal Perceiver: A General Architecture for Arbitrary Boundary Detection by Jing Tan et al
03-01-2022	JOINED : Prior Guided Multi-task Learning for Joint Optic Disc/Cup Segmentation and Fovea Detection by Huaqing He et al
03-02-2022	Container Localisation and Mass Estimation with an RGB-D Camera by Tommaso Apicella et al
03-01-2022	Incremental Transformer Structure Enhanced Image Inpainting with Masking Positional Encoding by Qiaole Dong et al
03-01-2022	SEA: Bridging the Gap Between One- and Two-stage Detector Distillation via SEmantic-aware Alignment by Yixin Chen et al
03-01-2022	Hybrid Optimized Deep Convolution Neural Network based Learning Model for Object Detection by Venkata Beri
03-02-2022	Improving Generalization of Deep Networks for Estimating Physical Properties of Containers and Fillings by Hengyi Wang et al
03-02-2022	Towards Bidirectional Arbitrary Image Rescaling: Joint Optimization and Cycle Idempotence by Zhihong Pan et al
03-02-2022	A Principled Design of Image Representation: Towards Forensic Tasks by Shuren Qi et al
03-02-2022	PUFA-GAN: A Frequency-Aware Generative Adversarial Network for 3D Point Cloud Upsampling by Hao Liu et al
03-03-2022	FairPrune: Achieving Fairness Through Pruning for Dermatological Disease Diagnosis by Yawen Wu et al
03-03-2022	STUN: Self-Teaching Uncertainty Estimation for Place Recognition by Kaiwen Cai et al
03-01-2022	FP-Loc: Lightweight and Drift-free Floor Plan-assisted LiDAR Localization by Ling Gao et al
03-01-2022	Efficient Globally-Optimal Correspondence-Less Visual Odometry for Planar Ground Vehicles by Ling Gao et al
03-02-2022	NeW CRFs: Neural Window Fully-connected CRFs for Monocular Depth Estimation by Weihao Yuan et al
03-02-2022	SoftGroup for 3D Instance Segmentation on Point Clouds by Thang Vu et al
03-02-2022	Translation Invariant Global Estimation of Heading Angle Using Sinogram of LiDAR Point Cloud by Xiaqing Ding et al
03-04-2022	MF-Hovernet: An Extension of Hovernet for Colon Nuclei Identification and Counting (CoNiC) Challenge by Vi Thi-Tuong Vo et al
03-03-2022	Debiased Batch Normalization via Gaussian Process for Generalizable Person Re-Identification by Jiawei Liu et al
03-04-2022	Rethinking Efficient Lane Detection via Curve Modeling by Zhengyang Feng et al
03-02-2022	Vision-based Large-scale 3D Semantic Mapping for Autonomous Driving Applications by Qing Cheng et al
03-04-2022	Rethinking Reconstruction Autoencoder-Based Out-of-Distribution Detection by Yibo Zhou
03-03-2022	Addressing the Shape-Radiance Ambiguity in View-Dependent Radiance Fields by Sverker Rasmuson et al
03-01-2022	Unified Physical Threat Monitoring System Aided by Virtual Building Simulation by Zenjie Li et al
03-02-2022	CD-GAN: a robust fusion-based generative adversarial network for unsupervised change detection between heterogeneous images by Jin-Ju Wang et al
03-04-2022	HDNet: High-resolution Dual-domain Learning for Spectral Compressive Imaging by Xiaowan Hu et al
03-01-2022	Runtime Detection of Executional Errors in Robot-Assisted Surgery by Zongyu Li et al
03-03-2022	Self-Supervised Ego-Motion Estimation Based on Multi-Layer Fusion of RGB and Inferred Depth by Zijie Jiang et al
03-02-2022	iMVS: Improving MVS Networks by Learning Depth Discontinuities by Nail Ibrahimli et al
03-02-2022	Sketched RT3D: How to reconstruct billions of photons per second by Julián Tachella et al
03-04-2022	Mobile authentication of copy detection patterns by Olga Taran et al
03-03-2022	Syntax-Aware Network for Handwritten Mathematical Expression Recognition by Ye Yuan et al
03-01-2022	Motion-aware Dynamic Graph Neural Network for Video Compressive Sensing by Ruiying Lu et al
03-04-2022	Transformations in Learned Image Compression from a Communication Perspective by Youneng Bao et al
03-02-2022	Aggregated Pyramid Vision Transformer: Split-transform-merge Strategy for Image Recognition without Convolutions by Rui-Yang Ju et al
03-02-2022	Class Re-Activation Maps for Weakly-Supervised Semantic Segmentation by Zhaozheng Chen et al
03-04-2022	Contextformer: A Transformer with Spatio-Channel Attention for Context Modeling in Learned Image Compression by A. Burakhan Koyuncu et al
03-03-2022	Audio-Visual Object Classification for Human-Robot Collaboration by A. Xompero et al
03-03-2022	Occlusion-Aware Cost Constructor for Light Field Depth Estimation by Yingqian Wang et al
03-02-2022	Improving Point Cloud Based Place Recognition with Ranking-based Loss and Large Batch Training by Jacek Komorowski
03-03-2022	3D Human Motion Prediction: A Survey by Kedi Lyu et al
03-04-2022	Pedestrian Stop and Go Forecasting with Hybrid Feature Fusion by Dongxu Guo et al
03-04-2022	The Familiarity Hypothesis: Explaining the Behavior of Deep Open Set Methods by Thomas G. Dietterich et al
03-03-2022	ViTransPAD: Video Transformer using convolution and self-attention for Face Presentation Attack Detection by Zuheng Ming et al
03-03-2022	3D endoscopic depth estimation using 3D surface-aware constraints by Shang Zhao et al
03-01-2022	Tricks and Plugins to GBM on Images and Sequences by Biyi Fang et al
03-02-2022	Half Wavelet Attention on M-Net+ for Low-Light Image Enhancement by Chi-Mao Fan et al
03-03-2022	Towards Universal Backward-Compatible Representation Learning by Binjie Zhang et al
03-03-2022	Multi-Tailed Vision Transformer for Efficient Inference by Yunke Wang et al
03-03-2022	Learning Incrementally to Segment Multiple Organs in a CT Image by Pengbo Liu et al
03-04-2022	Behavioural Curves Analysis Using Near-Infrared-Iris Image Sequences by L. Causa et al
03-01-2022	Low-Cost On-device Partial Domain Adaptation (LoCO-PDA): Enabling efficient CNN retraining on edge devices by Aditya Rajagopal et al
03-03-2022	Learning Category-Level Generalizable Object Manipulation Policy via Generative Adversarial Self-Imitation Learning from Demonstrations by Hao Shen et al
03-04-2022	Mixed Reality Depth Contour Occlusion Using Binocular Similarity Matching and Three-dimensional Contour Optimisation by Naye Ji et al
03-04-2022	Quantum Levenberg--Marquardt Algorithm for optimization in Bundle Adjustment by Luca Bernecker et al
03-02-2022	Spatial-Temporal Gating-Adjacency GCN for Human Motion Prediction by Chongyang Zhong et al
03-02-2022	ParaPose: Parameter and Domain Randomization Optimization for Pose Estimation using Synthetic Data by Frederik Hagelskjaer et al
03-04-2022	Differentiable Control Barrier Functions for Vision-based End-to-End Autonomous Driving by Wei Xiao et al
03-04-2022	Detecting GAN-generated Images by Orthogonal Training of Multiple CNNs by Sara Mandelli et al
03-02-2022	DN-DETR: Accelerate DETR Training by Introducing Query DeNoising by Feng Li et al
03-04-2022	AutoMO-Mixer: An automated multi-objective Mixer model for balanced, safe and robust prediction in medicine by Xi Chen et al
03-02-2022	3D object reconstruction and 6D-pose estimation from 2D shape for robotic grasping of objects by Marcell Wolnitza et al
03-03-2022	Polarity Sampling: Quality and Diversity Control of Pre-Trained Generative Networks via Singular Values by Ahmed Imtiaz Humayun et al
03-03-2022	Counting Molecules: Python based scheme for automated enumeration and categorization of molecules in scanning tunneling microscopy images by Jack Hellerstedt et al
03-02-2022	Exploring Smoothness and Class-Separation for Semi-supervised Medical Image Segmentation by Yicheng Wu et al

03-03-2022	HOI4D: A 4D Egocentric Dataset for Category-Level Human-Object Interaction by Yunze Liu et al
03-02-2022	Colar: Effective and Efficient Online Action Detection by Consulting Exemplars by Le Yang et al
03-02-2022	MUAD: Multiple Uncertainties for Autonomous Driving benchmark for multiple uncertainty types and tasks by Gianni Franchi et al
03-02-2022	Parameterized Image Quality Score Distribution Prediction by Yixuan Gao et al
03-02-2022	TransDARC: Transformer-based Driver Activity Recognition with Latent Space Feature Calibration by Kunyu Peng et al
03-04-2022	ACVNet: Attention Concatenation Volume for Accurate and Efficient Stereo Matching by Gangwei Xu et al
03-01-2022	Knock, knock. Whos there? -- Identifying football player jersey numbers with synthetic data by Divya Bhargavi et al
03-01-2022	3D Skeleton-based Human Motion Prediction with Manifold-Aware GAN by Baptiste Chopin et al
03-02-2022	Bending Reality: Distortion-aware Transformers for Adapting to Panoramic Semantic Segmentation by Jiaming Zhang et al
03-04-2022	Voice-Face Homogeneity Tells Deepfake by Harry Cheng et al
03-04-2022	Time-to-Label: Temporal Consistency for Self-Supervised Monocular 3D Object Detection by Issa Mouawad et al
03-04-2022	Semi-parametric Makeup Transfer via Semantic-aware Correspondence by Mingrui Zhu et al
03-04-2022	Semantic-Aware Representation Blending for Multi-Label Image Recognition with Partial Labels by Tao Pu et al
03-04-2022	HyperTransformer: A Textural and Spectral Feature Fusion Transformer for Pansharpening by Wele Gedara Chaminda Bandara et al
03-01-2022	Effect of Timing Error: A Case Study of Navigation Camera by Sandeep S. Kulkarni et al
03-03-2022	Universal Segmentation of 33 Anatomies by Pengbo Liu et al
03-04-2022	Simultaneous Alignment and Surface Regression Using Hybrid 2D-3D Networks for 3D Coherent Layer Segmentation of Retina OCT Images by Hong Liu et al
03-04-2022	DetFlowTrack: 3D Multi-object Tracking based on Simultaneous Optimization of Object Detection and Scene Flow Estimation by Yueling Shen et al
03-04-2022	PatchMVSNet: Patch-wise Unsupervised Multi-View Stereo for Weakly-Textured Surface Reconstruction by Haonan Dong et al
03-04-2022	OPAL: Occlusion Pattern Aware Loss for Unsupervised Light Field Disparity Estimation by Peng Li et al
03-03-2022	Towards Rich, Portable, and Large-Scale Pedestrian Data Collection by Allan Wang et al
03-04-2022	Didnt see that coming: a survey on non-verbal social human behavior forecasting by German Barquero et al
03-03-2022	A Versatile Multi-View Framework for LiDAR-based 3D Object Detection with Guidance from Panoptic Segmentation by Hamidreza Fazlali et al
03-04-2022	Partial Wasserstein Adversarial Network for Non-rigid Point Set Registration by Zi-Ming Wang et al
03-03-2022	Robust Segmentation of Brain MRI in the Wild with Hierarchical CNNs and no Retraining by Benjamin Billot et al
03-04-2022	Real-Time Hybrid Mapping of Populated Indoor Scenes using a Low-Cost Monocular UAV by Stuart Golodetz et al
03-02-2022	What Makes Transfer Learning Work For Medical Images: Feature Reuse & Other Factors by Christos Matsoukas et al
03-03-2022	FS-COCO: Towards Understanding of Freehand Sketches of Common Objects in Context by Pinaki Nath Chowdhury et al
03-03-2022	Pseudo-Stereo for Monocular 3D Object Detection in Autonomous Driving by Yi-Nan Chen et al
03-03-2022	Color Space-based HoVer-Net for Nuclei Instance Segmentation and Classification by Hussam Azzuni et al
03-04-2022	Computer-Aided Road Inspection: Systems and Algorithms by Rui Fan et al
03-04-2022	Feature Transformation for Cross-domain Few-shot Remote Sensing Scene Classification by Qiaoling Chen et al
03-02-2022	Quality or Quantity: Toward a Unified Approach for Multi-organ Segmentation in Body CT by Fakrul Islam Tushar et al
03-01-2022	There is a Time and Place for Reasoning Beyond the Image by Xingyu Fu et al
03-03-2022	Semantic-guided Image Virtual Attribute Learning for Noisy Multi-label Chest X-ray Classification by Yuanhong Chen et al
03-04-2022	Patch Similarity Aware Data-Free Quantization for Vision Transformers by Zhikai Li et al
03-03-2022	Scribble-Supervised Medical Image Segmentation via Dual-Branch Network and Dynamically Mixed Pseudo Labels Supervision by Xiangde Luo et al
03-02-2022	Object Pose Estimation using Mid-level Visual Representations by Negar Nejatishahidin et al
03-04-2022	F2DNet: Fast Focal Detection Network for Pedestrian Detection by Abdul Hannan Khan et al
03-04-2022	Class-Aware Contrastive Semi-Supervised Learning by Fan Yang et al
03-04-2022	Characterizing Renal Structures with 3D Block Aggregate Transformers by Xin Yu et al
03-03-2022	Fast Neural Architecture Search for Lightweight Dense Prediction Networks by Lam Huynh et al
03-03-2022	Towards Benchmarking and Evaluating Deepfake Detection by Chenhao Lin et al
03-03-2022	MixCL: Pixel label matters to contrastive learning by Jun Li et al
03-04-2022	SFPN: Synthetic FPN for Object Detection by Yu-Ming Zhang et al
03-04-2022	ViT-P: Rethinking Data-efficient Vision Transformers from Locality by Bin Chen et al
03-04-2022	Convolutional Analysis Operator Learning by End-To-End Training of Iterative Neural Networks by Andreas Kofler et al
03-03-2022	A Comprehensive Review of Computer Vision in Sports: Open Issues, Future Trends and Research Directions by Banoth Thulasya Naik et al
03-02-2022	Nuclei segmentation and classification in histopathology images with StarDist for the CoNIC Challenge 2022 by Martin Weigert et al
03-03-2022	A multi-stream convolutional neural network for classification of progressive MCI in Alzheimers disease using structural MRI images by Mona Ashtari-Majlan et al
03-02-2022	Temporal Context Matters: Enhancing Single Image Prediction with Disease Progression Representations by Aishik Konwer et al
03-02-2022	Contextual Attention Network: Transformer Meets U-Net by Azad Reza et al
03-03-2022	Anomaly Detection-Inspired Few-Shot Medical Image Segmentation Through Self-Supervision With Supervoxels by Stine Hansen et al
03-02-2022	E-CIR: Event-Enhanced Continuous Intensity Recovery by Chen Song et al
03-03-2022	Sim2Real Instance-Level Style Transfer for 6D Pose Estimation by Takuya Ikeda et al
03-01-2022	Deep Temporal Interpolation of Radar-based Precipitation by Michiaki Tatsubori et al

Craig SmithMarch 7, 2022