2021.8.16 Vision papers

08-10-2021	MotionInput v2.0 supporting DirectX: A modular library of open-source gesture-based machine learning and computer vision methods for interacting and controlling existing software with a webcam by Ashild Kummen et al
08-12-2021	COVINS: Visual-Inertial SLAM for Centralized Collaboration by Patrik Schmuck et al
08-11-2021	SIDER: Single-Image Neural Optimization for Facial Geometric Detail Recovery by Aggelina Chatziagapi et al
08-11-2021	A Real-Time Online Learning Framework for Joint 3D Reconstruction and Semantic Segmentation of Indoor Scenes by Davide Menini et al
08-11-2021	Fog Simulation on Real LiDAR Point Clouds for 3D Object Detection in Adverse Weather by Martin Hahner et al
08-12-2021	Deep Amended Gradient Descent for Efficient Spectral Reconstruction from Single RGB Images by Zhiyu Zhu et al
08-10-2021	Optimal MRI Undersampling Patterns for Ultimate Benefit of Medical Vision Tasks by Artem Razumov et al
08-10-2021	FLAME-in-NeRF : Neural control of Radiance Fields for Free View Face Animation by ShahRukh Athar et al
08-13-2021	An Interpretable Algorithm for Uveal Melanoma Subtyping from Whole Slide Cytology Images by Haomin Chen et al
08-11-2021	Semi-Supervised Domain Generalizable Person Re-Identification by Lingxiao He et al
08-11-2021	Learning to Rearrange Voxels in Binary Segmentation Masks for Smooth Manifold Triangulation by Jianning Li et al
08-12-2021	Trash to Treasure: Harvesting OOD Data with Cross-Modal Matching for Open-Set Semi-Supervised Learning by Junkai Huang et al
08-12-2021	Robotic Testbed for Rendezvous and Optical Navigation: Multi-Source Calibration and Machine Learning Use Cases by Tae Ha Park et al
08-11-2021	Improving Single-Image Defocus Deblurring: How Dual-Pixel Images Help Through Multi-Task Learning by Abdullah Abuolaim et al
08-11-2021	Deep Learning Classification of Lake Zooplankton by S. P. Kyathanahally et al
08-10-2021	First Order Locally Orderless Registration by Sune Darkner et al
08-10-2021	Embodied BERT: A Transformer Model for Embodied, Language-guided Visual Task Completion by Alessandro Suglia et al
08-10-2021	Method Towards CVPR 2021 Image Matching Challenge by Xiaopeng Bi et al
08-12-2021	Deep Microlocal Reconstruction for Limited-Angle Tomography by Héctor Andrade-Loarca et al
08-10-2021	Relation-aware Compositional Zero-shot Learning for Attribute-Object Pair Recognition by Ziwei Xu et al
08-13-2021	FedPara: Low-rank Hadamard Product Parameterization for Efficient Federated Learning by Nam Hyeon-Woo et al
08-11-2021	FakeAVCeleb: A Novel Audio-Video Multimodal Deepfake Dataset by Hasam Khalid et al
08-12-2021	DARTS for Inverse Problems: a Study on Hyperparameter Sensitivity by Jonas Geiping et al
08-12-2021	Resetting the baseline: CT-based COVID-19 diagnosis with Deep Transfer Learning is not as accurate as widely thought by Fouzia Altaf et al
08-11-2021	Weakly Supervised Medical Image Segmentation by Pedro H. T. Gama et al
08-11-2021	NI-UDA: Graph Adversarial Domain Adaptation from Non-shared-and-Imbalanced Big Data to Small Imbalanced Applications by Guangyi Xiao et al
08-11-2021	Medical-VLBERT: Medical Visual Language BERT for COVID-19 CT Report Generation With Alternate Learning by Guangyi Liu et al
08-12-2021	Multiview Detection with Shadow Transformer (and View-Coherent Data Augmentation) by Yunzhong Hou et al
08-13-2021	Modal-Adaptive Gated Recoding Network for RGB-D Salient Object Detection by Feng Dong et al
08-10-2021	BERTHop: An Effective Vision-and-Language Model for Chest X-ray Disease Diagnosis by Masoud Monajatipoor et al
08-12-2021	Unconditional Scene Graph Generation by Sarthak Garg et al
08-12-2021	Mobile-Former: Bridging MobileNet and Transformer by Yinpeng Chen et al
08-12-2021	MicroNet: Improving Image Recognition with Extremely Low FLOPs by Yunsheng Li et al
08-12-2021	Semantic Concentration for Domain Adaptation by Shuang Li et al
08-12-2021	MT-ORL: Multi-Task Occlusion Relationship Learning by Panhe Feng et al
08-11-2021	Representation Learning for Remote Sensing: An Unsupervised Sensor Fusion Approach by Aidan M. Swope et al
08-11-2021	Multi-Source Fusion and Automatic Predictor Selection for Zero-Shot Video Object Segmentation by Xiaoqi Zhao et al
08-11-2021	Voxel-level Importance Maps for Interpretable Brain Age Estimation by Kyriaki-Margarita Bintsi et al
08-11-2021	The Pitfalls of Sample Selection: A Case Study on Lung Nodule Classification by Vasileios Baltatzis et al
08-11-2021	Automatic Gaze Analysis: A Survey of DeepLearning based Approaches by Shreya Ghosh et al
08-12-2021	Learning Visual Affordance Grounding from Demonstration Videos by Hongchen Luo et al
08-13-2021	Progressive Representative Labeling for Deep Semi-Supervised Learning by Xiaopeng Yan et al
08-13-2021	Coupling Model-Driven and Data-Driven Methods for Remote Sensing Image Restoration and Fusion by Huanfeng Shen et al
08-11-2021	Rethinking Coarse-to-Fine Approach in Single Image Deblurring by Sung-Jin Cho et al
08-12-2021	m-RevNet: Deep Reversible Neural Networks with Momentum by Duo Li et al
08-12-2021	Towers of Babel: Combining Images, Language, and 3D Geometry for Learning Multimodal Vision by Xiaoshi Wu et al
08-11-2021	Learning Bias-Invariant Representation by Cross-Sample Mutual Information Minimization by Wei Zhu et al
08-10-2021	On the Effect of Pruning on Adversarial Robustness by Artur Jordao et al
08-12-2021	PixelSynth: Generating a 3D-Consistent Experience from a Single Image by Chris Rockwell et al
08-10-2021	How Self-Supervised Learning Can be Used for Fine-Grained Head Pose Estimation? by Mahdi Pourmirzaei et al
08-10-2021	Interpreting Generative Adversarial Networks for Interactive Image Generation by Bolei Zhou
08-10-2021	Learning Canonical 3D Object Representation for Fine-Grained Recognition by Sunghun Joung et al
08-12-2021	Alzheimers Disease Diagnosis via Deep Factorization Machine Models by Raphael Ronge et al
08-12-2021	Distributional Depth-Based Estimation of Object Articulation Models by Ajinkya Jain et al
08-10-2021	U-Net-and-a-half: Convolutional network for biomedical image segmentation using multiple expert-driven annotations by Yichi Zhang et al
08-13-2021	Robustness testing of AI systems: A case study for traffic sign recognition by Christian Berghoff et al
08-10-2021	SP-GAN: Sphere-Guided 3D Shape Generation and Manipulation by Ruihui Li et al
08-10-2021	Scalable Reverse Image Search Engine for NASAWorldview by Abhigya Sodani et al
08-10-2021	TBNet:Two-Stream Boundary-aware Network for Generic Image Manipulation Localization by Zan Gao et al
08-11-2021	Statistical Dependency Guided Contrastive Learning for Multiple Labeling in Prenatal Ultrasound by Shuangchi He et al
08-10-2021	Exploiting Features with Split-and-Share Module by Jaemin Lee et al
08-11-2021	Towards Top-Down Just Noticeable Difference Estimation of Natural Images by Qiuping Jiang et al
08-11-2021	An Approach to Partial Observability in Games: Learning to Both Act and Observe by Elizabeth Gilmour et al
08-11-2021	Few-Shot Segmentation with Global and Local Contrastive Learning by Weide Liu et al
08-12-2021	AGKD-BML: Defense Against Adversarial Attack by Attention Guided Knowledge Distillation and Bi-directional Metric Learning by Hong Wang et al
08-12-2021	UniFaceGAN: A Unified Framework for Temporally Consistent Facial Video Editing by Meng Cao et al
08-12-2021	Conditional Temporal Variational AutoEncoder for Action Video Prediction by Xiaogang Xu et al
08-13-2021	Pruning vs XNOR-Net: A Comprehensive Study on Deep Learning for Audio Classification in Microcontrollers by Md Mohaimenuzzaman et al
08-13-2021	Learning Transferable Parameters for Unsupervised Domain Adaptation by Zhongyi Han et al
08-10-2021	BIDCD - Bosch Industrial Depth Completion Dataset by Adam Botach et al
08-13-2021	Generative Zero-Shot Learning for Semantic Segmentation of 3D Point Cloud by Björn Michele et al
08-10-2021	FoodLogoDet-1500: A Dataset for Large-Scale Food Logo Detection via Multi-Scale Feature Decoupling Network by Qiang Hou et al
08-10-2021	Method Towards CVPR 2021 SimLocMatch Challenge by Xiaopeng Bi et al
08-12-2021	Deep Motion Prior for Weakly-Supervised Temporal Action Localization by Meng Cao et al
08-12-2021	Multi-Modal MRI Reconstruction with Spatial Alignment Network by Kai Xuan et al
08-11-2021	Learning Oculomotor Behaviors from Scanpath by Beibin Li et al
08-11-2021	One-Sided Box Filter for Edge Preserving Image Smoothing by Yuanhao Gong
08-11-2021	Boosting the Generalization Capability in Cross-Domain Few-shot Learning via Noise-enhanced Supervised Autoencoder by Hanwen Liang et al
08-11-2021	Learning Action Completeness from Points for Weakly-supervised Temporal Action Localization by Pilhyeon Lee et al
08-10-2021	Differentiable Surface Rendering via Non-Differentiable Sampling by Forrester Cole et al
08-12-2021	LabOR: Labeling Only if Required for Domain Adaptive Semantic Segmentation by Inkyu Shin et al
08-12-2021	Non-imaging real-time detection and tracking of fast-moving objects by Fengming Zhou et al
08-10-2021	Reference-based Defect Detection Network by Zhaoyang Zeng et al
08-10-2021	Box-Aware Feature Enhancement for Single Object Tracking on Point Clouds by Chaoda Zheng et al
08-10-2021	MetaPose: Fast 3D Pose from Multiple Views without 3D Supervision by Ben Usman et al
08-10-2021	AuraSense: Robot Collision Avoidance by Full Surface Proximity Detection by Xiaoran Fan et al
08-13-2021	SimCVD: Simple Contrastive Voxel-Wise Representation Distillation for Semi-Supervised Medical Image Segmentation by Chenyu You et al
08-11-2021	Instance-weighted Central Similarity for Multi-label Image Retrieval by Zhiwei Zhang et al
08-12-2021	DIODE: Dilatable Incremental Object Detection by Can Peng et al
08-12-2021	A Systematic Benchmarking Analysis of Transfer Learning for Medical Image Analysis by Mohammad Reza Hosseinzadeh Taher et al
08-12-2021	Continual Neural Mapping: Learning An Implicit Scene Representation from Sequential Observations by Zike Yan et al
08-10-2021	White blood cell subtype detection and classification by Nalla Praveen et al
08-10-2021	Multi-Camera Trajectory Forecasting with Trajectory Tensors by Olly Styles et al
08-12-2021	perf4sight: A toolflow to model CNN training performance on Edge GPUs by Aditya Rajagopal et al
08-12-2021	DnD: Dense Depth Estimation in Crowded Dynamic Indoor Scenes by Dongki Jung et al
08-12-2021	Cascade Bagging for Accuracy Prediction with Few Training Samples by Ruyi Zhang et al
08-10-2021	Hand Pose Classification Based on Neural Networks by Rashmi Bakshi
08-10-2021	Multigranular Visual-Semantic Embedding for Cloth-Changing Person Re-identification by Zan Gao et al

08-10-2021	Multi-domain Collaborative Feature Representation for Robust Visual Object Tracking by Jiqing Zhang et al
08-10-2021	CPNet: Cross-Parallel Network for Efficient Anomaly Detection by Youngsaeng Jin et al
08-13-2021	Point-Voxel Transformer: An Efficient Approach To 3D Deep Learning by Cheng Zhang et al
08-12-2021	HandFoldingNet: A 3D Hand Pose Estimation Network Using Multiscale-Feature Guided Folding of a 2D Hand Skeleton by Wencan Cheng et al
08-10-2021	Learning Fair Face Representation With Progressive Cross Transformer by Yong Li et al
08-13-2021	Bi-Temporal Semantic Reasoning for the Semantic Change Detection of HR Remote Sensing Images by Lei Ding et al
08-12-2021	Silhouette based View embeddings for Gait Recognition under Multiple Views by Tianrui Chai et al
08-10-2021	SnowflakeNet: Point Cloud Completion by Snowflake Point Deconvolution with Skip-Transformer by Peng Xiang et al
08-10-2021	Domain-Aware Universal Style Transfer by Kibeom Hong et al
08-12-2021	TF-Blender: Temporal Feature Blender for Video Object Detection by Yiming Cui et al
08-10-2021	Self-supervised Consensus Representation Learning for Attributed Graph by Changshu Liu et al
08-12-2021	Billion-Scale Pretraining with Vision Transformers for Multi-Task Visual Representations by Josh Beal et al
08-10-2021	ASMR: Learning Attribute-Based Person Search with Adaptive Semantic Margin Regularizer by Boseung Jeong et al
08-10-2021	Learning Multi-Granular Spatio-Temporal Graph Network for Skeleton-based Action Recognition by Tailin Chen et al
08-11-2021	Self-supervised Contrastive Learning for Irrigation Detection in Satellite Imagery by Chitra Agastya et al
08-11-2021	Zero-Shot Domain Adaptation with a Physics Prior by Attila Lengyel et al
08-11-2021	Towards Interpretable Deep Networks for Monocular Depth Estimation by Zunzhi You et al
08-10-2021	Learning Deep Multimodal Feature Representation with Asymmetric Multi-layer Fusion by Yikai Wang et al
08-12-2021	CODEs: Chamfer Out-of-Distribution Examples against Overconfidence Issue by Keke Tang et al
08-10-2021	A Transformer-based Math Language Model for Handwritten Math Expression Recognition by Huy Quang Ung et al
08-13-2021	Effective semantic segmentation in Cataract Surgery: What matters most? by Theodoros Pissas et al
08-13-2021	UMFA: A photorealistic style transfer method based on U-Net and multi-layer feature aggregation by D. Y. Rao et al
08-12-2021	Track without Appearance: Learn Box and Tracklet Embedding with Local and Global Motion Patterns for Vehicle Tracking by Gaoang Wang et al
08-10-2021	Semantics-STGCNN: A Semantics-guided Spatial-Temporal Graph Convolutional Network for Multi-class Trajectory Prediction by Ben A. Rainbow et al
08-12-2021	Memory-based Semantic Segmentation for Off-road Unstructured Natural Environments by Youngsaeng Jin et al
08-12-2021	Spatio-Temporal Human Action Recognition Modelwith Flexible-interval Sampling and Normalization by Yuke et al
08-12-2021	3D-SiamRPN: An End-to-End Learning Method for Real-Time 3D Single Object Tracking Using Raw Point Cloud by Zheng Fang et al
08-10-2021	Instance-wise Hard Negative Example Generation for Contrastive Learning in Unpaired Image-to-Image Translation by Weilun Wang et al
08-10-2021	TrUMAn: Trope Understanding in Movies and Animations by Hung-Ting Su et al
08-11-2021	Attention-driven Graph Clustering Network by Zhihao Peng et al
08-11-2021	Mutual Affine Network for Spatially Variant Kernel Estimation in Blind Image Super-Resolution by Jingyun Liang et al
08-11-2021	Hierarchical Conditional Flow: A Unified Framework for Image Super-Resolution and Image Rescaling by Jingyun Liang et al
08-11-2021	Video Transformer for Deepfake Detection with Incremental Learning by Sohail A. Khan et al
08-11-2021	ConvNets vs. Transformers: Whose Visual Representations are More Transferable? by Hong-Yu Zhou et al
08-11-2021	A Better Loss for Visual-Textual Grounding by Davide Rigoni et al
08-10-2021	VisEvent: Reliable Object Tracking via Collaboration of Frame and Event Flows by Xiao Wang et al
08-10-2021	Prototype Completion for Few-Shot Learning by Baoquan Zhang et al
08-10-2021	Iterative Self-consistent Parallel Magnetic Resonance Imaging Reconstruction based on Nonlocal Low-Rank Regularization by Ting Pan et al
08-13-2021	IFR: Iterative Fusion Based Recognizer For Low Quality Scene Text Recognition by Zhiwei Jia et al
08-13-2021	Detection and Captioning with Unseen Object Classes by Berkan Demirel et al
08-13-2021	3D point cloud segmentation using GIS by Chao-Jung Liu et al
08-11-2021	Two is a crowd: tracking relations in videos by Artem Moskalev et al
08-10-2021	Understanding Character Recognition using Visual Explanations Derived from the Human Visual System and Deep Networks by Chetan Ralekar et al
08-12-2021	Oriented R-CNN for Object Detection by Xingxing Xie et al
08-11-2021	Mounting Video Metadata on Transformer-based Language Model for Open-ended Video Question Answering by Donggeon Lee et al
08-12-2021	Progressive Coordinate Transforms for Monocular 3D Object Detection by Li Wang et al
08-12-2021	iButter: Neural Interactive Bullet Time Generator for Human Free-viewpoint Rendering by Liao Wang et al
08-13-2021	Evaluating the Robustness of Semantic Segmentation for Autonomous Driving against Real-World Adversarial Patch Attacks by Federico Nesti et al
08-12-2021	Deep Camera Obscura: An Image Restoration Pipeline for Lensless Pinhole Photography by Joshua D. Rego et al
08-12-2021	Improving Ranking Correlation of Supernet with Candidates Enhancement and Progressive Training by Ziwei Yang et al
08-10-2021	An Image-based Generator Architecture for Synthetic Image Refinement by Alex Nasser
08-13-2021	Towards Efficient Point Cloud Graph Neural Networks Through Architectural Simplification by Shyam A. Tailor et al
08-12-2021	Presenting an extensive lab- and field-image dataset of crops and weeds for computer vision tasks in agriculture by Michael A. Beck et al
08-12-2021	Patchwork: Concentric Zone-based Region-wise Ground Segmentation with Ground Likelihood Estimation Using a 3D LiDAR Sensor by Hyungtae Lim et al
08-12-2021	Vision-Language Transformer and Query Generation for Referring Segmentation by Henghui Ding et al
08-10-2021	Deep Metric Learning for Open World Semantic Segmentation by Jun Cen et al
08-13-2021	Full-resolution quality assessment for pansharpening by Giuseppe Scarpa et al
08-10-2021	R4Dyn: Exploring Radar for Self-Supervised Monocular Depth Estimation of Dynamic Scenes by Stefano Gasperini et al
08-10-2021	The Effect of the Loss on Generalization: Empirical Study on Synthetic Lung Nodule Data by Vasileios Baltatzis et al
08-13-2021	SVC-onGoing: Signature Verification Competition by Ruben Tolosana et al
08-12-2021	Logit Attenuating Weight Normalization by Aman Gupta et al
08-12-2021	AdaFit: Rethinking Learning-based Normal Estimation on Point Clouds by Runsong Zhu et al
08-10-2021	SUNet: Symmetric Undistortion Network for Rolling Shutter Correction by Bin Fan et al
08-11-2021	ProAI: An Efficient Embedded AI Hardware for Automotive Applications - a Benchmark Study by Sven Mantowsky et al
08-12-2021	DiagViB-6: A Diagnostic Benchmark Suite for Vision Models in the Presence of Shortcut and Generalization Opportunities by Elias Eulig et al
08-11-2021	Deep PET/CT fusion with Dempster-Shafer theory for lymphoma segmentation by Ling Huang et al
08-13-2021	EEEA-Net: An Early Exit Evolutionary Neural Architecture Search by Chakkrit Termritthikun et al
08-13-2021	Conditional DETR for Fast Training Convergence by Depu Meng et al
08-10-2021	Meta-repository of screening mammography classifiers by Benjamin Stadnick et al
08-13-2021	A Generative Adversarial Framework for Optimizing Image Matting and Harmonization Simultaneously by Xuqian Ren et al
08-12-2021	Correlate-and-Excite: Real-Time Stereo Matching via Guided Cost Volume Excitation by Antyanta Bangunharcana et al
08-13-2021	Dual Path Learning for Domain Adaptation of Semantic Segmentation by Yiting Cheng et al
08-10-2021	Simple black-box universal adversarial attacks on medical image classification based on deep neural networks by Kazuki Koga et al
08-10-2021	Elastic Tactile Simulation Towards Tactile-Visual Perception by Yikai Wang et al
08-12-2021	Towards Interpretable Deep Metric Learning with Structural Matching by Wenliang Zhao et al
08-12-2021	MUSIQ: Multi-scale Image Quality Transformer by Junjie Ke et al
08-10-2021	Joint Multi-Object Detection and Tracking with Camera-LiDAR Fusion for Autonomous Driving by Kemiao Huang et al
08-12-2021	MISS GAN: A Multi-IlluStrator Style Generative Adversarial Network for image to illustration translation by Noa Barzilay et al
08-11-2021	Preventing Catastrophic Forgetting and Distribution Mismatch in Knowledge Distillation via Synthetic Data by Kuluhan Binici et al
08-10-2021	Known Operator Learning and Hybrid Machine Learning in Medical Imaging --- A Review of the Past, the Present, and the Future by Andreas Maier et al
08-11-2021	Efficient Surfel Fusion Using Normalised Information Distance by Louis Gallagher et al
08-11-2021	Discriminative Distillation to Reduce Class Confusion in Continual Learning by Changhong Zhong et al
08-11-2021	Distilling Holistic Knowledge with Graph Neural Networks by Sheng Zhou et al
08-10-2021	UniNet: A Unified Scene Understanding Network and Exploring Multi-Task Relationships through the Lens of Adversarial Attacks by NareshKumar Gurulingan et al
08-11-2021	Person Re-identification via Attention Pyramid by Guangyi Chen et al
08-11-2021	Cervical Optical Coherence Tomography Image Classification Based on Contrastive Self-Supervised Texture Learning by Kaiyi Chen et al
08-11-2021	Automatic Polyp Segmentation via Multi-scale Subtraction Network by Xiaoqi Zhao et al
08-12-2021	DexMV: Imitation Learning for Dexterous Manipulation from Human Videos by Yuzhe Qin et al
08-11-2021	Mining the Benefits of Two-stage and One-stage HOI Detection by Aixi Zhang et al
08-11-2021	M3D-VTON: A Monocular-to-3D Virtual Try-On Network by Fuwei Zhao et al
08-10-2021	Depth Infused Binaural Audio Generation using Hierarchical Cross-Modal Attention by Kranti Kumar Parida et al
08-13-2021	CNN-based Two-Stage Parking Slot Detection Using Region-Specific Multi-Scale Feature Extraction by Quang Huy Bui et al
08-13-2021	SPACE: A Simulator for Physical Interactions and Causal Learning in 3D Environments by Jiafei Duan et al
08-13-2021	Detecting socially interacting groups using f-formation: A survey of taxonomy, methods, datasets, applications, challenges, and future research directions by Hrishav Bakul Barua et al
08-12-2021	TVT: Transferable Vision Transformer for Unsupervised Domain Adaptation by Jinyu Yang et al
08-11-2021	MultiTask-CenterNet (MCN): Efficient and Diverse Multitask Learning using an Anchor Free Approach by Falk Heuer et al

Craig SmithAugust 17, 2021