2021.5.10 Vision papers

05-04-2021	A Fast Partial Video Copy Detection Using KNN and Global Feature Database by Weijun Tan et al
05-04-2021	The Pursuit of Knowledge: Discovering and Localizing Novel Categories using Dual Memory by Sai Saketh Rambhatla et al
05-06-2021	Generalizable Representation Learning for Mixture Domain Face Anti-Spoofing by Zhihong Chen et al
05-07-2021	LINN: Lifting Inspired Invertible Neural Network for Image Denoising by Jun-Jie Huang et al
05-06-2021	BasisNet: Two-stage Model Synthesis for Efficient Inference by Mingda Zhang et al
05-04-2021	Poisoning the Unlabeled Dataset of Semi-Supervised Learning by Nicholas Carlini
05-04-2021	An Empirical Review of Deep Learning Frameworks for Change Detection: Model Design, Experimental Frameworks, Challenges and Research Needs by Murari Mandal et al
05-07-2021	Energy-Based Anomaly Detection and Localization by Ergin Utku Genc et al
05-06-2021	Q-Match: Iterative Shape Matching via Quantum Annealing by Marcel Seelbach Benkner et al
05-06-2021	Deep Polarization Imaging for 3D shape and SVBRDF Acquisition by Valentin Deschaintre et al
05-06-2021	Aligning Subtitles in Sign Language Videos by Hannah Bull et al
05-04-2021	Hallucination Improves Few-Shot Object Detection by Weilin Zhang et al
05-04-2021	LAFFNet: A Lightweight Adaptive Feature Fusion Network for Underwater Image Enhancement by Hao-Hsiang Yang et al
05-04-2021	Uncertainty-aware INVASE: Enhanced Breast Cancer Diagnosis Feature Selection by Jia-Xing Zhong et al
05-05-2021	PD-GAN: Probabilistic Diverse GAN for Image Inpainting by Hongyu Liu et al
05-04-2021	Technical Report for Valence-Arousal Estimation on Affwild2 Dataset by I-Hsuan Li
05-05-2021	DeepSMOTE: Fusing Deep Learning and SMOTE for Imbalanced Data by Damien Dablain et al
05-04-2021	One Model for All Quantization: A Quantized Network Supporting Hot-Swap Bit-Width Adjustment by Qigong Sun et al
05-07-2021	Towards Real-World Category-level Articulation Pose Estimation by Liu Liu et al
05-05-2021	Impact of individual rater style on deep learning uncertainty in medical imaging segmentation by Olivier Vincent et al
05-05-2021	Proposal-free One-stage Referring Expression via Grid-Word Cross-Attention by Wei Suo et al
05-05-2021	Self-Supervised Multi-Frame Monocular Scene Flow by Junhwa Hur et al
05-04-2021	Multipath Graph Convolutional Neural Networks by Rangan Das et al
05-04-2021	Where and When: Space-Time Attention for Audio-Visual Explanations by Yanbei Chen et al
05-04-2021	Remote Pathological Gait Classification System by Pedro Albuquerque et al
05-06-2021	Inverting Generative Adversarial Renderer for Face Reconstruction by Jingtan Piao et al
05-06-2021	Towards Novel Target Discovery Through Open-Set Domain Adaptation by Taotao Jing et al
05-06-2021	Weakly Supervised Action Selection Learning in Video by Junwei Ma et al
05-04-2021	Dual-Cross Central Difference Network for Face Anti-Spoofing by Zitong Yu et al
05-06-2021	Do You Even Need Attention? A Stack of Feed-Forward Layers Does Surprisingly Well on ImageNet by Luke Melas-Kyriazi
05-06-2021	Sparse convolutional context-aware multiple instance learning for whole slide image classification by Marvin Lerousseau et al
05-07-2021	NTIRE 2021 Challenge on Perceptual Image Quality Assessment by Jinjin Gu et al
05-05-2021	Bayesian Logistic Shape Model Inference: application to cochlea image segmentation by Wang Zihao et al
05-06-2021	Federated Face Recognition by Fan Bai et al
05-04-2021	Motion-Augmented Self-Training for Video Recognition at Smaller Scale by Kirill Gavrilyuk et al
05-06-2021	Online Preconditioning of Experimental Inkjet Hardware by Bayesian Optimization in Loop by Alexander E. Siemenn et al
05-05-2021	Rethinking Ultrasound Augmentation: A Physics-Inspired Approach by Maria Tirindelli et al
05-05-2021	PolarMask++: Enhanced Polar Representation for Single-Shot Instance Segmentation and Beyond by Enze Xie et al
05-05-2021	Addressing Annotation Imprecision for Tree Crown Delineation Using the RandCrowns Index by Dylan Stewart et al
05-05-2021	Content4All Open Research Sign Language Translation Datasets by Necati Cihan Camgoz et al
05-05-2021	Novelty Detection and Analysis of Traffic Scenario Infrastructures in the Latent Space of a Vision Transformer-Based Triplet Autoencoder by Jonas Wurst et al
05-06-2021	Why Approximate Matrix Square Root Outperforms Accurate SVD in Global Covariance Pooling? by Yue Song et al
05-05-2021	Spatio-Temporal Matching for Siamese Visual Tracking by Jinpu Zhang et al
05-05-2021	In the Danger Zone: U-Net Driven Quantile Regression can Predict High-risk SARS-CoV-2 Regions via Pollutant Particulate Matter and Satellite Imagery by Jacquelyn Shelton et al
05-04-2021	3D Vehicle Detection Using Camera and Low-Resolution LiDAR by Lin Bai et al
05-04-2021	Moving Towards Centers: Re-ranking with Attention and Memory for Re-identification by Yunhao Zhou et al
05-07-2021	ResMLP: Feedforward networks for image classification with data-efficient training by Hugo Touvron et al
05-04-2021	Representation Learning for Clustering via Building Consensus by Aniket Anand Deshmukh et al
05-04-2021	Walk in the Cloud: Learning Curves for Point Clouds Shape Analysis by Tiange Xiang et al
05-06-2021	Deep Weighted Consensus: Dense correspondence confidence maps for 3D shape registration by Dvir Ginzburg et al
05-07-2021	Self-Adaptive Transfer Learning for Multicenter Glaucoma Classification in Fundus Retina Images by Yiming Bao et al
05-07-2021	Contrastive Learning for Unsupervised Image-to-Image Translation by Hanbit Lee et al
05-06-2021	VideoLT: Large-scale Long-tailed Video Recognition by Xing Zhang et al
05-04-2021	Self-Improving Semantic Perception on a Construction Robot by Hermann Blum et al
05-04-2021	CUAB: Convolutional Uncertainty Attention Block Enhanced the Chest X-ray Image Analysis by Chi-Shiang Wang et al
05-05-2021	VoxelContext-Net: An Octree based Framework for Point Cloud Compression by Zizheng Que et al
05-05-2021	Pairwise Point Cloud Registration using Graph Matching and Rotation-invariant Features by Rong Huang et al
05-06-2021	Animatable Neural Radiance Fields for Human Body Modeling by Sida Peng et al
05-05-2021	Contrastive Learning and Self-Training for Unsupervised Domain Adaptation in Semantic Segmentation by Robert A. Marsden et al
05-04-2021	Texture for Colors: Natural Representations of Colors Using Variable Bit-Depth Textures by Shumeet Baluja
05-07-2021	Human Object Interaction Detection using Two-Direction Spatial Enhancement and Exclusive Object Prior by Lu Liu et al
05-06-2021	Salient Objects in Clutter by Deng-Ping Fan et al
05-06-2021	Few-Shot Learning for Image Classification of Common Flora by Joshua Ball
05-07-2021	Self-paced Resistance Learning against Overfitting on Noisy Labels by Xiaoshuang Shi et al
05-04-2021	Weak Multi-View Supervision for Surface Mapping Estimation by Nishant Rai et al
05-07-2021	Neural 3D Scene Compression via Model Compression by Berivan Isik
05-05-2021	Prototype Memory for Large-scale Face Representation Learning by Evgeny Smirnov et al
05-05-2021	Perceptual Gradient Networks by Dmitry Nikulin et al
05-04-2021	Leveraging Third-Order Features in Skeleton-Based Action Recognition by Zhenyue Qin et al
05-06-2021	Adaptive Domain-Specific Normalization for Generalizable Person Re-Identification by Jiawei Liu et al
05-04-2021	MLP-Mixer: An all-MLP Architecture for Vision by Ilya Tolstikhin et al
05-05-2021	Exploring Explicit and Implicit Visual Relationships for Image Captioning by Zeliang Song et al
05-06-2021	Saliency-Guided Deep Learning Network for Automatic Tumor Bed Volume Delineation in Post-operative Breast Irradiation by Mahdieh Kazemimoghadam et al
05-07-2021	Probabilistic Ranking-Aware Ensembles for Enhanced Object Detections by Mingyuan Mao et al
05-07-2021	Interpretable Social Anchors for Human Trajectory Forecasting in Crowds by Parth Kothari et al
05-05-2021	SIPSA-Net: Shift-Invariant Pan Sharpening with Moving Object Alignment for Satellite Imagery by Jaehyup Lee et al
05-05-2021	Learning Feature Aggregation for Deep 3D Morphable Models by Zhixiang Chen et al
05-04-2021	Combining Supervised and Un-supervised Learning for Automatic Citrus Segmentation by Heqing Huang et al
05-06-2021	Unsupervised Visual Representation Learning by Tracking Patches in Video by Guangting Wang et al
05-06-2021	A Novel Falling-Ball Algorithm for Image Segmentation by Asra Aslam et al
05-06-2021	Understanding Catastrophic Overfitting in Adversarial Training by Peilin Kang et al
05-07-2021	A State-of-the-art Survey of Object Detection Techniques in Microorganism Image Analysis: from Traditional Image Processing and Classical Machine Learning to Current Deep Convolutional Neural Networks and Potential Visual Transformers by Chen Li et al
05-07-2021	An Intelligent Passive Food Intake Assessment System with Egocentric Cameras by Frank Po Wen Lo et al
05-06-2021	Faster and Simpler Siamese Network for Single Object Tracking by Shaokui Jiang et al
05-06-2021	Quantification of pulmonary involvement in COVID-19 pneumonia by means of a cascade oftwo U-nets: training and assessment on multipledatasets using different annotation criteria by Francesca Lizzi et al
05-04-2021	PingAn-VCGroups Solution for ICDAR 2021 Competition on Scientific Table Image Recognition to Latex by Yelin He et al
05-05-2021	Physically Inspired Dense Fusion Networks for Relighting by Amirsaeed Yazdani et al
05-05-2021	Continual Learning on the Edge with TensorFlow Lite by Giorgos Demosthenous et al
05-04-2021	Curvatures of Stiefel manifolds with deformation metrics by Du Nguyen
05-06-2021	A novel method of predictive collision risk area estimation for proactive pedestrian accident prevention system in urban surveillance infrastructure by Byeongjoon Noh et al
05-04-2021	COVID-19 Detection from Chest X-ray Images using Imprinted Weights Approach by Jianxing Zhang et al
05-04-2021	PingAn-VCGroups Solution for ICDAR 2021 Competition on Scientific Literature Parsing Task B: Table Recognition to HTML by Jiaquan Ye et al
05-05-2021	Attention for Image Registration (AiR): an unsupervised Transformer approach by Zihao Wang et al
05-04-2021	Real-time Face Mask Detection in Video Data by Yuchen Ding et al
05-06-2021	LASR: Learning Articulated Shape Reconstruction from a Monocular Video by Gengshan Yang et al
05-05-2021	MiCE: Mixture of Contrastive Experts for Unsupervised Image Clustering by Tsung Wei Tsai et al
05-07-2021	More Separable and Easier to Segment: A Cluster Alignment Method for Cross-Domain Semantic Segmentation by Shuang Wang et al
05-07-2021	Toward Interactive Modulation for Photo-Realistic Image Restoration by Haoming Cai et al
05-05-2021	Multi-scale Image Decomposition using a Local Statistical Edge Model by Kin-Ming Wong
05-05-2021	Visual Composite Set Detection Using Part-and-Sum Transformers by Qi Dong et al
05-04-2021	TransHash: Transformer-based Hamming Hashing for Efficient Image Retrieval by Yongbiao Chen et al
05-04-2021	Lesion Segmentation and RECIST Diameter Prediction via Click-driven Attention and Dual-path Connection by Youbao Tang et al
05-06-2021	Computer-Aided Design as Language by Yaroslav Ganin et al
05-05-2021	Split and Connect: A Universal Tracklet Booster for Multi-Object Tracking by Gaoang Wang et al
05-06-2021	MAFER: a Multi-resolution Approach to Facial Expression Recognition by Fabio Valerio Massoli et al
05-05-2021	R2U3D: Recurrent Residual 3D U-Net for Lung Segmentation by Dhaval D. Kadia et al
05-05-2021	A Step Toward More Inclusive People Annotations for Fairness by Candice Schumann et al
05-04-2021	Generative Adversarial Networks (GAN) Powered Fast Magnetic Resonance Imaging -- Mini Review, Comparison and Perspectives by Guang Yang et al
05-04-2021	Encoder Fusion Network with Co-Attention Embedding for Referring Image Segmentation by Guang Feng et al

05-05-2021	4DComplete: Non-Rigid Motion Estimation Beyond the Observable Surface by Yang Li et al
05-06-2021	Pose-Guided Sign Language Video GAN with Dynamic Lambda by Christopher Kissel et al
05-06-2021	Vision based Pedestrian Potential Risk Analysis based on Automated Behavior Feature Extraction for Smart and Safe City by Byeongjoon Noh et al
05-06-2021	Estimating Presentation Competence using Multimodal Nonverbal Behavioral Cues by Ömer Sümer et al
05-07-2021	Adv-Makeup: A New Imperceptible and Transferable Attack on Face Recognition by Bangjie Yin et al
05-05-2021	This Looks Like That... Does it? Shortcomings of Latent Space Prototype Explainability in Deep Networks by Adrian Hoffmann et al
05-05-2021	Image Embedding and Model Ensembling for Automated Chest X-Ray Interpretation by Edoardo Giacomello et al
05-05-2021	QueryInst: Parallelly Supervised Mask Query for Instance Segmentation by Yuxin Fang et al
05-04-2021	Effectively Leveraging Attributes for Visual Similarity by Samarth Mishra et al
05-05-2021	SeaDronesSee: A Maritime Benchmark for Detecting Humans in Open Water by Leon Amadeus Varga et al
05-07-2021	A^2-FPN: Attention Aggregation based Feature Pyramid Network for Instance Segmentation by Miao Hu et al
05-05-2021	Conditional Invertible Neural Networks for Diverse Image-to-Image Translation by Lynton Ardizzone et al
05-04-2021	Robustness Enhancement of Object Detection in Advanced Driver Assistance Systems (ADAS) by Le-Anh Tran et al
05-05-2021	Function4D: Real-time Human Volumetric Capture from Very Sparse Consumer RGBD Sensors by Tao Yu et al
05-05-2021	Towards an efficient framework for Data Extraction from Chart Images by Weihong Ma et al
05-04-2021	COVID-Net CT-S: 3D Convolutional Neural Network Architectures for COVID-19 Severity Assessment using Chest CT Images by Hossein Aboutalebi et al
05-04-2021	Computer vision for liquid samples in hospitals and medical labs using hierarchical image segmentation and relations prediction by Sagi Eppel et al
05-05-2021	MODS -- A USV-oriented object detection and obstacle segmentation benchmark by Borja Bovcon et al
05-05-2021	Instance segmentation of fallen trees in aerial color infrared imagery using active multi-contour evolution with fully convolutional network-based intensity priors by Przemyslaw Polewski et al
05-07-2021	Autoencoder Based Inter-Vehicle Generalization for In-Cabin Occupant Classification by Steve Dias Da Cruz et al
05-05-2021	AdaVQA: Overcoming Language Priors with Adapted Margin Cosine Loss by Yangyang Guo et al
05-06-2021	Learning Skeletal Articulations with Neural Blend Shapes by Peizhuo Li et al
05-06-2021	Cascade Image Matting with Deformable Graph Refinement by Zijian Yu et al
05-06-2021	Two4Two: Evaluating Interpretable Machine Learning - A Synthetic Dataset For Controlled Experiments by Martin Schuessler et al
05-04-2021	Attention-based Stylisation for Exemplar Image Colourisation by Marc Gorriz Blanch et al
05-05-2021	Towards Self-Supervision for Video Identification of Individual Holstein-Friesian Cattle: The Cows2021 Dataset by Jing Gao et al
05-05-2021	FLEX: Parameter-free Multi-view 3D Human Motion Reconstruction by Brian Gordon et al
05-04-2021	Intensity Harmonization for Airborne LiDAR by David Jones et al
05-05-2021	Moving SLAM: Fully Unsupervised Deep Learning in Non-Rigid Scenes by Dan Xu et al
05-04-2021	Joint Registration and Segmentation via Multi-Task Learning for Adaptive Radiotherapy of Prostate Cancer by Mohamed S. Elmahdy et al
05-05-2021	Real-time Multi-Adaptive-Resolution-Surfel 6D LiDAR Odometry using Continuous-time Trajectory Optimization by Jan Quenzel et al
05-06-2021	Learning Neighborhood Representation from Multi-Modal Multi-Graph: Image, Text, Mobility Graph and Beyond by Tianyuan Huang et al
05-07-2021	Exploring Instance Relations for Unsupervised Feature Embedding by Yifei Zhang et al
05-07-2021	Foreground-guided Facial Inpainting with Fidelity Preservation by Jireh Jam et al
05-06-2021	Deep Learning based Multi-modal Computing with Feature Disentanglement for MRI Image Synthesis by Yuchen Fei et al
05-04-2021	Soft-Attention Improves Skin Cancer Classification Performance by Soumyya Kanti Datta et al
05-06-2021	Local Relation Learning for Face Forgery Detection by Shen Chen et al
05-05-2021	MOS: Towards Scaling Out-of-distribution Detection for Large Semantic Space by Rui Huang et al
05-06-2021	A Simple and Strong Baseline for Universal Targeted Attacks on Siamese Visual Tracking by Zhenbang Li et al
05-06-2021	Real-Time Video Super-Resolution by Joint Local Inference and Global Parameter Estimation by Noam Elron et al
05-04-2021	Height Estimation of Children under Five Years using Depth Images by Anusua Trivedi et al
05-06-2021	Object-centric Video Prediction without Annotation by Karl Schmeckpeper et al
05-04-2021	DeepRT: A Soft Real Time Scheduler for Computer Vision Applications on the Edge by Zhe Yang et al
05-06-2021	Relative stability toward diffeomorphisms in deep nets indicates performance by Leonardo Petrini et al
05-06-2021	Body Meshes as Points by Jianfeng Zhang et al
05-06-2021	Structured dataset documentation: a datasheet for CheXpert by Christian Garbin et al
05-06-2021	Multi-Perspective LSTM for Joint Visual Representation Learning by Alireza Sepas-Moghaddam et al
05-06-2021	Dynamic Defense Approach for Adversarial Robustness in Deep Neural Networks via Stochastic Ensemble Smoothed Model by Ruoxi Qin et al
05-05-2021	Weakly Supervised Pseudo-Label assisted Learning for ALS Point Cloud Semantic Segmentation by Puzuo Wang et al
05-04-2021	Orienting Point Clouds with Dipole Propagation by Gal Metzer et al
05-04-2021	Surveilling Surveillance: Estimating the Prevalence of Surveillance Cameras with Street View Data by Hao Sheng et al
05-05-2021	Magnifying Subtle Facial Motions for Effective 4D Expression Recognition by Qingkai Zhen et al
05-05-2021	Person Retrieval in Surveillance Using Textual Query: A Review by Hiren Galiyawala et al
05-05-2021	Handwritten Mathematical Expression Recognition with Bidirectionally Trained Transformer by Wenqi Zhao et al
05-05-2021	Iterative Human and Automated Identification of Wildlife Images by Zhongqi Miao et al
05-05-2021	Deep Spherical Manifold Gaussian Kernel for Unsupervised Domain Adaptation by Youshan Zhang et al
05-06-2021	A 2.5D Vehicle Odometry Estimation for Vision Applications by Paul Moran et al
05-06-2021	SS-CADA: A Semi-Supervised Cross-Anatomy Domain Adaptation for Coronary Artery Segmentation by Jingyang Zhang et al
05-07-2021	Probabilistic Visual Place Recognition for Hierarchical Localization by Ming Xu et al
05-04-2021	GANs for Urban Design by Stanislava Fedorova
05-06-2021	SkyCam: A Dataset of Sky Images and their Irradiance values by Evangelos Ntavelis et al
05-06-2021	ACORN: Adaptive Coordinate Networks for Neural Scene Representation by Julien N. P. Martel et al
05-05-2021	Explainable Artificial Intelligence for Human Decision-Support System in Medical Domain by Samanta Knapič et al
05-06-2021	Development of a Fast and Robust Gaze Tracking System for Game Applications by Manh Duong Phung et al
05-06-2021	PoseAug: A Differentiable Pose Augmentation Framework for 3D Human Pose Estimation by Kehong Gong et al
05-05-2021	Beyond Self-attention: External Attention using Two Linear Layers for Visual Tasks by Meng-Hao Guo et al
05-05-2021	Cuboids Revisited: Learning Robust 3D Shape Fitting to Single RGB Images by Florian Kluger et al
05-04-2021	Canonical Saliency Maps: Decoding Deep Face Models by Thrupthi Ann John et al
05-06-2021	(ASNA) An Attention-based Siamese-Difference Neural Network with Surrogate Ranking Loss function for Perceptual Image Quality Assessment by Seyed Mehdi Ayyoubzadeh et al
05-05-2021	DeepPlastic: A Novel Approach to Detecting Epipelagic Bound Plastic Using Deep Visual Models by Gautam Tata et al
05-04-2021	Real-time Deep Dynamic Characters by Marc Habermann et al
05-06-2021	Detection, Tracking, and Counting Meets Drones in Crowds: A Benchmark by Longyin Wen et al
05-06-2021	Efficient Masked Face Recognition Method during the COVID-19 Pandemic by Walid Hariri
05-07-2021	Adaptive Focus for Efficient Video Recognition by Yulin Wang et al
05-07-2021	MOTR: End-to-End Multiple-Object Tracking with TRansformer by Fangao Zeng et al
05-05-2021	RepMLP: Re-parameterizing Convolutions into Fully-connected Layers for Image Recognition by Xiaohan Ding et al
05-05-2021	MCGNet: Partial Multi-view Few-shot Learning via Meta-alignment and Context Gated-aggregation by Yuan Zhou et al

Craig SmithMay 11, 2021