2021.11.22 Vision papers

11-18-2021	TransMix: Attend to Mix for Vision Transformers by Jie-Neng Chen et al
11-19-2021	FastDOG: Fast Discrete Optimization on GPU by Ahmed Abbas et al
11-19-2021	Enhanced countering adversarial attacks via input denoising and feature restoring by Yanni Li et al
11-19-2021	Evaluating Self and Semi-Supervised Methods for Remote Sensing Segmentation Tasks by Chaitanya Patel et al
11-19-2021	A 3D 2D convolutional Neural Network Model for Hyperspectral Image Classification by Jiaxin Cao et al
11-19-2021	Probabilistic Regression with Huber Distributions by David Mohlin et al
11-17-2021	Lidar with Velocity: Motion Distortion Correction of Point Clouds from Oscillating Scanning Lidars by Wen Yang et al
11-17-2021	Reference-based Magnetic Resonance Image Reconstruction Using Texture Transforme by Pengfei Guo et al
11-17-2021	Quality Measures in Biometric Systems by Fernando Alonso-Fernandez et al
11-16-2021	Automated Atlas-based Segmentation of Single Coronal Mouse Brain Slices using Linear 2D-2D Registration by Sébastien Piluso et al
11-17-2021	The Multiscenario Multienvironment BioSecure Multimodal Database (BMDB) by Javier Ortega-Garcia et al
11-17-2021	Benchmarking Quality-Dependent and Cost-Sensitive Score-Level Multimodal Biometric Fusion Algorithms by Norman Poh et al
11-16-2021	Exploring dual-attention mechanism with multi-scale feature extraction scheme for skin lesion segmentation by G Jignesh Chowdary et al
11-17-2021	Dynamically pruning segformer for efficient semantic segmentation by Haoli Bai et al
11-17-2021	Low Precision Decentralized Distributed Training with Heterogeneous Data by Sai Aparna Aketi et al
11-17-2021	Learning to Align Sequential Actions in the Wild by Weizhe Liu et al
11-17-2021	Segmentation of Lung Tumor from CT Images using Deep Supervision by Farhanaz Farheen et al
11-16-2021	Fight Detection from Still Images in the Wild by Şeymanur Aktı et al
11-17-2021	DiverGAN: An Efficient and Effective Single-Stage Framework for Diverse Text-to-Image Generation by Zhenxing Zhang et al
11-17-2021	TraSw: Tracklet-Switch Adversarial Attacks against Multi-Object Tracking by Delv Lin et al
11-17-2021	Discriminative Dictionary Learning based on Statistical Methods by G. Madhuri et al
11-16-2021	2.5D Vehicle Odometry Estimation by Ciaran Eising et al
11-19-2021	Deep Domain Adaptation for Pavement Crack Detection by Huijun Liu et al
11-18-2021	Edge-preserving Domain Adaptation for semantic segmentation of Medical Images by Thong Vo et al
11-17-2021	Augmentation of base classifier performance via HMMs on a handwritten character data set by Hélder Campos et al
11-16-2021	GRI: General Reinforced Imitation and its Application to Vision-Based Autonomous Driving by Raphael Chekroun et al
11-16-2021	Tracking Blobs in the Turbulent Edge Plasma of Tokamak Fusion Reactors by Woonghee Han et al
11-17-2021	Towards Open Vocabulary Object Detection without Human-provided Bounding Boxes by Mingfei Gao et al
11-19-2021	Positional Encoder Graph Neural Networks for Geographic Data by Konstantin Klemmer et al
11-16-2021	Online Meta Adaptation for Variable-Rate Learned Image Compression by Wei Jiang et al
11-18-2021	Evaluating Transformers for Lightweight Action Recognition by Raivo Koot et al
11-18-2021	IMFNet: Interpretable Multimodal Fusion for Point Cloud Registration by Xiaoshui Huang et al
11-16-2021	Image-specific Convolutional Kernel Modulation for Single Image Super-resolution by Yuanfei Huang et al
11-16-2021	Two-step adversarial debiasing with partial learning -- medical image case-studies by Ramon Correa et al
11-16-2021	Automatic Semantic Segmentation of the Lumbar Spine. Clinical Applicability in a Multi-parametric and Multi-centre MRI study by Jhon Jairo Saenz-Gamboa et al
11-18-2021	Deep neural networks-based denoising models for CT imaging and their efficacy by Prabhat KC et al
11-17-2021	See Eye to Eye: A Lidar-Agnostic 3D Detection Framework for Unsupervised Multi-Target Domain Adaptation by Darren Tsai et al
11-17-2021	Protection of SVM Model with Secret Key from Unauthorized Access by Ryota Iijima et al
11-17-2021	Using Convolutional Neural Networks to Detect Compression Algorithms by Shubham Bharadwaj
11-19-2021	Meta Adversarial Perturbations by Chia-Hung Yuan et al
11-19-2021	More than Words: In-the-Wild Visually-Driven Prosody for Text-to-Speech by Michael Hassid et al
11-19-2021	Medical Visual Question Answering: A Survey by Zhihong Lin et al
11-16-2021	Choose Settings Carefully: Comparing Action Unit detection at Different Settings Using a Large-Scale Dataset by Mina Bishay et al
11-17-2021	Developing a Machine Learning Algorithm-Based Classification Models for the Detection of High-Energy Gamma Particles by Emmanuel Dadzie et al
11-18-2021	A Trainable Spectral-Spatial Sparse Coding Model for Hyperspectral Image Restoration by Théo Bodrito et al
11-16-2021	Pose Recognition in the Wild: Animal pose estimation using Agglomerative Clustering and Contrastive Learning by Samayan Bhattacharya et al
11-16-2021	SequentialPointNet: A strong parallelized point cloud sequence network for 3D action recognition by Xing Li et al
11-18-2021	Neural Network Kalman filtering for 3D object tracking from linear array ultrasound data by Arttu Arjas et al
11-18-2021	Recurrent Variational Network: A Deep Learning Inverse Problem Solver applied to the task of Accelerated MRI Reconstruction by George Yiasemis et al
11-18-2021	Learning Modified Indicator Functions for Surface Reconstruction by Dong Xiao et al
11-17-2021	Efficient deep learning models for land cover image classification by Ioannis Papoutsis et al
11-16-2021	Deep Neural Networks for Rank-Consistent Ordinal Regression Based On Conditional Probabilities by Xintong Shi et al
11-18-2021	Restormer: Efficient Transformer for High-Resolution Image Restoration by Syed Waqas Zamir et al
11-18-2021	Robust Person Re-identification with Multi-Modal Joint Defence by Yunpeng Gong et al
11-19-2021	Neural Image Beauty Predictor Based on Bradley-Terry Model by Shiyu Li et al
11-18-2021	UFO: A UniFied TransfOrmer for Vision-Language Representation Learning by Jianfeng Wang et al
11-16-2021	Rethinking Keypoint Representations: Modeling Keypoints and Poses as Objects for Multi-Person Human Pose Estimation by William McNally et al
11-19-2021	DSPoint: Dual-scale Point Cloud Recognition with High-frequency Fusion by Renrui Zhang et al
11-19-2021	Advancing High-Resolution Video-Language Representation with Large-Scale Video Transcriptions by Hongwei Xue et al
11-19-2021	Bi-Mix: Bidirectional Mixing for Domain Adaptive Nighttime Semantic Segmentation by Guanglei Yang et al
11-18-2021	ClipCap: CLIP Prefix for Image Captioning by Ron Mokady et al
11-16-2021	Consistent Semantic Attacks on Optical Flow by Tom Koren et al
11-16-2021	Synthesis-Guided Feature Learning for Cross-Spectral Periocular Recognition by Domenick Poster et al
11-17-2021	RAANet: Range-Aware Attention Network for LiDAR-based 3D Object Detection with Auxiliary Density Level Estimation by Yantao Lu et al
11-16-2021	Keypoint Message Passing for Video-based Person Re-Identification by Di Chen et al
11-16-2021	A Benchmark for Modeling Violation-of-Expectation in Physical Reasoning Across Event Categories by Arijit Dasgupta et al
11-17-2021	Large-scale Building Height Retrieval from Single SAR Imagery based on Bounding Box Regression Networks by Yao Sun et al
11-17-2021	Self-Attending Task Generative Adversarial Network for Realistic Satellite Image Creation by Nathan Toner et al
11-17-2021	Temporally Consistent Online Depth Estimation in Dynamic Scenes by Zhaoshuo Li et al
11-17-2021	Learning to Compose Visual Relations by Nan Liu et al
11-16-2021	HARA: A Hierarchical Approach for Robust Rotation Averaging by Seong Hun Lee et al
11-19-2021	Global and Local Alignment Networks for Unpaired Image-to-Image Translation by Guanglei Yang et al
11-17-2021	Self-Supervised Predictive Convolutional Attentive Block for Anomaly Detection by Nicolae-Catalin Ristea et al
11-18-2021	Meta Clustering Learning for Large-scale Unsupervised Person Re-identification by Xin Jin et al
11-18-2021	DeMFI: Deep Joint Deblurring and Multi-Frame Interpolation with Flow-Guided Attentive Correlation and Recursive Boosting by Jihyong Oh et al
11-19-2021	Toward Compact Parameter Representations for Architecture-Agnostic Neural Network Compression by Yuezhou Sun et al
11-19-2021	Factorisation-based Image Labelling by Yu Yan et al
11-17-2021	Improving Person Re-Identification with Temporal Constraints by Julia Dietlmeier et al
11-18-2021	LOLNeRF: Learn from One Look by Daniel Rebain et al
11-18-2021	Improving Transferability of Representations via Augmentation-Aware Self-Supervision by Hankook Lee et al
11-18-2021	TnT Attacks! Universal Naturalistic Adversarial Patches Against Deep Neural Network Systems by Bao Gia Doan et al
11-17-2021	Motion Detection using CSI from Raspberry Pi 4 by Glenn Forbes et al
11-19-2021	DVCFlow: Modeling Information Flow Towards Human-like Video Captioning by Xu Yan et al
11-17-2021	DeepCurrents: Learning Implicit Representations of Shapes with Boundaries by David Palmer et al
11-17-2021	EMScore: Evaluating Video Captioning via Coarse-Grained and Fine-Grained Embedding Matching by Yaya Shi et al
11-17-2021	Local Texture Estimator for Implicit Representation Function by Jaewon Lee et al
11-17-2021	Long-Tailed Multi-Label Retinal Diseases Recognition Using Hierarchical Information and Hybrid Knowledge Distillation by Lie Ju et al
11-18-2021	FBNetV5: Neural Architecture Search for Multiple Tasks in One Run by Bichen Wu et al
11-18-2021	Perceiving and Modeling Density is All You Need for Image Dehazing by Tian Ye et al
11-18-2021	Swin Transformer V2: Scaling Up Capacity and Resolution by Ze Liu et al
11-18-2021	SimMIM: A Simple Framework for Masked Image Modeling by Zhenda Xie et al
11-18-2021	PyTorchVideo: A Deep Learning Library for Video Understanding by Haoqi Fan et al
11-18-2021	Simple but Effective: CLIP Embeddings for Embodied AI by Apoorv Khandelwal et al
11-16-2021	Learning Intrinsic Images for Clothing by Kuo Jiang et al
11-16-2021	Robustness of Bayesian Neural Networks to White-Box Adversarial Attacks by Adaku Uchendu et al
11-17-2021	IntraQ: Learning Synthetic Images with Intra-Class Heterogeneity for Zero-Shot Network Quantization by Yunshan Zhong et al
11-17-2021	Rethinking Drone-Based Search and Rescue with Aerial Person Detection by Pasi Pyrrö et al
11-17-2021	Fine-Grained Vehicle Classification in Urban Traffic Scenes using Deep Learning by Syeda Aneeba Najeeb et al
11-16-2021	Bengali Handwritten Grapheme Classification: Deep Learning Approach by Tarun Roy et al
11-18-2021	CoCAtt: A Cognitive-Conditioned Driver Attention Dataset by Yuan Shen et al
11-17-2021	STEEX: Steering Counterfactual Explanations with Semantics by Paul Jacob et al
11-19-2021	Instance-Adaptive Video Compression: Improving Neural Codecs by Training on the Test Set by Ties van Rozendaal et al
11-16-2021	Multi-Grained Vision Language Pre-Training: Aligning Texts with Visual Concepts by Yan Zeng et al
11-17-2021	Do Not Trust Prediction Scores for Membership Inference Attacks by Dominik Hintersdorf et al
11-17-2021	Facial Information Analysis Technology for Gender and Age Estimation by Gilheum Park et al
11-16-2021	A Latent Encoder Coupled Generative Adversarial Network (LE-GAN) for Efficient Hyperspectral Image Super-resolution by Yue Shi et al
11-16-2021	Advancement of Deep Learning in Pneumonia and Covid-19 Classification and Localization: A Qualitative and Quantitative Analysis by Aakash Shah et al
11-16-2021	CAR -- Cityscapes Attributes Recognition A Multi-category Attributes Dataset for Autonomous Vehicles by Kareem Metwaly et al
11-18-2021	Towards Intelligibility-Oriented Audio-Visual Speech Enhancement by Tassadaq Hussain et al
11-16-2021	Detecting AutoAttack Perturbations in the Frequency Domain by Peter Lorenz et al
11-18-2021	Boosting Supervised Learning Performance with Co-training by Xinnan Du et al
11-18-2021	Unsupervised Online Learning for Robotic Interestingness with Visual Memory by Chen Wang et al
11-18-2021	LiDAR Cluster First and Camera Inference Later: A New Perspective Towards Autonomous Driving by Jiyang Chen et al
11-18-2021	SimpleTrack: Understanding and Rethinking 3D Multi-object Tracking by Ziqi Pang et al
11-16-2021	Data Augmentation using Random Image Cropping for High-resolution Virtual Try-On (VITON-CROP) by Taewon Kang et al
11-18-2021	Rethinking Query, Key, and Value Embedding in Vision Transformer under Tiny Model Constraints by Jaesin Ahn et al
11-16-2021	TorchGeo: deep learning with geospatial data by Adam J. Stewart et al
11-16-2021	SAPNet: Segmentation-Aware Progressive Network for Perceptual Contrastive Deraining by Shen Zheng et al
11-16-2021	Achieving Human Parity on Visual Question Answering by Ming Yan et al
11-16-2021	ARKitScenes -- A Diverse Real-World Dataset For 3D Indoor Scene Understanding Using Mobile RGB-D Data by Gilad Baruch et al
11-16-2021	DRINet++: Efficient Voxel-as-point Point Cloud Segmentation by Maosheng Ye et al
11-16-2021	Pansharpening by convolutional neural networks in the full resolution framework by Matteo Ciotola et al
11-17-2021	Trustworthy Long-Tailed Classification by Bolian Li et al
11-16-2021	Diversified Multi-prototype Representation for Semi-supervised Segmentation by Jizong Peng et al
11-17-2021	Blind VQA on 360{\deg} Video via Progressively Learning from Pixels, Frames and Video by Li Yang et al
11-16-2021	NENet: Monocular Depth Estimation via Neural Ensembles by Shuwei Shao et al
11-17-2021	SeCGAN: Parallel Conditional Generative Adversarial Networks for Face Editing via Semantic Consistency by Jiaze Sun et al
11-16-2021	Enhanced Correlation Matching based Video Frame Interpolation by Sungho Lee et al
11-18-2021	Rethink Dilated Convolution for Real-time Semantic Segmentation by Roland Gao
11-17-2021	Image Super-Resolution Using T-Tetromino Pixels by Simon Grosche et al
11-17-2021	Two-Face: Adversarial Audit of Commercial Face Recognition Systems by Siddharth D Jaiswal et al
11-19-2021	Semi-Supervised Domain Generalization in Real World:New Benchmark and Strong Baseline by Luojun Lin et al
11-16-2021	DeltaConv: Anisotropic Point Cloud Learning with Exterior Calculus by Ruben Wiersma et al
11-16-2021	UBnormal: New Benchmark for Supervised Open-Set Video Anomaly Detection by Andra Acsintoae et al
11-16-2021	CNN Filter Learning from Drawn Markers for the Detection of Suggestive Signs of COVID-19 in CT Images by Azael M. Sousa et al
11-16-2021	Enabling equivariance for arbitrary Lie groups by Lachlan E. MacDonald et al
11-18-2021	COVID-19 Detection on Chest X-Ray Images: A comparison of CNN architectures and ensembles by Fabricio Breve
11-16-2021	Which CNNs and Training Settings to Choose for Action Unit Detection? A Study Based on a Large-Scale Dataset by Mina Bishay et al
11-18-2021	Wiggling Weights to Improve the Robustness of Classifiers by Sadaf Gulshad et al
11-16-2021	Identifying the Factors that Influence Urban Public Transit Demand by Armstrong Aboah et al
11-17-2021	Induce, Edit, Retrieve:Language Grounded Multimodal Schema for Instructional Video Retrieval by Yue Yang et al
11-16-2021	Delta-GAN-Encoder: Encoding Semantic Changes for Explicit Image Editing, using Few Synthetic Samples by Nir Diamant et al
11-16-2021	Improved Robustness of Vision Transformer via PreLayerNorm in Patch Embedding by Bum Jun Kim et al
11-19-2021	Xp-GAN: Unsupervised Multi-object Controllable Video Generation by Bahman Rouhani et al
11-16-2021	TYolov5: A Temporal Yolov5 Detector Based on Quasi-Recurrent Neural Networks for Real-Time Handgun Detection in Video by Mario Alberto Duran-Vega et al
11-16-2021	TRIG: Transformer-Based Text Recognizer with Initial Embedding Guidance by Yue Tao et al
11-16-2021	Grounding Psychological Shape Space in Convolutional Neural Networks by Lucas Bechberger et al
11-18-2021	SUB-Depth: Self-distillation and Uncertainty Boosting Self-supervised Monocular Depth Estimation by Hang Zhou et al
11-17-2021	Tiny Obstacle Discovery by Occlusion-Aware Multilayer Regression by Feng Xue et al
11-17-2021	Cryo-shift: Reducing domain shift in cryo-electron subtomograms with unsupervised domain adaptation and randomization by Hmrishav Bandyopadhyay et al
11-18-2021	One-Shot Generative Domain Adaptation by Ceyuan Yang et al
11-18-2021	M2A: Motion Aware Attention for Accurate Video Action Recognition by Brennan Gebotys et al
11-18-2021	Exploring the Limits of Epistemic Uncertainty Quantification in Low-Shot Settings by Matias Valdenegro-Toro
11-16-2021	Weakly-supervised fire segmentation by visualizing intermediate CNN layers by Milad Niknejad et al
11-19-2021	Ubi-SleepNet: Advanced Multimodal Fusion Techniques for Three-stage Sleep Classification Using Ubiquitous Sensing by Bing Zhai et al
11-16-2021	Language bias in Visual Question Answering: A Survey and Taxonomy by Desen Yuan
11-16-2021	Computer Vision for Supporting Image Search by Alan F. Smeaton
11-16-2021	A Data-Driven Approach for Linear and Nonlinear Damage Detection Using Variational Mode Decomposition and GARCH Model by Vahid Reza Gharehbaghi et al
11-16-2021	INTERN: A New Learning Paradigm Towards General Vision by Jing Shao et al
11-17-2021	MPF6D: Masked Pyramid Fusion 6D Pose Estimation by Nuno Pereira et al
11-19-2021	Learning to Detect Instance-level Salient Objects Using Complementary Image Labels by Xin Tian et al
11-17-2021	Single-pass Object-adaptive Data Undersampling and Reconstruction for MRI by Zhishen Huang et al
11-19-2021	Fooling Adversarial Training with Inducing Noise by Zhirui Wang et al
11-17-2021	Fast and Light-Weight Network for Single Frame Structured Illumination Microscopy Super-Resolution by Xi Cheng et al
11-17-2021	Compositional Transformers for Scene Generation by Drew A. Hudson et al
11-18-2021	Successor Feature Landmarks for Long-Horizon Goal-Conditioned Reinforcement Learning by Christopher Hoang et al
11-16-2021	Code-free development and deployment of deep segmentation models for digital pathology by Henrik Sahlin Pettersen et al
11-16-2021	Robust 3D Scene Segmentation through Hierarchical and Learnable Part-Fusion by Anirud Thyagharajan et al
11-19-2021	An Analysis of the Influence of Transfer Learning When Measuring the Tortuosity of Blood Vessels by Matheus V. da Silva et al
11-19-2021	Panoptic Segmentation: A Review by Omar Elharrouss et al
11-19-2021	Combined Scaling for Zero-shot Transfer Learning by Hieu Pham et al
11-18-2021	Interactive segmentation using U-Net with weight map and dynamic user interactions by Ragavie Pirabaharan et al
11-18-2021	The Way to my Heart is through Contrastive Learning: Remote Photoplethysmography from Unlabelled Video by John Gideon et al
11-16-2021	IKEA Object State Dataset: A 6DoF object pose estimation dataset and benchmark for multi-state assembly objects by Yongzhi Su et al
11-16-2021	Point detection through multi-instance deep heatmap regression for sutures in endoscopy by Lalith Sharan et al
11-18-2021	Adaptive Shrink-Mask for Text Detection by Chuang Yang et al
11-17-2021	End-to-end optimized image compression with competition of prior distributions by Benoit Brummer et al
11-17-2021	Automated Approach for Computer Vision-based Vehicle Movement Classification at Traffic Intersections by Udita Jana et al
11-17-2021	Nonlinear Intensity Sonar Image Matching based on Deep Convolution Features by Xiaoteng Zhou et al
11-16-2021	Self-supervised High-fidelity and Re-renderable 3D Facial Reconstruction from a Single Image by Mingxin Yang et al
11-17-2021	Generating Unrestricted 3D Adversarial Point Clouds by Xuelong Dai et al
11-17-2021	Pedestrian Detection by Exemplar-Guided Contrastive Learning by Zebin Lin et al
11-17-2021	Probabilistic Spatial Distribution Prior Based Attentional Keypoints Matching Network by Xiaoming Zhao et al
11-16-2021	An Overview of Backdoor Attacks Against Deep Neural Networks and Possible Defences by Wei Guo et al
11-19-2021	ClevrTex: A Texture-Rich Benchmark for Unsupervised Multi-Object Segmentation by Laurynas Karazija et al
11-18-2021	Correcting Face Distortion in Wide-Angle Videos by Wei-Sheng Lai et al
11-16-2021	Single Image Object Counting and Localizing using Active-Learning by Inbar Huberman-Spiegelglas et al
11-17-2021	3D Lip Event Detection via Interframe Motion Divergence at Multiple Temporal Resolutions by Jie Zhang et al
11-16-2021	Film Trailer Generation via Task Decomposition by Pinelopi Papalampidi et al
11-19-2021	Grounded Situation Recognition with Transformers by Junhyeong Cho et al
11-17-2021	Its About Time: Analog Clock Reading in the Wild by Charig Yang et al
11-16-2021	SEnSeI: A Deep Learning Module for Creating Sensor Independent Cloud Masks by Alistair Francis et al
11-17-2021	Transparent Human Evaluation for Image Captioning by Jungo Kasai et al
11-18-2021	Automatic Neural Network Pruning that Efficiently Preserves the Model Accuracy by Thibault Castells et al
11-16-2021	Learning Scene Dynamics from Point Cloud Sequences by Pan He et al

Craig SmithNovember 22, 2021