2021.1.4 Vision papers

12-31-2020	TransTrack: Multiple-Object Tracking with Transformer by Peize Sun et al
12-31-2020	NeuralMagicEye: Learning to See and Understand the Scene Behind an Autostereogram by Zhengxia Zou et al
12-31-2020	Neural Body: Implicit Neural Representations with Structured Latent Codes for Novel View Synthesis of Dynamic Humans by Sida Peng et al
12-30-2020	OSTeC: One-Shot Texture Completion by Baris Gecer et al
12-31-2020	Rethinking Semantic Segmentation from a Sequence-to-Sequence Perspective with Transformers by Sixiao Zheng et al
12-29-2020	TrustMAE: A Noise-Resilient Defect Classification Framework using Memory-Augmented Auto-Encoders with Trust Regions by Daniel Stanley Tan et al
12-29-2020	Deep Hashing for Secure Multimodal Biometrics by Veeru Talreja et al
12-29-2020	Detecting Hate Speech in Multi-modal Memes by Abhishek Das et al
12-30-2020	3D Human motion anticipation and classification by Emad Barsoum et al
12-30-2020	SkiNet: A Deep Learning Solution for Skin Lesion Diagnosis with Uncertainty Estimation and Explainability by Rajeev Kumar Singh et al
12-30-2020	Accurate Word Representations with Universal Visual Guidance by Zhuosheng Zhang et al
12-30-2020	Bidirectional Mapping Coupled GAN for Generalized Zero-Shot Learning by Tasfia Shermin et al
12-29-2020	Towards Unsupervised Deep Image Enhancement with Generative Adversarial Network by Zhangkai Ni et al
12-29-2020	AILearn: An Adaptive Incremental Learning Model for Spoof Fingerprint Detection by Shivang Agarwal et al
12-29-2020	Visual-Thermal Camera Dataset Release and Multi-Modal Alignment without Calibration Information by Frank Mascarich et al
12-29-2020	MS-GWNN:multi-scale graph wavelet neural network for breast cancer diagnosis by Mo Zhang et al
12-29-2020	DeepSphere: a graph-based spherical CNN by Michaël Defferrard et al
12-29-2020	Tips and Tricks for Webly-Supervised Fine-Grained Recognition: Learning from the WebFG 2020 Challenge by Xiu-Shen Wei et al
12-31-2020	Audio-Visual Floorplan Reconstruction by Senthil Purushwalkam et al
12-30-2020	Automatic Polyp Segmentation using U-Net-ResNet50 by Saruar Alam et al
12-29-2020	Graph-based non-linear least squares optimization for visual place recognition in changing environments by Stefan Schubert et al
12-29-2020	Object sorting using faster R-CNN by Pengchang Chen et al
12-30-2020	Provident Vehicle Detection at Night: The PVDN Dataset by Lars Ohnemus et al
12-30-2020	Temporally-Transferable Perturbations: Efficient, One-Shot Adversarial Attacks for Online Visual Object Trackers by Krishna Kanth Nakka et al
12-29-2020	Parzen Window Approximation on Riemannian Manifold by Abhishek et al
12-29-2020	Learning a Dynamic Map of Visual Appearance by Tawfiq Salem et al
12-30-2020	Some Algorithms on Exact, Approximate and Error-Tolerant Graph Matching by Shri Prakash Dwivedi
12-29-2020	Towards Reducing Severe Defocus Spread Effects for Multi-Focus Image Fusion via an Optimization Based Strategy by Shuang Xu et al
12-29-2020	Damaged Fingerprint Recognition by Convolutional Long Short-Term Memory Networks for Forensic Purposes by Jaouhar Fattahi et al
12-29-2020	The VIP Gallery for Video Processing Education by Todd Goodall et al
12-29-2020	Chasing the Tail in Monocular 3D Human Reconstruction with Prototype Memory by Yu Rong et al
12-30-2020	FREA-Unet: Frequency-aware U-net for Modality Transfer by Hajar Emami et al
12-29-2020	FPCC-Net: Fast Point Cloud Clustering for Instance Segmentation by Yajun Xu et al
12-30-2020	Exploring Large Context for Cerebral Aneurysm Segmentation by Jun Ma et al
12-30-2020	Model-Based Visual Planning with Self-Supervised Functional Distances by Stephen Tian et al
12-29-2020	COIN: Contrastive Identifier Network for Breast Mass Diagnosis in Mammography by Heyi Li et al
12-30-2020	Beating Attackers At Their Own Games: Adversarial Example Detection Using Adversarial Gradient Directions by Yuhang Wu et al
12-29-2020	Image-to-Image Retrieval by Learning Similarity between Scene Graphs by Sangwoong Yoon et al
12-31-2020	Incremental Embedding Learning via Zero-Shot Translation by Kun Wei et al
12-29-2020	NBNet: Noise Basis Learning for Image Denoising with Subspace Projection by Shen Cheng et al
12-30-2020	Unpaired Image Enhancement with Quality-Attention Generative Adversarial Network by Zhangkai Ni et al
12-30-2020	RTS3D: Real-time Stereo 3D Detection from 4D Feature-Consistency Embedding Space for Autonomous Driving by Peixuan Li et al
12-30-2020	Rethinking the Heatmap Regression for Bottom-up Human Pose Estimation by Zhengxiong Luo et al
12-29-2020	2D or not 2D? Adaptive 3D Convolution Selection for Efficient Video Recognition by Hengduo Li et al
12-30-2020	MM-FSOD: Meta and metric integrated few-shot object detection by Yuewen Li et al
12-31-2020	Language-Mediated, Object-Centric Representation Learning by Ruocheng Wang et al
12-30-2020	Active Annotation of Informative Overlapping Frames in Video Mosaicking Applications by Loic Peter et al
12-30-2020	DUT-LFSaliency: Versatile Dataset and Light Field-to-RGB Saliency Detection by Yongri Piao et al
12-29-2020	Semi-supervised Cardiac Image Segmentation via Label Propagation and Style Transfer by Yao Zhang et al
12-29-2020	SALA: Soft Assignment Local Aggregation for 3D Semantic Segmentation by Hani Itani et al
12-30-2020	MRI brain tumor segmentation and uncertainty estimation using 3D-UNet architectures by Laura Mora Ballestar et al
12-30-2020	SID: Incremental Learning for Anchor-Free Object Detection via Selective and Inter-Related Distillation by Can Peng et al
12-30-2020	Fast Hyperspectral Image Recovery via Non-iterative Fusion of Dual-Camera Compressive Hyperspectral Imaging by Wei He et al
12-31-2020	Text-Free Image-to-Speech Synthesis Using Learned Segmental Units by Wei-Ning Hsu et al
12-30-2020	DDANet: Dual Decoder Attention Network for Automatic Polyp Segmentation by Nikhil Kumar Tomar et al
12-30-2020	Medico Multimedia Task at MediaEval 2020: Automatic Polyp Segmentation by Debesh Jha et al
12-31-2020	iGOS++: Integrated Gradient Optimized Saliency by Bilateral Perturbations by Saeed Khorram et al
12-31-2020	Learned Multi-Resolution Variable-Rate Image Compression with Octave-based Residual Blocks by Mohammad Akbari et al
12-31-2020	A CNN Approach to Simultaneously Count Plants and Detect Plantation-Rows from UAV Imagery by Lucas Prado Osco et al
12-30-2020	New Bag of Deep Visual Words based features to classify chest x-ray images for COVID-19 diagnosis by Chiranjibi Sitaula et al
12-30-2020	Survey of the Detection and Classification of Pulmonary Lesions via CT and X-Ray by Yixuan Sun et al
12-31-2020	CorrNet3D: Unsupervised End-to-end Learning of Dense Correspondence for 3D Point Clouds by Yiming Zeng et al
12-31-2020	Exploiting Shared Knowledge from Non-COVID Lesions for Annotation-Efficient COVID-19 CT Lung Infection Segmentation by Yichi Zhang et al
12-31-2020	Estimating Uncertainty in Neural Networks for Cardiac MRI Segmentation: A Benchmark Study by Matthew Ng et al
12-31-2020	Overview of MediaEval 2020 Predicting Media Memorability Task: What Makes a Video Memorable? by Alba García Seco De Herrera et al
12-31-2020	Voxel R-CNN: Towards High Performance Voxel-based 3D Object Detection by Jiajun Deng et al
12-31-2020	Real-time Webcam Heart-Rate and Variability Estimation with Clean Ground Truth for Evaluation by Amogh Gudi et al
12-31-2020	CNN-based Single Image Crowd Counting: Network Design, Loss Function and Supervisory Signal by Haoyue Bai et al
12-31-2020	Unsupervised Monocular Depth Reconstruction of Non-Rigid Scenes by Ayça Takmaz et al
12-31-2020	Colonoscopy Polyp Detection: Domain Adaptation From Medical Report Images to Real-time Videos by Zhi-Qin Zhan et al
12-31-2020	Investigating Memorability of Dynamic Media by Phuc H. Le-Khac et al
12-31-2020	Leveraging Audio Gestalt to Predict Media Memorability by Lorin Sweeney et al
12-31-2020	Searching a Raw Video Database using Natural Language Queries by Sriram Krishna et al
12-31-2020	A Deep Retinal Image Quality Assessment Network with Salient Structure Priors by Ziwen Xu et al
12-30-2020	SharpGAN: Receptive Field Block Net for Dynamic Scene Deblurring by Hui Feng et al
12-29-2020	Advances in deep learning methods for pavement surface crack detection and identification with visible light visual images by Kailiang Lu
12-31-2020	Illumination Estimation Challenge: experience of past two years by Egor Ershov et al
12-31-2020	Patch-wise++ Perturbation for Adversarial Targeted Attacks by Lianli Gao et al
12-30-2020	Knowledge Distillation with Adaptive Asymmetric Label Sharpening for Semi-supervised Fracture Detection in Chest X-rays by Yirui Wang et al
12-29-2020	A Review of Machine Learning Techniques for Applied Eye Fundus and Tongue Digital Image Processing with Diabetes Management System by Wei Xiang Lim et al
12-30-2020	H2NF-Net for Brain Tumor Segmentation using Multimodal MR Imaging: 2nd Place Solution to BraTS Challenge 2020 Segmentation Task by Haozhe Jia et al

Craig SmithJanuary 4, 2021