2021.15.2 Vision papers

02-11-2021	High-Performance Large-Scale Image Recognition Without Normalization by Andrew Brock et al
02-11-2021	Scaling Up Visual and Vision-Language Representation Learning With Noisy Text Supervision by Chao Jia et al
02-10-2021	Training Vision Transformers for Image Retrieval by Alaaeldin El-Nouby et al
02-09-2021	Is Space-Time Attention All You Need for Video Understanding? by Gedas Bertasius et al
02-11-2021	A-NeRF: Surface-free Human 3D Pose Refinement via Neural Rendering by Shih-Yang Su et al
02-11-2021	Neural Re-rendering for Full-frame Video Stabilization by Yu-Lun Liu et al
02-11-2021	Unsupervised Semantic Segmentation by Contrasting Object Mask Proposals by Wouter Van Gansbeke et al
02-11-2021	Shelf-Supervised Mesh Prediction in the Wild by Yufei Ye et al
02-11-2021	Deep Photo Scan: Semi-supervised learning for dealing with the real-world degradation in smartphone photo scanning by Man M. Ho et al
02-11-2021	Less is More: ClipBERT for Video-and-Language Learning via Sparse Sampling by Jie Lei et al
02-11-2021	The Barrier of meaning in archaeological data science by Luca Casini et al
02-11-2021	SWAGAN: A Style-based Wavelet-driven Generative Model by Rinon Gal et al
02-11-2021	K-Hairstyle: A Large-scale Korean hairstyle dataset for virtual hair editing and hairstyle classification by Taewoo Kim et al
02-11-2021	Neural BRDF Representation and Importance Sampling by Alejandro Sztrajman et al
02-10-2021	AdaFuse: Adaptive Temporal Fusion Network for Efficient Action Recognition by Yue Meng et al
02-11-2021	A Survey on Synchronous Augmented, Virtual and Mixed Reality Remote Collaboration Systems by Alexander Schäfer et al
02-11-2021	Sample Efficient Learning of Image-Based Diagnostic Classifiers Using Probabilistic Labels by Roberto Vega et al
02-11-2021	The Deepfake Detection Dilemma: A Multistakeholder Exploration of Adversarial Dynamics in Synthetic Media by Claire Leibowicz et al
02-10-2021	H3D: Benchmark on Semantic Segmentation of High-Resolution 3D Point Clouds and textured Meshes from UAV LiDAR and Multi-View-Stereo by Michael Kölle et al
02-09-2021	VINS: Visual Search for Mobile User Interface Design by Sara Bunian et al
02-09-2021	More Is More -- Narrowing the Generalization Gap by Adding Classification Heads by Roee Cates et al
02-11-2021	HyperPocket: Generative Point Cloud Completion by Przemysław Spurek et al
02-10-2021	Driving Style Representation in Convolutional Recurrent Neural Network Model of Driver Identification by Sobhan Moosavi et al
02-10-2021	Two Novel Performance Improvements for Evolving CNN Topologies by Yaron Strauch et al
02-10-2021	UAV Localization Using Autoencoded Satellite Images by Mollie Bianchi et al
02-09-2021	DetCo: Unsupervised Contrastive Learning for Object Detection by Enze Xie et al
02-11-2021	Searching for Pneumothorax in X-Ray Images Using Autoencoded Deep Features by Antonio Sze-To et al
02-11-2021	A fully automated method for 3D individual tooth identification and segmentation in dental CBCT by Tae Jun Jang et al
02-11-2021	Corner Cases for Visual Perception in Automated Driving: Some Guidance on Detection Approaches by Jasmin Breitenstein et al
02-11-2021	Adversarially robust deepfake media detection using fused convolutional neural network predictions by Sohail Ahmed Khan et al
02-10-2021	Hyperbolic Generative Adversarial Network by Diego Lazcano et al
02-09-2021	End-to-End Deep Learning of Lane Detection and Path Prediction for Real-Time Autonomous Driving by Der-Hau Lee et al
02-12-2021	Improving Object Detection in Art Images Using Only Style Transfer by David Kadish et al
02-11-2021	Adversarial Segmentation Loss for Sketch Colorization by Samet Hicsonmez et al
02-11-2021	ABOShips -- An Inshore and Offshore Maritime Vessel Detection Dataset with Precise Annotations by Bogdan Iancu et al
02-10-2021	ZeroScatter: Domain Transfer for Long Distance Imaging and Vision through Scattering Media by Zheng Shi et al
02-10-2021	Sparse-Push: Communication- & Energy-Efficient Decentralized Distributed Learning over Directed & Time-Varying Graphs with non-IID Datasets by Sai Aparna Aketi et al
02-11-2021	Explainability in CNN Models By Means of Z-Scores by David Malmgren-Hansen et al
02-10-2021	A Topological Approach for Motion Track Discrimination by Tegan Emerson et al
02-11-2021	Modeling 3D Surface Manifolds with a Locally Conditioned Atlas by Przemysław Spurek et al
02-10-2021	Frame Difference-Based Temporal Loss for Video Stylization by Jianjin Xu et al
02-12-2021	Efficient Conditional GAN Transfer with Knowledge Propagation across Classes by Mohamad Shahbazi et al
02-11-2021	L-SNet: from Region Localization to Scale Invariant Medical Image Segmentation by Jiahao Xie et al
02-10-2021	Classification of Long Noncoding RNA Elements Using Deep Convolutional Neural Networks and Siamese Networks by Brian McClannahan et al
02-09-2021	Telling the What while Pointing the Where: Fine-grained Mouse Trace and Language Supervision for Improved Image Retrieval by Soravit Changpinyo et al
02-09-2021	Policy Augmentation: An Exploration Strategy for Faster Convergence of Deep Reinforcement Learning Algorithms by Arash Mahyari
02-10-2021	Scale Normalized Image Pyramids with AutoFocus for Object Detection by Bharat Singh et al
02-10-2021	Audiovisual Highlight Detection in Videos by Karel Mundnich et al
02-09-2021	FLOP: Federated Learning on Medical Datasets using Partial Networks by Qian Yang et al
02-09-2021	Where is my hand? Deep hand segmentation for visual self-recognition in humanoid robots by Alexandre Almeida et al
02-09-2021	Generative Models as Distributions of Functions by Emilien Dupont et al
02-10-2021	Partial transfusion: on the expressive influence of trainable batch norm parameters for transfer learning by Fahdi Kanavati et al
02-10-2021	Enhancing Real-World Adversarial Patches with 3D Modeling Techniques by Yael Mathov et al
02-09-2021	Train a One-Million-Way Instance Classifier for Unsupervised Visual Representation Learning by Yu Liu et al
02-09-2021	Attention-Based Neural Networks for Chroma Intra Prediction in Video Coding by Marc Górriz et al
02-12-2021	Semantically-Conditioned Negative Samples for Efficient Contrastive Learning by James O' Neill et al
02-12-2021	End-to-end Audio-visual Speech Recognition with Conformers by Pingchuan Ma et al
02-09-2021	CorrDetector: A Framework for Structural Corrosion Detection from Drone Images using Ensemble Deep Learning by Abdur Rahim Mohammad Forkan et al
02-09-2021	Dynamic Neural Networks: A Survey by Yizeng Han et al
02-09-2021	Facial Expression Recognition on a Quantum Computer by Riccardo Mengoni et al
02-09-2021	Improving Visual Reasoning by Exploiting The Knowledge in Texts by Sahand Sharifzadeh et al
02-10-2021	Reference-based Texture transfer for Single Image Super-resolution of Magnetic Resonance images by Madhu Mithra K K et al
02-10-2021	Application of Yolo on Mask Detection Task by Ren Liu et al
02-09-2021	Input Similarity from the Neural Network Perspective by Guillaume Charpiat et al
02-09-2021	An underwater binocular stereo matching algorithm based on the best search domain by Yimin Peng et al
02-10-2021	Learning to Enhance Visual Quality via Hyperspectral Domain Mapping by Harsh Sinha et al
02-10-2021	RoBIC: A benchmark suite for assessing classifiers robustness by Thibault Maho et al
02-09-2021	Distribution Adaptive INT8 Quantization for Training CNNs by Kang Zhao et al
02-10-2021	Dysplasia grading of colorectal polyps through CNN analysis of WSI by Daniele Perlo et al
02-09-2021	Sequential vessel segmentation via deep channel attention network by Dongdong Hao et al
02-10-2021	Doctor Imitator: A Graph-based Bone Age Assessment Framework Using Hand Radiographs by Jintai Chen et al
02-12-2021	Depthwise Separable Convolutions Allow for Fast and Memory-Efficient Spectral Normalization by Christina Runkel et al
02-09-2021	Deep learning architectural designs for super-resolution of noisy images by Angel Villar-Corrales et al
02-12-2021	Min-Max-Plus Neural Networks by Ye Luo et al
02-09-2021	SG2Caps: Revisiting Scene Graphs for Image Captioning by Subarna Tripathi et al
02-09-2021	An application of a pseudo-parabolic modeling to texture image recognition by Joao B. Florindo et al
02-10-2021	Automated Video Labelling: Identifying Faces by Corroborative Evidence by Andrew Brown et al
02-09-2021	Visual Search at Alibaba by Yanhao Zhang et al
02-12-2021	A Too-Good-to-be-True Prior to Reduce Shortcut Reliance by Nikolay Dagaev et al
02-12-2021	Universal Adversarial Perturbations Through the Lens of Deep Steganography: Towards A Fourier Perspective by Chaoning Zhang et al
02-09-2021	Robust Motion In-betweening by Félix G. Harvey et al
02-10-2021	Robustness in Compressed Neural Networks for Object Detection by Sebastian Cygert et al
02-10-2021	Enhancing efficiency of object recognition in different categorization levels by reinforcement learning in modular spiking neural networks by Fatemeh Sharifizadeh et al
02-12-2021	Confounding Tradeoffs for Neural Network Quantization by Sahaj Garg et al
02-09-2021	Loss Function Discovery for Object Detection via Convergence-Simulation Driven Search by Peidong Liu et al
02-09-2021	Driver2vec: Driver Identification from Automotive Data by Jingbo Yang et al
02-10-2021	Searching for Alignment in Face Recognition by Xiaqing Xu et al
02-09-2021	Negative Data Augmentation by Abhishek Sinha et al
02-09-2021	Polarimetric Monocular Dense Mapping Using Relative Deep Depth Prior by Moein Shakeri et al
02-09-2021	Whats in the box?!: Deflecting Adversarial Attacks by Randomly Deploying Adversarially-Disjoint Models by Sahar Abdelnabi et al
02-10-2021	A Generic Object Re-identification System for Short Videos by Tairu Qiu et al
02-09-2021	Mars Image Content Classification: Three Years of NASA Deployment and Recent Advances by Kiri Wagstaff et al
02-12-2021	Unleashing the Power of Contrastive Self-Supervised Visual Models via Contrast-Regularized Fine-Tuning by Yifan Zhang et al
02-09-2021	Residue Density Segmentation for Monitoring and Optimizing Tillage Practices by Jennifer Hobbs et al
02-09-2021	A Real-World Demonstration of Machine Learning Generalizability: Intracranial Hemorrhage Detection on Head CT by Hojjat Salehinejad et al
02-09-2021	Classification of Handwritten Names of Cities and Handwritten Text Recognition using Various Deep Learning Models by Daniyar Nurseitov et al
02-09-2021	Learning Multi-Modal Volumetric Prostate Registration with Weak Inter-Subject Spatial Correspondence by Oleksii Bashkanov et al
02-09-2021	Large-Scale Visual Search with Binary Distributed Graph at Alibaba by Kang Zhao et al
02-09-2021	Fashion Focus: Multi-modal Retrieval System for Video Commodity Localization in E-commerce by Yanhao Zhang et al
02-09-2021	Learning Unsupervised Cross-domain Image-to-Image Translation Using a Shared Discriminator by Rajiv Kumar et al
02-10-2021	Searching for Fast Model Families on Datacenter Accelerators by Sheng Li et al
02-12-2021	Rethinking Eye-blink: Assessing Task Difficulty through Physiological Representation of Spontaneous Blinking by Youngjun Cho
02-12-2021	Bayesian Uncertainty Estimation of Learned Variational MRI Reconstruction by Dominik Narnhofer et al
02-12-2021	Annotation Cleaning for the MSR-Video to Text Dataset by Haoran Chen et al
02-09-2021	Locally Free Weight Sharing for Network Width Search by Xiu Su et al
02-10-2021	BRECQ: Pushing the Limit of Post-Training Quantization by Block Reconstruction by Yuhang Li et al
02-10-2021	Regional Attention with Architecture-Rebuilt 3D Network for RGB-D Gesture Recognition by Benjia Zhou et al
02-09-2021	Ensembling object detectors for image and video data analysis by Kateryna Chumachenko et al
02-09-2021	DARE-SLAM: Degeneracy-Aware and Resilient Loop Closing in Perceptually-Degraded Environments by Kamak Ebadi et al
02-12-2021	A Generative Model for Hallucinating Diverse Versions of Super Resolution Images by Mohamed Abderrahmen Abid et al
02-09-2021	D2A U-Net: Automatic Segmentation of COVID-19 Lesions from CT Slices with Dilated Convolution and Dual Attention Mechanism by Xiangyu Zhao et al
02-12-2021	Outdoor inverse rendering from a single image using multiview self-supervision by Ye Yu et al
02-10-2021	Improving Aerial Instance Segmentation in the Dark with Self-Supervised Low Light Enhancement by Prateek Garg et al
02-11-2021	COVID-19 detection from scarce chest x-ray image data using deep learning by Shruti Jadon
02-09-2021	Detecting Localized Adversarial Examples: A Generic Approach using Critical Region Analysis by Fengting Li et al
02-12-2021	Multi-source Pseudo-label Learning of Semantic Segmentation for the Scene Recognition of Agricultural Mobile Robots by Shigemichi Matsuzaki et al
02-12-2021	Robust White Matter Hyperintensity Segmentation on Unseen Domain by Xingchen Zhao et al
02-09-2021	Large Scale Long-tailed Product Recognition System at Alibaba by Xiangzeng Zhou et al
02-12-2021	Predicting and Attending to Damaging Collisions for Placing Everyday Objects in Photo-Realistic Simulations by Aly Magassouba et al
02-10-2021	Exploiting Depth Information for Wildlife Monitoring by Timm Haucke et al
02-12-2021	Adversarial Branch Architecture Search for Unsupervised Domain Adaptation by Luca Robbiano et al
02-12-2021	A Parameterised Quantum Circuit Approach to Point Set Matching by Mohammadreza Noormandipour et al
02-12-2021	Uncertainty-Aware Semi-supervised Method using Large Unlabelled and Limited Labeled COVID-19 Data by Roohallah Alizadehsani et al
02-12-2021	Destination similarity based on implicit user interest by Hongliu Cao et al
02-09-2021	RODNet: A Real-Time Radar Object Detection Network Cross-Supervised by Camera-Radar Fused Object 3D Localization by Yizhou Wang et al
02-09-2021	Referring Segmentation in Images and Videos with Cross-Modal Self-Attention Network by Linwei Ye et al
02-09-2021	Diverse Single Image Generation with Controllable Global Structure though Self-Attention by Sutharsan Mahendren et al
02-09-2021	Deep Multilabel CNN for Forensic Footwear Impression Descriptor Identification by Marcin Budka et al
02-09-2021	The Role of the Input in Natural Language Video Description by Silvia Cascianelli et al
02-09-2021	LIFT-CAM: Towards Better Explanations for Class Activation Mapping by Hyungsik Jung et al
02-09-2021	Culture-inspired Multi-modal Color Palette Generation and Colorization: A Chinese Youth Subculture Case by Yufan Li et al
02-11-2021	COVID-19 identification from volumetric chest CT scans using a progressively resized 3D-CNN incorporating segmentation, augmentation, and class-rebalancing by Md. Kamrul Hasan et al
02-12-2021	Reviving Iterative Training with Mask Guidance for Interactive Segmentation by Konstantin Sofiiuk et al
02-12-2021	Densely Deformable Efficient Salient Object Detection Network by Tanveer Hussain et al
02-09-2021	How Unique Is a Face: An Investigative Study by Michal Balazia et al
02-12-2021	Analysis of Interpolation based Image In-painting Approaches by Mustafa Zor et al
02-09-2021	On the Robustness of Multi-View Rotation Averaging by Xinyi Li et al
02-09-2021	Transfer learning based few-shot classification using optimal transport mapping from preprocessed latent space of backbone neural network by Tomáš Chobola et al
02-09-2021	Virtual ID Discovery from E-commerce Media at Alibaba: Exploiting Richness of User Click Behavior for Visual Search Relevance by Yanhao Zhang et al
02-11-2021	Mediastinal lymph nodes segmentation using 3D convolutional neural network ensembles and anatomical priors guiding by David Bouget et al
02-11-2021	ReRankMatch: Semi-Supervised Learning with Semantics-Oriented Similarity Representation by Trung Quang Tran et al
02-11-2021	What does LIME really see in images? by Damien Garreau et al
02-11-2021	Segmentation-Renormalized Deep Feature Modulation for Unpaired Image Harmonization by Mengwei Ren et al
02-11-2021	Learning Depth via Leveraging Semantics: Self-supervised Monocular Depth Estimation with Both Implicit and Explicit Semantic Guidance by Rui Li et al
02-11-2021	Towards DeepSentinel: An extensible corpus of labelled Sentinel-1 and -2 imagery and a general-purpose sensor-fusion semantic embedding model by Lucas Kruitwagen

Craig SmithFebruary 15, 2021