2021.15.2 Vision papers

 

02-11-2021

High-Performance Large-Scale Image Recognition Without Normalization
by Andrew Brock et al

02-11-2021

Scaling Up Visual and Vision-Language Representation Learning With Noisy Text Supervision
by Chao Jia et al

02-10-2021

Training Vision Transformers for Image Retrieval
by Alaaeldin El-Nouby et al

02-09-2021

Is Space-Time Attention All You Need for Video Understanding?
by Gedas Bertasius et al

02-11-2021

A-NeRF: Surface-free Human 3D Pose Refinement via Neural Rendering
by Shih-Yang Su et al

02-11-2021

Neural Re-rendering for Full-frame Video Stabilization
by Yu-Lun Liu et al

02-11-2021

Unsupervised Semantic Segmentation by Contrasting Object Mask Proposals
by Wouter Van Gansbeke et al

02-11-2021

Shelf-Supervised Mesh Prediction in the Wild
by Yufei Ye et al

02-11-2021

Deep Photo Scan: Semi-supervised learning for dealing with the real-world degradation in smartphone photo scanning
by Man M. Ho et al

02-11-2021

Less is More: ClipBERT for Video-and-Language Learning via Sparse Sampling
by Jie Lei et al

02-11-2021

The Barrier of meaning in archaeological data science
by Luca Casini et al

02-11-2021

SWAGAN: A Style-based Wavelet-driven Generative Model
by Rinon Gal et al

02-11-2021

K-Hairstyle: A Large-scale Korean hairstyle dataset for virtual hair editing and hairstyle classification
by Taewoo Kim et al

02-11-2021

Neural BRDF Representation and Importance Sampling
by Alejandro Sztrajman et al

02-10-2021

AdaFuse: Adaptive Temporal Fusion Network for Efficient Action Recognition
by Yue Meng et al

02-11-2021

A Survey on Synchronous Augmented, Virtual and Mixed Reality Remote Collaboration Systems
by Alexander Schäfer et al

02-11-2021

Sample Efficient Learning of Image-Based Diagnostic Classifiers Using Probabilistic Labels
by Roberto Vega et al

02-11-2021

The Deepfake Detection Dilemma: A Multistakeholder Exploration of Adversarial Dynamics in Synthetic Media
by Claire Leibowicz et al

02-10-2021

H3D: Benchmark on Semantic Segmentation of High-Resolution 3D Point Clouds and textured Meshes from UAV LiDAR and Multi-View-Stereo
by Michael Kölle et al

02-09-2021

VINS: Visual Search for Mobile User Interface Design
by Sara Bunian et al

02-09-2021

More Is More -- Narrowing the Generalization Gap by Adding Classification Heads
by Roee Cates et al

02-11-2021

HyperPocket: Generative Point Cloud Completion
by Przemysław Spurek et al

02-10-2021

Driving Style Representation in Convolutional Recurrent Neural Network Model of Driver Identification
by Sobhan Moosavi et al

02-10-2021

Two Novel Performance Improvements for Evolving CNN Topologies
by Yaron Strauch et al

02-10-2021

UAV Localization Using Autoencoded Satellite Images
by Mollie Bianchi et al

02-09-2021

DetCo: Unsupervised Contrastive Learning for Object Detection
by Enze Xie et al

02-11-2021

Searching for Pneumothorax in X-Ray Images Using Autoencoded Deep Features
by Antonio Sze-To et al

02-11-2021

A fully automated method for 3D individual tooth identification and segmentation in dental CBCT
by Tae Jun Jang et al

02-11-2021

Corner Cases for Visual Perception in Automated Driving: Some Guidance on Detection Approaches
by Jasmin Breitenstein et al

02-11-2021

Adversarially robust deepfake media detection using fused convolutional neural network predictions
by Sohail Ahmed Khan et al

02-10-2021

Hyperbolic Generative Adversarial Network
by Diego Lazcano et al

02-09-2021

End-to-End Deep Learning of Lane Detection and Path Prediction for Real-Time Autonomous Driving
by Der-Hau Lee et al

02-12-2021

Improving Object Detection in Art Images Using Only Style Transfer
by David Kadish et al

02-11-2021

Adversarial Segmentation Loss for Sketch Colorization
by Samet Hicsonmez et al

02-11-2021

ABOShips -- An Inshore and Offshore Maritime Vessel Detection Dataset with Precise Annotations
by Bogdan Iancu et al

02-10-2021

ZeroScatter: Domain Transfer for Long Distance Imaging and Vision through Scattering Media
by Zheng Shi et al

02-10-2021

Sparse-Push: Communication- & Energy-Efficient Decentralized Distributed Learning over Directed & Time-Varying Graphs with non-IID Datasets
by Sai Aparna Aketi et al

02-11-2021

Explainability in CNN Models By Means of Z-Scores
by David Malmgren-Hansen et al

02-10-2021

A Topological Approach for Motion Track Discrimination
by Tegan Emerson et al

02-11-2021

Modeling 3D Surface Manifolds with a Locally Conditioned Atlas
by Przemysław Spurek et al

02-10-2021

Frame Difference-Based Temporal Loss for Video Stylization
by Jianjin Xu et al

02-12-2021

Efficient Conditional GAN Transfer with Knowledge Propagation across Classes
by Mohamad Shahbazi et al

02-11-2021

L-SNet: from Region Localization to Scale Invariant Medical Image Segmentation
by Jiahao Xie et al

02-10-2021

Classification of Long Noncoding RNA Elements Using Deep Convolutional Neural Networks and Siamese Networks
by Brian McClannahan et al

02-09-2021

Telling the What while Pointing the Where: Fine-grained Mouse Trace and Language Supervision for Improved Image Retrieval
by Soravit Changpinyo et al

02-09-2021

Policy Augmentation: An Exploration Strategy for Faster Convergence of Deep Reinforcement Learning Algorithms
by Arash Mahyari

02-10-2021

Scale Normalized Image Pyramids with AutoFocus for Object Detection
by Bharat Singh et al

02-10-2021

Audiovisual Highlight Detection in Videos
by Karel Mundnich et al

02-09-2021

FLOP: Federated Learning on Medical Datasets using Partial Networks
by Qian Yang et al

02-09-2021

Where is my hand? Deep hand segmentation for visual self-recognition in humanoid robots
by Alexandre Almeida et al

02-09-2021

Generative Models as Distributions of Functions
by Emilien Dupont et al

02-10-2021

Partial transfusion: on the expressive influence of trainable batch norm parameters for transfer learning
by Fahdi Kanavati et al

02-10-2021

Enhancing Real-World Adversarial Patches with 3D Modeling Techniques
by Yael Mathov et al

02-09-2021

Train a One-Million-Way Instance Classifier for Unsupervised Visual Representation Learning
by Yu Liu et al

02-09-2021

Attention-Based Neural Networks for Chroma Intra Prediction in Video Coding
by Marc Górriz et al

02-12-2021

Semantically-Conditioned Negative Samples for Efficient Contrastive Learning
by James O' Neill et al

02-12-2021

End-to-end Audio-visual Speech Recognition with Conformers
by Pingchuan Ma et al

02-09-2021

CorrDetector: A Framework for Structural Corrosion Detection from Drone Images using Ensemble Deep Learning
by Abdur Rahim Mohammad Forkan et al

02-09-2021

Dynamic Neural Networks: A Survey
by Yizeng Han et al

02-09-2021

Facial Expression Recognition on a Quantum Computer
by Riccardo Mengoni et al

02-09-2021

Improving Visual Reasoning by Exploiting The Knowledge in Texts
by Sahand Sharifzadeh et al

02-10-2021

Reference-based Texture transfer for Single Image Super-resolution of Magnetic Resonance images
by Madhu Mithra K K et al

02-10-2021

Application of Yolo on Mask Detection Task
by Ren Liu et al

02-09-2021

Input Similarity from the Neural Network Perspective
by Guillaume Charpiat et al

02-09-2021

An underwater binocular stereo matching algorithm based on the best search domain
by Yimin Peng et al

02-10-2021

Learning to Enhance Visual Quality via Hyperspectral Domain Mapping
by Harsh Sinha et al

02-10-2021

RoBIC: A benchmark suite for assessing classifiers robustness
by Thibault Maho et al

02-09-2021

Distribution Adaptive INT8 Quantization for Training CNNs
by Kang Zhao et al

02-10-2021

Dysplasia grading of colorectal polyps through CNN analysis of WSI
by Daniele Perlo et al

02-09-2021

Sequential vessel segmentation via deep channel attention network
by Dongdong Hao et al

02-10-2021

Doctor Imitator: A Graph-based Bone Age Assessment Framework Using Hand Radiographs
by Jintai Chen et al

02-12-2021

Depthwise Separable Convolutions Allow for Fast and Memory-Efficient Spectral Normalization
by Christina Runkel et al

02-09-2021

Deep learning architectural designs for super-resolution of noisy images
by Angel Villar-Corrales et al

02-12-2021

Min-Max-Plus Neural Networks
by Ye Luo et al

02-09-2021

SG2Caps: Revisiting Scene Graphs for Image Captioning
by Subarna Tripathi et al

02-09-2021

An application of a pseudo-parabolic modeling to texture image recognition
by Joao B. Florindo et al

02-10-2021

Automated Video Labelling: Identifying Faces by Corroborative Evidence
by Andrew Brown et al

02-09-2021

Visual Search at Alibaba
by Yanhao Zhang et al

02-12-2021

A Too-Good-to-be-True Prior to Reduce Shortcut Reliance
by Nikolay Dagaev et al

02-12-2021

Universal Adversarial Perturbations Through the Lens of Deep Steganography: Towards A Fourier Perspective
by Chaoning Zhang et al

02-09-2021

Robust Motion In-betweening
by Félix G. Harvey et al

02-10-2021

Robustness in Compressed Neural Networks for Object Detection
by Sebastian Cygert et al

02-10-2021

Enhancing efficiency of object recognition in different categorization levels by reinforcement learning in modular spiking neural networks
by Fatemeh Sharifizadeh et al

02-12-2021

Confounding Tradeoffs for Neural Network Quantization
by Sahaj Garg et al

02-09-2021

Loss Function Discovery for Object Detection via Convergence-Simulation Driven Search
by Peidong Liu et al

02-09-2021

Driver2vec: Driver Identification from Automotive Data
by Jingbo Yang et al

02-10-2021

Searching for Alignment in Face Recognition
by Xiaqing Xu et al

02-09-2021

Negative Data Augmentation
by Abhishek Sinha et al

02-09-2021

Polarimetric Monocular Dense Mapping Using Relative Deep Depth Prior
by Moein Shakeri et al

02-09-2021

Whats in the box?!: Deflecting Adversarial Attacks by Randomly Deploying Adversarially-Disjoint Models
by Sahar Abdelnabi et al

02-10-2021

A Generic Object Re-identification System for Short Videos
by Tairu Qiu et al

02-09-2021

Mars Image Content Classification: Three Years of NASA Deployment and Recent Advances
by Kiri Wagstaff et al

02-12-2021

Unleashing the Power of Contrastive Self-Supervised Visual Models via Contrast-Regularized Fine-Tuning
by Yifan Zhang et al

02-09-2021

Residue Density Segmentation for Monitoring and Optimizing Tillage Practices
by Jennifer Hobbs et al

02-09-2021

A Real-World Demonstration of Machine Learning Generalizability: Intracranial Hemorrhage Detection on Head CT
by Hojjat Salehinejad et al

02-09-2021

Classification of Handwritten Names of Cities and Handwritten Text Recognition using Various Deep Learning Models
by Daniyar Nurseitov et al

02-09-2021

Learning Multi-Modal Volumetric Prostate Registration with Weak Inter-Subject Spatial Correspondence
by Oleksii Bashkanov et al

02-09-2021

Large-Scale Visual Search with Binary Distributed Graph at Alibaba
by Kang Zhao et al

02-09-2021

Fashion Focus: Multi-modal Retrieval System for Video Commodity Localization in E-commerce
by Yanhao Zhang et al

02-09-2021

Learning Unsupervised Cross-domain Image-to-Image Translation Using a Shared Discriminator
by Rajiv Kumar et al

02-10-2021

Searching for Fast Model Families on Datacenter Accelerators
by Sheng Li et al

02-12-2021

Rethinking Eye-blink: Assessing Task Difficulty through Physiological Representation of Spontaneous Blinking
by Youngjun Cho

02-12-2021

Bayesian Uncertainty Estimation of Learned Variational MRI Reconstruction
by Dominik Narnhofer et al

02-12-2021

Annotation Cleaning for the MSR-Video to Text Dataset
by Haoran Chen et al

02-09-2021

Locally Free Weight Sharing for Network Width Search
by Xiu Su et al

02-10-2021

BRECQ: Pushing the Limit of Post-Training Quantization by Block Reconstruction
by Yuhang Li et al

02-10-2021

Regional Attention with Architecture-Rebuilt 3D Network for RGB-D Gesture Recognition
by Benjia Zhou et al

02-09-2021

Ensembling object detectors for image and video data analysis
by Kateryna Chumachenko et al

02-09-2021

DARE-SLAM: Degeneracy-Aware and Resilient Loop Closing in Perceptually-Degraded Environments
by Kamak Ebadi et al

02-12-2021

A Generative Model for Hallucinating Diverse Versions of Super Resolution Images
by Mohamed Abderrahmen Abid et al

02-09-2021

D2A U-Net: Automatic Segmentation of COVID-19 Lesions from CT Slices with Dilated Convolution and Dual Attention Mechanism
by Xiangyu Zhao et al

02-12-2021

Outdoor inverse rendering from a single image using multiview self-supervision
by Ye Yu et al

02-10-2021

Improving Aerial Instance Segmentation in the Dark with Self-Supervised Low Light Enhancement
by Prateek Garg et al

02-11-2021

COVID-19 detection from scarce chest x-ray image data using deep learning
by Shruti Jadon

02-09-2021

Detecting Localized Adversarial Examples: A Generic Approach using Critical Region Analysis
by Fengting Li et al

02-12-2021

Multi-source Pseudo-label Learning of Semantic Segmentation for the Scene Recognition of Agricultural Mobile Robots
by Shigemichi Matsuzaki et al

02-12-2021

Robust White Matter Hyperintensity Segmentation on Unseen Domain
by Xingchen Zhao et al

02-09-2021

Large Scale Long-tailed Product Recognition System at Alibaba
by Xiangzeng Zhou et al

02-12-2021

Predicting and Attending to Damaging Collisions for Placing Everyday Objects in Photo-Realistic Simulations
by Aly Magassouba et al

02-10-2021

Exploiting Depth Information for Wildlife Monitoring
by Timm Haucke et al

02-12-2021

Adversarial Branch Architecture Search for Unsupervised Domain Adaptation
by Luca Robbiano et al

02-12-2021

A Parameterised Quantum Circuit Approach to Point Set Matching
by Mohammadreza Noormandipour et al

02-12-2021

Uncertainty-Aware Semi-supervised Method using Large Unlabelled and Limited Labeled COVID-19 Data
by Roohallah Alizadehsani et al

02-12-2021

Destination similarity based on implicit user interest
by Hongliu Cao et al

02-09-2021

RODNet: A Real-Time Radar Object Detection Network Cross-Supervised by Camera-Radar Fused Object 3D Localization
by Yizhou Wang et al

02-09-2021

Referring Segmentation in Images and Videos with Cross-Modal Self-Attention Network
by Linwei Ye et al

02-09-2021

Diverse Single Image Generation with Controllable Global Structure though Self-Attention
by Sutharsan Mahendren et al

02-09-2021

Deep Multilabel CNN for Forensic Footwear Impression Descriptor Identification
by Marcin Budka et al

02-09-2021

The Role of the Input in Natural Language Video Description
by Silvia Cascianelli et al

02-09-2021

LIFT-CAM: Towards Better Explanations for Class Activation Mapping
by Hyungsik Jung et al

02-09-2021

Culture-inspired Multi-modal Color Palette Generation and Colorization: A Chinese Youth Subculture Case
by Yufan Li et al

02-11-2021

COVID-19 identification from volumetric chest CT scans using a progressively resized 3D-CNN incorporating segmentation, augmentation, and class-rebalancing
by Md. Kamrul Hasan et al

02-12-2021

Reviving Iterative Training with Mask Guidance for Interactive Segmentation
by Konstantin Sofiiuk et al

02-12-2021

Densely Deformable Efficient Salient Object Detection Network
by Tanveer Hussain et al

02-09-2021

How Unique Is a Face: An Investigative Study
by Michal Balazia et al

02-12-2021

Analysis of Interpolation based Image In-painting Approaches
by Mustafa Zor et al

02-09-2021

On the Robustness of Multi-View Rotation Averaging
by Xinyi Li et al

02-09-2021

Transfer learning based few-shot classification using optimal transport mapping from preprocessed latent space of backbone neural network
by Tomáš Chobola et al

02-09-2021

Virtual ID Discovery from E-commerce Media at Alibaba: Exploiting Richness of User Click Behavior for Visual Search Relevance
by Yanhao Zhang et al

02-11-2021

Mediastinal lymph nodes segmentation using 3D convolutional neural network ensembles and anatomical priors guiding
by David Bouget et al

02-11-2021

ReRankMatch: Semi-Supervised Learning with Semantics-Oriented Similarity Representation
by Trung Quang Tran et al

02-11-2021

What does LIME really see in images?
by Damien Garreau et al

02-11-2021

Segmentation-Renormalized Deep Feature Modulation for Unpaired Image Harmonization
by Mengwei Ren et al

02-11-2021

Learning Depth via Leveraging Semantics: Self-supervised Monocular Depth Estimation with Both Implicit and Explicit Semantic Guidance
by Rui Li et al

02-11-2021

Towards DeepSentinel: An extensible corpus of labelled Sentinel-1 and -2 imagery and a general-purpose sensor-fusion semantic embedding model
by Lucas Kruitwagen

 
Craig Smith