2021.5.10 Vision papers

 

05-04-2021

A Fast Partial Video Copy Detection Using KNN and Global Feature Database
by Weijun Tan et al

05-04-2021

The Pursuit of Knowledge: Discovering and Localizing Novel Categories using Dual Memory
by Sai Saketh Rambhatla et al

05-06-2021

Generalizable Representation Learning for Mixture Domain Face Anti-Spoofing
by Zhihong Chen et al

05-07-2021

LINN: Lifting Inspired Invertible Neural Network for Image Denoising
by Jun-Jie Huang et al

05-06-2021

BasisNet: Two-stage Model Synthesis for Efficient Inference
by Mingda Zhang et al

05-04-2021

Poisoning the Unlabeled Dataset of Semi-Supervised Learning
by Nicholas Carlini

05-04-2021

An Empirical Review of Deep Learning Frameworks for Change Detection: Model Design, Experimental Frameworks, Challenges and Research Needs
by Murari Mandal et al

05-07-2021

Energy-Based Anomaly Detection and Localization
by Ergin Utku Genc et al

05-06-2021

Q-Match: Iterative Shape Matching via Quantum Annealing
by Marcel Seelbach Benkner et al

05-06-2021

Deep Polarization Imaging for 3D shape and SVBRDF Acquisition
by Valentin Deschaintre et al

05-06-2021

Aligning Subtitles in Sign Language Videos
by Hannah Bull et al

05-04-2021

Hallucination Improves Few-Shot Object Detection
by Weilin Zhang et al

05-04-2021

LAFFNet: A Lightweight Adaptive Feature Fusion Network for Underwater Image Enhancement
by Hao-Hsiang Yang et al

05-04-2021

Uncertainty-aware INVASE: Enhanced Breast Cancer Diagnosis Feature Selection
by Jia-Xing Zhong et al

05-05-2021

PD-GAN: Probabilistic Diverse GAN for Image Inpainting
by Hongyu Liu et al

05-04-2021

Technical Report for Valence-Arousal Estimation on Affwild2 Dataset
by I-Hsuan Li

05-05-2021

DeepSMOTE: Fusing Deep Learning and SMOTE for Imbalanced Data
by Damien Dablain et al

05-04-2021

One Model for All Quantization: A Quantized Network Supporting Hot-Swap Bit-Width Adjustment
by Qigong Sun et al

05-07-2021

Towards Real-World Category-level Articulation Pose Estimation
by Liu Liu et al

05-05-2021

Impact of individual rater style on deep learning uncertainty in medical imaging segmentation
by Olivier Vincent et al

05-05-2021

Proposal-free One-stage Referring Expression via Grid-Word Cross-Attention
by Wei Suo et al

05-05-2021

Self-Supervised Multi-Frame Monocular Scene Flow
by Junhwa Hur et al

05-04-2021

Multipath Graph Convolutional Neural Networks
by Rangan Das et al

05-04-2021

Where and When: Space-Time Attention for Audio-Visual Explanations
by Yanbei Chen et al

05-04-2021

Remote Pathological Gait Classification System
by Pedro Albuquerque et al

05-06-2021

Inverting Generative Adversarial Renderer for Face Reconstruction
by Jingtan Piao et al

05-06-2021

Towards Novel Target Discovery Through Open-Set Domain Adaptation
by Taotao Jing et al

05-06-2021

Weakly Supervised Action Selection Learning in Video
by Junwei Ma et al

05-04-2021

Dual-Cross Central Difference Network for Face Anti-Spoofing
by Zitong Yu et al

05-06-2021

Do You Even Need Attention? A Stack of Feed-Forward Layers Does Surprisingly Well on ImageNet
by Luke Melas-Kyriazi

05-06-2021

Sparse convolutional context-aware multiple instance learning for whole slide image classification
by Marvin Lerousseau et al

05-07-2021

NTIRE 2021 Challenge on Perceptual Image Quality Assessment
by Jinjin Gu et al

05-05-2021

Bayesian Logistic Shape Model Inference: application to cochlea image segmentation
by Wang Zihao et al

05-06-2021

Federated Face Recognition
by Fan Bai et al

05-04-2021

Motion-Augmented Self-Training for Video Recognition at Smaller Scale
by Kirill Gavrilyuk et al

05-06-2021

Online Preconditioning of Experimental Inkjet Hardware by Bayesian Optimization in Loop
by Alexander E. Siemenn et al

05-05-2021

Rethinking Ultrasound Augmentation: A Physics-Inspired Approach
by Maria Tirindelli et al

05-05-2021

PolarMask++: Enhanced Polar Representation for Single-Shot Instance Segmentation and Beyond
by Enze Xie et al

05-05-2021

Addressing Annotation Imprecision for Tree Crown Delineation Using the RandCrowns Index
by Dylan Stewart et al

05-05-2021

Content4All Open Research Sign Language Translation Datasets
by Necati Cihan Camgoz et al

05-05-2021

Novelty Detection and Analysis of Traffic Scenario Infrastructures in the Latent Space of a Vision Transformer-Based Triplet Autoencoder
by Jonas Wurst et al

05-06-2021

Why Approximate Matrix Square Root Outperforms Accurate SVD in Global Covariance Pooling?
by Yue Song et al

05-05-2021

Spatio-Temporal Matching for Siamese Visual Tracking
by Jinpu Zhang et al

05-05-2021

In the Danger Zone: U-Net Driven Quantile Regression can Predict High-risk SARS-CoV-2 Regions via Pollutant Particulate Matter and Satellite Imagery
by Jacquelyn Shelton et al

05-04-2021

3D Vehicle Detection Using Camera and Low-Resolution LiDAR
by Lin Bai et al

05-04-2021

Moving Towards Centers: Re-ranking with Attention and Memory for Re-identification
by Yunhao Zhou et al

05-07-2021

ResMLP: Feedforward networks for image classification with data-efficient training
by Hugo Touvron et al

05-04-2021

Representation Learning for Clustering via Building Consensus
by Aniket Anand Deshmukh et al

05-04-2021

Walk in the Cloud: Learning Curves for Point Clouds Shape Analysis
by Tiange Xiang et al

05-06-2021

Deep Weighted Consensus: Dense correspondence confidence maps for 3D shape registration
by Dvir Ginzburg et al

05-07-2021

Self-Adaptive Transfer Learning for Multicenter Glaucoma Classification in Fundus Retina Images
by Yiming Bao et al

05-07-2021

Contrastive Learning for Unsupervised Image-to-Image Translation
by Hanbit Lee et al

05-06-2021

VideoLT: Large-scale Long-tailed Video Recognition
by Xing Zhang et al

05-04-2021

Self-Improving Semantic Perception on a Construction Robot
by Hermann Blum et al

05-04-2021

CUAB: Convolutional Uncertainty Attention Block Enhanced the Chest X-ray Image Analysis
by Chi-Shiang Wang et al

05-05-2021

VoxelContext-Net: An Octree based Framework for Point Cloud Compression
by Zizheng Que et al

05-05-2021

Pairwise Point Cloud Registration using Graph Matching and Rotation-invariant Features
by Rong Huang et al

05-06-2021

Animatable Neural Radiance Fields for Human Body Modeling
by Sida Peng et al

05-05-2021

Contrastive Learning and Self-Training for Unsupervised Domain Adaptation in Semantic Segmentation
by Robert A. Marsden et al

05-04-2021

Texture for Colors: Natural Representations of Colors Using Variable Bit-Depth Textures
by Shumeet Baluja

05-07-2021

Human Object Interaction Detection using Two-Direction Spatial Enhancement and Exclusive Object Prior
by Lu Liu et al

05-06-2021

Salient Objects in Clutter
by Deng-Ping Fan et al

05-06-2021

Few-Shot Learning for Image Classification of Common Flora
by Joshua Ball

05-07-2021

Self-paced Resistance Learning against Overfitting on Noisy Labels
by Xiaoshuang Shi et al

05-04-2021

Weak Multi-View Supervision for Surface Mapping Estimation
by Nishant Rai et al

05-07-2021

Neural 3D Scene Compression via Model Compression
by Berivan Isik

05-05-2021

Prototype Memory for Large-scale Face Representation Learning
by Evgeny Smirnov et al

05-05-2021

Perceptual Gradient Networks
by Dmitry Nikulin et al

05-04-2021

Leveraging Third-Order Features in Skeleton-Based Action Recognition
by Zhenyue Qin et al

05-06-2021

Adaptive Domain-Specific Normalization for Generalizable Person Re-Identification
by Jiawei Liu et al

05-04-2021

MLP-Mixer: An all-MLP Architecture for Vision
by Ilya Tolstikhin et al

05-05-2021

Exploring Explicit and Implicit Visual Relationships for Image Captioning
by Zeliang Song et al

05-06-2021

Saliency-Guided Deep Learning Network for Automatic Tumor Bed Volume Delineation in Post-operative Breast Irradiation
by Mahdieh Kazemimoghadam et al

05-07-2021

Probabilistic Ranking-Aware Ensembles for Enhanced Object Detections
by Mingyuan Mao et al

05-07-2021

Interpretable Social Anchors for Human Trajectory Forecasting in Crowds
by Parth Kothari et al

05-05-2021

SIPSA-Net: Shift-Invariant Pan Sharpening with Moving Object Alignment for Satellite Imagery
by Jaehyup Lee et al

05-05-2021

Learning Feature Aggregation for Deep 3D Morphable Models
by Zhixiang Chen et al

05-04-2021

Combining Supervised and Un-supervised Learning for Automatic Citrus Segmentation
by Heqing Huang et al

05-06-2021

Unsupervised Visual Representation Learning by Tracking Patches in Video
by Guangting Wang et al

05-06-2021

A Novel Falling-Ball Algorithm for Image Segmentation
by Asra Aslam et al

05-06-2021

Understanding Catastrophic Overfitting in Adversarial Training
by Peilin Kang et al

05-07-2021

A State-of-the-art Survey of Object Detection Techniques in Microorganism Image Analysis: from Traditional Image Processing and Classical Machine Learning to Current Deep Convolutional Neural Networks and Potential Visual Transformers
by Chen Li et al

05-07-2021

An Intelligent Passive Food Intake Assessment System with Egocentric Cameras
by Frank Po Wen Lo et al

05-06-2021

Faster and Simpler Siamese Network for Single Object Tracking
by Shaokui Jiang et al

05-06-2021

Quantification of pulmonary involvement in COVID-19 pneumonia by means of a cascade oftwo U-nets: training and assessment on multipledatasets using different annotation criteria
by Francesca Lizzi et al

05-04-2021

PingAn-VCGroups Solution for ICDAR 2021 Competition on Scientific Table Image Recognition to Latex
by Yelin He et al

05-05-2021

Physically Inspired Dense Fusion Networks for Relighting
by Amirsaeed Yazdani et al

05-05-2021

Continual Learning on the Edge with TensorFlow Lite
by Giorgos Demosthenous et al

05-04-2021

Curvatures of Stiefel manifolds with deformation metrics
by Du Nguyen

05-06-2021

A novel method of predictive collision risk area estimation for proactive pedestrian accident prevention system in urban surveillance infrastructure
by Byeongjoon Noh et al

05-04-2021

COVID-19 Detection from Chest X-ray Images using Imprinted Weights Approach
by Jianxing Zhang et al

05-04-2021

PingAn-VCGroups Solution for ICDAR 2021 Competition on Scientific Literature Parsing Task B: Table Recognition to HTML
by Jiaquan Ye et al

05-05-2021

Attention for Image Registration (AiR): an unsupervised Transformer approach
by Zihao Wang et al

05-04-2021

Real-time Face Mask Detection in Video Data
by Yuchen Ding et al

05-06-2021

LASR: Learning Articulated Shape Reconstruction from a Monocular Video
by Gengshan Yang et al

05-05-2021

MiCE: Mixture of Contrastive Experts for Unsupervised Image Clustering
by Tsung Wei Tsai et al

05-07-2021

More Separable and Easier to Segment: A Cluster Alignment Method for Cross-Domain Semantic Segmentation
by Shuang Wang et al

05-07-2021

Toward Interactive Modulation for Photo-Realistic Image Restoration
by Haoming Cai et al

05-05-2021

Multi-scale Image Decomposition using a Local Statistical Edge Model
by Kin-Ming Wong

05-05-2021

Visual Composite Set Detection Using Part-and-Sum Transformers
by Qi Dong et al

05-04-2021

TransHash: Transformer-based Hamming Hashing for Efficient Image Retrieval
by Yongbiao Chen et al

05-04-2021

Lesion Segmentation and RECIST Diameter Prediction via Click-driven Attention and Dual-path Connection
by Youbao Tang et al

05-06-2021

Computer-Aided Design as Language
by Yaroslav Ganin et al

05-05-2021

Split and Connect: A Universal Tracklet Booster for Multi-Object Tracking
by Gaoang Wang et al

05-06-2021

MAFER: a Multi-resolution Approach to Facial Expression Recognition
by Fabio Valerio Massoli et al

05-05-2021

R2U3D: Recurrent Residual 3D U-Net for Lung Segmentation
by Dhaval D. Kadia et al

05-05-2021

A Step Toward More Inclusive People Annotations for Fairness
by Candice Schumann et al

05-04-2021

Generative Adversarial Networks (GAN) Powered Fast Magnetic Resonance Imaging -- Mini Review, Comparison and Perspectives
by Guang Yang et al

05-04-2021

Encoder Fusion Network with Co-Attention Embedding for Referring Image Segmentation
by Guang Feng et al

05-05-2021

4DComplete: Non-Rigid Motion Estimation Beyond the Observable Surface
by Yang Li et al

05-06-2021

Pose-Guided Sign Language Video GAN with Dynamic Lambda
by Christopher Kissel et al

05-06-2021

Vision based Pedestrian Potential Risk Analysis based on Automated Behavior Feature Extraction for Smart and Safe City
by Byeongjoon Noh et al

05-06-2021

Estimating Presentation Competence using Multimodal Nonverbal Behavioral Cues
by Ömer Sümer et al

05-07-2021

Adv-Makeup: A New Imperceptible and Transferable Attack on Face Recognition
by Bangjie Yin et al

05-05-2021

This Looks Like That... Does it? Shortcomings of Latent Space Prototype Explainability in Deep Networks
by Adrian Hoffmann et al

05-05-2021

Image Embedding and Model Ensembling for Automated Chest X-Ray Interpretation
by Edoardo Giacomello et al

05-05-2021

QueryInst: Parallelly Supervised Mask Query for Instance Segmentation
by Yuxin Fang et al

05-04-2021

Effectively Leveraging Attributes for Visual Similarity
by Samarth Mishra et al

05-05-2021

SeaDronesSee: A Maritime Benchmark for Detecting Humans in Open Water
by Leon Amadeus Varga et al

05-07-2021

A^2-FPN: Attention Aggregation based Feature Pyramid Network for Instance Segmentation
by Miao Hu et al

05-05-2021

Conditional Invertible Neural Networks for Diverse Image-to-Image Translation
by Lynton Ardizzone et al

05-04-2021

Robustness Enhancement of Object Detection in Advanced Driver Assistance Systems (ADAS)
by Le-Anh Tran et al

05-05-2021

Function4D: Real-time Human Volumetric Capture from Very Sparse Consumer RGBD Sensors
by Tao Yu et al

05-05-2021

Towards an efficient framework for Data Extraction from Chart Images
by Weihong Ma et al

05-04-2021

COVID-Net CT-S: 3D Convolutional Neural Network Architectures for COVID-19 Severity Assessment using Chest CT Images
by Hossein Aboutalebi et al

05-04-2021

Computer vision for liquid samples in hospitals and medical labs using hierarchical image segmentation and relations prediction
by Sagi Eppel et al

05-05-2021

MODS -- A USV-oriented object detection and obstacle segmentation benchmark
by Borja Bovcon et al

05-05-2021

Instance segmentation of fallen trees in aerial color infrared imagery using active multi-contour evolution with fully convolutional network-based intensity priors
by Przemyslaw Polewski et al

05-07-2021

Autoencoder Based Inter-Vehicle Generalization for In-Cabin Occupant Classification
by Steve Dias Da Cruz et al

05-05-2021

AdaVQA: Overcoming Language Priors with Adapted Margin Cosine Loss
by Yangyang Guo et al

05-06-2021

Learning Skeletal Articulations with Neural Blend Shapes
by Peizhuo Li et al

05-06-2021

Cascade Image Matting with Deformable Graph Refinement
by Zijian Yu et al

05-06-2021

Two4Two: Evaluating Interpretable Machine Learning - A Synthetic Dataset For Controlled Experiments
by Martin Schuessler et al

05-04-2021

Attention-based Stylisation for Exemplar Image Colourisation
by Marc Gorriz Blanch et al

05-05-2021

Towards Self-Supervision for Video Identification of Individual Holstein-Friesian Cattle: The Cows2021 Dataset
by Jing Gao et al

05-05-2021

FLEX: Parameter-free Multi-view 3D Human Motion Reconstruction
by Brian Gordon et al

05-04-2021

Intensity Harmonization for Airborne LiDAR
by David Jones et al

05-05-2021

Moving SLAM: Fully Unsupervised Deep Learning in Non-Rigid Scenes
by Dan Xu et al

05-04-2021

Joint Registration and Segmentation via Multi-Task Learning for Adaptive Radiotherapy of Prostate Cancer
by Mohamed S. Elmahdy et al

05-05-2021

Real-time Multi-Adaptive-Resolution-Surfel 6D LiDAR Odometry using Continuous-time Trajectory Optimization
by Jan Quenzel et al

05-06-2021

Learning Neighborhood Representation from Multi-Modal Multi-Graph: Image, Text, Mobility Graph and Beyond
by Tianyuan Huang et al

05-07-2021

Exploring Instance Relations for Unsupervised Feature Embedding
by Yifei Zhang et al

05-07-2021

Foreground-guided Facial Inpainting with Fidelity Preservation
by Jireh Jam et al

05-06-2021

Deep Learning based Multi-modal Computing with Feature Disentanglement for MRI Image Synthesis
by Yuchen Fei et al

05-04-2021

Soft-Attention Improves Skin Cancer Classification Performance
by Soumyya Kanti Datta et al

05-06-2021

Local Relation Learning for Face Forgery Detection
by Shen Chen et al

05-05-2021

MOS: Towards Scaling Out-of-distribution Detection for Large Semantic Space
by Rui Huang et al

05-06-2021

A Simple and Strong Baseline for Universal Targeted Attacks on Siamese Visual Tracking
by Zhenbang Li et al

05-06-2021

Real-Time Video Super-Resolution by Joint Local Inference and Global Parameter Estimation
by Noam Elron et al

05-04-2021

Height Estimation of Children under Five Years using Depth Images
by Anusua Trivedi et al

05-06-2021

Object-centric Video Prediction without Annotation
by Karl Schmeckpeper et al

05-04-2021

DeepRT: A Soft Real Time Scheduler for Computer Vision Applications on the Edge
by Zhe Yang et al

05-06-2021

Relative stability toward diffeomorphisms in deep nets indicates performance
by Leonardo Petrini et al

05-06-2021

Body Meshes as Points
by Jianfeng Zhang et al

05-06-2021

Structured dataset documentation: a datasheet for CheXpert
by Christian Garbin et al

05-06-2021

Multi-Perspective LSTM for Joint Visual Representation Learning
by Alireza Sepas-Moghaddam et al

05-06-2021

Dynamic Defense Approach for Adversarial Robustness in Deep Neural Networks via Stochastic Ensemble Smoothed Model
by Ruoxi Qin et al

05-05-2021

Weakly Supervised Pseudo-Label assisted Learning for ALS Point Cloud Semantic Segmentation
by Puzuo Wang et al

05-04-2021

Orienting Point Clouds with Dipole Propagation
by Gal Metzer et al

05-04-2021

Surveilling Surveillance: Estimating the Prevalence of Surveillance Cameras with Street View Data
by Hao Sheng et al

05-05-2021

Magnifying Subtle Facial Motions for Effective 4D Expression Recognition
by Qingkai Zhen et al

05-05-2021

Person Retrieval in Surveillance Using Textual Query: A Review
by Hiren Galiyawala et al

05-05-2021

Handwritten Mathematical Expression Recognition with Bidirectionally Trained Transformer
by Wenqi Zhao et al

05-05-2021

Iterative Human and Automated Identification of Wildlife Images
by Zhongqi Miao et al

05-05-2021

Deep Spherical Manifold Gaussian Kernel for Unsupervised Domain Adaptation
by Youshan Zhang et al

05-06-2021

A 2.5D Vehicle Odometry Estimation for Vision Applications
by Paul Moran et al

05-06-2021

SS-CADA: A Semi-Supervised Cross-Anatomy Domain Adaptation for Coronary Artery Segmentation
by Jingyang Zhang et al

05-07-2021

Probabilistic Visual Place Recognition for Hierarchical Localization
by Ming Xu et al

05-04-2021

GANs for Urban Design
by Stanislava Fedorova

05-06-2021

SkyCam: A Dataset of Sky Images and their Irradiance values
by Evangelos Ntavelis et al

05-06-2021

ACORN: Adaptive Coordinate Networks for Neural Scene Representation
by Julien N. P. Martel et al

05-05-2021

Explainable Artificial Intelligence for Human Decision-Support System in Medical Domain
by Samanta Knapič et al

05-06-2021

Development of a Fast and Robust Gaze Tracking System for Game Applications
by Manh Duong Phung et al

05-06-2021

PoseAug: A Differentiable Pose Augmentation Framework for 3D Human Pose Estimation
by Kehong Gong et al

05-05-2021

Beyond Self-attention: External Attention using Two Linear Layers for Visual Tasks
by Meng-Hao Guo et al

05-05-2021

Cuboids Revisited: Learning Robust 3D Shape Fitting to Single RGB Images
by Florian Kluger et al

05-04-2021

Canonical Saliency Maps: Decoding Deep Face Models
by Thrupthi Ann John et al

05-06-2021

(ASNA) An Attention-based Siamese-Difference Neural Network with Surrogate Ranking Loss function for Perceptual Image Quality Assessment
by Seyed Mehdi Ayyoubzadeh et al

05-05-2021

DeepPlastic: A Novel Approach to Detecting Epipelagic Bound Plastic Using Deep Visual Models
by Gautam Tata et al

05-04-2021

Real-time Deep Dynamic Characters
by Marc Habermann et al

05-06-2021

Detection, Tracking, and Counting Meets Drones in Crowds: A Benchmark
by Longyin Wen et al

05-06-2021

Efficient Masked Face Recognition Method during the COVID-19 Pandemic
by Walid Hariri

05-07-2021

Adaptive Focus for Efficient Video Recognition
by Yulin Wang et al

05-07-2021

MOTR: End-to-End Multiple-Object Tracking with TRansformer
by Fangao Zeng et al

05-05-2021

RepMLP: Re-parameterizing Convolutions into Fully-connected Layers for Image Recognition
by Xiaohan Ding et al

05-05-2021

MCGNet: Partial Multi-view Few-shot Learning via Meta-alignment and Context Gated-aggregation
by Yuan Zhou et al

 
Craig Smith