2021.6.28 Vision papers

 

06-24-2021

Federated Noisy Client Learning
by Li Li et al

06-25-2021

Building Intelligent Autonomous Navigation Agents
by Devendra Singh Chaplot

06-25-2021

Image-to-image Transformation with Auxiliary Condition
by Robert Leer et al

06-23-2021

What makes visual place recognition easy or hard?
by Stefan Schubert et al

06-23-2021

Conditional Deformable Image Registration with Convolutional Neural Network
by Tony C. W. Mok et al

06-24-2021

ChaLearn Looking at People: Inpainting and Denoising challenges
by Sergio Escalera et al

06-24-2021

VOLO: Vision Outlooker for Visual Recognition
by Li Yuan et al

06-23-2021

Transformer Meets Convolution: A Bilateral Awareness Net-work for Semantic Segmentation of Very Fine Resolution Ur-ban Scene Images
by Libo Wang et al

06-24-2021

When Differential Privacy Meets Interpretability: A Case Study
by Rakshit Naidu et al

06-24-2021

Driver-centric Risk Object Identification
by Chengxi Li et al

06-23-2021

3D human tongue reconstruction from single in-the-wild images
by Stylianos Ploumpis et al

06-24-2021

Advancing biological super-resolution microscopy through deep learning: a brief review
by Tianjie Yang et al

06-25-2021

Semantic annotation for computational pathology: Multidisciplinary experience and best practice recommendations
by Noorul Wahab et al

06-23-2021

Multi-modal and frequency-weighted tensor nuclear norm for hyperspectral image denoising
by Sheng Liu et al

06-24-2021

DnS: Distill-and-Select for Efficient and Accurate Video Indexing and Retrieval
by Giorgos Kordopatis-Zilos et al

06-23-2021

A Review of Assistive Technologies for Activities of Daily Living of Elderly
by Nirmalya Thakur et al

06-24-2021

Towards Automatic Speech to Sign Language Generation
by Parul Kapoor et al

06-24-2021

FaDIV-Syn: Fast Depth-Independent View Synthesis
by Andre Rochow et al

06-23-2021

Alias-Free Generative Adversarial Networks
by Tero Karras et al

06-22-2021

Euro-PVI: Pedestrian Vehicle Interactions in Dense Urban Centers
by Apratim Bhattacharyya et al

06-23-2021

How Well do Feature Visualizations Support Causal Understanding of CNN Activations?
by Roland S. Zimmermann et al

06-24-2021

Generalized One-Class Learning Using Pairs of Complementary Classifiers
by Anoop Cherian et al

06-25-2021

Partially fake it till you make it: mixing real and fake thermal images for improved object detection
by Francesco Bongini et al

06-23-2021

Real-time Instance Segmentation with Discriminative Orientation Maps
by Wentao Du et al

06-24-2021

Physics perception in sloshing scenes with guaranteed thermodynamic consistency
by Beatriz Moya et al

06-22-2021

Give Me Your Trained Model: Domain Adaptive Semantic Segmentation without Source Data
by Yuxi Wang et al

06-22-2021

A Survey on Human-aware Robot Navigation
by Ronja Möller et al

06-22-2021

Universal Domain Adaptation in Ordinal Regression
by Chidlovskii Boris et al

06-22-2021

Automatic Head Overcoat Thickness Measure with NASNet-Large-Decoder Net
by Youshan Zhang et al

06-22-2021

Deep3DPose: Realtime Reconstruction of Arbitrarily Posed Human Bodies from Single RGB Images
by Liguo Jiang et al

06-25-2021

Interactive Multi-level Stroke Control for Neural Style Transfer
by Max Reimann et al

06-24-2021

Fast Monte Carlo Rendering via Multi-Resolution Sampling
by Qiqi Hou et al

06-22-2021

Data Augmentation for Opcode Sequence Based Malware Detection
by Niall McLaughlin et al

06-22-2021

Evaluation of a Region Proposal Architecture for Multi-task Document Layout Analysis
by Lorenzo Quirós et al

06-23-2021

Sentinel-1 and Sentinel-2 Spatio-Temporal Data Fusion for Clouds Removal
by Alessandro Sebastianelli et al

06-24-2021

Generalized Unsupervised Clustering of Hyperspectral Images of Geological Targets in the Near Infrared
by Angela F. Gao et al

06-22-2021

MEAL: Manifold Embedding-based Active Learning
by Deepthi Sreenivasaiah et al

06-22-2021

Confidence-Aware Learning for Camouflaged Object Detection
by Jiawei Liu et al

06-22-2021

PatentNet: A Large-Scale Incomplete Multiview, Multimodal, Multilabel Industrial Goods Image Database
by Fangyuan Lei et al

06-22-2021

Self-Supervised Iterative Contextual Smoothing for Efficient Adversarial Defense against Gray- and Black-Box Attack
by Sungmin Cha et al

06-23-2021

Continuous-Time Deep Glioma Growth Models
by Jens Petersen et al

06-24-2021

Handling Data Heterogeneity with Generative Replay in Collaborative Learning for Medical Imaging
by Liangqiong Qu et al

06-24-2021

CausalCity: Complex Simulations with Agency for Causal Discovery and Reasoning
by Daniel McDuff et al

06-22-2021

Towards Reducing Labeling Cost in Deep Object Detection
by Ismail Elezi et al

06-22-2021

nuPlan: A closed-loop ML-based planning benchmark for autonomous vehicles
by Holger Caesar et al

06-23-2021

Deformed2Self: Self-Supervised Denoising for Dynamic Medical Imaging
by Junshen Xu et al

06-24-2021

Domain-guided Machine Learning for Remotely Sensed In-Season Crop Growth Estimation
by George Worrall et al

06-24-2021

FOVQA: Blind Foveated Video Quality Assessment
by Yize Jin et al

06-23-2021

Behavior Mimics Distribution: Combining Individual and Group Behaviors for Federated Learning
by Hua Huang et al

06-23-2021

Estimating the Robustness of Classification Models by the Structure of the Learned Feature-Space
by Kalun Ho et al

06-22-2021

Exploiting Negative Learning for Implicit Pseudo Label Rectification in Source-Free Domain Adaptive Semantic Segmentation
by Xin Luo et al

06-25-2021

Efficient Document Image Classification Using Region-Based Graph Neural Network
by Jaya Krishna Mandivarapu et al

06-24-2021

Attention Toward Neighbors: A Context Aware Framework for High Resolution Image Segmentation
by Fahim Faisal Niloy et al

06-22-2021

Long-term Cross Adversarial Training: A Robust Meta-learning Method for Few-shot Classification Tasks
by Fan Liu et al

06-25-2021

Animatable Neural Radiance Fields from Monocular RGB Video
by Jianchuan Chen et al

06-22-2021

Learning-Based Practical Light Field Image Compression Using A Disparity-Aware Model
by Mohana Singh et al

06-23-2021

APNN-TC: Accelerating Arbitrary Precision Neural Networks on Ampere GPU Tensor Cores
by Boyuan Feng et al

06-23-2021

Open Images V5 Text Annotation and Yet Another Mask Text Spotter
by Ilya Krylov et al

06-24-2021

Video Super-Resolution with Long-Term Self-Exemplars
by Guotao Meng et al

06-25-2021

Probing Inter-modality: Visual Parsing with Self-Attention for Vision-Language Pre-training
by Hongwei Xue et al

06-22-2021

Transfer Learning of Deep Spatiotemporal Networks to Model Arbitrarily Long Videos of Seizures
by Fernando Pérez-García et al

06-24-2021

Generative Modeling for Multi-task Visual Learning
by Zhipeng Bao et al

06-22-2021

A Comparison for Patch-level Classification of Deep Learning Methods on Transparent Images: from Convolutional Neural Networks to Visual Transformers
by Hechen Yang et al

06-25-2021

PVTv2: Improved Baselines with Pyramid Vision Transformer
by Wenhai Wang et al

06-22-2021

Part-Aware Measurement for Robust Multi-View Multi-Human 3D Pose Estimation and Tracking
by Hau Chu et al

06-24-2021

A Systematic Collection of Medical Image Datasets for Deep Learning
by Johann Li et al

06-23-2021

IA-RED22: Interpretability-Aware Redundancy Reduction for Vision Transformers
by Bowen Pan et al

06-23-2021

Co-advise: Cross Inductive Bias Distillation
by Sucheng Ren et al

06-22-2021

PALMAR: Towards Adaptive Multi-inhabitant Activity Recognition in Point-Cloud Technology
by Mohammad Arif Ul Alam et al

06-23-2021

Image-to-Image Translation of Synthetic Samples for Rare Classes
by Edoardo Lanzini et al

06-24-2021

Semi-supervised Meta-learning with Disentanglement for Domain-generalised Medical Image Segmentation
by Xiao Liu et al

06-24-2021

Q-space Conditioned Translation Networks for Directional Synthesis of Diffusion Weighted Images from Multi-modal Structural MRI
by Mengwei Ren et al

06-24-2021

Continual Novelty Detection
by Rahaf Aljundi et al

06-24-2021

Class agnostic moving target detection by color and location prediction of moving area
by Zhuang He et al

06-24-2021

VinDr-SpineXR: A deep learning framework for spinal lesions detection and classification from radiographs
by Hieu T. Nguyen et al

06-25-2021

Hierarchical Object-oriented Spatio-Temporal Reasoning for Video Question Answering
by Long Hoang Dang et al

06-22-2021

Multi-layered Semantic Representation Network for Multi-label Image Classification
by Xiwen Qu et al

06-23-2021

A Global Appearance and Local Coding Distortion based Fusion Framework for CNN based Filtering in Video Coding
by Jian Yue et al

06-23-2021

Adapting Off-the-Shelf Source Segmenter for Target Medical Image Segmentation
by Xiaofeng Liu et al

06-22-2021

LegoFormer: Transformers for Block-by-Block Multi-view 3D Reconstruction
by Farid Yagubbayli et al

06-23-2021

Gradient-Based Interpretability Methods and Binarized Neural Networks
by Amy Widdicombe et al

06-23-2021

Feature Alignment for Approximated Reversibility in Neural Networks
by Tiago de Souza Farias et al

06-24-2021

FitVid: Overfitting in Pixel-Level Video Prediction
by Mohammad Babaeizadeh et al

06-25-2021

Video Moment Retrieval with Text Query Considering Many-to-Many Correspondence Using Potentially Relevant Pair
by Sho Maeoki et al

06-24-2021

Free-viewpoint Indoor Neural Relighting from Multi-view Stereo
by Julien Philip et al

06-23-2021

FoldIt: Haustral Folds Detection and Segmentation in Colonoscopy Videos
by Shawn Mathew et al

06-25-2021

A Picture May Be Worth a Hundred Words for Visual Question Answering
by Yusuke Hirota et al

06-22-2021

MIMIR: Deep Regression for Automated Analysis of UK Biobank Body MRI
by Taro Langner et al

06-24-2021

Self-Supervised Monocular Depth Estimation of Untextured Indoor Rotated Scenes
by Benjamin Keltjens et al

06-24-2021

Rate Distortion Characteristic Modeling for Neural Image Compression
by Chuanmin Jia et al

06-24-2021

Regularisation for PCA- and SVD-type matrix factorisations
by Abdolrahman Khoshrou et al

06-22-2021

Unsupervised Object-Level Representation Learning from Scene Images
by Jiahao Xie et al

06-24-2021

AVHYAS: A Free and Open Source QGIS Plugin for Advanced Hyperspectral Image Analysis
by Rosly Boy Lyngdoh et al

06-22-2021

RGB2Hands: Real-Time Tracking of 3D Hand Interactions from Monocular RGB Video
by Jiayi Wang et al

06-22-2021

HybVIO: Pushing the Limits of Real-time Visual-inertial Odometry
by Otto Seiskari et al

06-22-2021

G-VAE, a Geometric Convolutional VAE for ProteinStructure Generation
by Hao Huang et al

06-24-2021

RSN: Range Sparse Net for Efficient, Accurate LiDAR 3D Object Detection
by Pei Sun et al

06-24-2021

Towards Fully Interpretable Deep Neural Networks: Are We There Yet?
by Sandareka Wickramanayake et al

06-23-2021

Region-Aware Network: Model Humans Top-Down Visual Perception Mechanism for Crowd Counting
by Yuehai Chen et al

06-22-2021

P2T: Pyramid Pooling Transformer for Scene Understanding
by Yu-Huan Wu et al

06-22-2021

On Matrix Factorizations in Subspace Clustering
by Reeshad Arian et al

06-24-2021

Energy-Based Generative Cooperative Saliency Prediction
by Jing Zhang et al

06-24-2021

To the Point: Efficient 3D Object Detection in the Range Image with Graph Convolution Kernels
by Yuning Chai et al

06-24-2021

Bayesian Eye Tracking
by Qiang Ji et al

06-24-2021

Detection of Deepfake Videos Using Long Distance Attention
by Wei Lu et al

06-24-2021

MatchVIE: Exploiting Match Relevancy between Entities for Visual Information Extraction
by Guozhi Tang et al

06-23-2021

Generative Self-training for Cross-domain Unsupervised Tagged-to-Cine MRI Synthesis
by Xiaofeng Liu et al

06-25-2021

Vision Transformer Architecture Search
by Xiu Su et al

06-22-2021

Domain-Smoothing Network for Zero-Shot Sketch-Based Image Retrieval
by Zhipeng Wang et al

06-23-2021

Fairness in Cardiac MR Image Analysis: An Investigation of Bias Due to Data Imbalance in Deep Learning Based Segmentation
by Esther Puyol-Anton et al

06-22-2021

Diabetic Retinopathy Detection using Ensemble Machine Learning
by Israa Odeh et al

06-24-2021

Exploring Stronger Feature for Temporal Action Localization
by Zhiwu Qing et al

06-22-2021

On the importance of cross-task features for class-incremental learning
by Albin Soutif--Cormerais et al

06-24-2021

Differential Morph Face Detection using Discriminative Wavelet Sub-bands
by Baaria Chaudhary et al

06-23-2021

Neural Fashion Image Captioning : Accounting for Data Diversity
by Gilles Hacheme et al

06-22-2021

The Neurally-Guided Shape Parser: A Monte Carlo Method for Hierarchical Labeling of Over-segmented 3D Shapes
by R. Kenny Jones et al

06-22-2021

Team PyKale (xy9) Submission to the EPIC-Kitchens 2021 Unsupervised Domain Adaptation Challenge for Action Recognition
by Xianyuan Liu et al

06-24-2021

HAN: An Efficient Hierarchical Self-Attention Network for Skeleton-Based Gesture Recognition
by Jianbo Liu et al

06-24-2021

Interpreting Depression From Question-wise Long-term Video Recording of SDS Evaluation
by Wanqing Xie et al

06-24-2021

Countering Adversarial Examples: Combining Input Transformation and Noisy Training
by Cheng Zhang et al

06-22-2021

Kernel Clustering with Sigmoid-based Regularization for Efficient Segmentation of Sequential Data
by Tung Doan et al

06-22-2021

Winning the CVPR2021 Kinetics-GEBD Challenge: Contrastive Learning Approach
by Hyolim Kang et al

06-25-2021

Multiview Video Compression Using Advanced HEVC Screen Content Coding
by Jarosław Samelak et al

06-23-2021

Florida Wildlife Camera Trap Dataset
by Crystal Gagne et al

06-23-2021

STRESS: Super-Resolution for Dynamic Fetal MRI using Self-Supervised Learning
by Junshen Xu et al

06-23-2021

Coarse-to-Fine Q-attention: Efficient Learning for Visual Robotic Manipulation via Discretisation
by Stephen James et al

06-23-2021

Human Activity Recognition using Continuous Wavelet Transform and Convolutional Neural Networks
by Anna Nedorubova et al

06-22-2021

MetaAvatar: Learning Animatable Clothed Human Models from Few Depth Images
by Shaofei Wang et al

06-22-2021

RootPainter3D: Interactive-machine-learning enables rapid and accurate contouring for radiotherapy
by Abraham George Smith et al

06-22-2021

Fine-Tuning StyleGAN2 For Cartoon Face Generation
by Jihye Back

06-22-2021

A Latent Transformer for Disentangled and Identity-Preserving Face Editing
by Xu Yao et al

06-24-2021

Unsupervised Deep Image Stitching: Reconstructing Stitched Features to Images
by Lang Nie et al

06-22-2021

Hand-Drawn Electrical Circuit Recognition using Object Detection and Node Recognition
by Rachala Rohith Reddy et al

06-23-2021

Bootstrap Representation Learning for Segmentation on Medical Volumes and Sequences
by Zejian Chen et al

06-25-2021

Connecting Sphere Manifolds Hierarchically for Regularization
by Damien Scieur et al

06-23-2021

CxSE: Chest X-ray Slow Encoding CNN forCOVID-19 Diagnosis
by Thangarajah Akilan

06-23-2021

Mutual-Information Based Few-Shot Classification
by Malik Boudiaf et al

06-23-2021

Topological Semantic Mapping by Consolidation of Deep Visual Features
by Ygor C. N. Sousa et al

06-24-2021

Evaluation of deep lift pose models for 3D rodent pose estimation based on geometrically triangulated data
by Indrani Sarkar et al

06-25-2021

On the Robustness of Pretraining and Self-Supervision for a Deep Learning-based Analysis of Diabetic Retinopathy
by Vignesh Srinivasan et al

06-25-2021

Diversifying Semantic Image Synthesis and Editing via Class- and Layer-wise VAEs
by Yuki Endo et al

06-24-2021

A Transformer-based Cross-modal Fusion Model with Adversarial Training for VQA Challenge 2021
by Ke-Han Lu et al

06-25-2021

Zero Shot Point Cloud Upsampling
by Kaiyue Zhou et al

06-22-2021

Prototypical Cross-Attention Networks for Multiple Object Tracking and Segmentation
by Lei Ke et al

06-24-2021

High-resolution Image Registration of Consecutive and Re-stained Sections in Histopathology
by Johannes Lotz et al

06-24-2021

Learning by Planning: Language-Guided Global Image Editing
by Jing Shi et al

06-23-2021

A Circular-Structured Representation for Visual Emotion Distribution Learning
by Jingyuan Yang et al

06-23-2021

Deep unsupervised 3D human body reconstruction from a sparse set of landmarks
by Meysam Madadi et al

06-23-2021

A Label Management Mechanism for Retinal Fundus Image Classification of Diabetic Retinopathy
by Mengdi Gao et al

06-25-2021

Single Image Texture Translation for Data Augmentation
by Boyi Li et al

06-22-2021

Zero-Shot Chinese Character Recognition with Stroke-Level Decomposition
by Jingye Chen et al

06-23-2021

Vision Permutator: A Permutable MLP-Like Architecture for Visual Recognition
by Qibin Hou et al

06-22-2021

SSUL: Semantic Segmentation with Unknown Label for Exemplar-based Class-Incremental Learning
by Sungmin Cha. Beomyoung Kim et al

06-22-2021

Creating A New Color Space utilizing PSO and FCM to Perform Skin Detection by using Neural Network and ANFIS
by Kobra Nazaria et al

06-23-2021

A new Video Synopsis Based Approach Using Stereo Camera
by Talha Dilber et al

06-24-2021

SGTBN: Generating Dense Depth Maps from Single-Line LiDAR
by Hengjie Lu et al

06-24-2021

Relationship between pulmonary nodule malignancy and surrounding pleurae, airways and vessels: a quantitative study using the public LIDC-IDRI dataset
by Yulei Qin et al

06-25-2021

SRPN: similarity-based region proposal networks for nuclei and cells detection in histology images
by Yibao Sun et al

06-25-2021

Graph Pattern Loss based Diversified Attention Network for Cross-Modal Retrieval
by Xueying Chen et al

06-25-2021

Circumpapillary OCT-Focused Hybrid Learning for Glaucoma Grading Using Tailored Prototypical Neural Networks
by Gabriel García et al

06-25-2021

A Novel Self-Learning Framework for Bladder Cancer Grading Using Histopathological Images
by Gabriel García et al

06-23-2021

Feature Completion for Occluded Person Re-Identification
by Ruibing Hou et al

06-23-2021

Multi-Modal 3D Object Detection in Autonomous Driving: a Survey
by Yingjie Wang et al

06-23-2021

Frequency Domain Convolutional Neural Network: Accelerated CNN for Large Diabetic Retinopathy Image Classification
by Ee Fey Goh et al

06-23-2021

Planetary UAV localization based on Multi-modal Registration with Pre-existing Digital Terrain Model
by Xue Wan et al

06-25-2021

Re-parameterizing VAEs for stability
by David Dehaene et al

06-25-2021

Projection-wise Disentangling for Fair and Interpretable Representation Learning: Application to 3D Facial Shape Analysis
by Xianjing Liu et al

06-23-2021

High-Throughput Precision Phenotyping of Left Ventricular Hypertrophy with Cardiovascular Deep Learning
by Grant Duffy et al

06-24-2021

Symmetric Wasserstein Autoencoders
by Sun Sun et al

06-23-2021

Multi-Class Classification of Blood Cells -- End to End Computer Vision based diagnosis case study
by Sai Sukruth Bezugam

06-22-2021

DocFormer: End-to-End Transformer for Document Understanding
by Srikar Appalaraju et al

06-22-2021

Residual Networks as Flows of Velocity Fields for Diffeomorphic Time Series Alignment
by Hao Huang et al

06-22-2021

Enhanced Separable Disentanglement for Unsupervised Domain Adaptation
by Youshan Zhang et al

06-25-2021

NP-DRAW: A Non-Parametric Structured Latent Variable Modelfor Image Generation
by Xiaohui Zeng et al

06-24-2021

Video Swin Transformer
by Ze Liu et al

06-24-2021

Depth Confidence-aware Camouflaged Object Detection
by Jing Zhang et al

06-22-2021

Volume Rendering of Neural Implicit Surfaces
by Lior Yariv et al

06-23-2021

Deep Fake Detection: Survey of Facial Manipulation Detection Solutions
by Samay Pashine et al

06-23-2021

Vision-based Behavioral Recognition of Novelty Preference in Pigs
by Aniket Shirke et al

06-23-2021

Collaborative Visual Inertial SLAM for Multiple Smart Phones
by Jialing Liu et al

06-23-2021

All You Need is a Second Look: Towards Arbitrary-Shaped Text Detection
by Meng Cao et al

06-24-2021

DCoM: A Deep Column Mapper for Semantic Data Type Detection
by Subhadip Maji et al

06-22-2021

Differentiable Architecture Search Without Training Nor Labels: A Pruning Perspective
by Miao Zhang et al

06-23-2021

Handwritten Digit Recognition using Machine and Deep Learning Algorithms
by Samay Pashine et al

06-23-2021

ATP-Net: An Attention-based Ternary Projection Network For Compressed Sensing
by Guanxiong Nie et al

06-24-2021

AutoAdapt: Automated Segmentation Network Search for Unsupervised Domain Adaptation
by Xueqing Deng et al

06-24-2021

HyperNeRF: A Higher-Dimensional Representation for Topologically Varying Neural Radiance Fields
by Keunhong Park et al

06-24-2021

A Simple and Strong Baseline: Progressively Region-based Scene Text Removal Networks
by Yuxin Wang et al

06-23-2021

Instance-based Vision Transformer for Subtyping of Papillary Renal Cell Carcinoma in Histopathological Image
by Zeyu Gao et al

06-25-2021

Shape registration in the time of transformers
by Giovanni Trappolini et al

06-22-2021

Tracking Instances as Queries
by Shusheng Yang et al

06-24-2021

Exploring Corruption Robustness: Inductive Biases in Vision Transformers and MLP-Mixers
by Katelyn Morrison et al

06-24-2021

Unsupervised Learning of Depth and Depth-of-Field Effect from Natural Images with Aperture Rendering Generative Adversarial Networks
by Takuhiro Kaneko

06-24-2021

AudioCLIP: Extending CLIP to Image, Text and Audio
by Andrey Guzhov et al

06-22-2021

Reachability Analysis of Convolutional Neural Networks
by Xiaodong Yang et al

06-22-2021

The Hitchhikers Guide to Prior-Shift Adaptation
by Tomas Sipka et al

06-24-2021

GaussiGAN: Controllable Image Synthesis with 3D Gaussians from Unposed Silhouettes
by Youssef A. Mejjati et al

06-23-2021

Learning from Pseudo Lesion: A Self-supervised Framework for COVID-19 Diagnosis
by Zhongliang Li et al

06-23-2021

FusionPainting: Multimodal Fusion with Adaptive Attention for 3D Object Detection
by Shaoqing Xu et al

06-22-2021

Towards Consistent Predictive Confidence through Fitted Ensembles
by Navid Kardan et al

06-24-2021

Sparse Needlets for Lighting Estimation with Spherical Transport Loss
by Fangneng Zhan et al

 
Craig Smith