2021.5.31 Vision papers

 

05-26-2021

CogView: Mastering Text-to-Image Generation via Transformers
by Ming Ding et al

05-25-2021

Adversarial Attack Driven Data Augmentation for Accurate And Robust Medical Image Segmentation
by Mst. Tasnim Pervin et al

05-25-2021

The Nonlinearity Coefficient -- A Practical Guide to Neural Architecture Design
by George Philipp

05-28-2021

NViSII: A Scriptable Tool for Photorealistic Image Generation
by Nathan Morrical et al

05-27-2021

Blind Motion Deblurring Super-Resolution: When Dynamic Spatio-Temporal Learning Meets Static Image Understanding
by Wenjia Niu et al

05-26-2021

Self-Ensembling Contrastive Learning for Semi-Supervised Medical Image Segmentation
by Jinxi Xiang et al

05-26-2021

Robust Navigation for Racing Drones based on Imitation Learning and Modularization
by Tianqi Wang et al

05-25-2021

Self-Organized Variational Autoencoders (Self-VAE) for Learned Image Compression
by M. Akın Yılmaz et al

05-28-2021

AutoSampling: Search for Effective Data Sampling Schedules
by Ming Sun et al

05-28-2021

ResT: An Efficient Transformer for Visual Recognition
by Qinglong Zhang et al

05-28-2021

What Is Considered Complete for Visual Recognition?
by Lingxi Xie et al

05-27-2021

Unsupervised Domain Adaption of Object Detectors: A Survey
by Poojan Oza et al

05-27-2021

Using Early-Learning Regularization to Classify Real-World Noisy Data
by Alessio Galatolo et al

05-27-2021

SSAN: Separable Self-Attention Network for Video Representation Learning
by Xudong Guo et al

05-25-2021

Calibration and Uncertainty Quantification of Bayesian Convolutional Neural Networks for Geophysical Applications
by Lukas Mosser et al

05-27-2021

Passing Multi-Channel Material Textures to a 3-Channel Loss
by Thomas Chambon et al

05-26-2021

Multiple Domain Experts Collaborative Learning: Multi-Source Domain Generalization For Person Re-Identification
by Shijie Yu et al

05-28-2021

EDEN: Deep Feature Distribution Pooling for Saimaa Ringed Seals Pattern Matching
by Ilja Chelak et al

05-26-2021

Towards Transparent Application of Machine Learning in Video Processing
by Luka Murn et al

05-25-2021

Bridging the Gap Between Explainable AI and Uncertainty Quantification to Enhance Trustability
by Dominik Seuß

05-27-2021

HDRUNet: Single Image HDR Reconstruction with Denoising and Dequantization
by Xiangyu Chen et al

05-27-2021

Unsupervised Adaptive Semantic Segmentation with Local Lipschitz Constraint
by Guanyu Cai et al

05-28-2021

Learning Relation Alignment for Calibrated Cross-modal Retrieval
by Shuhuai Ren et al

05-27-2021

GuideMe: A Mobile Application based on Global Positioning System and Object Recognition Towards a Smart Tourist Guide
by Wadii Boulila et al

05-27-2021

Dynamic Network selection for the Object Detection task: why it matters and what we (didnt) achieve
by Emanuele Vitali et al

05-25-2021

Small and large scale critical infrastructures detection based on deep learning using high resolution orthogonal images
by Pérez-Hernández Francisco et al

05-25-2021

Matching Targets Across Domains with RADON, the Re-Identification Across Domain Network
by Cassandra Burgess et al

05-27-2021

Learning to Stylize Novel Views
by Hsin-Ping Huang et al

05-26-2021

On the Advantages of Multiple Stereo Vision Camera Designs for Autonomous Drone Navigation
by Rui Pimentel de Figueiredo et al

05-25-2021

Optimal ANN-SNN Conversion for Fast and Accurate Inference in Deep Spiking Neural Networks
by Jianhao Ding et al

05-25-2021

Dynamic Dual Sampling Module for Fine-Grained Semantic Segmentation
by Chen Shi et al

05-25-2021

FINNger -- Applying artificial intelligence to ease math learning for children
by Rafael Baldasso Audibert et al

05-26-2021

Benchmarking Scientific Image Forgery Detectors
by João P. Cardenuto et al

05-25-2021

Graph Self Supervised Learning: the BT, the HSIC, and the VICReg
by Sayan Nag

05-28-2021

PTNet: A High-Resolution Infant MRI Synthesizer Based on Transformer
by Xuzhe Zhang et al

05-25-2021

Temporal Action Proposal Generation with Transformers
by Lining Wang et al

05-26-2021

Dynamic Probabilistic Pruning: A general framework for hardware-constrained pruning at different granularities
by Lizeth Gonzalez-Carabarin et al

05-25-2021

Estimates of maize plant density from UAV RGB images using Faster-RCNN detection model: impact of the spatial resolution
by Kaaviya Velumani et al

05-26-2021

Blurs Make Results Clearer: Spatial Smoothings to Improve Accuracy, Uncertainty, and Robustness
by Namuk Park et al

05-25-2021

BoundarySqueeze: Image Segmentation as Boundary Squeezing
by Hao He et al

05-25-2021

Bridging Few-Shot Learning and Adaptation: New Challenges of Support-Query Shift
by Etienne Bennequin et al

05-27-2021

An Efficient Style Virtual Try on Network
by Shanchen Pang et al

05-25-2021

SB-GCN: Structured BREP Graph Convolutional Network for Automatic Mating of CAD Assemblies
by Benjamin Jones et al

05-27-2021

Graph-Based Deep Learning for Medical Diagnosis and Analysis: Past, Present and Future
by David Ahmedt-Aristizabal et al

05-26-2021

An Online Learning System for Wireless Charging Alignment using Surround-view Fisheye Cameras
by Ashok Dahal et al

05-26-2021

Multi-Modal Semantic Inconsistency Detection in Social Media News Posts
by Scott McCrae et al

05-26-2021

Low Resolution Information Also Matters: Learning Multi-Resolution Representations for Person Re-Identification
by Guoqing Zhang et al

05-28-2021

The Wits Intelligent Teaching System: Detecting Student Engagement During Lectures Using Convolutional Neural Networks
by Richard Klein et al

05-28-2021

A systematic review of transfer learning based approaches for diabetic retinopathy detection
by Burcu Oltu et al

05-25-2021

GAN for Vision, KG for Relation: a Two-stage Deep Network for Zero-shot Action Recognition
by Bin Sun et al

05-25-2021

ViBERTgrid: A Jointly Trained Multi-Modal 2D Document Representation for Key Information Extraction from Documents
by Weihong Lin et al

05-26-2021

RSCA: Real-time Segmentation-based Context-Aware Scene Text Detection
by Jiachen Li et al

05-25-2021

Style Similarity as Feedback for Product Design
by Mathew Schwartz et al

05-25-2021

Security in Next Generation Mobile Payment Systems: A Comprehensive Survey
by Waqas Ahmed et al

05-25-2021

Review on Indoor RGB-D Semantic Segmentation with Deep Convolutional Neural Networks
by Sami Barchid et al

05-27-2021

A Dataset for Provident Vehicle Detection at Night
by Sascha Saralajew et al

05-26-2021

PSGAN++: Robust Detail-Preserving Makeup Transfer and Removal
by Si Liu et al

05-26-2021

Context-aware Cross-level Fusion Network for Camouflaged Object Detection
by Yujia Sun et al

05-28-2021

Using Convolutional Neural Networks for Relative Pose Estimation of a Non-Cooperative Spacecraft with Thermal Infrared Imagery
by Maxwell Hogan et al

05-26-2021

CBANet: Towards Complexity and Bitrate Adaptive Deep Image Compression using a Single Network
by Jinyang Guo et al

05-25-2021

Deep learning-based bias transfer for overcoming laboratory differences of microscopic images
by Ann-Katrin Thebille et al

05-26-2021

Detecting Biological Locomotion in Video: A Computational Approach
by Soo Min Kang et al

05-26-2021

Disentangled Face Attribute Editing via Instance-Aware Latent Space Search
by Yuxuan Han et al

05-26-2021

Edge Detection for Satellite Images without Deep Networks
by Joshua Abraham et al

05-26-2021

KLIEP-based Density Ratio Estimation for Semantically Consistent Synthetic to Real Images Adaptation in Urban Traffic Scenes
by Artem Savkin et al

05-25-2021

Towards Compact Single Image Super-Resolution via Contrastive Self-distillation
by Yanbo Wang et al

05-26-2021

DFPN: Deformable Frame Prediction Network
by M. Akın Yılmaz et al

05-25-2021

Towards Unpaired Depth Enhancement and Super-Resolution in the Wild
by Aleksandr Safin et al

05-25-2021

DSANet: Dynamic Segment Aggregation Network for Video-Level Representation Learning
by Wenhao Wu et al

05-25-2021

PAS-MEF: Multi-exposure image fusion based on principal component analysis, adaptive well-exposedness and saliency map
by Diclehan Karakaya et al

05-27-2021

Pose2Drone: A Skeleton-Pose-based Framework for Human-Drone Interaction
by Zdravko Marinov et al

05-26-2021

What data do we need for training an AV motion planner?
by Long Chen et al

05-26-2021

Weighing Features of Lung and Heart Regions for Thoracic Disease Classification
by Jiansheng Fang et al

05-28-2021

Deception Detection in Videos using the Facial Action Coding System
by Hammad Ud Din Ahmed et al

05-26-2021

i3dLoc: Image-to-range Cross-domain Localization Robust to Inconsistent Environmental Conditions
by Peng Yin et al

05-26-2021

3D Segmentation Learning from Sparse Annotations and Hierarchical Descriptors
by Peng Yin et al

05-25-2021

Performance Analysis of a Foreground Segmentation Neural Network Model
by Joel Tomás Morais et al

05-26-2021

Permutation invariance and uncertainty in multitemporal image super-resolution
by Diego Valsesia et al

05-26-2021

Sli2Vol: Annotate a 3D Volume from a Single Slice with Self-Supervised Learning
by Pak-Hei Yeung et al

05-28-2021

Focus on Local: Detecting Lane Marker from Bottom Up via Key Point
by Zhan Qu et al

05-26-2021

Unsupervised Part Segmentation through Disentangling Appearance and Shape
by Shilong Liu et al

05-26-2021

Recent Standard Development Activities on Video Coding for Machines
by Wen Gao et al

05-27-2021

Embedded Vision for Self-Driving on Forest Roads
by Sorin Grigorescu et al

05-28-2021

The Herbarium 2021 Half-Earth Challenge Dataset
by Riccardo de Lutio et al

05-25-2021

A Geometry-Informed Deep Learning Framework for Ultra-Sparse 3D Tomographic Image Reconstruction
by Liyue Shen et al

05-25-2021

FNAS: Uncertainty-Aware Fast Neural Architecture Search
by Jihao Liu et al

05-28-2021

Boosting Monocular Depth Estimation Models to High-Resolution via Content-Adaptive Multi-Resolution Merging
by S. Mahdi H. Miangoleh et al

05-25-2021

Tab.IAIS: Flexible Table Recognition and Semantic Interpretation System
by Marcin Namysl et al

05-25-2021

Few-Shot Learning with Part Discovery and Augmentation from Unlabeled Images
by Wentao Chen et al

05-25-2021

TransNAS-Bench-101: Improving Transferability and Generalizability of Cross-Task Neural Architecture Search
by Yawen Duan et al

05-28-2021

Learning Uncertainty For Safety-Oriented Semantic Segmentation In Autonomous Driving
by Victor Besnier et al

05-28-2021

Geometric Deep Learning and Equivariant Neural Networks
by Jan E. Gerken et al

05-26-2021

Towards an IMU-based Pen Online Handwriting Recognizer
by Mohamad Wehbi et al

05-26-2021

Adversarial robustness against multiple lplp-threat models at the price of one and how to quickly fine-tune robust models to another threat model
by Francesco Croce et al

05-25-2021

CoRSAI: A System for Robust Interpretation of CT Scans of COVID-19 Patients Using Deep Learning
by Manvel Avetisian et al

05-27-2021

Self-supervised Detransformation Autoencoder for Representation Learning in Open Set Recognition
by Jingyun Jia et al

05-27-2021

Unsupervised Activity Segmentation by Joint Representation Learning and Online Clustering
by Sateesh Kumar et al

05-27-2021

Tracking Without Re-recognition in Humans and Machines
by Drew Linsley et al

05-25-2021

ST-HOI: A Spatial-Temporal Baseline for Human-Object Interaction Detection in Videos
by Meng-Jiun Chiou et al

05-28-2021

FReTAL: Generalizing Deepfake Detection using Knowledge Distillation and Representation Learning
by Minha Kim et al

05-26-2021

Using the Overlapping Score to Improve Corruption Benchmarks
by Alfred Laugros et al

05-28-2021

New Image Captioning Encoder via Semantic Visual Feature Matching for Heavy Rain Images
by Chang-Hwan Son et al

05-25-2021

DTNN: Energy-efficient Inference with Dendrite Tree Inspired Neural Networks for Edge Vision Applications
by Tao Luo et al

05-25-2021

GCNBoost: Artwork Classification by Label Propagation through a Knowledge Graph
by Cheikh Brahim El Vaigh et al

05-25-2021

Fast and Accurate Scene Parsing via Bi-direction Alignment Networks
by Yanran Wu et al

05-27-2021

Drawing Multiple Augmentation Samples Per Image During Training Efficiently Decreases Test Error
by Stanislav Fort et al

05-28-2021

MODISSA: a multipurpose platform for the prototypical realization of vehicle-related applications using optical sensors
by Björn Borgmann et al

05-25-2021

Deep High-Resolution Representation Learning for Cross-Resolution Person Re-identification
by Guoqing Zhang et al

05-26-2021

How to Calibrate Your Event Camera
by Manasi Muglikar et al

05-25-2021

Occlusion Aware Kernel Correlation Filter Tracker using RGB-D
by Srishti Yadav

05-27-2021

Stylizing 3D Scene via Implicit Representation and HyperNetwork
by Pei-Ze Chiang et al

05-28-2021

Demotivate adversarial defense in remote sensing
by Adrien Chan-Hon-Tong et al

05-28-2021

Training of SSD(Single Shot Detector) for Facial Detection using Nvidia Jetson Nano
by Saif Ur Rehman et al

05-26-2021

Predict then Interpolate: A Simple Algorithm to Learn Stable Classifiers
by Yujia Bao et al

05-28-2021

Iris Liveness Detection using a Cascade of Dedicated Deep Learning Networks
by Juan Tapia et al

05-26-2021

Computer Vision and Conflicting Values: Describing People with Automated Alt Text
by Margot Hanley et al

05-27-2021

2nd Place Solution for IJCAI-PRICAI 2020 3D AI Challenge: 3D Object Reconstruction from A Single Image
by Yichen Cao et al

05-28-2021

Recursive Contour Saliency Blending Network for Accurate Salient Object Detection
by Yi Ke Yun et al

05-27-2021

When Liebigs Barrel Meets Facial Landmark Detection: A Practical Model
by Haibo Jin et al

05-27-2021

Classification and Uncertainty Quantification of Corrupted Data using Semi-Supervised Autoencoders
by Philipp Joppich et al

05-27-2021

PSRR-MaxpoolNMS: Pyramid Shifted MaxpoolNMS with Relationship Recovery
by Tianyi Zhang et al

05-27-2021

Cardiac Segmentation on CT Images through Shape-Aware Contour Attentions
by Sanguk Park et al

05-25-2021

SBEVNet: End-to-End Deep Stereo Layout Estimation
by Divam Gupta et al

05-26-2021

cofga: A Dataset for Fine Grained Classification of Objects from Aerial Imagery
by Eran Dahan et al

05-26-2021

Predicting invasive ductal carcinoma using a Reinforcement Sample Learning Strategy using Deep Learning
by Rushabh Patel

05-27-2021

Recent advances and clinical applications of deep learning in medical image analysis
by Xuxin Chen et al

05-27-2021

Type III solar radio burst detection and classification: A deep learning approach
by Jeremiah Scully et al

05-26-2021

Deep Learning for Weakly-Supervised Object Detection and Object Localization: A Survey
by Feifei Shao et al

05-27-2021

Feature Reuse and Fusion for Real-time Semantic segmentation
by Tan Sixiang

05-25-2021

AutoReCon: Neural Architecture Search-based Reconstruction for Data-free Compression
by Baozhou Zhu et al

05-28-2021

Revitalizing Optimization for 3D Human Pose and Shape Estimation: A Sparse Constrained Formulation
by Taosha Fan et al

05-26-2021

ViPTT-Net: Video pretraining of spatio-temporal model for tuberculosis type classification from chest CT scans
by Hasib Zunair et al

05-26-2021

Pattern Detection in the Activation Space for Identifying Synthesized Content
by Celia Cintas et al

05-26-2021

DSLR: Dynamic to Static LiDAR Scan Reconstruction Using Adversarially Trained Autoencoder
by Prashant Kumar et al

05-27-2021

One-shot Learning with Absolute Generalization
by Hao Su

05-28-2021

Chromatic and spatial analysis of one-pixel attacks against an image classifier
by Janne Alatalo et al

05-26-2021

Improving Sign Language Translation with Monolingual Data by Sign Back-Translation
by Hao Zhou et al

05-25-2021

Emotion Recognition in Horses with Convolutional Neural Networks
by Luis A. Corujo et al

05-25-2021

Learning Generative Prior with Latent Space Sparsity Constraints
by Vinayak Killedar et al

05-26-2021

SimNet: Learning Reactive Self-driving Simulations from Real-world Observations
by Luca Bergamini et al

05-27-2021

Joint-DetNAS: Upgrade Your Detector with NAS, Pruning and Dynamic Distillation
by Lewei Yao et al

05-25-2021

Hyperspectral Image Denoising with Log-Based Robust PCA
by Yang Liu et al

05-25-2021

Real-time Monocular Depth Estimation with Sparse Supervision on Mobile
by Mehmet Kerim Yucel et al

05-26-2021

Issues in Object Detection in Videos using Common Single-Image CNNs
by Spencer Ploeger et al

05-26-2021

Unsupervised Video Summarization via Multi-source Features
by Hussain Kanafani et al

05-28-2021

Semi-supervised Anatomical Landmark Detection via Shape-regulated Self-training
by Runnan Chen et al

05-26-2021

Social-IWSTCNN: A Social Interaction-Weighted Spatio-Temporal Convolutional Neural Network for Pedestrian Trajectory Prediction in Urban Traffic Scenarios
by Chi Zhang et al

05-28-2021

DeepTag: A General Framework for Fiducial Marker Design and Detection
by Zhuming Zhang et al

05-26-2021

Calibrated prediction in and out-of-domain for state-of-the-art saliency modeling
by Akis Linardos et al

05-25-2021

Improving Few-shot Learning with Weakly-supervised Object Localization
by Inyong Koo et al

05-26-2021

Image-Based Plant Wilting Estimation
by Changye Yang et al

05-25-2021

Understanding Mobile GUI: from Pixel-Words to Screen-Sentences
by Jingwen Fu et al

05-26-2021

Learning to Detect Fortified Areas
by Allan Grønlund et al

05-27-2021

Learning Dynamic Graph Representation of Brain Connectome with Spatio-Temporal Attention
by Byung-Hoon Kim et al

05-25-2021

High-Frequency aware Perceptual Image Enhancement
by Hyungmin Roh et al

05-28-2021

On Hamilton-Jacobi PDEs and image denoising models with certain non-additive noise
by Jérôme Darbon et al

05-26-2021

Enhance to Read Better: An Improved Generative Adversarial Network for Handwritten Document Image Enhancement
by Sana Khamekhem Jemni et al

05-26-2021

Spatio-Contextual Deep Network Based Multimodal Pedestrian Detection For Autonomous Driving
by Kinjal Dasgupta et al

05-27-2021

ECG Heart-beat Classification Using Multimodal Image Fusion
by Zeeshan Ahmad et al

05-27-2021

Inertial Sensor Data To Image Encoding For Human Action Recognition
by Zeeshan Ahmad et al

05-27-2021

Empirical Study of Multi-Task Hourglass Model for Semantic Segmentation Task
by Darwin Saire et al

05-27-2021

FastRIFE: Optimization of Real-Time Intermediate Flow Estimation for Video Frame Interpolation
by Malwina Kubas et al

05-25-2021

Self-Guided Instance-Aware Network for Depth Completion and Enhancement
by Zhongzhen Luo et al

05-25-2021

Learning a Model-Driven Variational Network for Deformable Image Registration
by Xi Jia et al

05-27-2021

The Imaginative Generative Adversarial Network: Automatic Data Augmentation for Dynamic Skeleton-Based Hand Gesture and Human Action Recognition
by Junxiao Shen et al

05-27-2021

Efficient High-Resolution Image-to-Image Translation using Multi-Scale Gradient U-Net
by Kumarapu Laxman et al

05-27-2021

How saccadic vision might help with theinterpretability of deep networks
by Iana Sereda et al

05-27-2021

ICDAR 2021 Competition on Historical Map Segmentation
by Joseph Chazalon et al

05-26-2021

YOLO5Face: Why Reinventing a Face Detector
by Delong Qi et al

05-27-2021

Training With Data Dependent Dynamic Learning Rates
by Shreyas Saxena et al

05-28-2021

Improving Facial Attribute Recognition by Group and Graph Learning
by Zhenghao Chen et al

05-28-2021

Linguistic Structures as Weak Supervision for Visual Scene Graph Generation
by Keren Ye et al

05-25-2021

Dense Regression Activation Maps For Lesion Segmentation in CT scans of COVID-19 patients
by Weiyi Xie et al

05-26-2021

Aggregating Nested Transformers
by Zizhao Zhang et al

05-26-2021

Smile Like You Mean It: Driving Animatronic Robotic Face with Learned Models
by Boyuan Chen et al

05-26-2021

Anticipating human actions by correlating past with the future with Jaccard similarity measures
by Basura Fernando et al

 
Craig Smith