2021.5.31 Vision papers

05-26-2021	CogView: Mastering Text-to-Image Generation via Transformers by Ming Ding et al
05-25-2021	Adversarial Attack Driven Data Augmentation for Accurate And Robust Medical Image Segmentation by Mst. Tasnim Pervin et al
05-25-2021	The Nonlinearity Coefficient -- A Practical Guide to Neural Architecture Design by George Philipp
05-28-2021	NViSII: A Scriptable Tool for Photorealistic Image Generation by Nathan Morrical et al
05-27-2021	Blind Motion Deblurring Super-Resolution: When Dynamic Spatio-Temporal Learning Meets Static Image Understanding by Wenjia Niu et al
05-26-2021	Self-Ensembling Contrastive Learning for Semi-Supervised Medical Image Segmentation by Jinxi Xiang et al
05-26-2021	Robust Navigation for Racing Drones based on Imitation Learning and Modularization by Tianqi Wang et al
05-25-2021	Self-Organized Variational Autoencoders (Self-VAE) for Learned Image Compression by M. Akın Yılmaz et al
05-28-2021	AutoSampling: Search for Effective Data Sampling Schedules by Ming Sun et al
05-28-2021	ResT: An Efficient Transformer for Visual Recognition by Qinglong Zhang et al
05-28-2021	What Is Considered Complete for Visual Recognition? by Lingxi Xie et al
05-27-2021	Unsupervised Domain Adaption of Object Detectors: A Survey by Poojan Oza et al
05-27-2021	Using Early-Learning Regularization to Classify Real-World Noisy Data by Alessio Galatolo et al
05-27-2021	SSAN: Separable Self-Attention Network for Video Representation Learning by Xudong Guo et al
05-25-2021	Calibration and Uncertainty Quantification of Bayesian Convolutional Neural Networks for Geophysical Applications by Lukas Mosser et al
05-27-2021	Passing Multi-Channel Material Textures to a 3-Channel Loss by Thomas Chambon et al
05-26-2021	Multiple Domain Experts Collaborative Learning: Multi-Source Domain Generalization For Person Re-Identification by Shijie Yu et al
05-28-2021	EDEN: Deep Feature Distribution Pooling for Saimaa Ringed Seals Pattern Matching by Ilja Chelak et al
05-26-2021	Towards Transparent Application of Machine Learning in Video Processing by Luka Murn et al
05-25-2021	Bridging the Gap Between Explainable AI and Uncertainty Quantification to Enhance Trustability by Dominik Seuß
05-27-2021	HDRUNet: Single Image HDR Reconstruction with Denoising and Dequantization by Xiangyu Chen et al
05-27-2021	Unsupervised Adaptive Semantic Segmentation with Local Lipschitz Constraint by Guanyu Cai et al
05-28-2021	Learning Relation Alignment for Calibrated Cross-modal Retrieval by Shuhuai Ren et al
05-27-2021	GuideMe: A Mobile Application based on Global Positioning System and Object Recognition Towards a Smart Tourist Guide by Wadii Boulila et al
05-27-2021	Dynamic Network selection for the Object Detection task: why it matters and what we (didnt) achieve by Emanuele Vitali et al
05-25-2021	Small and large scale critical infrastructures detection based on deep learning using high resolution orthogonal images by Pérez-Hernández Francisco et al
05-25-2021	Matching Targets Across Domains with RADON, the Re-Identification Across Domain Network by Cassandra Burgess et al
05-27-2021	Learning to Stylize Novel Views by Hsin-Ping Huang et al
05-26-2021	On the Advantages of Multiple Stereo Vision Camera Designs for Autonomous Drone Navigation by Rui Pimentel de Figueiredo et al
05-25-2021	Optimal ANN-SNN Conversion for Fast and Accurate Inference in Deep Spiking Neural Networks by Jianhao Ding et al
05-25-2021	Dynamic Dual Sampling Module for Fine-Grained Semantic Segmentation by Chen Shi et al
05-25-2021	FINNger -- Applying artificial intelligence to ease math learning for children by Rafael Baldasso Audibert et al
05-26-2021	Benchmarking Scientific Image Forgery Detectors by João P. Cardenuto et al
05-25-2021	Graph Self Supervised Learning: the BT, the HSIC, and the VICReg by Sayan Nag
05-28-2021	PTNet: A High-Resolution Infant MRI Synthesizer Based on Transformer by Xuzhe Zhang et al
05-25-2021	Temporal Action Proposal Generation with Transformers by Lining Wang et al
05-26-2021	Dynamic Probabilistic Pruning: A general framework for hardware-constrained pruning at different granularities by Lizeth Gonzalez-Carabarin et al
05-25-2021	Estimates of maize plant density from UAV RGB images using Faster-RCNN detection model: impact of the spatial resolution by Kaaviya Velumani et al
05-26-2021	Blurs Make Results Clearer: Spatial Smoothings to Improve Accuracy, Uncertainty, and Robustness by Namuk Park et al
05-25-2021	BoundarySqueeze: Image Segmentation as Boundary Squeezing by Hao He et al
05-25-2021	Bridging Few-Shot Learning and Adaptation: New Challenges of Support-Query Shift by Etienne Bennequin et al
05-27-2021	An Efficient Style Virtual Try on Network by Shanchen Pang et al
05-25-2021	SB-GCN: Structured BREP Graph Convolutional Network for Automatic Mating of CAD Assemblies by Benjamin Jones et al
05-27-2021	Graph-Based Deep Learning for Medical Diagnosis and Analysis: Past, Present and Future by David Ahmedt-Aristizabal et al
05-26-2021	An Online Learning System for Wireless Charging Alignment using Surround-view Fisheye Cameras by Ashok Dahal et al
05-26-2021	Multi-Modal Semantic Inconsistency Detection in Social Media News Posts by Scott McCrae et al
05-26-2021	Low Resolution Information Also Matters: Learning Multi-Resolution Representations for Person Re-Identification by Guoqing Zhang et al
05-28-2021	The Wits Intelligent Teaching System: Detecting Student Engagement During Lectures Using Convolutional Neural Networks by Richard Klein et al
05-28-2021	A systematic review of transfer learning based approaches for diabetic retinopathy detection by Burcu Oltu et al
05-25-2021	GAN for Vision, KG for Relation: a Two-stage Deep Network for Zero-shot Action Recognition by Bin Sun et al
05-25-2021	ViBERTgrid: A Jointly Trained Multi-Modal 2D Document Representation for Key Information Extraction from Documents by Weihong Lin et al
05-26-2021	RSCA: Real-time Segmentation-based Context-Aware Scene Text Detection by Jiachen Li et al
05-25-2021	Style Similarity as Feedback for Product Design by Mathew Schwartz et al
05-25-2021	Security in Next Generation Mobile Payment Systems: A Comprehensive Survey by Waqas Ahmed et al
05-25-2021	Review on Indoor RGB-D Semantic Segmentation with Deep Convolutional Neural Networks by Sami Barchid et al
05-27-2021	A Dataset for Provident Vehicle Detection at Night by Sascha Saralajew et al
05-26-2021	PSGAN++: Robust Detail-Preserving Makeup Transfer and Removal by Si Liu et al
05-26-2021	Context-aware Cross-level Fusion Network for Camouflaged Object Detection by Yujia Sun et al
05-28-2021	Using Convolutional Neural Networks for Relative Pose Estimation of a Non-Cooperative Spacecraft with Thermal Infrared Imagery by Maxwell Hogan et al
05-26-2021	CBANet: Towards Complexity and Bitrate Adaptive Deep Image Compression using a Single Network by Jinyang Guo et al
05-25-2021	Deep learning-based bias transfer for overcoming laboratory differences of microscopic images by Ann-Katrin Thebille et al
05-26-2021	Detecting Biological Locomotion in Video: A Computational Approach by Soo Min Kang et al
05-26-2021	Disentangled Face Attribute Editing via Instance-Aware Latent Space Search by Yuxuan Han et al
05-26-2021	Edge Detection for Satellite Images without Deep Networks by Joshua Abraham et al
05-26-2021	KLIEP-based Density Ratio Estimation for Semantically Consistent Synthetic to Real Images Adaptation in Urban Traffic Scenes by Artem Savkin et al
05-25-2021	Towards Compact Single Image Super-Resolution via Contrastive Self-distillation by Yanbo Wang et al
05-26-2021	DFPN: Deformable Frame Prediction Network by M. Akın Yılmaz et al
05-25-2021	Towards Unpaired Depth Enhancement and Super-Resolution in the Wild by Aleksandr Safin et al
05-25-2021	DSANet: Dynamic Segment Aggregation Network for Video-Level Representation Learning by Wenhao Wu et al
05-25-2021	PAS-MEF: Multi-exposure image fusion based on principal component analysis, adaptive well-exposedness and saliency map by Diclehan Karakaya et al
05-27-2021	Pose2Drone: A Skeleton-Pose-based Framework for Human-Drone Interaction by Zdravko Marinov et al
05-26-2021	What data do we need for training an AV motion planner? by Long Chen et al
05-26-2021	Weighing Features of Lung and Heart Regions for Thoracic Disease Classification by Jiansheng Fang et al
05-28-2021	Deception Detection in Videos using the Facial Action Coding System by Hammad Ud Din Ahmed et al
05-26-2021	i3dLoc: Image-to-range Cross-domain Localization Robust to Inconsistent Environmental Conditions by Peng Yin et al
05-26-2021	3D Segmentation Learning from Sparse Annotations and Hierarchical Descriptors by Peng Yin et al
05-25-2021	Performance Analysis of a Foreground Segmentation Neural Network Model by Joel Tomás Morais et al
05-26-2021	Permutation invariance and uncertainty in multitemporal image super-resolution by Diego Valsesia et al
05-26-2021	Sli2Vol: Annotate a 3D Volume from a Single Slice with Self-Supervised Learning by Pak-Hei Yeung et al
05-28-2021	Focus on Local: Detecting Lane Marker from Bottom Up via Key Point by Zhan Qu et al
05-26-2021	Unsupervised Part Segmentation through Disentangling Appearance and Shape by Shilong Liu et al
05-26-2021	Recent Standard Development Activities on Video Coding for Machines by Wen Gao et al
05-27-2021	Embedded Vision for Self-Driving on Forest Roads by Sorin Grigorescu et al
05-28-2021	The Herbarium 2021 Half-Earth Challenge Dataset by Riccardo de Lutio et al
05-25-2021	A Geometry-Informed Deep Learning Framework for Ultra-Sparse 3D Tomographic Image Reconstruction by Liyue Shen et al
05-25-2021	FNAS: Uncertainty-Aware Fast Neural Architecture Search by Jihao Liu et al
05-28-2021	Boosting Monocular Depth Estimation Models to High-Resolution via Content-Adaptive Multi-Resolution Merging by S. Mahdi H. Miangoleh et al
05-25-2021	Tab.IAIS: Flexible Table Recognition and Semantic Interpretation System by Marcin Namysl et al
05-25-2021	Few-Shot Learning with Part Discovery and Augmentation from Unlabeled Images by Wentao Chen et al
05-25-2021	TransNAS-Bench-101: Improving Transferability and Generalizability of Cross-Task Neural Architecture Search by Yawen Duan et al
05-28-2021	Learning Uncertainty For Safety-Oriented Semantic Segmentation In Autonomous Driving by Victor Besnier et al
05-28-2021	Geometric Deep Learning and Equivariant Neural Networks by Jan E. Gerken et al
05-26-2021	Towards an IMU-based Pen Online Handwriting Recognizer by Mohamad Wehbi et al
05-26-2021	Adversarial robustness against multiple lplp-threat models at the price of one and how to quickly fine-tune robust models to another threat model by Francesco Croce et al
05-25-2021	CoRSAI: A System for Robust Interpretation of CT Scans of COVID-19 Patients Using Deep Learning by Manvel Avetisian et al
05-27-2021	Self-supervised Detransformation Autoencoder for Representation Learning in Open Set Recognition by Jingyun Jia et al
05-27-2021	Unsupervised Activity Segmentation by Joint Representation Learning and Online Clustering by Sateesh Kumar et al
05-27-2021	Tracking Without Re-recognition in Humans and Machines by Drew Linsley et al
05-25-2021	ST-HOI: A Spatial-Temporal Baseline for Human-Object Interaction Detection in Videos by Meng-Jiun Chiou et al
05-28-2021	FReTAL: Generalizing Deepfake Detection using Knowledge Distillation and Representation Learning by Minha Kim et al
05-26-2021	Using the Overlapping Score to Improve Corruption Benchmarks by Alfred Laugros et al
05-28-2021	New Image Captioning Encoder via Semantic Visual Feature Matching for Heavy Rain Images by Chang-Hwan Son et al
05-25-2021	DTNN: Energy-efficient Inference with Dendrite Tree Inspired Neural Networks for Edge Vision Applications by Tao Luo et al
05-25-2021	GCNBoost: Artwork Classification by Label Propagation through a Knowledge Graph by Cheikh Brahim El Vaigh et al
05-25-2021	Fast and Accurate Scene Parsing via Bi-direction Alignment Networks by Yanran Wu et al
05-27-2021	Drawing Multiple Augmentation Samples Per Image During Training Efficiently Decreases Test Error by Stanislav Fort et al
05-28-2021	MODISSA: a multipurpose platform for the prototypical realization of vehicle-related applications using optical sensors by Björn Borgmann et al
05-25-2021	Deep High-Resolution Representation Learning for Cross-Resolution Person Re-identification by Guoqing Zhang et al
05-26-2021	How to Calibrate Your Event Camera by Manasi Muglikar et al
05-25-2021	Occlusion Aware Kernel Correlation Filter Tracker using RGB-D by Srishti Yadav
05-27-2021	Stylizing 3D Scene via Implicit Representation and HyperNetwork by Pei-Ze Chiang et al
05-28-2021	Demotivate adversarial defense in remote sensing by Adrien Chan-Hon-Tong et al
05-28-2021	Training of SSD(Single Shot Detector) for Facial Detection using Nvidia Jetson Nano by Saif Ur Rehman et al
05-26-2021	Predict then Interpolate: A Simple Algorithm to Learn Stable Classifiers by Yujia Bao et al
05-28-2021	Iris Liveness Detection using a Cascade of Dedicated Deep Learning Networks by Juan Tapia et al
05-26-2021	Computer Vision and Conflicting Values: Describing People with Automated Alt Text by Margot Hanley et al
05-27-2021	2nd Place Solution for IJCAI-PRICAI 2020 3D AI Challenge: 3D Object Reconstruction from A Single Image by Yichen Cao et al
05-28-2021	Recursive Contour Saliency Blending Network for Accurate Salient Object Detection by Yi Ke Yun et al
05-27-2021	When Liebigs Barrel Meets Facial Landmark Detection: A Practical Model by Haibo Jin et al
05-27-2021	Classification and Uncertainty Quantification of Corrupted Data using Semi-Supervised Autoencoders by Philipp Joppich et al
05-27-2021	PSRR-MaxpoolNMS: Pyramid Shifted MaxpoolNMS with Relationship Recovery by Tianyi Zhang et al
05-27-2021	Cardiac Segmentation on CT Images through Shape-Aware Contour Attentions by Sanguk Park et al
05-25-2021	SBEVNet: End-to-End Deep Stereo Layout Estimation by Divam Gupta et al
05-26-2021	cofga: A Dataset for Fine Grained Classification of Objects from Aerial Imagery by Eran Dahan et al
05-26-2021	Predicting invasive ductal carcinoma using a Reinforcement Sample Learning Strategy using Deep Learning by Rushabh Patel
05-27-2021	Recent advances and clinical applications of deep learning in medical image analysis by Xuxin Chen et al
05-27-2021	Type III solar radio burst detection and classification: A deep learning approach by Jeremiah Scully et al
05-26-2021	Deep Learning for Weakly-Supervised Object Detection and Object Localization: A Survey by Feifei Shao et al
05-27-2021	Feature Reuse and Fusion for Real-time Semantic segmentation by Tan Sixiang
05-25-2021	AutoReCon: Neural Architecture Search-based Reconstruction for Data-free Compression by Baozhou Zhu et al
05-28-2021	Revitalizing Optimization for 3D Human Pose and Shape Estimation: A Sparse Constrained Formulation by Taosha Fan et al
05-26-2021	ViPTT-Net: Video pretraining of spatio-temporal model for tuberculosis type classification from chest CT scans by Hasib Zunair et al
05-26-2021	Pattern Detection in the Activation Space for Identifying Synthesized Content by Celia Cintas et al
05-26-2021	DSLR: Dynamic to Static LiDAR Scan Reconstruction Using Adversarially Trained Autoencoder by Prashant Kumar et al
05-27-2021	One-shot Learning with Absolute Generalization by Hao Su
05-28-2021	Chromatic and spatial analysis of one-pixel attacks against an image classifier by Janne Alatalo et al
05-26-2021	Improving Sign Language Translation with Monolingual Data by Sign Back-Translation by Hao Zhou et al
05-25-2021	Emotion Recognition in Horses with Convolutional Neural Networks by Luis A. Corujo et al
05-25-2021	Learning Generative Prior with Latent Space Sparsity Constraints by Vinayak Killedar et al
05-26-2021	SimNet: Learning Reactive Self-driving Simulations from Real-world Observations by Luca Bergamini et al
05-27-2021	Joint-DetNAS: Upgrade Your Detector with NAS, Pruning and Dynamic Distillation by Lewei Yao et al
05-25-2021	Hyperspectral Image Denoising with Log-Based Robust PCA by Yang Liu et al
05-25-2021	Real-time Monocular Depth Estimation with Sparse Supervision on Mobile by Mehmet Kerim Yucel et al
05-26-2021	Issues in Object Detection in Videos using Common Single-Image CNNs by Spencer Ploeger et al
05-26-2021	Unsupervised Video Summarization via Multi-source Features by Hussain Kanafani et al
05-28-2021	Semi-supervised Anatomical Landmark Detection via Shape-regulated Self-training by Runnan Chen et al
05-26-2021	Social-IWSTCNN: A Social Interaction-Weighted Spatio-Temporal Convolutional Neural Network for Pedestrian Trajectory Prediction in Urban Traffic Scenarios by Chi Zhang et al
05-28-2021	DeepTag: A General Framework for Fiducial Marker Design and Detection by Zhuming Zhang et al
05-26-2021	Calibrated prediction in and out-of-domain for state-of-the-art saliency modeling by Akis Linardos et al
05-25-2021	Improving Few-shot Learning with Weakly-supervised Object Localization by Inyong Koo et al
05-26-2021	Image-Based Plant Wilting Estimation by Changye Yang et al
05-25-2021	Understanding Mobile GUI: from Pixel-Words to Screen-Sentences by Jingwen Fu et al
05-26-2021	Learning to Detect Fortified Areas by Allan Grønlund et al
05-27-2021	Learning Dynamic Graph Representation of Brain Connectome with Spatio-Temporal Attention by Byung-Hoon Kim et al
05-25-2021	High-Frequency aware Perceptual Image Enhancement by Hyungmin Roh et al
05-28-2021	On Hamilton-Jacobi PDEs and image denoising models with certain non-additive noise by Jérôme Darbon et al
05-26-2021	Enhance to Read Better: An Improved Generative Adversarial Network for Handwritten Document Image Enhancement by Sana Khamekhem Jemni et al
05-26-2021	Spatio-Contextual Deep Network Based Multimodal Pedestrian Detection For Autonomous Driving by Kinjal Dasgupta et al
05-27-2021	ECG Heart-beat Classification Using Multimodal Image Fusion by Zeeshan Ahmad et al
05-27-2021	Inertial Sensor Data To Image Encoding For Human Action Recognition by Zeeshan Ahmad et al
05-27-2021	Empirical Study of Multi-Task Hourglass Model for Semantic Segmentation Task by Darwin Saire et al
05-27-2021	FastRIFE: Optimization of Real-Time Intermediate Flow Estimation for Video Frame Interpolation by Malwina Kubas et al
05-25-2021	Self-Guided Instance-Aware Network for Depth Completion and Enhancement by Zhongzhen Luo et al
05-25-2021	Learning a Model-Driven Variational Network for Deformable Image Registration by Xi Jia et al
05-27-2021	The Imaginative Generative Adversarial Network: Automatic Data Augmentation for Dynamic Skeleton-Based Hand Gesture and Human Action Recognition by Junxiao Shen et al
05-27-2021	Efficient High-Resolution Image-to-Image Translation using Multi-Scale Gradient U-Net by Kumarapu Laxman et al
05-27-2021	How saccadic vision might help with theinterpretability of deep networks by Iana Sereda et al
05-27-2021	ICDAR 2021 Competition on Historical Map Segmentation by Joseph Chazalon et al
05-26-2021	YOLO5Face: Why Reinventing a Face Detector by Delong Qi et al
05-27-2021	Training With Data Dependent Dynamic Learning Rates by Shreyas Saxena et al
05-28-2021	Improving Facial Attribute Recognition by Group and Graph Learning by Zhenghao Chen et al
05-28-2021	Linguistic Structures as Weak Supervision for Visual Scene Graph Generation by Keren Ye et al
05-25-2021	Dense Regression Activation Maps For Lesion Segmentation in CT scans of COVID-19 patients by Weiyi Xie et al
05-26-2021	Aggregating Nested Transformers by Zizhao Zhang et al
05-26-2021	Smile Like You Mean It: Driving Animatronic Robotic Face with Learned Models by Boyuan Chen et al
05-26-2021	Anticipating human actions by correlating past with the future with Jaccard similarity measures by Basura Fernando et al

Craig SmithMay 31, 2021