2021.1.2 Vision papers

01-27-2021	Bottleneck Transformers for Visual Recognition by Aravind Srinivas et al
01-26-2021	Neural Geometric Level of Detail: Real-time Rendering with Implicit 3D Shapes by Towaki Takikawa et al
01-28-2021	Playable Video Generation by Willi Menapace et al
01-27-2021	Automated femur segmentation from computed tomography images using a deep neural network by P. A. Bjornsson et al
01-27-2021	DOC2PPT: Automatic Presentation Slides Generation from Scientific Documents by Tsu-Jui Fu et al
01-28-2021	Tokens-to-Token ViT: Training Vision Transformers from Scratch on ImageNet by Li Yuan et al
01-28-2021	The Role of Syntactic Planning in Compositional Image Captioning by Emanuele Bugliarello et al
01-27-2021	VisualMRC: Machine Reading Comprehension on Document Images by Ryota Tanaka et al
01-26-2021	Automatic Comic Generation with Stylistic Multi-page Layouts and Emotion-driven Text Balloon Generation by Xin Yang et al
01-26-2021	CPTR: Full Transformer Network for Image Captioning by Wei Liu et al
01-28-2021	Vx2Text: End-to-End Learning of Video-Based Text Generation From Multimodal Inputs by Xudong Lin et al
01-27-2021	Object Detection Made Simpler by Eliminating Heuristic NMS by Qiang Zhou et al
01-27-2021	CNN with large memory layers by Rasul Karimov et al
01-29-2021	Efficient-CapsNet: Capsule Network with Self-Attention Routing by Vittorio Mazzia et al
01-27-2021	Multi-Modal Aesthetic Assessment for MObile Gaming Image by Zhenyu Lei et al
01-27-2021	Assessing the applicability of Deep Learning-based visible-infrared fusion methods for fire imagery by J. F. Ciprián-Sánchez et al
01-28-2021	Exploring Cross-Image Pixel Contrast for Semantic Segmentation by Wenguan Wang et al
01-28-2021	PIG-Net: Inception based Deep Learning Architecture for 3D Point Cloud Segmentation by Sindhu Hegde et al
01-27-2021	Puzzle-CAM: Improved localization via matching partial and full features by Sanghyun Jo et al
01-27-2021	Deep Image Retrieval: A Survey by Wei Chen et al
01-28-2021	Rethinking Rotated Object Detection with Gaussian Wasserstein Distance Loss by Xue Yang et al
01-27-2021	Augmenting Proposals by the Detector Itself by Xiaopei Wan et al
01-28-2021	Domain Adaptation by Topology Regularization by Deborah Weeks et al
01-26-2021	Prototypical Pseudo Label Denoising and Target Structure Learning for Domain Adaptive Semantic Segmentation by Pan Zhang et al
01-27-2021	NTU60-X: Towards Skeleton-based Recognition of Subtle Human Actions by Anirudh Thatipelli et al
01-27-2021	Chronological age estimation of lateral cephalometric radiographs with deep learning by Ningtao Liu
01-27-2021	Generative Multi-Label Zero-Shot Learning by Akshita Gupta et al
01-27-2021	SwingBot: Learning Physical Features from In-hand Tactile Exploration for Dynamic Swing-up Manipulation by Chen Wang et al
01-27-2021	Learning task-agnostic representation via toddler-inspired learning by Kwanyoung Park et al
01-28-2021	The Hidden Tasks of Generative Adversarial Networks: An Alternative Perspective on GAN Training by Romann M. Weber
01-26-2021	Deep Burst Super-Resolution by Goutam Bhat et al
01-28-2021	An Explainable AI System for Automated COVID-19 Assessment and Lesion Categorization from CT-scans by Matteo Pennisi et al
01-28-2021	Self-Attention Meta-Learner for Continual Learning by Ghada Sokar et al
01-27-2021	Multi-Hypothesis Pose Networks: Rethinking Top-Down Pose Estimation by Rawal Khirodkar et al
01-28-2021	Self-supervised Cross-silo Federated Neural Architecture Search by Xinle Liang et al
01-26-2021	Introducing and assessing the explainable AI (XAI)method: SIDU by Satya M. Muddamsetty et al
01-28-2021	Reducing ReLU Count for Privacy-Preserving CNN Speedup by Inbar Helbitz et al
01-27-2021	Convolutional Neural Network-Based Age Estimation Using B-Mode Ultrasound Tongue Image by Kele Xu et al
01-28-2021	COMPAS: Representation Learning with Compositional Part Sharing for Few-Shot Classification by Ju He et al
01-27-2021	Learning Non-linear Wavelet Transformation via Normalizing Flow by Shuo-Hui Li
01-27-2021	Scheduled Sampling in Vision-Language Pretraining with Decoupled Encoder-Decoder Network by Yehao Li et al
01-28-2021	NeMo: Neural Mesh Models of Contrastive Features for Robust 3D Pose Estimation by Angtian Wang et al
01-27-2021	Im2Mesh GAN: Accurate 3D Hand Mesh Recovery from a Single RGB Image by Akila Pemasiri et al
01-28-2021	Generalising via Meta-Examples for Continual Learning in the Wild by Alessia Bertugli et al
01-27-2021	Efficient Video Summarization Framework using EEG and Eye-tracking Signals by Sai Sukruth Bezugam et al
01-27-2021	Meta Adversarial Training by Jan Hendrik Metzen et al
01-26-2021	Boosting Segmentation Performance across datasets using histogram specification with application to pelvic bone segmentation by Prabhakara Subramanya Jois et al
01-27-2021	Automatic Detection of Occulted Hard X-ray Flares Using Deep-Learning Methods by Shin-nosuke Ishikawa et al
01-28-2021	Neural Particle Image Velocimetry by Nikolay Stulov et al
01-26-2021	On the Importance of Capturing a Sufficient Diversity of Perspective for the Classification of micro-PCBs by Adam Byerly et al
01-26-2021	Automatic Curation of Large-Scale Datasets for Audio-Visual Representation Learning by Sangho Lee et al
01-27-2021	Reciprocal Landmark Detection and Tracking with Extremely Few Annotations by Jianzhe Lin et al
01-27-2021	A Multi-Scale Conditional Deep Model for Tumor Cell Ratio Counting by Eric Cosatto et al
01-27-2021	Self-Calibrating Active Binocular Vision via Active Efficient Coding with Deep Autoencoders by Charles Wilmot et al
01-27-2021	TorchPRISM: Principal Image Sections Mapping, a novel method for Convolutional Neural Network features visualization by Tomasz Szandala
01-26-2021	Learning Spatial and Spatio-Temporal Pixel Aggregations for Image and Video Denoising by Xiangyu Xu et al
01-28-2021	Neural Architecture Search with Random Labels by Xuanyang Zhang et al
01-27-2021	Syntactically Guided Generative Embeddings for Zero-Shot Skeleton Action Recognition by Pranay Gupta et al
01-26-2021	ResLT: Residual Learning for Long-tailed Recognition by Jiequan Cui et al
01-27-2021	Utilizing Uncertainty Estimation in Deep Learning Segmentation of Fluorescence Microscopy Images with Missing Markers by Alvaro Gomariz et al
01-27-2021	Easy-GT: Open-Source Software to Facilitate Making the Ground Truth for White Blood Cells Nucleus by Seyedeh-Zahra Mousavi Kouzehkanan et al
01-26-2021	Developing emotion recognition for video conference software to support people with autism by Marc Franzen et al
01-26-2021	Malware Detection Using Frequency Domain-Based Image Visualization and Deep Learning by Tajuddin Manhar Mohammed et al
01-26-2021	LIGHTS: LIGHT Specularity Dataset for specular detection in Multi-view by Mohamed Dahy Elkhouly et al
01-28-2021	VAE^2: Preventing Posterior Collapse of Variational Video Predictions in the Wild by Yizhou Zhou et al
01-26-2021	Uncertainty aware and explainable diagnosis of retinal disease by Amitojdeep Singh et al
01-27-2021	e-ACJ: Accurate Junction Extraction For Event Cameras by Zhihao Liu et al
01-27-2021	Detecting Adversarial Examples by Input Transformations, Defense Perturbations, and Voting by Federico Nesti et al
01-26-2021	Blind Image Denoising and Inpainting Using Robust Hadamard Autoencoders by Rasika Karkare et al
01-27-2021	HDIB1M -- Handwritten Document Image Binarization 1 Million Dataset by Kaustubh Sadekar et al
01-27-2021	Towards Improving the Consistency, Efficiency, and Flexibility of Differentiable Neural Architecture Search by Yibo Yang et al
01-28-2021	Fusion Moves for Graph Matching by Lisa Hutschenreiter et al
01-28-2021	Discriminative Appearance Modeling with Multi-track Pooling for Real-time Multi-object Tracking by Chanho Kim et al
01-26-2021	Semi-synthesis: A fast way to produce effective datasets for stereo matching by Ju He et al
01-26-2021	CoMo: A novel co-moving 3D camera system by Andrea Cavagna et al
01-27-2021	Bayesian Nested Neural Networks for Uncertainty Calibration and Adaptive Compression by Yufei Cui et al
01-26-2021	SkeletonVis: Interactive Visualization for Understanding Adversarial Attacks on Human Action Recognition Models by Haekyu Park et al
01-27-2021	Effects of Image Size on Deep Learning by Olivier Rukundo
01-26-2021	Arbitrary-Oriented Ship Detection through Center-Head Point Extraction by Feng Zhang et al
01-26-2021	DeepOIS: Gyroscope-Guided Deep Optical Image Stabilizer Compensation by Haipeng Li et al
01-27-2021	Edge-Labeling based Directed Gated Graph Network for Few-shot Learning by Peixiao Zheng et al
01-26-2021	Revisiting Contrastive Learning for Few-Shot Classification by Orchid Majumder et al
01-27-2021	An Interpretation of Regularization by Denoising and its Application with the Back-Projected Fidelity Term by Einav Yogev-Ofer et al
01-26-2021	Leveraging 3D Information in Unsupervised Brain MRI Segmentation by Benjamin Lambert et al
01-26-2021	Deep Video Inpainting Detection by Peng Zhou et al
01-27-2021	Automated Crop Field Surveillance using Computer Vision by Tejas Atul Khare et al
01-27-2021	Investigating Bi-Level Optimization for Learning and Vision from a Unified Perspective: A Survey and Beyond by Risheng Liu et al
01-27-2021	GaitGraph: Graph Convolutional Network for Skeleton-Based Gait Recognition by Torben Teepe et al
01-27-2021	Automatic image annotation base on Naive Bayes and Decision Tree classifiers using MPEG-7 by Jafar Majidpour et al
01-26-2021	A Survey and Analysis on Automated Glioma Brain Tumor Segmentation and Overall Patient Survival Prediction by Rupal Agravat et al
01-26-2021	The Effect of Class Definitions on the Transferability of Adversarial Attacks Against Forensic CNNs by Xinwei Zhao et al
01-26-2021	Defenses Against Multi-Sticker Physical Domain Attacks on Classifiers by Xinwei Zhao et al
01-27-2021	Spatial-Channel Transformer Network for Trajectory Prediction on the Traffic Scenes by Jingwen Zhao et al
01-27-2021	Controlling by Showing: i-Mimic: A Video-based Method to Control Robotic Arms by Debarati B. Chakraborty et al
01-26-2021	Ensembling complex network perspectives for mild cognitive impairment detection with artificial neural networks by Eufemia Lella et al
01-27-2021	Shape or Texture: Understanding Discriminative Features in CNNs by Md Amirul Islam et al
01-27-2021	The Work of Art in an Age of Mechanical Generation by Steven J. Frank
01-26-2021	Revisiting Locally Supervised Learning: an Alternative to End-to-end Training by Yulin Wang et al
01-27-2021	Detecting Deepfake Videos Using Euler Video Magnification by Rashmiranjan Das et al
01-26-2021	Towards Universal Physical Attacks On Cascaded Camera-Lidar 3D Object Detection Models by Mazen Abdelfattah et al
01-26-2021	Probability Trajectory: One New Movement Description for Trajectory Prediction by Pei Lv et al
01-26-2021	Online Body Schema Adaptation through Cost-Sensitive Active Learning by Gonçalo Cunha et al
01-27-2021	Automatic Segmentation of Gross Target Volume of Nasopharynx Cancer using Ensemble of Multiscale Deep Neural Networks with Spatial Attention by Haochen Mei et al
01-26-2021	AINet: Association Implantation for Superpixel Segmentation by Yaxiong Wang et al
01-26-2021	RAPIQUE: Rapid and Accurate Video Quality Prediction of User Generated Content by Zhengzhong Tu et al
01-26-2021	Evaluating Input Perturbation Methods for Interpreting CNNs and Saliency Map Comparison by Lukas Brunke et al
01-26-2021	ImageCHD: A 3D Computed Tomography Image Dataset for Classification of Congenital Heart Disease by Xiaowei Xu et al
01-26-2021	Glioblastoma Multiforme Patient Survival Prediction by Snehal Rajput et al
01-26-2021	Nondiscriminatory Treatment: a straightforward framework for multi-human parsing by Min Yan et al
01-26-2021	Consistent Mesh Colors for Multi-View Reconstructed 3D Scenes by Mohamed Dahy Elkhouly et al
01-28-2021	A Petri Dish for Histopathology Image Analysis by Jerry Wei et al
01-26-2021	Graphonomy: Universal Image Parsing via Graph Reasoning and Transfer by Liang Lin et al
01-29-2021	Layer-Peeled Model: Toward Understanding Well-Trained Deep Neural Networks by Cong Fang et al
01-26-2021	New Algorithms for Computing Field of Vision over 2D Grids by Evan R. M. Debenham et al
01-26-2021	Efficient Multi-objective Evolutionary 3D Neural Architecture Search for COVID-19 Detection with Chest CT Scans by Xin He et al
01-26-2021	Synthetic Generation of Three-Dimensional Cancer Cell Models from Histopathological Images by Yoav Alon et al
01-26-2021	Investigating the significance of adversarial attacks and their relation to interpretability for radar-based human activity recognition systems by Utku Ozbulak et al
01-26-2021	Global-Local Propagation Network for RGB-D Semantic Segmentation by Sihan Chen et al
01-26-2021	Lightweight Multi-Branch Network for Person Re-Identification by Fabian Herzog et al
01-26-2021	Joint Forecasting of Features and Feature Motion for Dense Semantic Future Prediction by Josip Šarić et al
01-26-2021	Learning-Based Patch-Wise Metal Segmentation with Consistency Check by Tristan M. Gottschalk et al
01-28-2021	Reliable COVID-19 Detection Using Chest X-ray Images by Aysen Degerli et al
01-26-2021	EPIC-Survival: End-to-end Part Inferred Clustering for Survival Analysis, Featuring Prognostic Stratification Boosting by Hassan Muhammad et al
01-28-2021	Deep Triplet Hashing Network for Case-based Medical Image Retrieval by Jiansheng Fang et al
01-29-2021	Automated Deep Learning Analysis of Angiography Video Sequences for Coronary Artery Disease by Chengyang Zhou et al
01-28-2021	Multi-Threshold Attention U-Net (MTAU) based Model for Multimodal Brain Tumor Segmentation in MRI scans by Navchetan Awasthi et al
01-29-2021	Few-Shot Learning for Road Object Detection by Anay Majee et al
01-29-2021	Open World Compositional Zero-Shot Learning by Massimiliano Mancini et al
01-29-2021	Robust Representation Learning with Feedback for Single Image Deraining by Chenghao Chen et al
01-28-2021	Re Learning Memory Guided Normality for Anomaly Detection by Kevin Stephen et al
01-29-2021	Self-Supervised Representation Learning for RGB-D Salient Object Detection by Xiaoqi Zhao et al
01-29-2021	Surprisingly Simple Semi-Supervised Domain Adaptation with Pretraining and Consistency by Samarth Mishra et al
01-29-2021	General-Purpose OCR Paragraph Identification by Graph Convolution Networks by Renshen Wang et al
01-29-2021	Gaining Scale Invariance in UAV Birds Eye View Object Detection by Adaptive Resizing by Martin Messmer et al
01-29-2021	Towards Generalising Neural Implicit Representations by Theo W. Costain et al
01-29-2021	Leveraging domain labels for object detection from UAVs by Benjamin Kiefer et al
01-29-2021	Spatiotemporal Dilated Convolution with Uncertain Matching for Video-based Crowd Estimation by Yu-Jen Ma et al
01-29-2021	Complementary Pseudo Labels For Unsupervised Domain Adaptation On Person Re-identification by Hao Feng et al
01-29-2021	The Minds Eye: Visualizing Class-Agnostic Features of CNNs by Alexandros Stergiou
01-29-2021	Polynomial Trajectory Predictions for Improved Learning Performance by Ido Freeman et al
01-28-2021	D3DLO: Deep 3D LiDAR Odometry by Philipp Adis et al
01-28-2021	Position, Padding and Predictions: A Deeper Look at Position Information in CNNs by Md Amirul Islam et al
01-29-2021	Neural networks for semantic segmentation of historical city maps: Cross-cultural performance and the impact of figurative diversity by Rémi Petitpierre

Craig SmithFebruary 1, 2021