2021.1.2 Vision papers

 

01-27-2021

Bottleneck Transformers for Visual Recognition
by Aravind Srinivas et al

01-26-2021

Neural Geometric Level of Detail: Real-time Rendering with Implicit 3D Shapes
by Towaki Takikawa et al

01-28-2021

Playable Video Generation
by Willi Menapace et al

01-27-2021

Automated femur segmentation from computed tomography images using a deep neural network
by P. A. Bjornsson et al

01-27-2021

DOC2PPT: Automatic Presentation Slides Generation from Scientific Documents
by Tsu-Jui Fu et al

01-28-2021

Tokens-to-Token ViT: Training Vision Transformers from Scratch on ImageNet
by Li Yuan et al

01-28-2021

The Role of Syntactic Planning in Compositional Image Captioning
by Emanuele Bugliarello et al

01-27-2021

VisualMRC: Machine Reading Comprehension on Document Images
by Ryota Tanaka et al

01-26-2021

Automatic Comic Generation with Stylistic Multi-page Layouts and Emotion-driven Text Balloon Generation
by Xin Yang et al

01-26-2021

CPTR: Full Transformer Network for Image Captioning
by Wei Liu et al

01-28-2021

Vx2Text: End-to-End Learning of Video-Based Text Generation From Multimodal Inputs
by Xudong Lin et al

01-27-2021

Object Detection Made Simpler by Eliminating Heuristic NMS
by Qiang Zhou et al

01-27-2021

CNN with large memory layers
by Rasul Karimov et al

01-29-2021

Efficient-CapsNet: Capsule Network with Self-Attention Routing
by Vittorio Mazzia et al

01-27-2021

Multi-Modal Aesthetic Assessment for MObile Gaming Image
by Zhenyu Lei et al

01-27-2021

Assessing the applicability of Deep Learning-based visible-infrared fusion methods for fire imagery
by J. F. Ciprián-Sánchez et al

01-28-2021

Exploring Cross-Image Pixel Contrast for Semantic Segmentation
by Wenguan Wang et al

01-28-2021

PIG-Net: Inception based Deep Learning Architecture for 3D Point Cloud Segmentation
by Sindhu Hegde et al

01-27-2021

Puzzle-CAM: Improved localization via matching partial and full features
by Sanghyun Jo et al

01-27-2021

Deep Image Retrieval: A Survey
by Wei Chen et al

01-28-2021

Rethinking Rotated Object Detection with Gaussian Wasserstein Distance Loss
by Xue Yang et al

01-27-2021

Augmenting Proposals by the Detector Itself
by Xiaopei Wan et al

01-28-2021

Domain Adaptation by Topology Regularization
by Deborah Weeks et al

01-26-2021

Prototypical Pseudo Label Denoising and Target Structure Learning for Domain Adaptive Semantic Segmentation
by Pan Zhang et al

01-27-2021

NTU60-X: Towards Skeleton-based Recognition of Subtle Human Actions
by Anirudh Thatipelli et al

01-27-2021

Chronological age estimation of lateral cephalometric radiographs with deep learning
by Ningtao Liu

01-27-2021

Generative Multi-Label Zero-Shot Learning
by Akshita Gupta et al

01-27-2021

SwingBot: Learning Physical Features from In-hand Tactile Exploration for Dynamic Swing-up Manipulation
by Chen Wang et al

01-27-2021

Learning task-agnostic representation via toddler-inspired learning
by Kwanyoung Park et al

01-28-2021

The Hidden Tasks of Generative Adversarial Networks: An Alternative Perspective on GAN Training
by Romann M. Weber

01-26-2021

Deep Burst Super-Resolution
by Goutam Bhat et al

01-28-2021

An Explainable AI System for Automated COVID-19 Assessment and Lesion Categorization from CT-scans
by Matteo Pennisi et al

01-28-2021

Self-Attention Meta-Learner for Continual Learning
by Ghada Sokar et al

01-27-2021

Multi-Hypothesis Pose Networks: Rethinking Top-Down Pose Estimation
by Rawal Khirodkar et al

01-28-2021

Self-supervised Cross-silo Federated Neural Architecture Search
by Xinle Liang et al

01-26-2021

Introducing and assessing the explainable AI (XAI)method: SIDU
by Satya M. Muddamsetty et al

01-28-2021

Reducing ReLU Count for Privacy-Preserving CNN Speedup
by Inbar Helbitz et al

01-27-2021

Convolutional Neural Network-Based Age Estimation Using B-Mode Ultrasound Tongue Image
by Kele Xu et al

01-28-2021

COMPAS: Representation Learning with Compositional Part Sharing for Few-Shot Classification
by Ju He et al

01-27-2021

Learning Non-linear Wavelet Transformation via Normalizing Flow
by Shuo-Hui Li

01-27-2021

Scheduled Sampling in Vision-Language Pretraining with Decoupled Encoder-Decoder Network
by Yehao Li et al

01-28-2021

NeMo: Neural Mesh Models of Contrastive Features for Robust 3D Pose Estimation
by Angtian Wang et al

01-27-2021

Im2Mesh GAN: Accurate 3D Hand Mesh Recovery from a Single RGB Image
by Akila Pemasiri et al

01-28-2021

Generalising via Meta-Examples for Continual Learning in the Wild
by Alessia Bertugli et al

01-27-2021

Efficient Video Summarization Framework using EEG and Eye-tracking Signals
by Sai Sukruth Bezugam et al

01-27-2021

Meta Adversarial Training
by Jan Hendrik Metzen et al

01-26-2021

Boosting Segmentation Performance across datasets using histogram specification with application to pelvic bone segmentation
by Prabhakara Subramanya Jois et al

01-27-2021

Automatic Detection of Occulted Hard X-ray Flares Using Deep-Learning Methods
by Shin-nosuke Ishikawa et al

01-28-2021

Neural Particle Image Velocimetry
by Nikolay Stulov et al

01-26-2021

On the Importance of Capturing a Sufficient Diversity of Perspective for the Classification of micro-PCBs
by Adam Byerly et al

01-26-2021

Automatic Curation of Large-Scale Datasets for Audio-Visual Representation Learning
by Sangho Lee et al

01-27-2021

Reciprocal Landmark Detection and Tracking with Extremely Few Annotations
by Jianzhe Lin et al

01-27-2021

A Multi-Scale Conditional Deep Model for Tumor Cell Ratio Counting
by Eric Cosatto et al

01-27-2021

Self-Calibrating Active Binocular Vision via Active Efficient Coding with Deep Autoencoders
by Charles Wilmot et al

01-27-2021

TorchPRISM: Principal Image Sections Mapping, a novel method for Convolutional Neural Network features visualization
by Tomasz Szandala

01-26-2021

Learning Spatial and Spatio-Temporal Pixel Aggregations for Image and Video Denoising
by Xiangyu Xu et al

01-28-2021

Neural Architecture Search with Random Labels
by Xuanyang Zhang et al

01-27-2021

Syntactically Guided Generative Embeddings for Zero-Shot Skeleton Action Recognition
by Pranay Gupta et al

01-26-2021

ResLT: Residual Learning for Long-tailed Recognition
by Jiequan Cui et al

01-27-2021

Utilizing Uncertainty Estimation in Deep Learning Segmentation of Fluorescence Microscopy Images with Missing Markers
by Alvaro Gomariz et al

01-27-2021

Easy-GT: Open-Source Software to Facilitate Making the Ground Truth for White Blood Cells Nucleus
by Seyedeh-Zahra Mousavi Kouzehkanan et al

01-26-2021

Developing emotion recognition for video conference software to support people with autism
by Marc Franzen et al

01-26-2021

Malware Detection Using Frequency Domain-Based Image Visualization and Deep Learning
by Tajuddin Manhar Mohammed et al

01-26-2021

LIGHTS: LIGHT Specularity Dataset for specular detection in Multi-view
by Mohamed Dahy Elkhouly et al

01-28-2021

VAE^2: Preventing Posterior Collapse of Variational Video Predictions in the Wild
by Yizhou Zhou et al

01-26-2021

Uncertainty aware and explainable diagnosis of retinal disease
by Amitojdeep Singh et al

01-27-2021

e-ACJ: Accurate Junction Extraction For Event Cameras
by Zhihao Liu et al

01-27-2021

Detecting Adversarial Examples by Input Transformations, Defense Perturbations, and Voting
by Federico Nesti et al

01-26-2021

Blind Image Denoising and Inpainting Using Robust Hadamard Autoencoders
by Rasika Karkare et al

01-27-2021

HDIB1M -- Handwritten Document Image Binarization 1 Million Dataset
by Kaustubh Sadekar et al

01-27-2021

Towards Improving the Consistency, Efficiency, and Flexibility of Differentiable Neural Architecture Search
by Yibo Yang et al

01-28-2021

Fusion Moves for Graph Matching
by Lisa Hutschenreiter et al

01-28-2021

Discriminative Appearance Modeling with Multi-track Pooling for Real-time Multi-object Tracking
by Chanho Kim et al

01-26-2021

Semi-synthesis: A fast way to produce effective datasets for stereo matching
by Ju He et al

01-26-2021

CoMo: A novel co-moving 3D camera system
by Andrea Cavagna et al

01-27-2021

Bayesian Nested Neural Networks for Uncertainty Calibration and Adaptive Compression
by Yufei Cui et al

01-26-2021

SkeletonVis: Interactive Visualization for Understanding Adversarial Attacks on Human Action Recognition Models
by Haekyu Park et al

01-27-2021

Effects of Image Size on Deep Learning
by Olivier Rukundo

01-26-2021

Arbitrary-Oriented Ship Detection through Center-Head Point Extraction
by Feng Zhang et al

01-26-2021

DeepOIS: Gyroscope-Guided Deep Optical Image Stabilizer Compensation
by Haipeng Li et al

01-27-2021

Edge-Labeling based Directed Gated Graph Network for Few-shot Learning
by Peixiao Zheng et al

01-26-2021

Revisiting Contrastive Learning for Few-Shot Classification
by Orchid Majumder et al

01-27-2021

An Interpretation of Regularization by Denoising and its Application with the Back-Projected Fidelity Term
by Einav Yogev-Ofer et al

01-26-2021

Leveraging 3D Information in Unsupervised Brain MRI Segmentation
by Benjamin Lambert et al

01-26-2021

Deep Video Inpainting Detection
by Peng Zhou et al

01-27-2021

Automated Crop Field Surveillance using Computer Vision
by Tejas Atul Khare et al

01-27-2021

Investigating Bi-Level Optimization for Learning and Vision from a Unified Perspective: A Survey and Beyond
by Risheng Liu et al

01-27-2021

GaitGraph: Graph Convolutional Network for Skeleton-Based Gait Recognition
by Torben Teepe et al

01-27-2021

Automatic image annotation base on Naive Bayes and Decision Tree classifiers using MPEG-7
by Jafar Majidpour et al

01-26-2021

A Survey and Analysis on Automated Glioma Brain Tumor Segmentation and Overall Patient Survival Prediction
by Rupal Agravat et al

01-26-2021

The Effect of Class Definitions on the Transferability of Adversarial Attacks Against Forensic CNNs
by Xinwei Zhao et al

01-26-2021

Defenses Against Multi-Sticker Physical Domain Attacks on Classifiers
by Xinwei Zhao et al

01-27-2021

Spatial-Channel Transformer Network for Trajectory Prediction on the Traffic Scenes
by Jingwen Zhao et al

01-27-2021

Controlling by Showing: i-Mimic: A Video-based Method to Control Robotic Arms
by Debarati B. Chakraborty et al

01-26-2021

Ensembling complex network perspectives for mild cognitive impairment detection with artificial neural networks
by Eufemia Lella et al

01-27-2021

Shape or Texture: Understanding Discriminative Features in CNNs
by Md Amirul Islam et al

01-27-2021

The Work of Art in an Age of Mechanical Generation
by Steven J. Frank

01-26-2021

Revisiting Locally Supervised Learning: an Alternative to End-to-end Training
by Yulin Wang et al

01-27-2021

Detecting Deepfake Videos Using Euler Video Magnification
by Rashmiranjan Das et al

01-26-2021

Towards Universal Physical Attacks On Cascaded Camera-Lidar 3D Object Detection Models
by Mazen Abdelfattah et al

01-26-2021

Probability Trajectory: One New Movement Description for Trajectory Prediction
by Pei Lv et al

01-26-2021

Online Body Schema Adaptation through Cost-Sensitive Active Learning
by Gonçalo Cunha et al

01-27-2021

Automatic Segmentation of Gross Target Volume of Nasopharynx Cancer using Ensemble of Multiscale Deep Neural Networks with Spatial Attention
by Haochen Mei et al

01-26-2021

AINet: Association Implantation for Superpixel Segmentation
by Yaxiong Wang et al

01-26-2021

RAPIQUE: Rapid and Accurate Video Quality Prediction of User Generated Content
by Zhengzhong Tu et al

01-26-2021

Evaluating Input Perturbation Methods for Interpreting CNNs and Saliency Map Comparison
by Lukas Brunke et al

01-26-2021

ImageCHD: A 3D Computed Tomography Image Dataset for Classification of Congenital Heart Disease
by Xiaowei Xu et al

01-26-2021

Glioblastoma Multiforme Patient Survival Prediction
by Snehal Rajput et al

01-26-2021

Nondiscriminatory Treatment: a straightforward framework for multi-human parsing
by Min Yan et al

01-26-2021

Consistent Mesh Colors for Multi-View Reconstructed 3D Scenes
by Mohamed Dahy Elkhouly et al

01-28-2021

A Petri Dish for Histopathology Image Analysis
by Jerry Wei et al

01-26-2021

Graphonomy: Universal Image Parsing via Graph Reasoning and Transfer
by Liang Lin et al

01-29-2021

Layer-Peeled Model: Toward Understanding Well-Trained Deep Neural Networks
by Cong Fang et al

01-26-2021

New Algorithms for Computing Field of Vision over 2D Grids
by Evan R. M. Debenham et al

01-26-2021

Efficient Multi-objective Evolutionary 3D Neural Architecture Search for COVID-19 Detection with Chest CT Scans
by Xin He et al

01-26-2021

Synthetic Generation of Three-Dimensional Cancer Cell Models from Histopathological Images
by Yoav Alon et al

01-26-2021

Investigating the significance of adversarial attacks and their relation to interpretability for radar-based human activity recognition systems
by Utku Ozbulak et al

01-26-2021

Global-Local Propagation Network for RGB-D Semantic Segmentation
by Sihan Chen et al

01-26-2021

Lightweight Multi-Branch Network for Person Re-Identification
by Fabian Herzog et al

01-26-2021

Joint Forecasting of Features and Feature Motion for Dense Semantic Future Prediction
by Josip Šarić et al

01-26-2021

Learning-Based Patch-Wise Metal Segmentation with Consistency Check
by Tristan M. Gottschalk et al

01-28-2021

Reliable COVID-19 Detection Using Chest X-ray Images
by Aysen Degerli et al

01-26-2021

EPIC-Survival: End-to-end Part Inferred Clustering for Survival Analysis, Featuring Prognostic Stratification Boosting
by Hassan Muhammad et al

01-28-2021

Deep Triplet Hashing Network for Case-based Medical Image Retrieval
by Jiansheng Fang et al

01-29-2021

Automated Deep Learning Analysis of Angiography Video Sequences for Coronary Artery Disease
by Chengyang Zhou et al

01-28-2021

Multi-Threshold Attention U-Net (MTAU) based Model for Multimodal Brain Tumor Segmentation in MRI scans
by Navchetan Awasthi et al

01-29-2021

Few-Shot Learning for Road Object Detection
by Anay Majee et al

01-29-2021

Open World Compositional Zero-Shot Learning
by Massimiliano Mancini et al

01-29-2021

Robust Representation Learning with Feedback for Single Image Deraining
by Chenghao Chen et al

01-28-2021

Re Learning Memory Guided Normality for Anomaly Detection
by Kevin Stephen et al

01-29-2021

Self-Supervised Representation Learning for RGB-D Salient Object Detection
by Xiaoqi Zhao et al

01-29-2021

Surprisingly Simple Semi-Supervised Domain Adaptation with Pretraining and Consistency
by Samarth Mishra et al

01-29-2021

General-Purpose OCR Paragraph Identification by Graph Convolution Networks
by Renshen Wang et al

01-29-2021

Gaining Scale Invariance in UAV Birds Eye View Object Detection by Adaptive Resizing
by Martin Messmer et al

01-29-2021

Towards Generalising Neural Implicit Representations
by Theo W. Costain et al

01-29-2021

Leveraging domain labels for object detection from UAVs
by Benjamin Kiefer et al

01-29-2021

Spatiotemporal Dilated Convolution with Uncertain Matching for Video-based Crowd Estimation
by Yu-Jen Ma et al

01-29-2021

Complementary Pseudo Labels For Unsupervised Domain Adaptation On Person Re-identification
by Hao Feng et al

01-29-2021

The Minds Eye: Visualizing Class-Agnostic Features of CNNs
by Alexandros Stergiou

01-29-2021

Polynomial Trajectory Predictions for Improved Learning Performance
by Ido Freeman et al

01-28-2021

D3DLO: Deep 3D LiDAR Odometry
by Philipp Adis et al

01-28-2021

Position, Padding and Predictions: A Deeper Look at Position Information in CNNs
by Md Amirul Islam et al

01-29-2021

Neural networks for semantic segmentation of historical city maps: Cross-cultural performance and the impact of figurative diversity
by Rémi Petitpierre

 
Craig Smith