2021.9.20 Vision papers

 

09-16-2021

DisUnknown: Distilling Unknown Factors for Disentanglement Learning
by Sitao Xiang et al

09-15-2021

Integrating Sensing and Communication in Cellular Networks via NR Sidelink
by Dariush Salami et al

09-14-2021

Broaden the Vision: Geo-Diverse Visual Commonsense Reasoning
by Da Yin et al

09-17-2021

Cross Modification Attention Based Deliberation Model for Image Captioning
by Zheng Lian et al

09-17-2021

Transformer-Unet: Raw Image Processing with Unet
by Youyang Sha et al

09-14-2021

Improved Few-shot Segmentation by Redefinition of the Roles of Multi-level CNN Features
by Zhijie Wang et al

09-16-2021

Adaptive Hierarchical Dual Consistency for Semi-Supervised Left Atrium Segmentation on Cross-Domain Data
by Jun Chen et al

09-16-2021

MHFC: Multi-Head Feature Collaboration for Few-Shot Learning
by Shuai Shao et al

09-16-2021

Towards Non-Line-of-Sight Photography
by Jiayong Peng et al

09-16-2021

Marginal MAP Estimation for Inverse RL under Occlusion with Observer Noise
by Prasanth Sengadu Suresh et al

09-17-2021

Pointly-supervised 3D Scene Parsing with Viewpoint Bottleneck
by Liyi Luo et al

09-16-2021

Stereo Video Reconstruction Without Explicit Depth Maps for Endoscopic Surgery
by Annika Brundyn et al

09-14-2021

Multi-Scale Aligned Distillation for Low-Resolution Detection
by Lu Qi et al

09-14-2021

The pitfalls of using open data to develop deep learning solutions for COVID-19 detection in chest X-rays
by Rachael Harkness et al

09-14-2021

Semi-Supervised Wide-Angle Portraits Correction by Multi-Scale Transformer
by Fushun Zhu et al

09-15-2021

Pose Transformers (POTR): Human Motion Prediction with Non-Autoregressive Transformers
by Angel Martínez-González et al

09-16-2021

Raising context awareness in motion forecasting
by Hédi Ben-Younes et al

09-16-2021

A Machine Learning Framework for Automatic Prediction of Human Semen Motility
by Sandra Ottl et al

09-16-2021

Eformer: Edge Enhancement based Transformer for Medical Image Denoising
by Achleshwar Luthra et al

09-16-2021

Mass Segmentation in Automated 3-D Breast Ultrasound Using Dual-Path U-net
by Hamed Fayyaz et al

09-15-2021

MD-CSDNetwork: Multi-Domain Cross Stitched Network for Deepfake Detection
by Aayushi Agarwal et al

09-14-2021

ZFlow: Gated Appearance Flow-based Virtual Try-on with 3D Priors
by Ayush Chopra et al

09-14-2021

Identifying partial mouse brain microscopy images from Allen reference atlas using a contrastively learned semantic space
by Justinas Antanavicius et al

09-16-2021

Semi-Supervised Visual Representation Learning for Fashion Compatibility
by Ambareesh Revanur et al

09-16-2021

Label-Attention Transformer with Geometrically Coherent Objects for Image Captioning
by Shikha Dubey et al

09-16-2021

A computationally efficient framework for vector representation of persistence diagrams
by Kit C. Chan et al

09-14-2021

Luminance Attentive Networks for HDR Image and Panorama Reconstruction
by Hanning Yu et al

09-15-2021

OPV2V: An Open Benchmark Dataset and Fusion Pipeline for Perception with Vehicle-to-Vehicle Communication
by Runsheng Xu et al

09-16-2021

Resolution based Feature Distillation for Cross Resolution Person Re-Identification
by Asad Munir et al

09-15-2021

Anchor DETR: Query Design for Transformer-Based Detector
by Yingming Wang et al

09-17-2021

Messing Up 3D Virtual Environments: Transferable Adversarial 3D Objects
by Enrico Meloni et al

09-15-2021

DSOR: A Scalable Statistical Filter for Removing Falling Snow from LiDAR Point Clouds in Severe Winter Weather
by Akhil Kurup et al

09-15-2021

Hybrid Local-Global Transformer for Image Dehazing
by Dong Zhao et al

09-14-2021

High-Resolution Image Harmonization via Collaborative Dual Transformations
by Wenyan Cong et al

09-16-2021

LoGG3D-Net: Locally Guided Global Descriptor Learning for 3D Place Recognition
by Kavisha Vidanapathirana et al

09-16-2021

Rotation Averaging in a Split Second: A Primal-Dual Method and a Closed-Form for Cycle Graphs
by Gabriel Moreira et al

09-14-2021

One-Class Meta-Learning: Towards Generalizable Few-Shot Open-Set Classification
by Jedrzej Kozerawski et al

09-16-2021

Torch.manual_seed(3407) is all you need: On the influence of random seeds in deep learning architectures for computer vision
by David Picard

09-14-2021

LRWR: Large-Scale Benchmark for Lip Reading in Russian language
by Evgeniy Egorov et al

09-17-2021

Diverse Generation from a Single Video Made Possible
by Niv Haim et al

09-16-2021

Urdu text in natural scene images: a new dataset and preliminary text detection
by Hazrat Ali et al

09-16-2021

Invertable Frowns: Video-to-Video Facial Emotion Translation
by Ian Magnusson et al

09-15-2021

Navigation-Oriented Scene Understanding for Robotic Autonomy: Learning to Segment Driveability in Egocentric Images
by Galadrielle Humblot-Renaux et al

09-15-2021

F-CAM: Full Resolution CAM via Guided Parametric Upscaling
by Soufiane Belharbi et al

09-16-2021

Neural Network Based Lidar Gesture Recognition for Realtime Robot Teleoperation
by Simon Chamorro et al

09-16-2021

An End-to-End Transformer Model for 3D Object Detection
by Ishan Misra et al

09-16-2021

Compact Binary Fingerprint for Image Copy Re-Ranking
by Nazar Mohammad et al

09-17-2021

Self-Supervised Neural Architecture Search for Imbalanced Datasets
by Aleksandr Timofeev et al

09-16-2021

Aesthetics and neural network image representations
by Romuald A. Janik

09-17-2021

GoG: Relation-aware Graph-over-Graph Network for Visual Dialog
by Feilong Chen et al

09-15-2021

Distract Your Attention: Multi-head Cross Attention Network for Facial Expression Recognition
by Zhengyao Wen et al

09-14-2021

Image Synthesis via Semantic Composition
by Yi Wang et al

09-15-2021

FFAVOD: Feature Fusion Architecture for Video Object Detection
by Hughes Perreault et al

09-15-2021

New Perspective on Progressive GANs Distillationfor One-class Novelty Detection
by Zhiwei Zhang et al

09-16-2021

Label Assignment Distillation for Object Detection
by Minghao Gao et al

09-14-2021

Spiking Neural Networks for Visual Place Recognition via Weighted Neuronal Assignments
by Somayeh Hussaini et al

09-16-2021

A Medical Pre-Diagnosis System for Histopathological Image of Breast Cancer
by Shiyu Fan et al

09-16-2021

SketchHairSalon: Deep Sketch-based Hair Image Synthesis
by Chufeng Xiao et al

09-14-2021

Space Time Recurrent Memory Network
by Hung Nguyen et al

09-15-2021

PointManifoldCut: Point-wise Augmentation in the Manifold for Point Clouds
by Tianfang Zhu et al

09-15-2021

UCP-Net: Unstructured Contour Points for Instance Segmentation
by Camille Dupont et al

09-16-2021

Are we ready for beyond-application high-volume data? The Reeds robot perception benchmark dataset
by Ola Benderius et al

09-14-2021

Uncertainty Quantification in Medical Image Segmentation with Multi-decoder U-Net
by Yanwu Yang et al

09-16-2021

Dense Pruning of Pointwise Convolutions in the Frequency Domain
by Mark Buckler et al

09-14-2021

Image-Based Alignment of 3D Scans
by Dolores Messer et al

09-14-2021

Cross-Region Domain Adaptation for Class-level Alignment
by Zhijie Wang et al

09-14-2021

COVID-Net MLSys: Designing COVID-Net for the Clinical Workflow
by Audrey G. Chung et al

09-16-2021

Humanly Certifying Superhuman Classifiers
by Qiongkai Xu et al

09-14-2021

Dense Deep Unfolding Network with 3D-CNN Prior for Snapshot Compressive Imaging
by Zhuoyuan Wu et al

09-15-2021

FSER: Deep Convolutional Neural Networks for Speech Emotion Recognition
by Bonaventure F. P. Dossou et al

09-15-2021

RGB-D Saliency Detection via Cascaded Mutual Information Minimization
by Jing Zhang et al

09-15-2021

A Framework for Multisensory Foresight for Embodied Agents
by Xiaohui Chen et al

09-17-2021

Including Keyword Position in Image-based Models for Act Segmentation of Historical Registers
by Mélodie Boillet et al

09-15-2021

Partner-Assisted Learning for Few-Shot Image Classification
by Jiawei Ma et al

09-15-2021

Patch-based medical image segmentation using Quantum Tensor Networks
by Raghavendra Selvan et al

09-17-2021

ActionCLIP: A New Paradigm for Video Action Recognition
by Mengmeng Wang et al

09-17-2021

CardiSort: a convolutional neural network for cross vendor automated sorting of cardiac MR images
by Ruth P Lim et al

09-17-2021

Multimodal Incremental Transformer with Visual Grounding for Visual Dialogue Generation
by Feilong Chen et al

09-17-2021

Realistic PointGoal Navigation via Auxiliary Losses and Information Bottleneck
by Guillermo Grande et al

09-17-2021

What we see and What we dont see: Imputing Occluded Crowd Structures from Robot Sensing
by Javad Amirian et al

09-16-2021

A Comparative Study of Machine Learning Methods for Predicting the Evolution of Brain Connectivity from a Baseline Timepoint
by Şeymanur Aktı et al

09-14-2021

Dodging Attack Using Carefully Crafted Natural Makeup
by Nitzan Guetta et al

09-17-2021

Semantic Snapping for Guided Multi-View Visualization Design
by Yngve S. Kristiansen et al

09-16-2021

Evaluating Continual Learning Algorithms by Generating 3D Virtual Environments
by Enrico Meloni et al

09-16-2021

Explainability Requires Interactivity
by Matthias Kirchler et al

09-16-2021

Towards agricultural autonomy: crop row detection under varying field conditions using deep learning
by Rajitha de Silva et al

09-16-2021

A Survey on Temporal Sentence Grounding in Videos
by Xiaohan Lan et al

09-14-2021

PnP-DETR: Towards Efficient Visual Analysis with Transformers
by Tao Wang et al

09-15-2021

Neural Architecture Search in operational context: a remote sensing case-study
by Anthony Cazasnoves et al

09-15-2021

Image Captioning for Effective Use of Language Models in Knowledge-Based Visual Question Answering
by Ander Salaberria et al

09-15-2021

SPIN Road Mapper: Extracting Roads from Aerial Images via Spatial and Interaction Space Graph Reasoning for Autonomous Driving
by Wele Gedara Chaminda Bandara et al

09-15-2021

ROS-X-Habitat: Bridging the ROS Ecosystem with Embodied AI
by Guanxiong Chen et al

09-15-2021

Neural Human Performer: Learning Generalizable Radiance Fields for Human Performance Rendering
by Youngjoong Kwon et al

09-16-2021

Habitat-Matterport 3D Dataset (HM3D): 1000 Large-scale 3D Environments for Embodied AI
by Santhosh K. Ramakrishnan et al

09-14-2021

Multi-Scale Input Strategies for Medulloblastoma Tumor Classification using Deep Transfer Learning
by Marcel Bengs et al

09-14-2021

3-Dimensional Deep Learning with Spatial Erasing for Unsupervised Anomaly Segmentation in Brain MRI
by Marcel Bengs et al

09-14-2021

Multi-Level Features Contrastive Networks for Unsupervised Domain Adaptation
by Le Liu et al

09-17-2021

Bio-Inspired Audio-Visual Cues Integration for Visual Attention Prediction
by Yuan Yuan et al

09-14-2021

A trainable monogenic ConvNet layer robust in front of large contrast changes in image classification
by E. Ulises Moya-Sánchez et al

09-17-2021

PIRenderer: Controllable Portrait Image Generation via Semantic Neural Rendering
by Yurui Ren et al

09-16-2021

DeepMTS: Deep Multi-task Learning for Survival Prediction in Patients with Advanced Nasopharyngeal Carcinoma using Pretreatment PET/CT
by Mingyuan Meng et al

09-15-2021

Contact-Aware Retargeting of Skinned Motion
by Ruben Villegas et al

09-14-2021

Sampling Network Guided Cross-Entropy Method for Unsupervised Point Cloud Registration
by Haobo Jiang et al

09-16-2021

A Divide-and-Merge Point Cloud Clustering Algorithm for LiDAR Panoptic Segmentation
by Yiming Zhao et al

09-15-2021

RAFT-Stereo: Multilevel Recurrent Field Transforms for Stereo Matching
by Lahav Lipson et al

09-15-2021

Learning the Regularization in DCE-MR Image Reconstruction for Functional Imaging of Kidneys
by Aziz Koçanaoğulları et al

09-15-2021

Resolution-robust Large Mask Inpainting with Fourier Convolutions
by Roman Suvorov et al

09-15-2021

3D Annotation Of Arbitrary Objects In The Wild
by Kenneth Blomqvist et al

09-14-2021

Tesla-Rapture: A Lightweight Gesture Recognition System from mmWave Radar Point Clouds
by Dariush Salami et al

09-16-2021

Lifting 2D Object Locations to 3D by Discounting LiDAR Outliers across Objects and Views
by Robert McCraith et al

09-16-2021

Exploiting Activation based Gradient Output Sparsity to Accelerate Backpropagation in CNNs
by Anup Sarma et al

09-17-2021

Autonomous Vision-based UAV Landing with Collision Avoidance using Deep Learning
by Tianpei Liao et al

09-16-2021

Heterogeneous Relational Complement for Vehicle Re-identification
by Jiajian Zhao et al

09-16-2021

Automated risk classification of colon biopsies based on semantic segmentation of histopathology images
by John-Melle Bokhorsta et al

09-16-2021

Multi-Level Visual Similarity Based Personalized Tourist Attraction Recommendation Using Geo-Tagged Photos
by Ling Chen et al

09-14-2021

Multi-modal Wound Classification using Wound Image and Location by Deep Neural Network
by D. M. Anisuzzaman et al

09-16-2021

Detection Accuracy for Evaluating Compositional Explanations of Units
by Sayo M. Makinwa et al

09-15-2021

A Wide-area, Low-latency, and Power-efficient 6-DoF Pose Tracking System for Rigid Objects
by Young-Ho Kim et al

09-17-2021

Expression Snippet Transformer for Robust Video-based Facial Expression Recognition
by Yuanyuan Liu et al

09-15-2021

A Pathology Deep Learning System Capable of Triage of Melanoma Specimens Utilizing Dermatopathologist Consensus as Ground Truth
by Sivaramakrishnan Sankarapandian et al

09-15-2021

Hybrid ICP
by Kamil Dreczkowski et al

09-14-2021

Anomaly Attribution of Multivariate Time Series using Counterfactual Reasoning
by Violeta Teodora Trifunov et al

09-15-2021

Direct and Sparse Deformable Tracking
by Jose Lamarca et al

09-15-2021

A Unified Framework for Biphasic Facial Age Translation with Noisy-Semantic Guided Generative Adversarial Networks
by Muyi Sun et al

09-15-2021

MISSFormer: An Effective Medical Image Segmentation Transformer
by Xiaohong Huang et al

09-16-2021

Few-Shot Object Detection by Attending to Per-Sample-Prototype
by Hojun Lee et al

09-16-2021

KATANA: Simple Post-Training Robustness Using Test Time Augmentations
by Gilad Cohen et al

09-16-2021

Quality-aware Cine Cardiac MRI Reconstruction and Analysis from Undersampled k-space Data
by Ines Machado et al

09-16-2021

Real Time Monocular Vehicle Velocity Estimation using Synthetic Data
by Robert McCraith et al

09-16-2021

Overview of Tencent Multi-modal Ads Video Understanding Challenge
by Zhenzhi Wang et al

09-16-2021

Learnable Multi-level Frequency Decomposition and Hierarchical Attention Mechanism for Generalized Face Presentation Attack Detection
by Meiling Fang et al

09-14-2021

Focus on Impact: Indoor Exploration with Intrinsic Motivation
by Roberto Bigazzi et al

09-14-2021

High-Fidelity GAN Inversion for Image Attribute Editing
by Tengfei Wang et al

09-15-2021

Dynamic Fusion Network for RGBT Tracking
by Jingchao Peng et al

09-14-2021

MotionHint: Self-Supervised Monocular Visual Odometry with Motion Constraints
by Cong Wang et al

09-14-2021

Hardware-aware Real-time Myocardial Segmentation Quality Control in Contrast Echocardiography
by Dewen Zeng et al

09-15-2021

Semi-supervised Contrastive Learning for Label-efficient Medical Image Segmentation
by Xinrong Hu et al

09-16-2021

Harnessing Perceptual Adversarial Patches for Crowd Counting
by Shunchang Liu et al

09-16-2021

TANet: A new Paradigm for Global Face Super-resolution via Transformer-CNN Aggregation Network
by Yuanzhi Wang et al

09-15-2021

Predicting 3D shapes, masks, and properties of materials, liquids, and objects inside transparent containers, using the TransProteus CGI dataset
by Sagi Eppel et al

09-14-2021

A Semantic Indexing Structure for Image Retrieval
by Ying Wang et al

09-15-2021

Learning Dynamical Human-Joint Affinity for 3D Pose Estimation in Videos
by Junhao Zhang et al

09-14-2021

ImUnity: a generalizable VAE-GAN solution for multicenter MR image harmonization
by Stenzel Cackowski et al

09-14-2021

Dynamic Attentive Graph Learning for Image Restoration
by Chong Mou et al

09-16-2021

Dense Semantic Contrast for Self-Supervised Visual Representation Learning
by Xiaoni Li et al

09-16-2021

Mask-Guided Feature Extraction and Augmentation for Ultra-Fine-Grained Visual Categorization
by Zicheng Pan et al

09-16-2021

End-to-End Partially Observable Visual Navigation in a Diverse Environment
by Bo Ai et al

09-16-2021

ObjectFolder: A Dataset of Objects with Implicit Visual, Auditory, and Tactile Representations
by Ruohan Gao et al

09-15-2021

Learning to Aggregate and Refine Noisy Labels for Visual Sentiment Analysis
by Wei Zhu et al

09-15-2021

Federated Contrastive Learning for Decentralized Unlabeled Medical Images
by Nanqing Dong et al

09-15-2021

Progressive Hard-case Mining across Pyramid Levels in Object Detection
by Binghong Wu et al

09-15-2021

DeFungi: Direct Mycological Examination of Microscopic Fungi Images
by Camilo Javier Pineda Sopo et al

09-16-2021

Context-aware Padding for Semantic Segmentation
by Yu-Hui Huang et al

09-15-2021

Sign-MAML: Efficient Model-Agnostic Meta-Learning by SignSGD
by Chen Fan et al

09-15-2021

A Multi-Task Cross-Task Learning Architecture for Ad-hoc Uncertainty Estimation in 3D Cardiac MRI Image Segmentation
by S. M. Kamrul Hasan et al

09-16-2021

Neural \{E}tendue Expander for Ultra-Wide-Angle High-Fidelity Holographic Display
by Seung-Hwan Baek et al

09-17-2021

LOF: Structure-Aware Line Tracking based on Optical Flow
by Meixiang Quan et al

09-14-2021

Automatic hippocampal surface generation via 3D U-net and active shape modeling with hybrid particle swarm optimization
by Pinyuan Zhong et al

09-15-2021

FCA: Learning a 3D Full-coverage Vehicle Camouflage for Multi-view Physical Adversarial Attack
by DonghuaWang et al

09-16-2021

Generating Dataset For Large-scale 3D Facial Emotion Recognition
by Faizan Farooq Khan et al

09-17-2021

A review of deep learning methods for MRI reconstruction
by Arghya Pal et al

09-15-2021

S3LAM: Structured Scene SLAM
by Mathieu Gonzalez et al

09-16-2021

M2RNet: Multi-modal and Multi-scale Refined Network for RGB-D Salient Object Detection
by Xian Fang et al

09-14-2021

Seeking an Optimal Approach for Computer-Aided Pulmonary Embolism Detection
by Nahid Ul Islam et al

09-15-2021

Deep Bregman Divergence for Contrastive Learning of Visual Representations
by Mina Rezaei et al

09-17-2021

GraFormer: Graph Convolution Transformer for 3D Pose Estimation
by Weixi Zhao et al

09-15-2021

METEOR: A Massive Dense & Heterogeneous Behavior Dataset for Autonomous Driving
by Rohan Chandra et al

09-14-2021

A Deep Learning Approach for Masking Fetal Gender in Ultrasound Images
by Amit Borundiya et al

 
Craig Smith