2021.1.4 Vision papers

 

12-31-2020

TransTrack: Multiple-Object Tracking with Transformer
by Peize Sun et al

12-31-2020

NeuralMagicEye: Learning to See and Understand the Scene Behind an Autostereogram
by Zhengxia Zou et al

12-31-2020

Neural Body: Implicit Neural Representations with Structured Latent Codes for Novel View Synthesis of Dynamic Humans
by Sida Peng et al

12-30-2020

OSTeC: One-Shot Texture Completion
by Baris Gecer et al

12-31-2020

Rethinking Semantic Segmentation from a Sequence-to-Sequence Perspective with Transformers
by Sixiao Zheng et al

12-29-2020

TrustMAE: A Noise-Resilient Defect Classification Framework using Memory-Augmented Auto-Encoders with Trust Regions
by Daniel Stanley Tan et al

12-29-2020

Deep Hashing for Secure Multimodal Biometrics
by Veeru Talreja et al

12-29-2020

Detecting Hate Speech in Multi-modal Memes
by Abhishek Das et al

12-30-2020

3D Human motion anticipation and classification
by Emad Barsoum et al

12-30-2020

SkiNet: A Deep Learning Solution for Skin Lesion Diagnosis with Uncertainty Estimation and Explainability
by Rajeev Kumar Singh et al

12-30-2020

Accurate Word Representations with Universal Visual Guidance
by Zhuosheng Zhang et al

12-30-2020

Bidirectional Mapping Coupled GAN for Generalized Zero-Shot Learning
by Tasfia Shermin et al

12-29-2020

Towards Unsupervised Deep Image Enhancement with Generative Adversarial Network
by Zhangkai Ni et al

12-29-2020

AILearn: An Adaptive Incremental Learning Model for Spoof Fingerprint Detection
by Shivang Agarwal et al

12-29-2020

Visual-Thermal Camera Dataset Release and Multi-Modal Alignment without Calibration Information
by Frank Mascarich et al

12-29-2020

MS-GWNN:multi-scale graph wavelet neural network for breast cancer diagnosis
by Mo Zhang et al

12-29-2020

DeepSphere: a graph-based spherical CNN
by Michaël Defferrard et al

12-29-2020

Tips and Tricks for Webly-Supervised Fine-Grained Recognition: Learning from the WebFG 2020 Challenge
by Xiu-Shen Wei et al

12-31-2020

Audio-Visual Floorplan Reconstruction
by Senthil Purushwalkam et al

12-30-2020

Automatic Polyp Segmentation using U-Net-ResNet50
by Saruar Alam et al

12-29-2020

Graph-based non-linear least squares optimization for visual place recognition in changing environments
by Stefan Schubert et al

12-29-2020

Object sorting using faster R-CNN
by Pengchang Chen et al

12-30-2020

Provident Vehicle Detection at Night: The PVDN Dataset
by Lars Ohnemus et al

12-30-2020

Temporally-Transferable Perturbations: Efficient, One-Shot Adversarial Attacks for Online Visual Object Trackers
by Krishna Kanth Nakka et al

12-29-2020

Parzen Window Approximation on Riemannian Manifold
by Abhishek et al

12-29-2020

Learning a Dynamic Map of Visual Appearance
by Tawfiq Salem et al

12-30-2020

Some Algorithms on Exact, Approximate and Error-Tolerant Graph Matching
by Shri Prakash Dwivedi

12-29-2020

Towards Reducing Severe Defocus Spread Effects for Multi-Focus Image Fusion via an Optimization Based Strategy
by Shuang Xu et al

12-29-2020

Damaged Fingerprint Recognition by Convolutional Long Short-Term Memory Networks for Forensic Purposes
by Jaouhar Fattahi et al

12-29-2020

The VIP Gallery for Video Processing Education
by Todd Goodall et al

12-29-2020

Chasing the Tail in Monocular 3D Human Reconstruction with Prototype Memory
by Yu Rong et al

12-30-2020

FREA-Unet: Frequency-aware U-net for Modality Transfer
by Hajar Emami et al

12-29-2020

FPCC-Net: Fast Point Cloud Clustering for Instance Segmentation
by Yajun Xu et al

12-30-2020

Exploring Large Context for Cerebral Aneurysm Segmentation
by Jun Ma et al

12-30-2020

Model-Based Visual Planning with Self-Supervised Functional Distances
by Stephen Tian et al

12-29-2020

COIN: Contrastive Identifier Network for Breast Mass Diagnosis in Mammography
by Heyi Li et al

12-30-2020

Beating Attackers At Their Own Games: Adversarial Example Detection Using Adversarial Gradient Directions
by Yuhang Wu et al

12-29-2020

Image-to-Image Retrieval by Learning Similarity between Scene Graphs
by Sangwoong Yoon et al

12-31-2020

Incremental Embedding Learning via Zero-Shot Translation
by Kun Wei et al

12-29-2020

NBNet: Noise Basis Learning for Image Denoising with Subspace Projection
by Shen Cheng et al

12-30-2020

Unpaired Image Enhancement with Quality-Attention Generative Adversarial Network
by Zhangkai Ni et al

12-30-2020

RTS3D: Real-time Stereo 3D Detection from 4D Feature-Consistency Embedding Space for Autonomous Driving
by Peixuan Li et al

12-30-2020

Rethinking the Heatmap Regression for Bottom-up Human Pose Estimation
by Zhengxiong Luo et al

12-29-2020

2D or not 2D? Adaptive 3D Convolution Selection for Efficient Video Recognition
by Hengduo Li et al

12-30-2020

MM-FSOD: Meta and metric integrated few-shot object detection
by Yuewen Li et al

12-31-2020

Language-Mediated, Object-Centric Representation Learning
by Ruocheng Wang et al

12-30-2020

Active Annotation of Informative Overlapping Frames in Video Mosaicking Applications
by Loic Peter et al

12-30-2020

DUT-LFSaliency: Versatile Dataset and Light Field-to-RGB Saliency Detection
by Yongri Piao et al

12-29-2020

Semi-supervised Cardiac Image Segmentation via Label Propagation and Style Transfer
by Yao Zhang et al

12-29-2020

SALA: Soft Assignment Local Aggregation for 3D Semantic Segmentation
by Hani Itani et al

12-30-2020

MRI brain tumor segmentation and uncertainty estimation using 3D-UNet architectures
by Laura Mora Ballestar et al

12-30-2020

SID: Incremental Learning for Anchor-Free Object Detection via Selective and Inter-Related Distillation
by Can Peng et al

12-30-2020

Fast Hyperspectral Image Recovery via Non-iterative Fusion of Dual-Camera Compressive Hyperspectral Imaging
by Wei He et al

12-31-2020

Text-Free Image-to-Speech Synthesis Using Learned Segmental Units
by Wei-Ning Hsu et al

12-30-2020

DDANet: Dual Decoder Attention Network for Automatic Polyp Segmentation
by Nikhil Kumar Tomar et al

12-30-2020

Medico Multimedia Task at MediaEval 2020: Automatic Polyp Segmentation
by Debesh Jha et al

12-31-2020

iGOS++: Integrated Gradient Optimized Saliency by Bilateral Perturbations
by Saeed Khorram et al

12-31-2020

Learned Multi-Resolution Variable-Rate Image Compression with Octave-based Residual Blocks
by Mohammad Akbari et al

12-31-2020

A CNN Approach to Simultaneously Count Plants and Detect Plantation-Rows from UAV Imagery
by Lucas Prado Osco et al

12-30-2020

New Bag of Deep Visual Words based features to classify chest x-ray images for COVID-19 diagnosis
by Chiranjibi Sitaula et al

12-30-2020

Survey of the Detection and Classification of Pulmonary Lesions via CT and X-Ray
by Yixuan Sun et al

12-31-2020

CorrNet3D: Unsupervised End-to-end Learning of Dense Correspondence for 3D Point Clouds
by Yiming Zeng et al

12-31-2020

Exploiting Shared Knowledge from Non-COVID Lesions for Annotation-Efficient COVID-19 CT Lung Infection Segmentation
by Yichi Zhang et al

12-31-2020

Estimating Uncertainty in Neural Networks for Cardiac MRI Segmentation: A Benchmark Study
by Matthew Ng et al

12-31-2020

Overview of MediaEval 2020 Predicting Media Memorability Task: What Makes a Video Memorable?
by Alba García Seco De Herrera et al

12-31-2020

Voxel R-CNN: Towards High Performance Voxel-based 3D Object Detection
by Jiajun Deng et al

12-31-2020

Real-time Webcam Heart-Rate and Variability Estimation with Clean Ground Truth for Evaluation
by Amogh Gudi et al

12-31-2020

CNN-based Single Image Crowd Counting: Network Design, Loss Function and Supervisory Signal
by Haoyue Bai et al

12-31-2020

Unsupervised Monocular Depth Reconstruction of Non-Rigid Scenes
by Ayça Takmaz et al

12-31-2020

Colonoscopy Polyp Detection: Domain Adaptation From Medical Report Images to Real-time Videos
by Zhi-Qin Zhan et al

12-31-2020

Investigating Memorability of Dynamic Media
by Phuc H. Le-Khac et al

12-31-2020

Leveraging Audio Gestalt to Predict Media Memorability
by Lorin Sweeney et al

12-31-2020

Searching a Raw Video Database using Natural Language Queries
by Sriram Krishna et al

12-31-2020

A Deep Retinal Image Quality Assessment Network with Salient Structure Priors
by Ziwen Xu et al

12-30-2020

SharpGAN: Receptive Field Block Net for Dynamic Scene Deblurring
by Hui Feng et al

12-29-2020

Advances in deep learning methods for pavement surface crack detection and identification with visible light visual images
by Kailiang Lu

12-31-2020

Illumination Estimation Challenge: experience of past two years
by Egor Ershov et al

12-31-2020

Patch-wise++ Perturbation for Adversarial Targeted Attacks
by Lianli Gao et al

12-30-2020

Knowledge Distillation with Adaptive Asymmetric Label Sharpening for Semi-supervised Fracture Detection in Chest X-rays
by Yirui Wang et al

12-29-2020

A Review of Machine Learning Techniques for Applied Eye Fundus and Tongue Digital Image Processing with Diabetes Management System
by Wei Xiang Lim et al

12-30-2020

H2NF-Net for Brain Tumor Segmentation using Multimodal MR Imaging: 2nd Place Solution to BraTS Challenge 2020 Segmentation Task
by Haozhe Jia et al

 
Craig Smith