2020.10.19 Vision papers

 

10-13-2020

Vokenization: Improving Language Understanding with Contextualized, Visual-Grounded Supervision
by Hao Tan et al

10-14-2020

Deep Learning from Small Amount of Medical Data with Noisy Labels: A Meta-Learning Approach
by Görkem Algan et al

10-13-2020

Are all negatives created equal in contrastive instance discrimination?
by Tiffany et al

10-14-2020

NeRF++: Analyzing and Improving Neural Radiance Fields
by Kai Zhang et al

10-14-2020

AdaBelief Optimizer: Adapting Stepsizes by the Belief in Observed Gradients
by Juntang Zhuang et al

10-15-2020

Natural Language Rationales with Full-Stack Visual Reasoning: From Pixels to Semantic Frames to Commonsense Graphs
by Ana Marasović et al

10-15-2020

The Deep Bootstrap: Good Online Learners are Good Offline Generalizers
by Preetum Nakkiran et al

10-14-2020

Just Pick a Sign: Optimizing Deep Multitask Models with Gradient Sign Dropout
by Zhao Chen et al

10-15-2020

MOTChallenge: A Benchmark for Single-camera Multiple Target Tracking
by Patrick Dendorfer et al

10-13-2020

Does my multimodal model learn cross-modal interactions? Its harder to tell than you might think!
by Jack Hessel et al

10-13-2020

LM-Reloc: Levenberg-Marquardt Based Direct Visual Relocalization
by Lukas von Stumberg et al

10-14-2020

Viewmaker Networks: Learning Views for Unsupervised Representation Learning
by Alex Tamkin et al

10-13-2020

Video Action Understanding: A Tutorial
by Matthew Hutchinson et al

10-15-2020

XPDNet for MRI Reconstruction: an Application to the fastMRI 2020 Brain Challenge
by Zaccharie Ramzi et al

10-15-2020

Empty Cities: a Dynamic-Object-Invariant Space for Visual SLAM
by Berta Bescos et al

10-15-2020

DynaSLAM II: Tightly-Coupled Multi-Object Tracking and SLAM
by Berta Bescos et al

10-14-2020

Spherical Knowledge Distillation
by Jia Guo et al

10-15-2020

Representation Learning via Invariant Causal Mechanisms
by Jovana Mitrovic et al

10-14-2020

Unsupervised Self-training Algorithm Based on Deep Learning for Optical Aerial Images Change Detection
by Yuan Zhou et al

10-15-2020

Improved Multi-Source Domain Adaptation by Preservation of Factors
by Sebastian Schrom et al

10-14-2020

Deep Ensembles for Low-Data Transfer Learning
by Basil Mustafa et al

10-15-2020

Does Data Augmentation Benefit from Split BatchNorms
by Amil Merchant et al

10-14-2020

A New Distributional Ranking Loss With Uncertainty: Illustrated in Relative Depth Estimation
by Alican Mertan et al

10-14-2020

Do End-to-end Stereo Algorithms Under-utilize Information?
by Changjiang Cai et al

10-14-2020

Back to the Future: Cycle Encoding Prediction for Self-supervised Contrastive Video Representation Learning
by Xinyu Yang et al

10-15-2020

Self-Supervised Domain Adaptation with Consistency Training
by L. Xiao et al

10-15-2020

Interpretation of Swedish Sign Language using Convolutional Neural Networks and Transfer Learning
by Gustaf Halvardsson et al

10-14-2020

RetiNerveNet: Using Recursive Deep Learning to Estimate Pointwise 24-2 Visual Field Data based on Retinal Structure
by Shounak Datta et al

10-15-2020

HS-ResNet: Hierarchical-Split Block on Convolutional Neural Network
by Pengcheng Yuan et al

10-14-2020

AMPA-Net: Optimization-Inspired Attention Neural Network for Deep Compressed Sensing
by Nanyu Li et al

10-15-2020

Respecting Domain Relations: Hypothesis Invariance for Domain Generalization
by Ziqi Wang et al

10-15-2020

Interactive Latent Interpolation on MNIST Dataset
by Mazeyar Moeini Feizabadi et al

10-15-2020

CIMON: Towards High-quality Hash Codes
by Xiao Luo et al

10-14-2020

Deep Learning Models for Predicting Wildfires from Historical Remote-Sensing Data
by Fantine Huot et al

10-15-2020

Generalizing Universal Adversarial Attacks Beyond Additive Perturbations
by Yanghao Zhang et al

10-15-2020

A Hamiltonian Monte Carlo Method for Probabilistic Adversarial Attack and Learning
by Hongjun Wang et al

10-15-2020

LiteDepthwiseNet: An Extreme Lightweight Network for Hyperspectral Image Classification
by Benlei Cui et al

10-15-2020

Unsupervised Constrative Person Re-identification
by Bo Pang et al

10-14-2020

Matching-space Stereo Networks for Cross-domain Generalization
by Changjiang Cai et al

10-14-2020

Towards Accurate Quantization and Pruning via Data-free Knowledge Transfer
by Chen Zhu et al

10-14-2020

Auto-calibration Method Using Stop Signs for Urban Autonomous Driving Applications
by Yunhai Han et al

10-14-2020

Skeleton-bridged Point Completion: From Global Inference to Local Adjustment
by Yinyu Nie et al

10-15-2020

Encoder-decoder semantic segmentation models for electroluminescence images of thin-film photovoltaic modules
by Evgenii Sovetkin et al

10-14-2020

Harnessing Uncertainty in Domain Adaptation for MRI Prostate Lesion Segmentation
by Eleni Chiou et al

10-14-2020

Identifying Wrongly Predicted Samples: A Method for Active Learning
by Rahaf Aljundi et al

10-13-2020

Measuring Visual Generalization in Continuous Control from Pixels
by Jake Grigsby et al

10-15-2020

Self-training for Few-shot Transfer Across Extreme Task Differences
by Cheng Perng Phoo et al

10-15-2020

A Human Eye-based Text Color Scheme Generation Method for Image Synthesis
by Shao Wei Wang et al

10-14-2020

AI-based BMI Inference from Facial Images: An Application to Weight Monitoring
by Hera Siddiqui et al

10-15-2020

Unsupervised Video Anomaly Detection via Flow-based Generative Modeling on Appearance and Motion Latent Features
by MyeongAh Cho et al

10-15-2020

Robust Keypoint Detection and Pose Estimation of Robot Manipulators with Self-Occlusions via Sim-to-Real Transfer
by Jingpei Lu et al

10-15-2020

An Empirical Analysis of Visual Features for Multiple Object Tracking in Urban Scenes
by Mehdi Miah et al

10-15-2020

THIN: THrowable Information Networks and Application for Facial Expression Recognition In The Wild
by Estephe Arnaud et al

10-14-2020

PP-LinkNet: Improving Semantic Segmentation of High Resolution Satellite Imagery with Multi-stage Training
by An Tran et al

10-13-2020

On Deep Learning Techniques to Boost Monocular Depth Estimation for Autonomous Navigation
by Raul de Queiroz Mendes et al

10-13-2020

Random Network Distillation as a Diversity Metric for Both Image and Text Generation
by Liam Fowl et al

10-14-2020

Pose Refinement Graph Convolutional Network for Skeleton-based Action Recognition
by Shijie Li et al

10-14-2020

Photovoltaic module segmentation and thermal analysis tool from thermal images
by L. E. Montañez et al

10-15-2020

Auto Seg-Loss: Searching Metric Surrogates for Semantic Segmentation
by Hao Li et al

10-15-2020

Boosting Image-based Mutual Gaze Detection using Pseudo 3D Gaze
by Bardia Doosti et al

10-13-2020

Experimental Quantum Generative Adversarial Networks for Image Generation
by He-Liang Huang et al

10-15-2020

A Deep Drift-Diffusion Model for Image Aesthetic Score Distribution Prediction
by Xin Jin et al

10-15-2020

Performance evaluation and application of computation based low-cost homogeneous machine learning model algorithm for image classification
by W. H. Huang

10-13-2020

Linking average- and worst-case perturbation robustness via class selectivity and dimensionality
by Matthew L. Leavitt et al

10-15-2020

Object Tracking Using Spatio-Temporal Future Prediction
by Yuan Liu et al

10-15-2020

FOSS: Multi-Person Age Estimation with Focusing on Objects and Still Seeing Surroundings
by Masakazu Yoshimura et al

10-14-2020

Better Patch Stitching for Parametric Surface Reconstruction
by Zhantao Deng et al

10-14-2020

Unsupervised Learning of Depth and Ego-Motion from Cylindrical Panoramic Video with Applications for Virtual Reality
by Alisha Sharma et al

10-13-2020

COVID-19 Imaging Data Privacy by Federated Learning Design: A Theoretical Framework
by Anwaar Ulhaq et al

10-13-2020

Coarse and fine-grained automatic cropping deep convolutional neural network
by Jingfei Chang

10-14-2020

A Patch-based Image Denoising Method Using Eigenvectors of the Geodesics Gramian Matrix
by Kelum Gajamannage et al

10-13-2020

When Wireless Communications Meet Computer Vision in Beyond 5G
by Takayuki Nishio et al

10-15-2020

LTN: Long-Term Network for Long-Term Motion Prediction
by YingQiao Wang

10-14-2020

CS2-Net: Deep Learning Segmentation of Curvilinear Structures in Medical Imaging
by Lei Mou et al

10-14-2020

3D Segmentation Networks for Excessive Numbers of Classes: Distinct Bone Segmentation in Upper Bodies
by Eva Schnider et al

10-14-2020

GreedyFool: An Imperceptible Black-box Adversarial Example Attack against Neural Networks
by Hui Liu et al

10-14-2020

Semantic Flow-guided Motion Removal Method for Robust Mapping
by Xudong Lv et al

10-16-2020

What Can You Learn from Your Muscles? Learning Visual Representation from Human Interactions
by Kiana Ehsani et al

10-14-2020

Practical Deep Raw Image Denoising on Mobile Devices
by Yuzhi Wang et al

10-13-2020

Training independent subnetworks for robust prediction
by Marton Havasi et al

10-13-2020

Multi-Scale Networks for 3D Human PoseEstimation with Inference Stage Optimization
by Cheng Yu et al

10-13-2020

Rotation Averaging with Attention Graph Neural Networks
by Joshua Thorpe et al

10-13-2020

Which Model to Transfer? Finding the Needle in the Growing Haystack
by Cedric Renggli et al

10-13-2020

Correlation Filter for UAV-Based Aerial Tracking: A Review and Experimental Evaluation
by Changhong Fu et al

10-16-2020

A Generalizable and Accessible Approach to Machine Learning with Global Satellite Imagery
by Esther Rolf et al

10-14-2020

PointManifold: Using Manifold Learning for Point Cloud Classification
by Dinghao Yang et al

10-14-2020

Efficient and high accuracy 3-D OCT angiography motion correction in pathology
by Stefan B. Ploner et al

10-13-2020

A review of 3D human pose estimation algorithms for markerless motion capture
by Yann Desmarais et al

10-13-2020

RMDL: Recalibrated multi-instance deep learning for whole slide gastric image classification
by Shujun Wang et al

10-13-2020

Exploring Efficient Volumetric Medical Image Segmentation Using 2.5D Method: An Empirical Study
by Yichi Zhang et al

10-14-2020

Adaptive-Attentive Geolocalization from few queries: a hybrid approach
by Gabriele Moreno Berton et al

10-16-2020

Auxiliary Task Reweighting for Minimum-data Learning
by Baifeng Shi et al

10-13-2020

Collaborative Video Object Segmentation by Multi-Scale Foreground-Background Integration
by Zongxin Yang et al

10-16-2020

Extracting Signals of Higgs Boson From Background Noise Using Deep Neural Networks
by Muhammad Abbas et al

10-15-2020

Early-stage COVID-19 diagnosis in presence of limited posteroanterior chest X-ray images via novel Pinball-OCSVM
by Sanjay Kumar Sonbhadra et al

10-13-2020

DoFE: Domain-oriented Feature Embedding for Generalizable Fundus Image Segmentation on Unseen Datasets
by Shujun Wang et al

10-14-2020

A spatial model checker in GPU (extended version)
by Laura Bussi et al

10-14-2020

Fast meningioma segmentation in T1-weighted MRI volumes using a lightweight 3D deep learning architecture
by David Bouget et al

10-16-2020

Anisotropic Stroke Control for Multiple Artists Style Transfer
by Xuanhong Chen et al

10-14-2020

Self-Supervised Ranking for Representation Learning
by Ali Varamesh et al

10-13-2020

A Multi-Modal Method for Satire Detection using Textual and Visual Cues
by Lily Li et al

10-16-2020

How many images do I need? Understanding how sample size per class affects deep learning model performance metrics for balanced designs in autonomous wildlife monitoring
by Saleh Shahinfar et al

10-14-2020

Semantic Segmentation for Partially Occluded Apple Trees Based on Deep Learning
by Zijue Chen et al

10-13-2020

A Scale and Rotational Invariant Key-point Detector based on Sparse Coding
by Thanh Hong-Phuoc et al

10-13-2020

DORi: Discovering Object Relationship for Moment Localization of a Natural-Language Query in Video
by Cristian Rodriguez-Opazo et al

10-14-2020

A Vector-based Representation to Enhance Head Pose Estimation
by Zhiwen Cao et al

10-16-2020

Training Data Generating Networks: Linking 3D Shapes and Few-Shot Classification
by Biao Zhang et al

10-15-2020

Semantic Editing On Segmentation Map Via Multi-Expansion Loss
by Jianfeng He et al

10-14-2020

Data Augmentation for Meta-Learning
by Renkun Ni et al

10-13-2020

Three-Dimensional Lip Motion Network for Text-Independent Speaker Recognition
by Jianrong Wang et al

10-13-2020

ISTA-NAS: Efficient and Consistent Neural Architecture Search by Sparse Coding
by Yibo Yang et al

10-13-2020

Ferrograph image classification
by Peng Peng et al

10-15-2020

Data Valuation for Medical Imaging Using Shapley Value: Application on A Large-scale Chest X-ray Dataset
by Siyi Tang et al

10-15-2020

Maximum-Entropy Adversarial Data Augmentation for Improved Generalization and Robustness
by Long Zhao et al

10-15-2020

Input-Aware Dynamic Backdoor Attack
by Anh Nguyen et al

10-13-2020

Two-Stream Compare and Contrast Network for Vertebral Compression Fracture Diagnosis
by Shixiang Feng et al

10-13-2020

Self-Supervised Multi-View Synchronization Learning for 3D Pose Estimation
by Simon Jenni et al

10-15-2020

MAST: Multimodal Abstractive Summarization with Trimodal Hierarchical Attention
by Aman Khullar et al

10-16-2020

ASMFS: Adaptive-Similarity-based Multi-modality Feature Selection for Classification of Alzheimers Disease
by Yuang Shi et al

10-13-2020

Robust Two-Stream Multi-Feature Network for Driver Drowsiness Detection
by Qi Shen et al

10-13-2020

Low-rank Convex/Sparse Thermal Matrix Approximation for Infrared-based Diagnostic System
by Bardia Yousefi et al

10-13-2020

Audio-Visual Self-Supervised Terrain Type Discovery for Mobile Platforms
by Akiyoshi Kurobe et al

10-14-2020

Towards Optimal Filter Pruning with Balanced Performance and Pruning Speed
by Dong Li et al

10-15-2020

Room-Across-Room: Multilingual Vision-and-Language Navigation with Dense Spatiotemporal Grounding
by Alexander Ku et al

10-14-2020

Learning Propagation Rules for Attribution Map Generation
by Yiding Yang et al

10-14-2020

Vision-Aided Radio: User Identity Match in Radio and Video Domains Using Machine Learning
by Vinicius M. de Pinho et al

10-15-2020

Physics-informed GANs for Coastal Flood Visualization
by Björn Lütjens et al

10-13-2020

Intrapersonal Parameter Optimization for Offline Handwritten Signature Augmentation
by Teruo M. Maruyama et al

10-13-2020

Scenic: A Language for Scenario Specification and Data Generation
by Daniel J. Fremont et al

10-16-2020

Towards truly local gradients with CLAPP: Contrastive, Local And Predictive Plasticity
by Bernd Illing et al

10-14-2020

Development of Open Informal Dataset Affecting Autonomous Driving
by Yong-Gu Lee et al

10-13-2020

Making Every Label Count: Handling Semantic Imprecision by Integrating Domain Knowledge
by Clemens-Alexander Brust et al

10-14-2020

Fader Networks for domain adaptation on fMRI: ABIDE-II study
by Marina Pominova et al

10-13-2020

Land Cover Semantic Segmentation Using ResUNet
by Vasilis Pollatos et al

10-16-2020

Automated Iterative Training of Convolutional Neural Networks for Tree Skeleton Segmentation
by Keenan Granland et al

10-13-2020

MixCo: Mix-up Contrastive Learning for Visual Representation
by Sungnyun Kim et al

10-14-2020

Multi-class segmentation under severe class imbalance: A case study in roof damage assessment
by Jean-Baptiste Boin et al

10-14-2020

Relative Depth Estimation as a Ranking Problem
by Alican Mertan et al

10-14-2020

FC-DCNN: A densely connected neural network for stereo estimation
by Dominik Hirner et al

10-16-2020

HPERL: 3D Human Pose Estimation from RGB and LiDAR
by Michael Fürst et al

10-13-2020

Detecting Anomalies from Video-Sequences: a Novel Descriptor
by Giulia Orrù et al

10-14-2020

WeightAlign: Normalizing Activations by Weight Alignment
by Xiangwei Shi et al

10-15-2020

Overfitting or Underfitting? Understand Robustness Drop in Adversarial Training
by Zichao Li et al

10-16-2020

How Does Supernet Help in Neural Architecture Search?
by Yuge Zhang et al

10-16-2020

G-DARTS-A: Groups of Channel Parallel Sampling with Attention
by Zhaowen Wang et al

10-13-2020

Few-shot Action Recognition with Implicit Temporal Alignment and Pair Similarity Optimization
by Congqi Cao et al

10-16-2020

Real-Time Face & Eye Tracking and Blink Detection using Event Cameras
by Cian Ryan et al

10-15-2020

On the Exploration of Incremental Learning for Fine-grained Image Retrieval
by Wei Chen et al

10-14-2020

Privacy-Preserving Object Detection & Localization Using Distributed Machine Learning: A Case Study of Infant Eyeblink Conditioning
by Stefan Zwaard et al

10-13-2020

A Generalized Zero-Shot Framework for Emotion Recognition from Body Gestures
by Jinting Wu et al

10-13-2020

Improving Road Signs Detection performance by Combining the Features of Hough Transform and Texture
by Tarik Ayaou et al

10-13-2020

Deep Learning for Recognizing Mobile Targets in Satellite Imagery
by Mark Pritt

10-13-2020

How important are faces for person re-identification?
by Julia Dietlmeier et al

10-15-2020

Semi-Supervised Semantic Segmentation in Earth Observation: The MiniFrance Suite, Dataset Analysis and Multi-task Network Study
by Javiera Castillo-Navarro et al

10-15-2020

Human Segmentation with Dynamic LiDAR Data
by Tao Zhong et al

10-13-2020

Automation of Hemocompatibility Analysis Using Image Segmentation and a Random Forest
by Johanna C. Clauser et al

10-16-2020

New Ideas and Trends in Deep Multimodal Content Understanding: A Review
by Wei Chen et al

10-13-2020

Satellite Image Classification with Deep Learning
by Mark Pritt et al

10-13-2020

Impact of Thermal Throttling on Long-Term Visual Inference in a CPU-based Edge Device
by Théo Benoit-Cattin et al

10-16-2020

Latent Vector Recovery of Audio GANs
by Andrew Keyes et al

10-15-2020

Video Object Segmentation with Adaptive Feature Bank and Uncertain-Region Refinement
by Yongqing Liang et al

10-16-2020

Reconstructing A Large Scale 3D Face Dataset for Deep 3D Face Identification
by Cuican Yu et al

10-16-2020

VolumeNet: A Lightweight Parallel Network for Super-Resolution of Medical Volumetric Data
by Yinhao Li et al

10-15-2020

QReLU and m-QReLU: Two novel quantum activation functions to aid medical diagnostics
by L. Parisi et al

10-16-2020

On the surprising similarities between supervised and self-supervised models
by Robert Geirhos et al

10-16-2020

Learning Monocular Dense Depth from Events
by Javier Hidalgo-Carrió et al

10-16-2020

Pose And Joint-Aware Action Recognition
by Anshul Shah et al

10-16-2020

Difference-in-Differences: Bridging Normalization and Disentanglement in PG-GAN
by Xiao Liu et al

10-14-2020

Differential diagnosis and molecular stratification of gastrointestinal stromal tumors on CT images using a radiomics approach
by Martijn P. A. Starmans et al

10-16-2020

Learning Accurate Entropy Model with Global Reference for Image Compression
by Yichen Qian et al

10-16-2020

Deep Learning based Automated Forest Health Diagnosis from Aerial Images
by Chia-Yen Chiang et al

10-14-2020

Domain Shift in Computer Vision models for MRI data analysis: An Overview
by Ekaterina Kondrateva et al

10-15-2020

What is More Likely to Happen Next? Video-and-Language Future Event Prediction
by Jie Lei et al

10-16-2020

Vid-ODE: Continuous-Time Video Generation with Neural Ordinary Differential Equation
by Sunghyun Park et al

10-13-2020

LiDAM: Semi-Supervised Learning with Localized Domain Adaptation and Iterative Matching
by Qun Liu et al

10-15-2020

Impact of Action Unit Occurrence Patterns on Detection
by Saurabh Hinduja et al

10-15-2020

Egok360: A 360 Egocentric Kinetic Human Activity Video Dataset
by Keshav Bhandari et al

10-15-2020

Revisiting Optical Flow Estimation in 360 Videos
by Keshav Bhandari et al

10-15-2020

TextMage: The Automated Bangla Caption Generator Based On Deep Learning
by Abrar Hasin Kamal et al

10-13-2020

Electroencephalography signal processing based on textural features for monitoring the drivers state by a Brain-Computer Interface
by Giulia Orrù et al

10-15-2020

Why Layer-Wise Learning is Hard to Scale-up and a Possible Solution via Accelerated Downsampling
by Wenchi Ma et al

10-16-2020

Toward Accurate Person-level Action Recognition in Videos of Crowded Scenes
by Li Yuan et al

10-16-2020

Human Perception-based Evaluation Criterion for Ultra-high Resolution Cell Membrane Segmentation
by Ruohua Shi et al

10-16-2020

Manipulation-Oriented Object Perception in Clutter through Affordance Coordinate Frames
by Xiaotong Chen et al

10-16-2020

SF-UDA3D3D: Source-Free Unsupervised Domain Adaptation for LiDAR-Based 3D Object Detection
by Cristiano Saltori et al

10-16-2020

In Depth Bayesian Semantic Scene Completion
by David Gillsjö et al

10-16-2020

Volumetric Calculation of Quantization Error in 3-D Vision Systems
by Eleni Bohacek et al

10-14-2020

Taking A Closer Look at Synthesis: Fine-grained Attribute Analysis for Person Re-Identification
by Suncheng Xiang et al

10-16-2020

Towards Online Steering of Flame Spray Pyrolysis Nanoparticle Synthesis
by Maksim Levental et al

10-15-2020

Integrating Coarse Granularity Part-level Features with Supervised Global-level Features for Person Re-identification
by Xiaofei Mao et al

10-15-2020

Quantifying the Extent to Which Race and Gender Features Determine Identity in Commercial Face Recognition Algorithms
by John J. Howard et al

10-15-2020

Convolutional Neural Network for Blur Images Detection as an Alternative for Laplacian Method
by Tomasz Szandala

 
Craig Smith