2021.8.16 Vision papers

 

08-10-2021

MotionInput v2.0 supporting DirectX: A modular library of open-source gesture-based machine learning and computer vision methods for interacting and controlling existing software with a webcam
by Ashild Kummen et al

08-12-2021

COVINS: Visual-Inertial SLAM for Centralized Collaboration
by Patrik Schmuck et al

08-11-2021

SIDER: Single-Image Neural Optimization for Facial Geometric Detail Recovery
by Aggelina Chatziagapi et al

08-11-2021

A Real-Time Online Learning Framework for Joint 3D Reconstruction and Semantic Segmentation of Indoor Scenes
by Davide Menini et al

08-11-2021

Fog Simulation on Real LiDAR Point Clouds for 3D Object Detection in Adverse Weather
by Martin Hahner et al

08-12-2021

Deep Amended Gradient Descent for Efficient Spectral Reconstruction from Single RGB Images
by Zhiyu Zhu et al

08-10-2021

Optimal MRI Undersampling Patterns for Ultimate Benefit of Medical Vision Tasks
by Artem Razumov et al

08-10-2021

FLAME-in-NeRF : Neural control of Radiance Fields for Free View Face Animation
by ShahRukh Athar et al

08-13-2021

An Interpretable Algorithm for Uveal Melanoma Subtyping from Whole Slide Cytology Images
by Haomin Chen et al

08-11-2021

Semi-Supervised Domain Generalizable Person Re-Identification
by Lingxiao He et al

08-11-2021

Learning to Rearrange Voxels in Binary Segmentation Masks for Smooth Manifold Triangulation
by Jianning Li et al

08-12-2021

Trash to Treasure: Harvesting OOD Data with Cross-Modal Matching for Open-Set Semi-Supervised Learning
by Junkai Huang et al

08-12-2021

Robotic Testbed for Rendezvous and Optical Navigation: Multi-Source Calibration and Machine Learning Use Cases
by Tae Ha Park et al

08-11-2021

Improving Single-Image Defocus Deblurring: How Dual-Pixel Images Help Through Multi-Task Learning
by Abdullah Abuolaim et al

08-11-2021

Deep Learning Classification of Lake Zooplankton
by S. P. Kyathanahally et al

08-10-2021

First Order Locally Orderless Registration
by Sune Darkner et al

08-10-2021

Embodied BERT: A Transformer Model for Embodied, Language-guided Visual Task Completion
by Alessandro Suglia et al

08-10-2021

Method Towards CVPR 2021 Image Matching Challenge
by Xiaopeng Bi et al

08-12-2021

Deep Microlocal Reconstruction for Limited-Angle Tomography
by Héctor Andrade-Loarca et al

08-10-2021

Relation-aware Compositional Zero-shot Learning for Attribute-Object Pair Recognition
by Ziwei Xu et al

08-13-2021

FedPara: Low-rank Hadamard Product Parameterization for Efficient Federated Learning
by Nam Hyeon-Woo et al

08-11-2021

FakeAVCeleb: A Novel Audio-Video Multimodal Deepfake Dataset
by Hasam Khalid et al

08-12-2021

DARTS for Inverse Problems: a Study on Hyperparameter Sensitivity
by Jonas Geiping et al

08-12-2021

Resetting the baseline: CT-based COVID-19 diagnosis with Deep Transfer Learning is not as accurate as widely thought
by Fouzia Altaf et al

08-11-2021

Weakly Supervised Medical Image Segmentation
by Pedro H. T. Gama et al

08-11-2021

NI-UDA: Graph Adversarial Domain Adaptation from Non-shared-and-Imbalanced Big Data to Small Imbalanced Applications
by Guangyi Xiao et al

08-11-2021

Medical-VLBERT: Medical Visual Language BERT for COVID-19 CT Report Generation With Alternate Learning
by Guangyi Liu et al

08-12-2021

Multiview Detection with Shadow Transformer (and View-Coherent Data Augmentation)
by Yunzhong Hou et al

08-13-2021

Modal-Adaptive Gated Recoding Network for RGB-D Salient Object Detection
by Feng Dong et al

08-10-2021

BERTHop: An Effective Vision-and-Language Model for Chest X-ray Disease Diagnosis
by Masoud Monajatipoor et al

08-12-2021

Unconditional Scene Graph Generation
by Sarthak Garg et al

08-12-2021

Mobile-Former: Bridging MobileNet and Transformer
by Yinpeng Chen et al

08-12-2021

MicroNet: Improving Image Recognition with Extremely Low FLOPs
by Yunsheng Li et al

08-12-2021

Semantic Concentration for Domain Adaptation
by Shuang Li et al

08-12-2021

MT-ORL: Multi-Task Occlusion Relationship Learning
by Panhe Feng et al

08-11-2021

Representation Learning for Remote Sensing: An Unsupervised Sensor Fusion Approach
by Aidan M. Swope et al

08-11-2021

Multi-Source Fusion and Automatic Predictor Selection for Zero-Shot Video Object Segmentation
by Xiaoqi Zhao et al

08-11-2021

Voxel-level Importance Maps for Interpretable Brain Age Estimation
by Kyriaki-Margarita Bintsi et al

08-11-2021

The Pitfalls of Sample Selection: A Case Study on Lung Nodule Classification
by Vasileios Baltatzis et al

08-11-2021

Automatic Gaze Analysis: A Survey of DeepLearning based Approaches
by Shreya Ghosh et al

08-12-2021

Learning Visual Affordance Grounding from Demonstration Videos
by Hongchen Luo et al

08-13-2021

Progressive Representative Labeling for Deep Semi-Supervised Learning
by Xiaopeng Yan et al

08-13-2021

Coupling Model-Driven and Data-Driven Methods for Remote Sensing Image Restoration and Fusion
by Huanfeng Shen et al

08-11-2021

Rethinking Coarse-to-Fine Approach in Single Image Deblurring
by Sung-Jin Cho et al

08-12-2021

m-RevNet: Deep Reversible Neural Networks with Momentum
by Duo Li et al

08-12-2021

Towers of Babel: Combining Images, Language, and 3D Geometry for Learning Multimodal Vision
by Xiaoshi Wu et al

08-11-2021

Learning Bias-Invariant Representation by Cross-Sample Mutual Information Minimization
by Wei Zhu et al

08-10-2021

On the Effect of Pruning on Adversarial Robustness
by Artur Jordao et al

08-12-2021

PixelSynth: Generating a 3D-Consistent Experience from a Single Image
by Chris Rockwell et al

08-10-2021

How Self-Supervised Learning Can be Used for Fine-Grained Head Pose Estimation?
by Mahdi Pourmirzaei et al

08-10-2021

Interpreting Generative Adversarial Networks for Interactive Image Generation
by Bolei Zhou

08-10-2021

Learning Canonical 3D Object Representation for Fine-Grained Recognition
by Sunghun Joung et al

08-12-2021

Alzheimers Disease Diagnosis via Deep Factorization Machine Models
by Raphael Ronge et al

08-12-2021

Distributional Depth-Based Estimation of Object Articulation Models
by Ajinkya Jain et al

08-10-2021

U-Net-and-a-half: Convolutional network for biomedical image segmentation using multiple expert-driven annotations
by Yichi Zhang et al

08-13-2021

Robustness testing of AI systems: A case study for traffic sign recognition
by Christian Berghoff et al

08-10-2021

SP-GAN: Sphere-Guided 3D Shape Generation and Manipulation
by Ruihui Li et al

08-10-2021

Scalable Reverse Image Search Engine for NASAWorldview
by Abhigya Sodani et al

08-10-2021

TBNet:Two-Stream Boundary-aware Network for Generic Image Manipulation Localization
by Zan Gao et al

08-11-2021

Statistical Dependency Guided Contrastive Learning for Multiple Labeling in Prenatal Ultrasound
by Shuangchi He et al

08-10-2021

Exploiting Features with Split-and-Share Module
by Jaemin Lee et al

08-11-2021

Towards Top-Down Just Noticeable Difference Estimation of Natural Images
by Qiuping Jiang et al

08-11-2021

An Approach to Partial Observability in Games: Learning to Both Act and Observe
by Elizabeth Gilmour et al

08-11-2021

Few-Shot Segmentation with Global and Local Contrastive Learning
by Weide Liu et al

08-12-2021

AGKD-BML: Defense Against Adversarial Attack by Attention Guided Knowledge Distillation and Bi-directional Metric Learning
by Hong Wang et al

08-12-2021

UniFaceGAN: A Unified Framework for Temporally Consistent Facial Video Editing
by Meng Cao et al

08-12-2021

Conditional Temporal Variational AutoEncoder for Action Video Prediction
by Xiaogang Xu et al

08-13-2021

Pruning vs XNOR-Net: A Comprehensive Study on Deep Learning for Audio Classification in Microcontrollers
by Md Mohaimenuzzaman et al

08-13-2021

Learning Transferable Parameters for Unsupervised Domain Adaptation
by Zhongyi Han et al

08-10-2021

BIDCD - Bosch Industrial Depth Completion Dataset
by Adam Botach et al

08-13-2021

Generative Zero-Shot Learning for Semantic Segmentation of 3D Point Cloud
by Björn Michele et al

08-10-2021

FoodLogoDet-1500: A Dataset for Large-Scale Food Logo Detection via Multi-Scale Feature Decoupling Network
by Qiang Hou et al

08-10-2021

Method Towards CVPR 2021 SimLocMatch Challenge
by Xiaopeng Bi et al

08-12-2021

Deep Motion Prior for Weakly-Supervised Temporal Action Localization
by Meng Cao et al

08-12-2021

Multi-Modal MRI Reconstruction with Spatial Alignment Network
by Kai Xuan et al

08-11-2021

Learning Oculomotor Behaviors from Scanpath
by Beibin Li et al

08-11-2021

One-Sided Box Filter for Edge Preserving Image Smoothing
by Yuanhao Gong

08-11-2021

Boosting the Generalization Capability in Cross-Domain Few-shot Learning via Noise-enhanced Supervised Autoencoder
by Hanwen Liang et al

08-11-2021

Learning Action Completeness from Points for Weakly-supervised Temporal Action Localization
by Pilhyeon Lee et al

08-10-2021

Differentiable Surface Rendering via Non-Differentiable Sampling
by Forrester Cole et al

08-12-2021

LabOR: Labeling Only if Required for Domain Adaptive Semantic Segmentation
by Inkyu Shin et al

08-12-2021

Non-imaging real-time detection and tracking of fast-moving objects
by Fengming Zhou et al

08-10-2021

Reference-based Defect Detection Network
by Zhaoyang Zeng et al

08-10-2021

Box-Aware Feature Enhancement for Single Object Tracking on Point Clouds
by Chaoda Zheng et al

08-10-2021

MetaPose: Fast 3D Pose from Multiple Views without 3D Supervision
by Ben Usman et al

08-10-2021

AuraSense: Robot Collision Avoidance by Full Surface Proximity Detection
by Xiaoran Fan et al

08-13-2021

SimCVD: Simple Contrastive Voxel-Wise Representation Distillation for Semi-Supervised Medical Image Segmentation
by Chenyu You et al

08-11-2021

Instance-weighted Central Similarity for Multi-label Image Retrieval
by Zhiwei Zhang et al

08-12-2021

DIODE: Dilatable Incremental Object Detection
by Can Peng et al

08-12-2021

A Systematic Benchmarking Analysis of Transfer Learning for Medical Image Analysis
by Mohammad Reza Hosseinzadeh Taher et al

08-12-2021

Continual Neural Mapping: Learning An Implicit Scene Representation from Sequential Observations
by Zike Yan et al

08-10-2021

White blood cell subtype detection and classification
by Nalla Praveen et al

08-10-2021

Multi-Camera Trajectory Forecasting with Trajectory Tensors
by Olly Styles et al

08-12-2021

perf4sight: A toolflow to model CNN training performance on Edge GPUs
by Aditya Rajagopal et al

08-12-2021

DnD: Dense Depth Estimation in Crowded Dynamic Indoor Scenes
by Dongki Jung et al

08-12-2021

Cascade Bagging for Accuracy Prediction with Few Training Samples
by Ruyi Zhang et al

08-10-2021

Hand Pose Classification Based on Neural Networks
by Rashmi Bakshi

08-10-2021

Multigranular Visual-Semantic Embedding for Cloth-Changing Person Re-identification
by Zan Gao et al

08-10-2021

Multi-domain Collaborative Feature Representation for Robust Visual Object Tracking
by Jiqing Zhang et al

08-10-2021

CPNet: Cross-Parallel Network for Efficient Anomaly Detection
by Youngsaeng Jin et al

08-13-2021

Point-Voxel Transformer: An Efficient Approach To 3D Deep Learning
by Cheng Zhang et al

08-12-2021

HandFoldingNet: A 3D Hand Pose Estimation Network Using Multiscale-Feature Guided Folding of a 2D Hand Skeleton
by Wencan Cheng et al

08-10-2021

Learning Fair Face Representation With Progressive Cross Transformer
by Yong Li et al

08-13-2021

Bi-Temporal Semantic Reasoning for the Semantic Change Detection of HR Remote Sensing Images
by Lei Ding et al

08-12-2021

Silhouette based View embeddings for Gait Recognition under Multiple Views
by Tianrui Chai et al

08-10-2021

SnowflakeNet: Point Cloud Completion by Snowflake Point Deconvolution with Skip-Transformer
by Peng Xiang et al

08-10-2021

Domain-Aware Universal Style Transfer
by Kibeom Hong et al

08-12-2021

TF-Blender: Temporal Feature Blender for Video Object Detection
by Yiming Cui et al

08-10-2021

Self-supervised Consensus Representation Learning for Attributed Graph
by Changshu Liu et al

08-12-2021

Billion-Scale Pretraining with Vision Transformers for Multi-Task Visual Representations
by Josh Beal et al

08-10-2021

ASMR: Learning Attribute-Based Person Search with Adaptive Semantic Margin Regularizer
by Boseung Jeong et al

08-10-2021

Learning Multi-Granular Spatio-Temporal Graph Network for Skeleton-based Action Recognition
by Tailin Chen et al

08-11-2021

Self-supervised Contrastive Learning for Irrigation Detection in Satellite Imagery
by Chitra Agastya et al

08-11-2021

Zero-Shot Domain Adaptation with a Physics Prior
by Attila Lengyel et al

08-11-2021

Towards Interpretable Deep Networks for Monocular Depth Estimation
by Zunzhi You et al

08-10-2021

Learning Deep Multimodal Feature Representation with Asymmetric Multi-layer Fusion
by Yikai Wang et al

08-12-2021

CODEs: Chamfer Out-of-Distribution Examples against Overconfidence Issue
by Keke Tang et al

08-10-2021

A Transformer-based Math Language Model for Handwritten Math Expression Recognition
by Huy Quang Ung et al

08-13-2021

Effective semantic segmentation in Cataract Surgery: What matters most?
by Theodoros Pissas et al

08-13-2021

UMFA: A photorealistic style transfer method based on U-Net and multi-layer feature aggregation
by D. Y. Rao et al

08-12-2021

Track without Appearance: Learn Box and Tracklet Embedding with Local and Global Motion Patterns for Vehicle Tracking
by Gaoang Wang et al

08-10-2021

Semantics-STGCNN: A Semantics-guided Spatial-Temporal Graph Convolutional Network for Multi-class Trajectory Prediction
by Ben A. Rainbow et al

08-12-2021

Memory-based Semantic Segmentation for Off-road Unstructured Natural Environments
by Youngsaeng Jin et al

08-12-2021

Spatio-Temporal Human Action Recognition Modelwith Flexible-interval Sampling and Normalization
by Yuke et al

08-12-2021

3D-SiamRPN: An End-to-End Learning Method for Real-Time 3D Single Object Tracking Using Raw Point Cloud
by Zheng Fang et al

08-10-2021

Instance-wise Hard Negative Example Generation for Contrastive Learning in Unpaired Image-to-Image Translation
by Weilun Wang et al

08-10-2021

TrUMAn: Trope Understanding in Movies and Animations
by Hung-Ting Su et al

08-11-2021

Attention-driven Graph Clustering Network
by Zhihao Peng et al

08-11-2021

Mutual Affine Network for Spatially Variant Kernel Estimation in Blind Image Super-Resolution
by Jingyun Liang et al

08-11-2021

Hierarchical Conditional Flow: A Unified Framework for Image Super-Resolution and Image Rescaling
by Jingyun Liang et al

08-11-2021

Video Transformer for Deepfake Detection with Incremental Learning
by Sohail A. Khan et al

08-11-2021

ConvNets vs. Transformers: Whose Visual Representations are More Transferable?
by Hong-Yu Zhou et al

08-11-2021

A Better Loss for Visual-Textual Grounding
by Davide Rigoni et al

08-10-2021

VisEvent: Reliable Object Tracking via Collaboration of Frame and Event Flows
by Xiao Wang et al

08-10-2021

Prototype Completion for Few-Shot Learning
by Baoquan Zhang et al

08-10-2021

Iterative Self-consistent Parallel Magnetic Resonance Imaging Reconstruction based on Nonlocal Low-Rank Regularization
by Ting Pan et al

08-13-2021

IFR: Iterative Fusion Based Recognizer For Low Quality Scene Text Recognition
by Zhiwei Jia et al

08-13-2021

Detection and Captioning with Unseen Object Classes
by Berkan Demirel et al

08-13-2021

3D point cloud segmentation using GIS
by Chao-Jung Liu et al

08-11-2021

Two is a crowd: tracking relations in videos
by Artem Moskalev et al

08-10-2021

Understanding Character Recognition using Visual Explanations Derived from the Human Visual System and Deep Networks
by Chetan Ralekar et al

08-12-2021

Oriented R-CNN for Object Detection
by Xingxing Xie et al

08-11-2021

Mounting Video Metadata on Transformer-based Language Model for Open-ended Video Question Answering
by Donggeon Lee et al

08-12-2021

Progressive Coordinate Transforms for Monocular 3D Object Detection
by Li Wang et al

08-12-2021

iButter: Neural Interactive Bullet Time Generator for Human Free-viewpoint Rendering
by Liao Wang et al

08-13-2021

Evaluating the Robustness of Semantic Segmentation for Autonomous Driving against Real-World Adversarial Patch Attacks
by Federico Nesti et al

08-12-2021

Deep Camera Obscura: An Image Restoration Pipeline for Lensless Pinhole Photography
by Joshua D. Rego et al

08-12-2021

Improving Ranking Correlation of Supernet with Candidates Enhancement and Progressive Training
by Ziwei Yang et al

08-10-2021

An Image-based Generator Architecture for Synthetic Image Refinement
by Alex Nasser

08-13-2021

Towards Efficient Point Cloud Graph Neural Networks Through Architectural Simplification
by Shyam A. Tailor et al

08-12-2021

Presenting an extensive lab- and field-image dataset of crops and weeds for computer vision tasks in agriculture
by Michael A. Beck et al

08-12-2021

Patchwork: Concentric Zone-based Region-wise Ground Segmentation with Ground Likelihood Estimation Using a 3D LiDAR Sensor
by Hyungtae Lim et al

08-12-2021

Vision-Language Transformer and Query Generation for Referring Segmentation
by Henghui Ding et al

08-10-2021

Deep Metric Learning for Open World Semantic Segmentation
by Jun Cen et al

08-13-2021

Full-resolution quality assessment for pansharpening
by Giuseppe Scarpa et al

08-10-2021

R4Dyn: Exploring Radar for Self-Supervised Monocular Depth Estimation of Dynamic Scenes
by Stefano Gasperini et al

08-10-2021

The Effect of the Loss on Generalization: Empirical Study on Synthetic Lung Nodule Data
by Vasileios Baltatzis et al

08-13-2021

SVC-onGoing: Signature Verification Competition
by Ruben Tolosana et al

08-12-2021

Logit Attenuating Weight Normalization
by Aman Gupta et al

08-12-2021

AdaFit: Rethinking Learning-based Normal Estimation on Point Clouds
by Runsong Zhu et al

08-10-2021

SUNet: Symmetric Undistortion Network for Rolling Shutter Correction
by Bin Fan et al

08-11-2021

ProAI: An Efficient Embedded AI Hardware for Automotive Applications - a Benchmark Study
by Sven Mantowsky et al

08-12-2021

DiagViB-6: A Diagnostic Benchmark Suite for Vision Models in the Presence of Shortcut and Generalization Opportunities
by Elias Eulig et al

08-11-2021

Deep PET/CT fusion with Dempster-Shafer theory for lymphoma segmentation
by Ling Huang et al

08-13-2021

EEEA-Net: An Early Exit Evolutionary Neural Architecture Search
by Chakkrit Termritthikun et al

08-13-2021

Conditional DETR for Fast Training Convergence
by Depu Meng et al

08-10-2021

Meta-repository of screening mammography classifiers
by Benjamin Stadnick et al

08-13-2021

A Generative Adversarial Framework for Optimizing Image Matting and Harmonization Simultaneously
by Xuqian Ren et al

08-12-2021

Correlate-and-Excite: Real-Time Stereo Matching via Guided Cost Volume Excitation
by Antyanta Bangunharcana et al

08-13-2021

Dual Path Learning for Domain Adaptation of Semantic Segmentation
by Yiting Cheng et al

08-10-2021

Simple black-box universal adversarial attacks on medical image classification based on deep neural networks
by Kazuki Koga et al

08-10-2021

Elastic Tactile Simulation Towards Tactile-Visual Perception
by Yikai Wang et al

08-12-2021

Towards Interpretable Deep Metric Learning with Structural Matching
by Wenliang Zhao et al

08-12-2021

MUSIQ: Multi-scale Image Quality Transformer
by Junjie Ke et al

08-10-2021

Joint Multi-Object Detection and Tracking with Camera-LiDAR Fusion for Autonomous Driving
by Kemiao Huang et al

08-12-2021

MISS GAN: A Multi-IlluStrator Style Generative Adversarial Network for image to illustration translation
by Noa Barzilay et al

08-11-2021

Preventing Catastrophic Forgetting and Distribution Mismatch in Knowledge Distillation via Synthetic Data
by Kuluhan Binici et al

08-10-2021

Known Operator Learning and Hybrid Machine Learning in Medical Imaging --- A Review of the Past, the Present, and the Future
by Andreas Maier et al

08-11-2021

Efficient Surfel Fusion Using Normalised Information Distance
by Louis Gallagher et al

08-11-2021

Discriminative Distillation to Reduce Class Confusion in Continual Learning
by Changhong Zhong et al

08-11-2021

Distilling Holistic Knowledge with Graph Neural Networks
by Sheng Zhou et al

08-10-2021

UniNet: A Unified Scene Understanding Network and Exploring Multi-Task Relationships through the Lens of Adversarial Attacks
by NareshKumar Gurulingan et al

08-11-2021

Person Re-identification via Attention Pyramid
by Guangyi Chen et al

08-11-2021

Cervical Optical Coherence Tomography Image Classification Based on Contrastive Self-Supervised Texture Learning
by Kaiyi Chen et al

08-11-2021

Automatic Polyp Segmentation via Multi-scale Subtraction Network
by Xiaoqi Zhao et al

08-12-2021

DexMV: Imitation Learning for Dexterous Manipulation from Human Videos
by Yuzhe Qin et al

08-11-2021

Mining the Benefits of Two-stage and One-stage HOI Detection
by Aixi Zhang et al

08-11-2021

M3D-VTON: A Monocular-to-3D Virtual Try-On Network
by Fuwei Zhao et al

08-10-2021

Depth Infused Binaural Audio Generation using Hierarchical Cross-Modal Attention
by Kranti Kumar Parida et al

08-13-2021

CNN-based Two-Stage Parking Slot Detection Using Region-Specific Multi-Scale Feature Extraction
by Quang Huy Bui et al

08-13-2021

SPACE: A Simulator for Physical Interactions and Causal Learning in 3D Environments
by Jiafei Duan et al

08-13-2021

Detecting socially interacting groups using f-formation: A survey of taxonomy, methods, datasets, applications, challenges, and future research directions
by Hrishav Bakul Barua et al

08-12-2021

TVT: Transferable Vision Transformer for Unsupervised Domain Adaptation
by Jinyu Yang et al

08-11-2021

MultiTask-CenterNet (MCN): Efficient and Diverse Multitask Learning using an Anchor Free Approach
by Falk Heuer et al

 
Craig Smith