2021.9.27 Vision papers

 

09-22-2021

Vehicle Behavior Prediction and Generalization Using Imbalanced Learning Techniques
by Theodor Westny et al

09-24-2021

Few-shot Learning Based on Multi-stage Transfer and Class-Balanced Loss for Diabetic Retinopathy Grading
by Lei Shi et al

09-22-2021

TACTIC: Joint Rate-Distortion-Accuracy Optimisation for Low Bitrate Compression
by Nikolina Kubiak et al

09-23-2021

Layered Neural Atlases for Consistent Video Editing
by Yoni Kasten et al

09-24-2021

How to find a good image-text embedding for remote sensing visual question answering?
by Christel Chappuis et al

09-24-2021

Learnable Triangulation for Deep Learning-based 3D Reconstruction of Objects of Arbitrary Topology from Single RGB Images
by Tarek Ben Charrada et al

09-22-2021

DyStyle: Dynamic Neural Network for Multi-Attribute-Conditioned Style Editing
by Bingchuan Li et al

09-22-2021

Animal inspired Application of a Variant of Mel Spectrogram for Seismic Data Processing
by Samayan Bhattacharya et al

09-21-2021

Towards a Real-Time Facial Analysis System
by Bishwo Adhikari et al

09-21-2021

Coast Sargassum Level Estimation from Smartphone Pictures
by Uriarte-Arcia Abril Valeria et al

09-23-2021

End-to-End Dense Video Grounding via Parallel Regression
by Fengyuan Shi et al

09-22-2021

Uncertainty-Aware Training for Cardiac Resynchronisation Therapy Response Prediction
by Tareen Dawood et al

09-22-2021

LDC-VAE: A Latent Distribution Consistency Approach to Variational AutoEncoders
by Xiaoyu Chen et al

09-23-2021

A Learned Stereo Depth System for Robotic Manipulation in Homes
by Krishna Shankar et al

09-22-2021

Caption Enriched Samples for Improving Hateful Memes Detection
by Efrat Blaier et al

09-24-2021

Localizing Infinity-shaped fishes: Sketch-guided object localization in the wild
by Pau Riba et al

09-21-2021

Multi-Domain Few-Shot Learning and Dataset for Agricultural Applications
by Sai Vidyaranya Nuthalapati et al

09-21-2021

Survey on Semantic Stereo Matching / Semantic Depth Estimation
by Viny Saajan Victor et al

09-22-2021

A Quantitative Comparison of Epistemic Uncertainty Maps Applied to Multi-Class Segmentation
by Robin Camarasa et al

09-21-2021

Skeleton-Graph: Long-Term 3D Motion Prediction From 2D Observations Using Deep Spatio-Temporal Graph CNNs
by Abduallah Mohamed et al

09-21-2021

Bayesian Confidence Calibration for Epistemic Uncertainty Modelling
by Fabian Küppers et al

09-22-2021

Natural Language Video Localization with Learnable Moment Proposals
by Shaoning Xiao et al

09-22-2021

Learning to Downsample for Segmentation of Ultra-High Resolution Images
by Chen Jin et al

09-21-2021

Learning Interpretable Concept Groups in CNNs
by Saurabh Varshneya et al

09-22-2021

A Novel Factor Graph-Based Optimization Technique for Stereo Correspondence Estimation
by Hanieh Shabanian et al

09-21-2021

Scale-aware direct monocular odometry
by Carlos Campos et al

09-23-2021

Self-supervised Learning for Semi-supervised Temporal Language Grounding
by Fan Luo et al

09-23-2021

Keypoints-Based Deep Feature Fusion for Cooperative Vehicle Detection of Autonomous Driving
by Yunshuang Yuan et al

09-23-2021

Fast Point Voxel Convolution Neural Network with Selective Feature Fusion for Point Cloud Semantic Segmentation
by Xu Wang et al

09-23-2021

SPNet: Multi-Shell Kernel Convolution for Point Cloud Semantic Segmentation
by Yuyan Li et al

09-23-2021

Leveraging distributed contact force measurements for slip detection: a physics-based approach enabled by a data-driven tactile sensor
by Pietro Griffa et al

09-22-2021

Unseen Object Amodal Instance Segmentation via Hierarchical Occlusion Modeling
by Seunghyeok Back et al

09-24-2021

ImplicitVol: Sensorless 3D Ultrasound Reconstruction with Deep Implicit Representation
by Pak-Hei Yeung et al

09-21-2021

Does Vision-and-Language Pretraining Improve Lexical Grounding?
by Tian Yun et al

09-24-2021

Two-Stage Mesh Deep Learning for Automated Tooth Segmentation and Landmark Localization on 3D Intraoral Scans
by Tai-Hsien Wu et al

09-21-2021

PDFNet: Pointwise Dense Flow Network for Urban-Scene Segmentation
by Venkata Satya Sai Ajay Daliparthi

09-22-2021

A deep neural network for multi-species fish detection using multiple acoustic cameras
by Garcia Fernandez et al

09-22-2021

Cross-Modal Coherence for Text-to-Image Retrieval
by Malihe Alikhani et al

09-23-2021

Long Short View Feature Decomposition via Contrastive Video Representation Learning
by Nadine Behrmann et al

09-21-2021

DS-Net++: Dynamic Weight Slicing for Efficient Inference in CNNs and Transformers
by Changlin Li et al

09-22-2021

Towards practical object detection for weed spraying in precision agriculture
by Adrian Salazar-Gomez et al

09-22-2021

FaceEraser: Removing Facial Parts for Augmented Reality
by Miao Hua et al

09-24-2021

GSIP: Green Semantic Segmentation of Large-Scale Indoor Point Clouds
by Min Zhang et al

09-21-2021

Robust marginalization of baryonic effects for cosmological inference at the field level
by Francisco Villaescusa-Navarro et al

09-24-2021

Frequency Pooling: Shift-Equivalent and Anti-Aliasing Downsampling
by Zhendong Zhang

09-21-2021

SemCal: Semantic LiDAR-Camera Calibration using Neural MutualInformation Estimator
by Peng Jiang et al

09-21-2021

AI in Osteoporosis
by Sokratis Makrogiannis et al

09-21-2021

Generating Compositional Color Representations from Text
by Paridhi Maheshwari et al

09-21-2021

Rapid detection and recognition of whole brain activity in a freely behaving Caenorhabditis elegans
by Yuxiang Wu et al

09-21-2021

Rotor Localization and Phase Mapping of Cardiac Excitation Waves using Deep Neural Networks
by Jan Lebert et al

09-21-2021

MVM3Det: A Novel Method for Multi-view Monocular 3D Detection
by Li Haoran et al

09-22-2021

Pix2seq: A Language Modeling Framework for Object Detection
by Ting Chen et al

09-22-2021

Differentiable Surface Triangulation
by Marie-Julie Rakotosaona et al

09-21-2021

KD-VLP: Improving End-to-End Vision-and-Language Pretraining with Object Knowledge Distillation
by Yongfei Liu et al

09-21-2021

Single Person Pose Estimation: A Survey
by Feng Zhang et al

09-21-2021

LOTR: Face Landmark Localization Using Localization Transformer
by Ukrit Watchareeruetai et al

09-23-2021

Revisit Geophysical Imaging in A New View of Physics-informed Generative Adversarial Learning
by Fangshu Yang et al

09-21-2021

Oriented Object Detection in Aerial Images Based on Area Ratio of Parallelogram
by Xinyu Yu et al

09-22-2021

Rational Polynomial Camera Model Warping for Deep Learning Based Satellite Multi-View Stereo Matching
by Jian Gao et al

09-22-2021

Adversarial Transfer Attacks With Unknown Data and Class Overlap
by Luke E. Richards et al

09-24-2021

Towards Autonomous Crop-Agnostic Visual Navigation in Arable Fields
by Alireza Ahmadi et al

09-24-2021

Fine-Grained Image Generation from Bangla Text Description using Attentional Generative Adversarial Network
by Md Aminul Haque Palash et al

09-24-2021

SIM2REALVIZ: Visualizing the Sim2Real Gap in Robot Ego-Pose Estimation
by Theo Jaunet et al

09-21-2021

Comparison of single and multitask learning for predicting cognitive decline based on MRI data
by Vandad Imani et al

09-22-2021

Scale Efficiently: Insights from Pre-training and Fine-tuning Transformers
by Yi Tay et al

09-21-2021

Finding Facial Forgery Artifacts with Parts-Based Detectors
by Steven Schwarcz et al

09-23-2021

Towards Generalized and Incremental Few-Shot Object Detection
by Yiting Li et al

09-23-2021

Improving Tuberculosis (TB) Prediction using Synthetically Generated Computed Tomography (CT) Images
by Ashia Lewis et al

09-23-2021

OH-Former: Omni-Relational High-Order Transformer for Person Re-Identification
by Xianing Chen et al

09-21-2021

Unsupervised Abstract Reasoning for Ravens Problem Matrices
by Tao Zhuo et al

09-24-2021

MODNet-V: Improving Portrait Video Matting via Background Restoration
by Jiayu Sun et al

09-24-2021

RSDet++: Point-based Modulated Loss for More Accurate Rotated Object Detection
by Wen Qian et al

09-21-2021

Data-driven controllers and the need for perception systems in underwater manipulation
by James P. Oubre et al

09-21-2021

Homography augumented momentum constrastive learning for SAR image retrieval
by Seonho Park et al

09-22-2021

Incorporating Data Uncertainty in Object Tracking Algorithms
by Anish Muthali et al

09-23-2021

LGD: Label-guided Self-distillation for Object Detection
by Peizhen Zhang et al

09-22-2021

DVC-P: Deep Video Compression with Perceptual Optimizations
by Saiping Zhang et al

09-24-2021

Visual Scene Graphs for Audio Source Separation
by Moitreya Chatterjee et al

09-21-2021

CondNet: Conditional Classifier for Scene Segmentation
by Changqian Yu et al

09-22-2021

An Efficient and Scalable Collection of Fly-inspired Voting Units for Visual Place Recognition in Changing Environments
by Bruno Arcanjo et al

09-21-2021

Self-Supervised Action-Space Prediction for Automated Driving
by Faris Janjoš et al

09-23-2021

The Hilti SLAM Challenge Dataset
by Michael Helmberger et al

09-23-2021

Multi-resolution deep learning pipeline for dense large scale point clouds
by Thomas Richard et al

09-24-2021

CPT: Colorful Prompt Tuning for Pre-trained Vision-Language Models
by Yuan Yao et al

09-24-2021

Adversarial Domain Feature Adaptation for Bronchoscopic Depth Estimation
by Mert Asim Karaoglu et al

09-23-2021

Clustering performance analysis using new correlation based cluster validity indices
by Nathakhun Wiroonsri

09-24-2021

Dense Contrastive Visual-Linguistic Pretraining
by Lei Shi et al

09-24-2021

Quantifying point cloud realism through adversarially learned latent representations
by Larissa T. Triess et al

09-21-2021

An Ultra-Fast Method for Simulation of Realistic Ultrasound Images
by Mostafa Sharifzadeh et al

09-22-2021

T6D-Direct: Transformers for Multi-Object 6D Pose Direct Regression
by Arash Amini et al

09-22-2021

A Method For Adding Motion-Blur on Arbitrary Objects By using Auto-Segmentation and Color Compensation Techniques
by Michihiro Mikamo et al

09-21-2021

VPN: Video Provenance Network for Robust Content Attribution
by Alexander Black et al

09-21-2021

Learning PAC-Bayes Priors for Probabilistic Neural Networks
by Maria Perez-Ortiz et al

09-23-2021

Lifelong 3D Object Recognition and Grasp Synthesis Using Dual Memory Recurrent Self-Organization Networks
by Krishnakumar Santhakumar et al

09-23-2021

Deep Learning Strategies for Industrial Surface Defect Detection Systems
by Dominik Martin et al

09-21-2021

3D Point Cloud Completion with Geometric-Aware Adversarial Augmentation
by Mengxi Wu et al

09-21-2021

Less is More: Learning from Synthetic Data with Fine-grained Attributes for Person Re-Identification
by Suncheng Xiang et al

09-21-2021

Joint Optical Neuroimaging Denoising with Semantic Tasks
by Tianfang Zhu et al

09-21-2021

Single Image Dehazing with An Independent Detail-Recovery Network
by Yan Li et al

09-24-2021

CLIPort: What and Where Pathways for Robotic Manipulation
by Mohit Shridhar et al

09-23-2021

Pairwise Emotional Relationship Recognition in Drama Videos: Dataset and Benchmark
by Xun Gao et al

09-24-2021

Quantitative Matching of Forensic Evidence Fragments Utilizing 3D Microscopy Analysis of Fracture Surface Replicas
by Bishoy Dawood et al

09-22-2021

Hierarchical Multimodal Transformer to Summarize Videos
by Bin Zhao et al

09-23-2021

SAME: Deformable Image Registration based on Self-supervised Anatomical Embeddings
by Fengze Liu et al

09-23-2021

Weakly-Supervised Monocular Depth Estimationwith Resolution-Mismatched Data
by Jialei Xu et al

09-21-2021

Automated segmentation and extraction of posterior eye segment using OCT scans
by Bilal Hassan et al

09-22-2021

Neural network relief: a pruning algorithm based on neural activity
by Aleksandr Dekhovich et al

09-23-2021

Towards Fine-grained 3D Face Dense Registration: An Optimal Dividing and Diffusing Method
by Zhenfeng Fan et al

09-21-2021

The First Vision For Vitals (V4V) Challenge for Non-Contact Video-Based Physiological Estimation
by Ambareesh Revanur et al

09-23-2021

A Skeleton-Driven Neural Occupancy Representation for Articulated Hands
by Korrawe Karunratanakul et al

09-23-2021

Semantic Segmentation-assisted Scene Completion for LiDAR Point Clouds
by Xuemeng Yang et al

09-22-2021

Learning Contrastive Representation for Semantic Correspondence
by Taihong Xiao et al

09-24-2021

Multi-View Video-Based 3D Hand Pose Estimation
by Leyla Khaleghi et al

09-21-2021

StereOBJ-1M: Large-scale Stereo Image Dataset for 6D Object Pose Estimation
by Xingyu Liu et al

09-22-2021

Improving 360 Monocular Depth Estimation via Non-local Dense Prediction Transformer and Joint Supervised and Self-supervised Learning
by Ilwi Yun et al

09-22-2021

HybridSDF: Combining Free Form Shapes and Geometric Primitives for effective Shape Manipulation
by Subeesh Vasu et al

09-23-2021

Recent Advances of Continual Learning in Computer Vision: An Overview
by Haoxuan Qu et al

09-22-2021

Deep Variational Clustering Framework for Self-labeling of Large-scale Medical Images
by Farzin Soleymani et al

09-22-2021

Label Cleaning Multiple Instance Learning: Refining Coarse Annotations on Single Whole-Slide Images
by Zhenzhen Wang et al

09-24-2021

Tackling Inter-Class Similarity and Intra-Class Variance for Microscopic Image-based Classification
by Aishwarya Venkataramanan et al

09-24-2021

Unaligned Image-to-Image Translation by Learning to Reweight
by Shaoan Xie et al

09-23-2021

Holistic Semi-Supervised Approaches for EEG Representation Learning
by Guangyi Zhang et al

09-22-2021

Efficient Context-Aware Network for Abdominal Multi-organ Segmentation
by Fan Zhang et al

09-24-2021

From images in the wild to video-informed image classification
by Marc Böhlen et al

09-22-2021

Live Speech Portraits: Real-Time Photorealistic Talking-Head Animation
by Yuanxun Lu et al

09-22-2021

Early Lane Change Prediction for Automated Driving Systems Using Multi-Task Attention-based Convolutional Neural Networks
by Sajjad Mozaffari et al

09-21-2021

Mixed-supervised segmentation: Confidence maximization helps knowledge distillation
by Bingyuan Liu et al

09-23-2021

Scene Graph Generation for Better Image Captioning?
by Maximilian Mozes et al

09-23-2021

DeepRare: Generic Unsupervised Visual Attention Models
by Phutphalla Kong et al

09-22-2021

Self-Training Based Unsupervised Cross-Modality Domain Adaptation for Vestibular Schwannoma and Cochlea Segmentation
by Hyungseob Shin et al

09-23-2021

PRANet: Point Cloud Registration with an Artificial Agent
by Lisa Tse et al

09-21-2021

KDFNet: Learning Keypoint Distance Field for 6D Object Pose Estimation
by Xingyu Liu et al

09-21-2021

Multi-Source Video Domain Adaptation with Temporal Attentive Moment Alignment
by Yuecong Xu et al

09-21-2021

Enforcing Mutual Consistency of Hard Regions for Semi-supervised Medical Image Segmentation
by Yicheng Wu et al

09-22-2021

The CAMELS Multifield Dataset: Learning the Universes Fundamental Parameters with Artificial Intelligence
by Francisco Villaescusa-Navarro et al

09-23-2021

Predicting the Timing of Camera Movements From the Kinematics of Instruments in Robotic-Assisted Surgery Using Artificial Neural Networks
by Hanna Kossowsky et al

09-24-2021

Training dataset generation for bridge game registration
by Piotr Wzorek et al

09-21-2021

TrOCR: Transformer-based Optical Character Recognition with Pre-trained Models
by Minghao Li et al

09-23-2021

Hierarchical Memory Matching Network for Video Object Segmentation
by Hongje Seong et al

09-21-2021

Self-supervised Representation Learning for Reliable Robotic Monitoring of Fruit Anomalies
by Taeyeong Choi et al

09-23-2021

A 3D Mesh-based Lifting-and-Projection Network for Human Pose Transfer
by Jinxiang Liu et al

09-23-2021

Feasibility study of urban flood mapping using traffic signs for route optimization
by Bahareh Alizadeh et al

09-23-2021

Training Automatic View Planner for Cardiac MR Imaging via Self-Supervision by Spatial Relationship between Views
by Dong Wei et al

09-23-2021

Paint4Poem: A Dataset for Artistic Visualization of Classical Chinese Poems
by Dan Li et al

09-23-2021

Cross Attention-guided Dense Network for Images Fusion
by Zhengwen Shen et al

09-22-2021

A Benchmark Comparison of Visual Place Recognition Techniques for Resource-Constrained Embedded Platforms
by Rose Power et al

09-24-2021

ZSD-YOLO: Zero-Shot YOLO Detection using Vision-Language KnowledgeDistillation
by Johnathan Xie et al

09-24-2021

DeepStroke: An Efficient Stroke Screening Framework for Emergency Rooms with Multimodal Adversarial Deep Learning
by Tongan Cai et al

09-24-2021

Catadioptric Stereo on a Smartphone
by Kristijan Bartol et al

09-24-2021

Learning-based Noise Component Map Estimation for Image Denoising
by Sheyda Ghanbaralizadeh Bahnemiri et al

09-23-2021

MARMOT: A Deep Learning Framework for Constructing Multimodal Representations for Vision-and-Language Tasks
by Patrick Y. Wu et al

09-23-2021

End-to-End AI-based MRI Reconstruction and Lesion Detection Pipeline for Evaluation of Deep Learning Image Reconstruction
by Ruiyang Zhao et al

09-23-2021

How much human-like visual experience do current self-supervised learning algorithms need to achieve human-level object recognition?
by A. Emin Orhan

 
Craig Smith