2021.9.27 Vision papers

09-22-2021	Vehicle Behavior Prediction and Generalization Using Imbalanced Learning Techniques by Theodor Westny et al
09-24-2021	Few-shot Learning Based on Multi-stage Transfer and Class-Balanced Loss for Diabetic Retinopathy Grading by Lei Shi et al
09-22-2021	TACTIC: Joint Rate-Distortion-Accuracy Optimisation for Low Bitrate Compression by Nikolina Kubiak et al
09-23-2021	Layered Neural Atlases for Consistent Video Editing by Yoni Kasten et al
09-24-2021	How to find a good image-text embedding for remote sensing visual question answering? by Christel Chappuis et al
09-24-2021	Learnable Triangulation for Deep Learning-based 3D Reconstruction of Objects of Arbitrary Topology from Single RGB Images by Tarek Ben Charrada et al
09-22-2021	DyStyle: Dynamic Neural Network for Multi-Attribute-Conditioned Style Editing by Bingchuan Li et al
09-22-2021	Animal inspired Application of a Variant of Mel Spectrogram for Seismic Data Processing by Samayan Bhattacharya et al
09-21-2021	Towards a Real-Time Facial Analysis System by Bishwo Adhikari et al
09-21-2021	Coast Sargassum Level Estimation from Smartphone Pictures by Uriarte-Arcia Abril Valeria et al
09-23-2021	End-to-End Dense Video Grounding via Parallel Regression by Fengyuan Shi et al
09-22-2021	Uncertainty-Aware Training for Cardiac Resynchronisation Therapy Response Prediction by Tareen Dawood et al
09-22-2021	LDC-VAE: A Latent Distribution Consistency Approach to Variational AutoEncoders by Xiaoyu Chen et al
09-23-2021	A Learned Stereo Depth System for Robotic Manipulation in Homes by Krishna Shankar et al
09-22-2021	Caption Enriched Samples for Improving Hateful Memes Detection by Efrat Blaier et al
09-24-2021	Localizing Infinity-shaped fishes: Sketch-guided object localization in the wild by Pau Riba et al
09-21-2021	Multi-Domain Few-Shot Learning and Dataset for Agricultural Applications by Sai Vidyaranya Nuthalapati et al
09-21-2021	Survey on Semantic Stereo Matching / Semantic Depth Estimation by Viny Saajan Victor et al
09-22-2021	A Quantitative Comparison of Epistemic Uncertainty Maps Applied to Multi-Class Segmentation by Robin Camarasa et al
09-21-2021	Skeleton-Graph: Long-Term 3D Motion Prediction From 2D Observations Using Deep Spatio-Temporal Graph CNNs by Abduallah Mohamed et al
09-21-2021	Bayesian Confidence Calibration for Epistemic Uncertainty Modelling by Fabian Küppers et al
09-22-2021	Natural Language Video Localization with Learnable Moment Proposals by Shaoning Xiao et al
09-22-2021	Learning to Downsample for Segmentation of Ultra-High Resolution Images by Chen Jin et al
09-21-2021	Learning Interpretable Concept Groups in CNNs by Saurabh Varshneya et al
09-22-2021	A Novel Factor Graph-Based Optimization Technique for Stereo Correspondence Estimation by Hanieh Shabanian et al
09-21-2021	Scale-aware direct monocular odometry by Carlos Campos et al
09-23-2021	Self-supervised Learning for Semi-supervised Temporal Language Grounding by Fan Luo et al
09-23-2021	Keypoints-Based Deep Feature Fusion for Cooperative Vehicle Detection of Autonomous Driving by Yunshuang Yuan et al
09-23-2021	Fast Point Voxel Convolution Neural Network with Selective Feature Fusion for Point Cloud Semantic Segmentation by Xu Wang et al
09-23-2021	SPNet: Multi-Shell Kernel Convolution for Point Cloud Semantic Segmentation by Yuyan Li et al
09-23-2021	Leveraging distributed contact force measurements for slip detection: a physics-based approach enabled by a data-driven tactile sensor by Pietro Griffa et al
09-22-2021	Unseen Object Amodal Instance Segmentation via Hierarchical Occlusion Modeling by Seunghyeok Back et al
09-24-2021	ImplicitVol: Sensorless 3D Ultrasound Reconstruction with Deep Implicit Representation by Pak-Hei Yeung et al
09-21-2021	Does Vision-and-Language Pretraining Improve Lexical Grounding? by Tian Yun et al
09-24-2021	Two-Stage Mesh Deep Learning for Automated Tooth Segmentation and Landmark Localization on 3D Intraoral Scans by Tai-Hsien Wu et al
09-21-2021	PDFNet: Pointwise Dense Flow Network for Urban-Scene Segmentation by Venkata Satya Sai Ajay Daliparthi
09-22-2021	A deep neural network for multi-species fish detection using multiple acoustic cameras by Garcia Fernandez et al
09-22-2021	Cross-Modal Coherence for Text-to-Image Retrieval by Malihe Alikhani et al
09-23-2021	Long Short View Feature Decomposition via Contrastive Video Representation Learning by Nadine Behrmann et al
09-21-2021	DS-Net++: Dynamic Weight Slicing for Efficient Inference in CNNs and Transformers by Changlin Li et al
09-22-2021	Towards practical object detection for weed spraying in precision agriculture by Adrian Salazar-Gomez et al
09-22-2021	FaceEraser: Removing Facial Parts for Augmented Reality by Miao Hua et al
09-24-2021	GSIP: Green Semantic Segmentation of Large-Scale Indoor Point Clouds by Min Zhang et al
09-21-2021	Robust marginalization of baryonic effects for cosmological inference at the field level by Francisco Villaescusa-Navarro et al
09-24-2021	Frequency Pooling: Shift-Equivalent and Anti-Aliasing Downsampling by Zhendong Zhang
09-21-2021	SemCal: Semantic LiDAR-Camera Calibration using Neural MutualInformation Estimator by Peng Jiang et al
09-21-2021	AI in Osteoporosis by Sokratis Makrogiannis et al
09-21-2021	Generating Compositional Color Representations from Text by Paridhi Maheshwari et al
09-21-2021	Rapid detection and recognition of whole brain activity in a freely behaving Caenorhabditis elegans by Yuxiang Wu et al
09-21-2021	Rotor Localization and Phase Mapping of Cardiac Excitation Waves using Deep Neural Networks by Jan Lebert et al
09-21-2021	MVM3Det: A Novel Method for Multi-view Monocular 3D Detection by Li Haoran et al
09-22-2021	Pix2seq: A Language Modeling Framework for Object Detection by Ting Chen et al
09-22-2021	Differentiable Surface Triangulation by Marie-Julie Rakotosaona et al
09-21-2021	KD-VLP: Improving End-to-End Vision-and-Language Pretraining with Object Knowledge Distillation by Yongfei Liu et al
09-21-2021	Single Person Pose Estimation: A Survey by Feng Zhang et al
09-21-2021	LOTR: Face Landmark Localization Using Localization Transformer by Ukrit Watchareeruetai et al
09-23-2021	Revisit Geophysical Imaging in A New View of Physics-informed Generative Adversarial Learning by Fangshu Yang et al
09-21-2021	Oriented Object Detection in Aerial Images Based on Area Ratio of Parallelogram by Xinyu Yu et al
09-22-2021	Rational Polynomial Camera Model Warping for Deep Learning Based Satellite Multi-View Stereo Matching by Jian Gao et al
09-22-2021	Adversarial Transfer Attacks With Unknown Data and Class Overlap by Luke E. Richards et al
09-24-2021	Towards Autonomous Crop-Agnostic Visual Navigation in Arable Fields by Alireza Ahmadi et al
09-24-2021	Fine-Grained Image Generation from Bangla Text Description using Attentional Generative Adversarial Network by Md Aminul Haque Palash et al
09-24-2021	SIM2REALVIZ: Visualizing the Sim2Real Gap in Robot Ego-Pose Estimation by Theo Jaunet et al
09-21-2021	Comparison of single and multitask learning for predicting cognitive decline based on MRI data by Vandad Imani et al
09-22-2021	Scale Efficiently: Insights from Pre-training and Fine-tuning Transformers by Yi Tay et al
09-21-2021	Finding Facial Forgery Artifacts with Parts-Based Detectors by Steven Schwarcz et al
09-23-2021	Towards Generalized and Incremental Few-Shot Object Detection by Yiting Li et al
09-23-2021	Improving Tuberculosis (TB) Prediction using Synthetically Generated Computed Tomography (CT) Images by Ashia Lewis et al
09-23-2021	OH-Former: Omni-Relational High-Order Transformer for Person Re-Identification by Xianing Chen et al
09-21-2021	Unsupervised Abstract Reasoning for Ravens Problem Matrices by Tao Zhuo et al
09-24-2021	MODNet-V: Improving Portrait Video Matting via Background Restoration by Jiayu Sun et al
09-24-2021	RSDet++: Point-based Modulated Loss for More Accurate Rotated Object Detection by Wen Qian et al
09-21-2021	Data-driven controllers and the need for perception systems in underwater manipulation by James P. Oubre et al
09-21-2021	Homography augumented momentum constrastive learning for SAR image retrieval by Seonho Park et al
09-22-2021	Incorporating Data Uncertainty in Object Tracking Algorithms by Anish Muthali et al
09-23-2021	LGD: Label-guided Self-distillation for Object Detection by Peizhen Zhang et al
09-22-2021	DVC-P: Deep Video Compression with Perceptual Optimizations by Saiping Zhang et al
09-24-2021	Visual Scene Graphs for Audio Source Separation by Moitreya Chatterjee et al
09-21-2021	CondNet: Conditional Classifier for Scene Segmentation by Changqian Yu et al
09-22-2021	An Efficient and Scalable Collection of Fly-inspired Voting Units for Visual Place Recognition in Changing Environments by Bruno Arcanjo et al
09-21-2021	Self-Supervised Action-Space Prediction for Automated Driving by Faris Janjoš et al
09-23-2021	The Hilti SLAM Challenge Dataset by Michael Helmberger et al
09-23-2021	Multi-resolution deep learning pipeline for dense large scale point clouds by Thomas Richard et al
09-24-2021	CPT: Colorful Prompt Tuning for Pre-trained Vision-Language Models by Yuan Yao et al
09-24-2021	Adversarial Domain Feature Adaptation for Bronchoscopic Depth Estimation by Mert Asim Karaoglu et al
09-23-2021	Clustering performance analysis using new correlation based cluster validity indices by Nathakhun Wiroonsri
09-24-2021	Dense Contrastive Visual-Linguistic Pretraining by Lei Shi et al
09-24-2021	Quantifying point cloud realism through adversarially learned latent representations by Larissa T. Triess et al
09-21-2021	An Ultra-Fast Method for Simulation of Realistic Ultrasound Images by Mostafa Sharifzadeh et al
09-22-2021	T6D-Direct: Transformers for Multi-Object 6D Pose Direct Regression by Arash Amini et al
09-22-2021	A Method For Adding Motion-Blur on Arbitrary Objects By using Auto-Segmentation and Color Compensation Techniques by Michihiro Mikamo et al
09-21-2021	VPN: Video Provenance Network for Robust Content Attribution by Alexander Black et al
09-21-2021	Learning PAC-Bayes Priors for Probabilistic Neural Networks by Maria Perez-Ortiz et al
09-23-2021	Lifelong 3D Object Recognition and Grasp Synthesis Using Dual Memory Recurrent Self-Organization Networks by Krishnakumar Santhakumar et al
09-23-2021	Deep Learning Strategies for Industrial Surface Defect Detection Systems by Dominik Martin et al
09-21-2021	3D Point Cloud Completion with Geometric-Aware Adversarial Augmentation by Mengxi Wu et al
09-21-2021	Less is More: Learning from Synthetic Data with Fine-grained Attributes for Person Re-Identification by Suncheng Xiang et al
09-21-2021	Joint Optical Neuroimaging Denoising with Semantic Tasks by Tianfang Zhu et al
09-21-2021	Single Image Dehazing with An Independent Detail-Recovery Network by Yan Li et al
09-24-2021	CLIPort: What and Where Pathways for Robotic Manipulation by Mohit Shridhar et al
09-23-2021	Pairwise Emotional Relationship Recognition in Drama Videos: Dataset and Benchmark by Xun Gao et al
09-24-2021	Quantitative Matching of Forensic Evidence Fragments Utilizing 3D Microscopy Analysis of Fracture Surface Replicas by Bishoy Dawood et al
09-22-2021	Hierarchical Multimodal Transformer to Summarize Videos by Bin Zhao et al
09-23-2021	SAME: Deformable Image Registration based on Self-supervised Anatomical Embeddings by Fengze Liu et al
09-23-2021	Weakly-Supervised Monocular Depth Estimationwith Resolution-Mismatched Data by Jialei Xu et al
09-21-2021	Automated segmentation and extraction of posterior eye segment using OCT scans by Bilal Hassan et al
09-22-2021	Neural network relief: a pruning algorithm based on neural activity by Aleksandr Dekhovich et al
09-23-2021	Towards Fine-grained 3D Face Dense Registration: An Optimal Dividing and Diffusing Method by Zhenfeng Fan et al
09-21-2021	The First Vision For Vitals (V4V) Challenge for Non-Contact Video-Based Physiological Estimation by Ambareesh Revanur et al
09-23-2021	A Skeleton-Driven Neural Occupancy Representation for Articulated Hands by Korrawe Karunratanakul et al
09-23-2021	Semantic Segmentation-assisted Scene Completion for LiDAR Point Clouds by Xuemeng Yang et al
09-22-2021	Learning Contrastive Representation for Semantic Correspondence by Taihong Xiao et al
09-24-2021	Multi-View Video-Based 3D Hand Pose Estimation by Leyla Khaleghi et al
09-21-2021	StereOBJ-1M: Large-scale Stereo Image Dataset for 6D Object Pose Estimation by Xingyu Liu et al
09-22-2021	Improving 360 Monocular Depth Estimation via Non-local Dense Prediction Transformer and Joint Supervised and Self-supervised Learning by Ilwi Yun et al
09-22-2021	HybridSDF: Combining Free Form Shapes and Geometric Primitives for effective Shape Manipulation by Subeesh Vasu et al
09-23-2021	Recent Advances of Continual Learning in Computer Vision: An Overview by Haoxuan Qu et al
09-22-2021	Deep Variational Clustering Framework for Self-labeling of Large-scale Medical Images by Farzin Soleymani et al
09-22-2021	Label Cleaning Multiple Instance Learning: Refining Coarse Annotations on Single Whole-Slide Images by Zhenzhen Wang et al
09-24-2021	Tackling Inter-Class Similarity and Intra-Class Variance for Microscopic Image-based Classification by Aishwarya Venkataramanan et al
09-24-2021	Unaligned Image-to-Image Translation by Learning to Reweight by Shaoan Xie et al
09-23-2021	Holistic Semi-Supervised Approaches for EEG Representation Learning by Guangyi Zhang et al
09-22-2021	Efficient Context-Aware Network for Abdominal Multi-organ Segmentation by Fan Zhang et al
09-24-2021	From images in the wild to video-informed image classification by Marc Böhlen et al
09-22-2021	Live Speech Portraits: Real-Time Photorealistic Talking-Head Animation by Yuanxun Lu et al
09-22-2021	Early Lane Change Prediction for Automated Driving Systems Using Multi-Task Attention-based Convolutional Neural Networks by Sajjad Mozaffari et al
09-21-2021	Mixed-supervised segmentation: Confidence maximization helps knowledge distillation by Bingyuan Liu et al
09-23-2021	Scene Graph Generation for Better Image Captioning? by Maximilian Mozes et al
09-23-2021	DeepRare: Generic Unsupervised Visual Attention Models by Phutphalla Kong et al
09-22-2021	Self-Training Based Unsupervised Cross-Modality Domain Adaptation for Vestibular Schwannoma and Cochlea Segmentation by Hyungseob Shin et al
09-23-2021	PRANet: Point Cloud Registration with an Artificial Agent by Lisa Tse et al
09-21-2021	KDFNet: Learning Keypoint Distance Field for 6D Object Pose Estimation by Xingyu Liu et al
09-21-2021	Multi-Source Video Domain Adaptation with Temporal Attentive Moment Alignment by Yuecong Xu et al
09-21-2021	Enforcing Mutual Consistency of Hard Regions for Semi-supervised Medical Image Segmentation by Yicheng Wu et al
09-22-2021	The CAMELS Multifield Dataset: Learning the Universes Fundamental Parameters with Artificial Intelligence by Francisco Villaescusa-Navarro et al
09-23-2021	Predicting the Timing of Camera Movements From the Kinematics of Instruments in Robotic-Assisted Surgery Using Artificial Neural Networks by Hanna Kossowsky et al
09-24-2021	Training dataset generation for bridge game registration by Piotr Wzorek et al
09-21-2021	TrOCR: Transformer-based Optical Character Recognition with Pre-trained Models by Minghao Li et al
09-23-2021	Hierarchical Memory Matching Network for Video Object Segmentation by Hongje Seong et al
09-21-2021	Self-supervised Representation Learning for Reliable Robotic Monitoring of Fruit Anomalies by Taeyeong Choi et al
09-23-2021	A 3D Mesh-based Lifting-and-Projection Network for Human Pose Transfer by Jinxiang Liu et al
09-23-2021	Feasibility study of urban flood mapping using traffic signs for route optimization by Bahareh Alizadeh et al
09-23-2021	Training Automatic View Planner for Cardiac MR Imaging via Self-Supervision by Spatial Relationship between Views by Dong Wei et al
09-23-2021	Paint4Poem: A Dataset for Artistic Visualization of Classical Chinese Poems by Dan Li et al
09-23-2021	Cross Attention-guided Dense Network for Images Fusion by Zhengwen Shen et al
09-22-2021	A Benchmark Comparison of Visual Place Recognition Techniques for Resource-Constrained Embedded Platforms by Rose Power et al
09-24-2021	ZSD-YOLO: Zero-Shot YOLO Detection using Vision-Language KnowledgeDistillation by Johnathan Xie et al
09-24-2021	DeepStroke: An Efficient Stroke Screening Framework for Emergency Rooms with Multimodal Adversarial Deep Learning by Tongan Cai et al
09-24-2021	Catadioptric Stereo on a Smartphone by Kristijan Bartol et al
09-24-2021	Learning-based Noise Component Map Estimation for Image Denoising by Sheyda Ghanbaralizadeh Bahnemiri et al
09-23-2021	MARMOT: A Deep Learning Framework for Constructing Multimodal Representations for Vision-and-Language Tasks by Patrick Y. Wu et al
09-23-2021	End-to-End AI-based MRI Reconstruction and Lesion Detection Pipeline for Evaluation of Deep Learning Image Reconstruction by Ruiyang Zhao et al
09-23-2021	How much human-like visual experience do current self-supervised learning algorithms need to achieve human-level object recognition? by A. Emin Orhan

Craig SmithSeptember 27, 2021