2021.11.15 Vision papers

11-11-2021	Full-Body Visual Self-Modeling of Robot Morphologies by Boyuan Chen et al
11-12-2021	Closed-Loop Data Transcription to an LDR via Minimaxing Rate Reduction by Xili Dai et al
11-12-2021	Meta-Teacher For Face Anti-Spoofing by Yunxiao Qin et al
11-10-2021	Self-Supervised Real-time Video Stabilization by Jinsoo Choi et al
11-10-2021	Multimodal Transformer with Variable-length Memory for Vision-and-Language Navigation by Chuang Lin et al
11-09-2021	Sparse Adversarial Video Attacks with Spatial Transformations by Ronghui Mu et al
11-10-2021	Selective Synthetic Augmentation with HistoGAN for Improved Histopathology Image Classification by Yuan Xue et al
11-10-2021	A Multi-attribute Controllable Generative Model for Histopathology Image Synthesis by Jiarong Ye et al
11-11-2021	Stacked U-Nets with Self-Assisted Priors Towards Robust Correction of Rigid Motion Artifact in Brain MRI by Mohammed A. Al-masni et al
11-12-2021	Temporally-Consistent Surface Reconstruction using Metrically-Consistent Atlases by Jan Bednarik et al
11-10-2021	Towards Live Video Analytics with On-Drone Deeper-yet-Compatible Compression by Junpeng Guo et al
11-10-2021	CLIP2TV: An Empirical Study on Transformer-based Methods for Video-Text Retrieval by Zijian Gao et al
11-10-2021	Palette: Image-to-Image Diffusion Models by Chitwan Saharia et al
11-09-2021	Unsupervised Spiking Instance Segmentation on Event Data using STDP by Paul Kirkland et al
11-11-2021	Open surgery tool classification and hand utilization using a multi-camera system by Kristina Basiev et al
11-11-2021	A Survey of Visual Transformers by Yang Liu et al
11-09-2021	Cross Attentional Audio-Visual Fusion for Dimensional Emotion Recognition by Gnana Praveen R et al
11-11-2021	Dynamic Iterative Refinement for Efficient 3D Hand Pose Estimation by John Yang et al
11-09-2021	Ethically aligned Deep Learning: Unbiased Facial Aesthetic Prediction by Michael Danner et al
11-09-2021	Sliced Recursive Transformer by Zhiqiang Shen et al
11-09-2021	Understanding the Generalization Benefit of Model Invariance from a Data Perspective by Sicheng Zhu et al
11-11-2021	Dense Unsupervised Learning for Video Segmentation by Nikita Araslanov et al
11-09-2021	Self-Supervised Audio-Visual Representation Learning with Relaxed Cross-Modal Temporal Synchronicity by Pritam Sarkar et al
11-11-2021	Expert Human-Level Driving in Gran Turismo Sport Using Deep Reinforcement Learning with Image-based Representation by Ryuji Imamura et al
11-09-2021	MMD-ReID: A Simple but Effective Solution for Visible-Thermal Person ReID by Chaitra Jambigi et al
11-10-2021	Traffic4cast -- Large-scale Traffic Prediction using 3DResNet and Sparse-UNet by Bo Wang et al
11-11-2021	Discovering and Explaining the Representation Bottleneck of DNNs by Huiqi Deng et al
11-10-2021	Theoretical and empirical analysis of a fast algorithm for extracting polygons from signed distance bounds by Nenad Markuš
11-09-2021	Exploiting Robust Unsupervised Video Person Re-identification by Xianghao Zang et al
11-10-2021	Feature Generation for Long-tail Classification by Rahul Vigneswaran et al
11-12-2021	Robust Analytics for Video-Based Gait Biometrics by Ebenezer R. H. P. Isaac
11-12-2021	Fully Automatic Page Turning on Real Scores by Florian Henkel et al
11-11-2021	Graph Relation Transformer: Incorporating pairwise object features into the Transformer architecture by Michael Yang et al
11-10-2021	Leveraging Geometry for Shape Estimation from a Single RGB Image by Florian Langer et al
11-12-2021	Learning to Break Deep Perceptual Hashing: The Use Case NeuralHash by Lukas Struppek et al
11-10-2021	FabricFlowNet: Bimanual Cloth Manipulation with a Flow-based Policy by Thomas Weng et al
11-09-2021	Automated Pulmonary Embolism Detection from CTPA Images Using an End-to-End Convolutional Neural Network by Yi Lin et al
11-09-2021	Using The Feedback of Dynamic Active-Pixel Vision Sensor (Davis) to Prevent Slip in Real Time by Armin Masoumian et al
11-11-2021	Towards Domain-Independent and Real-Time Gesture Recognition Using mmWave Signal by Yadong Li et al
11-10-2021	Semantic-aware Representation Learning Via Probability Contrastive Loss by Junjie Li et al
11-12-2021	A comprehensive study of clustering a class of 2D shapes by Agnieszka Kaliszewska et al
11-12-2021	Frequency learning for structured CNN filters with Gaussian fractional derivatives by Nikhil Saldanha et al
11-11-2021	On the Equivalence between Neural Network and Support Vector Machine by Yilan Chen et al
11-11-2021	CodEx: A Modular Framework for Joint Temporal De-blurring and Tomographic Reconstruction by Soumendu Majee et al
11-11-2021	Towards Axiomatic, Hierarchical, and Symbolic Explanation for Deep Models by Jie Ren et al
11-11-2021	Neuromuscular Control of the Face-Head-Neck Biomechanical Complex With Learning-Based Expression Transfer From Images and Videos by Xiao S. Zeng et al
11-10-2021	SwAMP: Swapped Assignment of Multi-Modal Pairs for Cross-Modal Retrieval by Minyoung Kim
11-10-2021	Keys to Accurate Feature Extraction Using Residual Spiking Neural Networks by Alex Vicente-Sola et al
11-10-2021	Hybrid Saturation Restoration for LDR Images of HDR Scenes by Chaobing Zheng et al
11-12-2021	Deceive D: Adaptive Pseudo Augmentation for GAN Training with Limited Data by Liming Jiang et al
11-10-2021	Trustworthy Medical Segmentation with Uncertainty Estimation by Giuseppina Carannante et al
11-10-2021	FINO: Flow-based Joint Image and Noise Model by Lanqing Guo et al
11-09-2021	Residual Quantity in Percentage of Factory Machines Using ComputerVision and Mathematical Methods by Seunghyeon Kim et al
11-10-2021	A Histopathology Study Comparing Contrastive Semi-Supervised and Fully Supervised Learning by Lantian Zhang et al
11-09-2021	Early Myocardial Infarction Detection over Multi-view Echocardiography by Aysen Degerli et al
11-09-2021	Robust deep learning-based semantic organ segmentation in hyperspectral images by Silvia Seidlitz et al
11-09-2021	Pipeline for 3D reconstruction of the human body from AR/VR headset mounted egocentric cameras by Shivam Grover et al
11-11-2021	Automatically identifying a mobile phone users position within a vehicle by Matt Knutson et al
11-12-2021	Multimodal Virtual Point 3D Detection by Tianwei Yin et al
11-09-2021	Analysis of PDE-based binarization model for degraded document images by Uche A. Nnolim
11-12-2021	Small or Far Away? Exploiting Deep Super-Resolution and Altitude Data for Aerial Animal Surveillance by Mowen Xue et al
11-12-2021	The channel-spatial attention-based vision transformer network for automated, accurate prediction of crop nitrogen status from UAV imagery by Xin Zhang et al
11-11-2021	The Emergence of Objectness: Learning Zero-Shot Segmentation from Videos by Runtao Liu et al
11-10-2021	TomoSLAM: factor graph optimization for rotation angle refinement in microtomography by Mark Griguletskii et al
11-10-2021	Multimodal End-to-End Group Emotion Recognition using Cross-Modal Attention by Lev Evtodienko
11-10-2021	Evaluation of Deep Learning Topcoders Method for Neuron Individualization in Histological Macaque Brain Section by Huaqian Wu et al
11-10-2021	Improving Structured Text Recognition with Regular Expression Biasing by Baoguang Shi et al
11-09-2021	PIMIP: An Open Source Platform for Pathology Information Management and Integration by Jialun Wu et al
11-09-2021	Data Augmentation Can Improve Robustness by Sylvestre-Alvise Rebuffi et al
11-09-2021	Monocular Human Shape and Pose with Dense Mesh-borne Local Image Features by Shubhendu Jena et al
11-12-2021	NRC-GAMMA: Introducing a Novel Large Gas Meter Image Dataset by Ashkan Ebadi et al
11-10-2021	Self-Compression in Bayesian Neural Networks by Giuseppina Carannante et al
11-10-2021	Robust Learning via Ensemble Density Propagation in Deep Neural Networks by Giuseppina Carannante et al
11-10-2021	Advancing Brain Metastases Detection in T1-Weighted Contrast-Enhanced 3D MRI using Noisy Student-based Training by Engin Dikici et al
11-09-2021	Does Thermal data make the detection systems more reliable? by Shruthi Gowda et al
11-09-2021	Approaching the Limit of Image Rescaling via Flow Guidance by Shang Li et al
11-09-2021	Efficient Data Compression for 3D Sparse TPC via Bicephalous Convolutional Autoencoder by Yi Huang et al
11-11-2021	Learning Signal-Agnostic Manifolds of Neural Fields by Yilun Du et al
11-09-2021	Space-Time Memory Network for Sounding Object Localization in Videos by Sizhe Li et al
11-12-2021	Attention Guided Cosine Margin For Overcoming Class-Imbalance in Few-Shot Road Object Detection by Ashutosh Agarwal et al
11-10-2021	Single image dehazing via combining the prior knowledge and CNNs by Yuwen Li et al
11-10-2021	Fast T2w/FLAIR MRI Acquisition by Optimal Sampling of Information Complementary to Pre-acquired T1w MRI by Junwei Yang et al
11-12-2021	Diversity-Promoting Human Motion Interpolation via Conditional Variational Auto-Encoder by Chunzhi Gu et al
11-11-2021	Indian Licence Plate Dataset in the wild by Sanchit Tanwar et al
11-12-2021	Sci-Net: a Scale Invariant Model for Building Detection from Aerial Images by Hasan Nasrallah et al
11-10-2021	ICDAR 2021 Competition on Document VisualQuestion Answering by Rubèn Tito et al
11-10-2021	3D modelling of survey scene from images enhanced with a multi-exposure fusion by Kwok-Leung Chan et al
11-10-2021	Deep Attention-guided Graph Clustering with Dual Self-supervision by Zhihao Peng et al
11-12-2021	Monte Carlo dropout increases model repeatability by Andreanne Lemay et al
11-10-2021	Self-Supervised Multi-Object Tracking with Cross-Input Consistency by Favyen Bastani et al
11-10-2021	csBoundary: City-scale Road-boundary Detection in Aerial Images for High-definition Maps by Zhenhua Xu et al
11-11-2021	Masked Autoencoders Are Scalable Vision Learners by Kaiming He et al
11-09-2021	Designing and Analyzing the PID and Fuzzy Control System for an Inverted Pendulum by Armin Masoumian et al
11-10-2021	Structure from Silence: Learning Scene Structure from Ambient Sound by Ziyang Chen et al
11-10-2021	Advances in Neural Rendering by Ayush Tewari et al
11-09-2021	Towards Active Vision for Action Localization with Reactive Control and Predictive Learning by Shubham Trehan et al
11-10-2021	Robust reconstructions by multi-scale/irregular tangential covering by Antoine Vacavant et al
11-10-2021	A soft thumb-sized vision-based sensor with accurate all-round force perception by Huanbo Sun et al
11-10-2021	Learning to ignore: rethinking attention in CNNs by Firas Laakom et al
11-10-2021	Efficient Neural Network Training via Forward and Backward Propagation Sparsification by Xiao Zhou et al
11-10-2021	Synthetic Document Generator for Annotation-free Layout Recognition by Natraj Raman et al
11-09-2021	Deep Convolution Network Based Emotion Analysis for Automatic Detection of Mild Cognitive Impairment in the Elderly by Zixiang Fei et al
11-09-2021	View Birdification in the Crowd: Ground-Plane Localization from Perceived Movements by Mai Nishimura et al
11-12-2021	AlphaRotate: A Rotation Detection Benchmark using TensorFlow by Xue Yang et al
11-11-2021	Clicking Matters:Towards Interactive Human Parsing by Yutong Gao et al
11-11-2021	Unsupervised Part Discovery from Contrastive Reconstruction by Subhabrata Choudhury et al
11-09-2021	Leveraging blur information for plenoptic camera calibration by Mathieu Labussière et al
11-09-2021	Bilinear pooling and metric learning network for early Alzheimers disease identification with FDG-PET images by Wenju Cui et al
11-09-2021	Video Text Tracking With a Spatio-Temporal Complementary Model by Yuzhe Gao et al
11-09-2021	Dual Prototypical Contrastive Learning for Few-shot Semantic Segmentation by Hyeongjun Kwon et al
11-09-2021	MixACM: Mixup-Based Robustness Transfer via Distillation of Activated Channel Maps by Muhammad Awais et al
11-09-2021	Are Transformers More Robust Than CNNs? by Yutong Bai et al
11-09-2021	Incremental Meta-Learning via Episodic Replay Distillation for Few-Shot Image Recognition by Kai Wang et al
11-09-2021	MAC-ReconNet: A Multiple Acquisition Context based Convolutional Neural Network for MR Image Reconstruction using Dynamic Weight Prediction by Sriprabha Ramanarayanan et al
11-12-2021	Transformer-based Image Compression by Ming Lu et al
11-10-2021	The Impact of Changes in Resolution on the Persistent Homology of Images by Teresa Heiss et al
11-10-2021	Dance In the Wild: Monocular Human Animation with Neural Dynamic Appearance Synthesis by Tuanfeng Y. Wang et al
11-09-2021	Handwritten Digit Recognition Using Improved Bounding Box Recognition Technique by Arkaprabha Basu et al
11-09-2021	A Structure Feature Algorithm for Multi-modal Forearm Registration by Jiaxin Li et al
11-11-2021	Related Work on Image Quality Assessment by Dongxu Wang
11-11-2021	A Novel Approach for Deterioration and Damage Identification in Building Structures Based on Stockwell-Transform and Deep Convolutional Neural Network by Vahid Reza Gharehbaghi et al
11-12-2021	Identifying On-road Scenarios Predictive of ADHD usingDriving Simulator Time Series Data by David Grethlein et al
11-12-2021	Deep-learning in the bioimaging wild: Handling ambiguous data with deepflash2 by Matthias Griebel et al
11-09-2021	Learning to Disentangle Scenes for Person Re-identification by Xianghao Zang et al
11-10-2021	An Extensive Study of User Identification via Eye Movements across Multiple Datasets by Sahar Mahdie Klim Al Zaidawi et al
11-10-2021	Explanatory Analysis and Rectification of the Pitfalls in COVID-19 Datasets by Samyak Prajapati et al
11-09-2021	GDCA: GAN-based single image super resolution with Dual discriminators and Channel Attention by Thanh Nguyen et al
11-11-2021	Fine-Grained Image Analysis with Deep Learning: A Survey by Xiu-Shen Wei et al
11-10-2021	Multi-Scale Single Image Dehazing Using Laplacian and Gaussian Pyramids by Zhengguo Li et al
11-11-2021	6D Pose Estimation with Combined Deep Learning and 3D Vision Techniques for a Fast and Accurate Object Grasping by Tuan-Tang Le et al
11-11-2021	Multiple Hypothesis Hypergraph Tracking for Posture Identification in Embryonic Caenorhabditis elegans by Andrew Lauziere et al
11-12-2021	Self-supervised GAN Detector by Yonghyun Jeong et al
11-09-2021	Object-Centric Representation Learning with Generative Spatial-Temporal Factorization by Li Nanbo et al
11-11-2021	Spatio-Temporal Scene-Graph Embedding for Autonomous Vehicle Collision Prediction by Arnav V. Malawade et al
11-10-2021	Multimodal Approach for Metadata Extraction from German Scientific Publications by Azeddine Bouabdallah et al

Craig SmithNovember 15, 2021