2021.11.15 Vision papers

 

11-11-2021

Full-Body Visual Self-Modeling of Robot Morphologies
by Boyuan Chen et al

11-12-2021

Closed-Loop Data Transcription to an LDR via Minimaxing Rate Reduction
by Xili Dai et al

11-12-2021

Meta-Teacher For Face Anti-Spoofing
by Yunxiao Qin et al

11-10-2021

Self-Supervised Real-time Video Stabilization
by Jinsoo Choi et al

11-10-2021

Multimodal Transformer with Variable-length Memory for Vision-and-Language Navigation
by Chuang Lin et al

11-09-2021

Sparse Adversarial Video Attacks with Spatial Transformations
by Ronghui Mu et al

11-10-2021

Selective Synthetic Augmentation with HistoGAN for Improved Histopathology Image Classification
by Yuan Xue et al

11-10-2021

A Multi-attribute Controllable Generative Model for Histopathology Image Synthesis
by Jiarong Ye et al

11-11-2021

Stacked U-Nets with Self-Assisted Priors Towards Robust Correction of Rigid Motion Artifact in Brain MRI
by Mohammed A. Al-masni et al

11-12-2021

Temporally-Consistent Surface Reconstruction using Metrically-Consistent Atlases
by Jan Bednarik et al

11-10-2021

Towards Live Video Analytics with On-Drone Deeper-yet-Compatible Compression
by Junpeng Guo et al

11-10-2021

CLIP2TV: An Empirical Study on Transformer-based Methods for Video-Text Retrieval
by Zijian Gao et al

11-10-2021

Palette: Image-to-Image Diffusion Models
by Chitwan Saharia et al

11-09-2021

Unsupervised Spiking Instance Segmentation on Event Data using STDP
by Paul Kirkland et al

11-11-2021

Open surgery tool classification and hand utilization using a multi-camera system
by Kristina Basiev et al

11-11-2021

A Survey of Visual Transformers
by Yang Liu et al

11-09-2021

Cross Attentional Audio-Visual Fusion for Dimensional Emotion Recognition
by Gnana Praveen R et al

11-11-2021

Dynamic Iterative Refinement for Efficient 3D Hand Pose Estimation
by John Yang et al

11-09-2021

Ethically aligned Deep Learning: Unbiased Facial Aesthetic Prediction
by Michael Danner et al

11-09-2021

Sliced Recursive Transformer
by Zhiqiang Shen et al

11-09-2021

Understanding the Generalization Benefit of Model Invariance from a Data Perspective
by Sicheng Zhu et al

11-11-2021

Dense Unsupervised Learning for Video Segmentation
by Nikita Araslanov et al

11-09-2021

Self-Supervised Audio-Visual Representation Learning with Relaxed Cross-Modal Temporal Synchronicity
by Pritam Sarkar et al

11-11-2021

Expert Human-Level Driving in Gran Turismo Sport Using Deep Reinforcement Learning with Image-based Representation
by Ryuji Imamura et al

11-09-2021

MMD-ReID: A Simple but Effective Solution for Visible-Thermal Person ReID
by Chaitra Jambigi et al

11-10-2021

Traffic4cast -- Large-scale Traffic Prediction using 3DResNet and Sparse-UNet
by Bo Wang et al

11-11-2021

Discovering and Explaining the Representation Bottleneck of DNNs
by Huiqi Deng et al

11-10-2021

Theoretical and empirical analysis of a fast algorithm for extracting polygons from signed distance bounds
by Nenad Markuš

11-09-2021

Exploiting Robust Unsupervised Video Person Re-identification
by Xianghao Zang et al

11-10-2021

Feature Generation for Long-tail Classification
by Rahul Vigneswaran et al

11-12-2021

Robust Analytics for Video-Based Gait Biometrics
by Ebenezer R. H. P. Isaac

11-12-2021

Fully Automatic Page Turning on Real Scores
by Florian Henkel et al

11-11-2021

Graph Relation Transformer: Incorporating pairwise object features into the Transformer architecture
by Michael Yang et al

11-10-2021

Leveraging Geometry for Shape Estimation from a Single RGB Image
by Florian Langer et al

11-12-2021

Learning to Break Deep Perceptual Hashing: The Use Case NeuralHash
by Lukas Struppek et al

11-10-2021

FabricFlowNet: Bimanual Cloth Manipulation with a Flow-based Policy
by Thomas Weng et al

11-09-2021

Automated Pulmonary Embolism Detection from CTPA Images Using an End-to-End Convolutional Neural Network
by Yi Lin et al

11-09-2021

Using The Feedback of Dynamic Active-Pixel Vision Sensor (Davis) to Prevent Slip in Real Time
by Armin Masoumian et al

11-11-2021

Towards Domain-Independent and Real-Time Gesture Recognition Using mmWave Signal
by Yadong Li et al

11-10-2021

Semantic-aware Representation Learning Via Probability Contrastive Loss
by Junjie Li et al

11-12-2021

A comprehensive study of clustering a class of 2D shapes
by Agnieszka Kaliszewska et al

11-12-2021

Frequency learning for structured CNN filters with Gaussian fractional derivatives
by Nikhil Saldanha et al

11-11-2021

On the Equivalence between Neural Network and Support Vector Machine
by Yilan Chen et al

11-11-2021

CodEx: A Modular Framework for Joint Temporal De-blurring and Tomographic Reconstruction
by Soumendu Majee et al

11-11-2021

Towards Axiomatic, Hierarchical, and Symbolic Explanation for Deep Models
by Jie Ren et al

11-11-2021

Neuromuscular Control of the Face-Head-Neck Biomechanical Complex With Learning-Based Expression Transfer From Images and Videos
by Xiao S. Zeng et al

11-10-2021

SwAMP: Swapped Assignment of Multi-Modal Pairs for Cross-Modal Retrieval
by Minyoung Kim

11-10-2021

Keys to Accurate Feature Extraction Using Residual Spiking Neural Networks
by Alex Vicente-Sola et al

11-10-2021

Hybrid Saturation Restoration for LDR Images of HDR Scenes
by Chaobing Zheng et al

11-12-2021

Deceive D: Adaptive Pseudo Augmentation for GAN Training with Limited Data
by Liming Jiang et al

11-10-2021

Trustworthy Medical Segmentation with Uncertainty Estimation
by Giuseppina Carannante et al

11-10-2021

FINO: Flow-based Joint Image and Noise Model
by Lanqing Guo et al

11-09-2021

Residual Quantity in Percentage of Factory Machines Using ComputerVision and Mathematical Methods
by Seunghyeon Kim et al

11-10-2021

A Histopathology Study Comparing Contrastive Semi-Supervised and Fully Supervised Learning
by Lantian Zhang et al

11-09-2021

Early Myocardial Infarction Detection over Multi-view Echocardiography
by Aysen Degerli et al

11-09-2021

Robust deep learning-based semantic organ segmentation in hyperspectral images
by Silvia Seidlitz et al

11-09-2021

Pipeline for 3D reconstruction of the human body from AR/VR headset mounted egocentric cameras
by Shivam Grover et al

11-11-2021

Automatically identifying a mobile phone users position within a vehicle
by Matt Knutson et al

11-12-2021

Multimodal Virtual Point 3D Detection
by Tianwei Yin et al

11-09-2021

Analysis of PDE-based binarization model for degraded document images
by Uche A. Nnolim

11-12-2021

Small or Far Away? Exploiting Deep Super-Resolution and Altitude Data for Aerial Animal Surveillance
by Mowen Xue et al

11-12-2021

The channel-spatial attention-based vision transformer network for automated, accurate prediction of crop nitrogen status from UAV imagery
by Xin Zhang et al

11-11-2021

The Emergence of Objectness: Learning Zero-Shot Segmentation from Videos
by Runtao Liu et al

11-10-2021

TomoSLAM: factor graph optimization for rotation angle refinement in microtomography
by Mark Griguletskii et al

11-10-2021

Multimodal End-to-End Group Emotion Recognition using Cross-Modal Attention
by Lev Evtodienko

11-10-2021

Evaluation of Deep Learning Topcoders Method for Neuron Individualization in Histological Macaque Brain Section
by Huaqian Wu et al

11-10-2021

Improving Structured Text Recognition with Regular Expression Biasing
by Baoguang Shi et al

11-09-2021

PIMIP: An Open Source Platform for Pathology Information Management and Integration
by Jialun Wu et al

11-09-2021

Data Augmentation Can Improve Robustness
by Sylvestre-Alvise Rebuffi et al

11-09-2021

Monocular Human Shape and Pose with Dense Mesh-borne Local Image Features
by Shubhendu Jena et al

11-12-2021

NRC-GAMMA: Introducing a Novel Large Gas Meter Image Dataset
by Ashkan Ebadi et al

11-10-2021

Self-Compression in Bayesian Neural Networks
by Giuseppina Carannante et al

11-10-2021

Robust Learning via Ensemble Density Propagation in Deep Neural Networks
by Giuseppina Carannante et al

11-10-2021

Advancing Brain Metastases Detection in T1-Weighted Contrast-Enhanced 3D MRI using Noisy Student-based Training
by Engin Dikici et al

11-09-2021

Does Thermal data make the detection systems more reliable?
by Shruthi Gowda et al

11-09-2021

Approaching the Limit of Image Rescaling via Flow Guidance
by Shang Li et al

11-09-2021

Efficient Data Compression for 3D Sparse TPC via Bicephalous Convolutional Autoencoder
by Yi Huang et al

11-11-2021

Learning Signal-Agnostic Manifolds of Neural Fields
by Yilun Du et al

11-09-2021

Space-Time Memory Network for Sounding Object Localization in Videos
by Sizhe Li et al

11-12-2021

Attention Guided Cosine Margin For Overcoming Class-Imbalance in Few-Shot Road Object Detection
by Ashutosh Agarwal et al

11-10-2021

Single image dehazing via combining the prior knowledge and CNNs
by Yuwen Li et al

11-10-2021

Fast T2w/FLAIR MRI Acquisition by Optimal Sampling of Information Complementary to Pre-acquired T1w MRI
by Junwei Yang et al

11-12-2021

Diversity-Promoting Human Motion Interpolation via Conditional Variational Auto-Encoder
by Chunzhi Gu et al

11-11-2021

Indian Licence Plate Dataset in the wild
by Sanchit Tanwar et al

11-12-2021

Sci-Net: a Scale Invariant Model for Building Detection from Aerial Images
by Hasan Nasrallah et al

11-10-2021

ICDAR 2021 Competition on Document VisualQuestion Answering
by Rubèn Tito et al

11-10-2021

3D modelling of survey scene from images enhanced with a multi-exposure fusion
by Kwok-Leung Chan et al

11-10-2021

Deep Attention-guided Graph Clustering with Dual Self-supervision
by Zhihao Peng et al

11-12-2021

Monte Carlo dropout increases model repeatability
by Andreanne Lemay et al

11-10-2021

Self-Supervised Multi-Object Tracking with Cross-Input Consistency
by Favyen Bastani et al

11-10-2021

csBoundary: City-scale Road-boundary Detection in Aerial Images for High-definition Maps
by Zhenhua Xu et al

11-11-2021

Masked Autoencoders Are Scalable Vision Learners
by Kaiming He et al

11-09-2021

Designing and Analyzing the PID and Fuzzy Control System for an Inverted Pendulum
by Armin Masoumian et al

11-10-2021

Structure from Silence: Learning Scene Structure from Ambient Sound
by Ziyang Chen et al

11-10-2021

Advances in Neural Rendering
by Ayush Tewari et al

11-09-2021

Towards Active Vision for Action Localization with Reactive Control and Predictive Learning
by Shubham Trehan et al

11-10-2021

Robust reconstructions by multi-scale/irregular tangential covering
by Antoine Vacavant et al

11-10-2021

A soft thumb-sized vision-based sensor with accurate all-round force perception
by Huanbo Sun et al

11-10-2021

Learning to ignore: rethinking attention in CNNs
by Firas Laakom et al

11-10-2021

Efficient Neural Network Training via Forward and Backward Propagation Sparsification
by Xiao Zhou et al

11-10-2021

Synthetic Document Generator for Annotation-free Layout Recognition
by Natraj Raman et al

11-09-2021

Deep Convolution Network Based Emotion Analysis for Automatic Detection of Mild Cognitive Impairment in the Elderly
by Zixiang Fei et al

11-09-2021

View Birdification in the Crowd: Ground-Plane Localization from Perceived Movements
by Mai Nishimura et al

11-12-2021

AlphaRotate: A Rotation Detection Benchmark using TensorFlow
by Xue Yang et al

11-11-2021

Clicking Matters:Towards Interactive Human Parsing
by Yutong Gao et al

11-11-2021

Unsupervised Part Discovery from Contrastive Reconstruction
by Subhabrata Choudhury et al

11-09-2021

Leveraging blur information for plenoptic camera calibration
by Mathieu Labussière et al

11-09-2021

Bilinear pooling and metric learning network for early Alzheimers disease identification with FDG-PET images
by Wenju Cui et al

11-09-2021

Video Text Tracking With a Spatio-Temporal Complementary Model
by Yuzhe Gao et al

11-09-2021

Dual Prototypical Contrastive Learning for Few-shot Semantic Segmentation
by Hyeongjun Kwon et al

11-09-2021

MixACM: Mixup-Based Robustness Transfer via Distillation of Activated Channel Maps
by Muhammad Awais et al

11-09-2021

Are Transformers More Robust Than CNNs?
by Yutong Bai et al

11-09-2021

Incremental Meta-Learning via Episodic Replay Distillation for Few-Shot Image Recognition
by Kai Wang et al

11-09-2021

MAC-ReconNet: A Multiple Acquisition Context based Convolutional Neural Network for MR Image Reconstruction using Dynamic Weight Prediction
by Sriprabha Ramanarayanan et al

11-12-2021

Transformer-based Image Compression
by Ming Lu et al

11-10-2021

The Impact of Changes in Resolution on the Persistent Homology of Images
by Teresa Heiss et al

11-10-2021

Dance In the Wild: Monocular Human Animation with Neural Dynamic Appearance Synthesis
by Tuanfeng Y. Wang et al

11-09-2021

Handwritten Digit Recognition Using Improved Bounding Box Recognition Technique
by Arkaprabha Basu et al

11-09-2021

A Structure Feature Algorithm for Multi-modal Forearm Registration
by Jiaxin Li et al

11-11-2021

Related Work on Image Quality Assessment
by Dongxu Wang

11-11-2021

A Novel Approach for Deterioration and Damage Identification in Building Structures Based on Stockwell-Transform and Deep Convolutional Neural Network
by Vahid Reza Gharehbaghi et al

11-12-2021

Identifying On-road Scenarios Predictive of ADHD usingDriving Simulator Time Series Data
by David Grethlein et al

11-12-2021

Deep-learning in the bioimaging wild: Handling ambiguous data with deepflash2
by Matthias Griebel et al

11-09-2021

Learning to Disentangle Scenes for Person Re-identification
by Xianghao Zang et al

11-10-2021

An Extensive Study of User Identification via Eye Movements across Multiple Datasets
by Sahar Mahdie Klim Al Zaidawi et al

11-10-2021

Explanatory Analysis and Rectification of the Pitfalls in COVID-19 Datasets
by Samyak Prajapati et al

11-09-2021

GDCA: GAN-based single image super resolution with Dual discriminators and Channel Attention
by Thanh Nguyen et al

11-11-2021

Fine-Grained Image Analysis with Deep Learning: A Survey
by Xiu-Shen Wei et al

11-10-2021

Multi-Scale Single Image Dehazing Using Laplacian and Gaussian Pyramids
by Zhengguo Li et al

11-11-2021

6D Pose Estimation with Combined Deep Learning and 3D Vision Techniques for a Fast and Accurate Object Grasping
by Tuan-Tang Le et al

11-11-2021

Multiple Hypothesis Hypergraph Tracking for Posture Identification in Embryonic Caenorhabditis elegans
by Andrew Lauziere et al

11-12-2021

Self-supervised GAN Detector
by Yonghyun Jeong et al

11-09-2021

Object-Centric Representation Learning with Generative Spatial-Temporal Factorization
by Li Nanbo et al

11-11-2021

Spatio-Temporal Scene-Graph Embedding for Autonomous Vehicle Collision Prediction
by Arnav V. Malawade et al

11-10-2021

Multimodal Approach for Metadata Extraction from German Scientific Publications
by Azeddine Bouabdallah et al

 
Craig Smith