2022.1.24 Vision papers

 

01-20-2022

Stitch it in Time: GAN-Based Facial Editing of Real Videos
by Rotem Tzaban et al

01-20-2022

Omnivore: A Single Model for Many Visual Modalities
by Rohit Girdhar et al

01-18-2022

Online Deep Learning based on Auto-Encoder
by Si-si Zhang et al

01-20-2022

Learning Pixel Trajectories with Multiscale Contrastive Random Walks
by Zhangxing Bian et al

01-20-2022

SPAMs: Structured Implicit Parametric Models
by Pablo Palafox et al

01-19-2022

Estimating Egocentric 3D Human Pose in the Wild with External Weak Supervision
by Jian Wang et al

01-20-2022

MeMViT: Memory-Augmented Multiscale Vision Transformer for Efficient Long-Term Video Recognition
by Chao-Yuan Wu et al

01-20-2022

End-to-end Generative Pretraining for Multimodal Video Captioning
by Paul Hongsuck Seo et al

01-21-2022

Point-NeRF: Point-based Neural Radiance Fields
by Qiangeng Xu et al

01-19-2022

ConDor: Self-Supervised Canonicalization of 3D Pose for Partial Shapes
by Rahul Sajnani et al

01-20-2022

The Elements of Temporal Sentence Grounding in Videos: A Survey and Future Directions
by Hao Zhang et al

01-19-2022

Nonlinear Unknown Input Observability and Unknown Input Reconstruction: The General Analytical Solution
by Agostino Martinelli

01-19-2022

CAST: Character labeling in Animation using Self-supervision by Tracking
by Oron Nir et al

01-20-2022

Real-time Rendering for Integral Imaging Light Field Displays Based on a Voxel-Pixel Lookup Table
by Quanzhen Wan

01-20-2022

AirPose: Multi-View Fusion Network for Aerial 3D Human Pose and Shape Estimation
by Nitin Saini et al

01-19-2022

Towards a General Deep Feature Extractor for Facial Expression Recognition
by Liam Schoneveld et al

01-20-2022

Deep Unsupervised Contrastive Hashing for Large-Scale Cross-Modal Text-Image Retrieval in Remote Sensing
by Georgii Mikriukov et al

01-20-2022

Revisiting Weakly Supervised Pre-Training of Visual Perception Models
by Mannat Singh et al

01-20-2022

DIVA-DAF: A Deep Learning Framework for Historical Document Image Analysis
by Lars Vögtlin et al

01-19-2022

Look Closer: Bridging Egocentric and Third-Person Views with Transformers for Robotic Manipulation
by Rishabh Jangir et al

01-20-2022

A Joint Morphological Profiles and Patch Tensor Change Detection for Hyperspectral Imagery
by Zengfu Hou et al

01-20-2022

TerViT: An Efficient Ternary Vision Transformer
by Sheng Xu et al

01-19-2022

Virtual Coil Augmentation Technology for MRI via Deep Learning
by Cailian Yang et al

01-19-2022

Experimental Large-Scale Jet Flames Geometrical Features Extraction for Risk Management Using Infrared Images and Deep Learning Segmentation Methods
by Carmina Pérez-Guerrero et al

01-19-2022

Weakly Supervised Semantic Segmentation of Remote Sensing Images for Tree Species Classification Based on Explanation Methods
by Steve Ahlswede et al

01-20-2022

Predicting Vegetation Stratum Occupancy from Airborne LiDAR Data with Deep Learning
by Ekaterina Kalinicheva et al

01-20-2022

PRMI: A Dataset of Minirhizotron Images for Diverse Plant Root Study
by Weihuang Xu et al

01-18-2022

STURE: Spatial-Temporal Mutual Representation Learning for Robust Data Association in Online Multi-Object Tracking
by Haidong Wang et al

01-19-2022

Visualization and Analysis of Wearable Health Data From COVID-19 Patients
by Susanne K. Suter et al

01-19-2022

Self-supervised Video Representation Learning with Cascade Positive Retrieval
by Cheng-En Wu et al

01-20-2022

Physically Embodied Deep Image Optimisation
by Daniela Mihai et al

01-20-2022

WPPNets: Unsupervised CNN Training with Wasserstein Patch Priors for Image Superresolution
by Fabian Altekrüger et al

01-20-2022

Modeling and hexahedral meshing of arterial networks from centerlines
by Méghane Decroocq et al

01-19-2022

CELESTIAL: Classification Enabled via Labelless Embeddings with Self-supervised Telescope Image Analysis Learning
by Suhas Kotha et al

01-20-2022

Domain Generalization via Frequency-based Feature Disentanglement and Interaction
by Jingye Wang et al

01-20-2022

What can we learn from misclassified ImageNet images?
by Shixian Wen et al

01-20-2022

A Computational Model for Machine Thinking
by Slimane Larabi

01-20-2022

GeoFill: Reference-Based Image Inpainting of Scenes with Complex Geometry
by Yunhan Zhao et al

01-19-2022

ASL Video Corpora & Sign Bank: Resources Available through the American Sign Language Linguistic Research Project (ASLLRP)
by Carol Neidle et al

01-19-2022

Superpixel Pre-Segmentation of HER2 Slides for Efficient Annotation
by Mathias Öttl et al

01-20-2022

Lightweight Salient Object Detection in Optical Remote Sensing Images via Feature Correlation
by Gongyang Li et al

01-19-2022

GASCN: Graph Attention Shape Completion Network
by Haojie Huang et al

01-19-2022

Learning-by-Novel-View-Synthesis for Full-Face Appearance-based 3D Gaze Estimation
by Jiawei Qin et al

01-20-2022

CP-Net: Contour-Perturbed Reconstruction Network for Self-Supervised Point Cloud Learning
by Mingye Xu et al

01-18-2022

KappaFace: Adaptive Additive Angular Margin Loss for Deep Face Recognition
by Chingis Oinar et al

01-19-2022

TransFuse: A Unified Transformer-based Image Fusion Framework using Self-supervised Learning
by Linhao Qu et al

01-19-2022

A pipeline for automated processing of Corona KH-4 (1962-1972) stereo imagery
by Sajid Ghuffar et al

01-20-2022

HumanIBR: High Quality Image-based Rendering of Challenging Human Performers using Sparse Views
by Tiansong Zhou et al

01-19-2022

Self-Supervised Deep Blind Video Super-Resolution
by Haoran Bai et al

01-19-2022

A Survey on Training Challenges in Generative Adversarial Networks for Biomedical Image Analysis
by Muhammad Muneeb Saad et al

01-19-2022

Global-Local Path Networks for Monocular Depth Estimation with Vertical CutDepth
by Doyeon Kim et al

01-18-2022

AI-based Carcinoma Detection and Classification Using Histopathological Images: A Systematic Review
by Swathi Prabhua et al

01-19-2022

Enhanced Performance of Pre-Trained Networks by Matched Augmentation Distributions
by Touqeer Ahmad et al

01-19-2022

DMF-Net: Dual-Branch Multi-Scale Feature Fusion Network for copy forgery identification of anti-counterfeiting QR code
by Zhongyuan Guo et al

01-19-2022

A Confidence-based Iterative Solver of Depths and Surface Normals for Deep Multi-view Stereo
by Wang Zhao et al

01-19-2022

Real-time Recognition of Yoga Poses using computer Vision for Smart Health Care
by Abhishek Sharma et al

01-19-2022

Simpler is better: spectral regularization and up-sampling techniques for variational autoencoders
by Sara Björk et al

01-18-2022

Attentional Feature Refinement and Alignment Network for Aircraft Detection in SAR Imagery
by Yan Zhao et al

01-18-2022

RePre: Improving Self-Supervised Vision Transformer with Reconstructive Pre-training
by Luya Wang et al

01-19-2022

WebUAV-3M: A Benchmark Unveiling the Power of Million-Scale Deep UAV Tracking
by Chunhui Zhang et al

01-19-2022

The Role of Facial Expressions and Emotion in ASL
by Lee Kezar et al

01-18-2022

Deep Learning Based Framework for Iranian License Plate Detection and Recognition
by Mojtaba Shahidi Zandi et al

01-21-2022

Dangerous Cloaking: Natural Trigger based Backdoor Attacks on Object Detectors in the Physical World
by Hua Ma et al

01-18-2022

TriCoLo: Trimodal Contrastive Loss for Fine-grained Text to Shape Retrieval
by Yue Ruan et al

01-18-2022

Pruning-aware Sparse Regularization for Network Pruning
by Nanfei Jiang et al

01-18-2022

Weakly Supervised Contrastive Learning for Better Severity Scoring of Lung Ultrasound
by Gautam Rajendrakumar Gare et al

01-19-2022

Variable Augmented Network for Invertible MR Coil Compression
by Xianghao Liao et al

01-18-2022

Lung Swapping Autoencoder: Learning a Disentangled Structure-texture Representation of Chest Radiographs
by Lei Zhou et al

01-19-2022

ROS georegistration: Aerial Multi-spectral Image Simulator for the Robot Operating System
by Andrew R. Willis et al

01-19-2022

Object Detection in Autonomous Vehicles: Status and Open Challenges
by Abhishek Balasubramaniam et al

01-20-2022

Watermarking Pre-trained Encoders in Contrastive Learning
by Yutong Wu et al

01-20-2022

SoftDropConnect (SDC) -- Effective and Efficient Quantification of the Network Uncertainty in Deep MR Image Analysis
by Qing Lyu et al

01-18-2022

Poseur: Direct Human Pose Regression with Transformers
by Weian Mao et al

01-19-2022

Learned Cone-Beam CT Reconstruction Using Neural Ordinary Differential Equations
by Mareike Thies et al

01-18-2022

Swin-Pose: Swin Transformer Based Human Pose Estimation
by Zinan Xiong et al

01-19-2022

BLINC: Lightweight Bimodal Learning for Low-Complexity VVC Intra Coding
by Farhad Pakdaman et al

01-19-2022

Using Self-Supervised Pretext Tasks for Active Learning
by John Seon Keun Yi et al

01-21-2022

Distance-Ratio-Based Formulation for Metric Learning
by Hyeongji Kim et al

01-20-2022

A Visual Analytics Approach to Building Logistic Regression Models and its Application to Health Records
by Erasmo Artur et al

01-19-2022

Open Source Handwritten Text Recognition on Medieval Manuscripts using Mixed Models and Document-Specific Finetuning
by Christian Reul et al

01-18-2022

When Facial Expression Recognition Meets Few-Shot Learning: A Joint and Alternate Learning Framework
by Xinyi Zou et al

01-21-2022

Conceptor Learning for Class Activation Mapping
by Guangwu Qian et al

01-18-2022

OSSID: Online Self-Supervised Instance Detection by (and for) Pose Estimation
by Qiao Gu et al

01-18-2022

Resistance Training using Prior Bias: toward Unbiased Scene Graph Generation
by Chao Chen et al

01-18-2022

Adaptive Weighted Guided Image Filtering for Depth Enhancement in Shape-From-Focus
by Yuwen Li et al

01-21-2022

VIPriors 2: Visual Inductive Priors for Data-Efficient Deep Learning Challenges
by Attila Lengyel et al

01-18-2022

Pistol: Pupil Invisible Supportive Tool to extract Pupil, Iris, Eye Opening, Eye Movements, Pupil and Iris Gaze Vector, and 2D as well as 3D Gaze
by Wolfgang Fuhl et al

01-18-2022

The Role of Pleura and Adipose in Lung Ultrasound AI
by Gautam Rajendrakumar Gare et al

01-18-2022

Deformable One-Dimensional Object Detection for Routing and Manipulation
by Azarakhsh Keipour et al

01-21-2022

What Can Machine Vision Do for Lymphatic Histopathology Image Analysis: A Comprehensive Review
by Xiaoqi Li et al

01-21-2022

SparseAlign: A Super-Resolution Algorithm for Automatic Marker Localization and Deformation Estimation in Cryo-Electron Tomography
by Poulami Somanya Ganguly et al

01-19-2022

Q-ViT: Fully Differentiable Quantization for Vision Transformer
by Zhexin Li et al

01-19-2022

Semi-automatic 3D Object Keypoint Annotation and Detection for the Masses
by Kenneth Blomqvist et al

01-21-2022

Multi-view Monocular Depth and Uncertainty Prediction with Deep SfM in Dynamic Environments
by Christian Homeyer et al

01-20-2022

An Empirical Investigation of Model-to-Model Distribution Shifts in Trained Convolutional Filters
by Paul Gavrikov et al

01-20-2022

Vertical Federated Edge Learning with Distributed Integrated Sensing and Communication
by Peixi Liu et al

01-19-2022

High-fidelity 3D Model Compression based on Key Spheres
by Yuanzhan Li et al

01-21-2022

SegTransVAE: Hybrid CNN -- Transformer with Regularization for medical image segmentation
by Quan-Dung Pham et al

01-21-2022

Enhancing Pseudo Label Quality for Semi-SupervisedDomain-Generalized Medical Image Segmentation
by Huifeng Yao et al

01-20-2022

FaceOcc: A Diverse, High-quality Face Occlusion Dataset for Human Face Extraction
by Xiangnan Yin et al

01-19-2022

Improving Specificity in Mammography Using Cross-correlation between Wavelet and Fourier Transform
by Liuhua Zhang

01-21-2022

Pseudo-Labeled Auto-Curriculum Learning for Semi-Supervised Keypoint Localization
by Can Wang et al

01-21-2022

Contrastive and Selective Hidden Embeddings for Medical Image Segmentation
by Zhuowei Li et al

01-21-2022

Object Detection in Aerial Images: What Improves the Accuracy?
by Hashmat Shadab Malik et al

01-21-2022

Classroom Slide Narration System
by Jobin K. V. et al

01-19-2022

GroupGazer: A Tool to Compute the Gaze per Participant in Groups with integrated Calibration to Map the Gaze Online to a Screen or Beamer Projection
by Wolfgang Fuhl

01-21-2022

A Comprehensive Study of Vision Transformers on Dense Prediction Tasks
by Kishaan Jeeveswaran et al

01-21-2022

ERS: a novel comprehensive endoscopy image dataset for machine learning, compliant with the MST 3.0 specification
by Jan Cychnerski et al

01-21-2022

Exploring Fusion Strategies for Accurate RGBT Visual Object Tracking
by Zhangyong Tang et al

01-21-2022

AiTLAS: Artificial Intelligence Toolbox for Earth Observation
by Ivica Dimitrovski et al

01-21-2022

Dynamic Deep Convolutional Candlestick Learner
by Jun-Hao Chen et al

01-21-2022

Improving Across-Dataset Brain Tissue Segmentation Using Transformer
by Vishwanatha M. Rao et al

01-21-2022

Reliable Detection of Doppelg\angers based on Deep Face Representations
by Christian Rathgeb et al

01-18-2022

DDU-Net: Dual-Decoder-U-Net for Road Extraction Using High-Resolution Remote Sensing Images
by Ying Wang et al

01-20-2022

Steerable Pyramid Transform Enables Robust Left Ventricle Quantification
by Xiangyang Zhu et al

01-21-2022

Fast Differentiable Matrix Square Root
by Yue Song et al

 
Craig Smith