2021.12.27 Vision papers

 

12-23-2021

BANMo: Building Animatable 3D Neural Models from Many Casual Videos
by Gengshan Yang et al

12-21-2021

StyleSDF: High-Resolution 3D-Consistent Image and Geometry Generation
by Roy Or-El et al

12-21-2021

MPViT: Multi-Path Vision Transformer for Dense Prediction
by Youngwan Lee et al

12-22-2021

NICE-SLAM: Neural Implicit Scalable Encoding for SLAM
by Zihan Zhu et al

12-21-2021

JoJoGAN: One Shot Face Stylization
by Min Jin Chong et al

12-22-2021

Open-Vocabulary Image Segmentation
by Golnaz Ghiasi et al

12-22-2021

Cost Aggregation Is All You Need for Few-Shot Segmentation
by Sunghwan Hong et al

12-23-2021

LaTr: Layout-Aware Transformer for Scene-Text VQA
by Ali Furkan Biten et al

12-21-2021

Implicit Neural Video Compression
by Yunfan Zhang et al

12-21-2021

Max-Margin Contrastive Learning
by Anshul Shah et al

12-22-2021

Learning and Crafting for the Wide Multiple Baseline Stereo
by Dmytro Mishkin

12-23-2021

ELSA: Enhanced Local Self-Attention for Vision Transformer
by Jingkai Zhou et al

12-23-2021

SLIP: Self-supervision meets Language-Image Pre-training
by Norman Mu et al

12-23-2021

DD-NeRF: Double-Diffusion Neural Radiance Field as a Generalizable Implicit Body Representation
by Guangming Yao et al

12-21-2021

Watch It Move: Unsupervised Discovery of 3D Joints for Re-Posing of Articulated Objects
by Atsuhiro Noguchi et al

12-23-2021

TagLab: A human-centric AI system for interactive semantic segmentation
by Gaia Pavoni et al

12-22-2021

Revisiting Transformation Invariant Geometric Deep Learning: Are Initial Representations All You Need?
by Ziwei Zhang et al

12-23-2021

3D Skeleton-based Few-shot Action Recognition with JEANIE is not so Na\ive
by Lei Wang et al

12-23-2021

PyCIL: A Python Toolbox for Class-Incremental Learning
by Da-Wei Zhou et al

12-21-2021

INTRPRT: A Systematic Review of and Guidelines for Designing and Validating Transparent AI in Medical Image Analysis
by Haomin Chen et al

12-23-2021

SeMask: Semantically Masked Transformers for Semantic Segmentation
by Jitesh Jain et al

12-22-2021

Multi-modal 3D Human Pose Estimation with 2D Weak Supervision in Autonomous Driving
by Jingxiao Zheng et al

12-23-2021

KFWC: A Knowledge-Driven Deep Learning Model for Fine-grained Classification of Wet-AMD
by Haihong E et al

12-23-2021

A Practical Data-Free Approach to One-shot Federated Learning with Heterogeneity
by Jie Zhang et al

12-21-2021

MOSAIC: Mobile Segmentation via decoding Aggregated Information and encoded Context
by Weijun Wang et al

12-23-2021

AI-based Reconstruction for Fast MRI -- A Systematic Review and Meta-analysis
by Yutong Chen et al

12-21-2021

PrimSeq: a deep learning-based pipeline to quantitate rehabilitation training
by Avinash Parnandi et al

12-21-2021

Deep Learning Based 3D Point Cloud Regression for Estimating Forest Biomass
by Stefan Oehmcke et al

12-22-2021

Multimodal Analysis of memes for sentiment extraction
by Nayan Varma Alluri et al

12-21-2021

A Theoretical View of Linear Backpropagation and Its Convergence
by Ziang Li et al

12-23-2021

Adaptive Modeling Against Adversarial Attacks
by Zhiwen Yan et al

12-21-2021

Shape from Polarization for Complex Scenes in the Wild
by Chenyang Lei et al

12-23-2021

Attentive Multi-View Deep Subspace Clustering Net
by Run-kun Lu et al

12-23-2021

FedFR: Joint Optimization Federated Framework for Generic and Personalized Face Recognition
by Chih-Ting Liu et al

12-22-2021

Learning Hierarchical Attention for Weakly-supervised Chest X-Ray Abnormality Localization and Diagnosis
by Xi Ouyang et al

12-23-2021

DILF-EN framework for Class-Incremental Learning
by Mohammed Asad Karim et al

12-23-2021

Assessing the Impact of Attention and Self-Attention Mechanisms on the Classification of Skin Lesions
by Rafael Pedro et al

12-21-2021

Learned ISTA with Error-based Thresholding for Adaptive Sparse Coding
by Ziang Li et al

12-21-2021

Multi-Modality Distillation via Learning the teachers modality-level Gram Matrix
by Peng Liu

12-21-2021

Efficient Registration of Forest Point Clouds by Global Matching of Relative Stem Positions
by Xufei Wang et al

12-22-2021

Robust and Precise Facial Landmark Detection by Self-Calibrated Pose Attention Network
by Jun Wan et al

12-22-2021

More is Better: A Novel Multi-view Framework for Domain Generalization
by Jian Zhang et al

12-23-2021

InstaIndoor and Multi-modal Deep Learning for Indoor Scene Recognition
by Andreea Glavan et al

12-23-2021

Your Face Mirrors Your Deepest Beliefs-Predicting Personality and Morals through Facial Emotion Recognition
by P. A. Gloor et al

12-23-2021

PandaSet: Advanced Sensor Suite Dataset for Autonomous Driving
by Pengchuan Xiao et al

12-21-2021

Can We Use Neural Regularization to Solve Depth Super-Resolution?
by Milena Gazdieva et al

12-23-2021

Pose Adaptive Dual Mixup for Few-Shot Single-View 3D Reconstruction
by Ta-Ying Cheng et al

12-21-2021

Geometry-Aware Unsupervised Domain Adaptation
by You-Wei Luo et al

12-22-2021

Few-Shot Object Detection: A Survey
by Mona Köhler et al

12-22-2021

Dual Path Structural Contrastive Embeddings for Learning Novel Objects
by Bingbin Li et al

12-22-2021

A Random Point Initialization Approach to Image Segmentation with Variational Level-sets
by J. N. Mueller et al

12-21-2021

Transferable End-to-end Room Layout Estimation via Implicit Encoding
by Hao Zhao et al

12-21-2021

Mapping industrial poultry operations at scale with deep learning and aerial imagery
by Caleb Robinson et al

12-21-2021

SOIT: Segmenting Objects with Instance-Aware Transformers
by Xiaodong Yu et al

12-23-2021

NinjaDesc: Content-Concealing Visual Descriptors via Adversarial Learning
by Tony Ng et al

12-21-2021

Exploring Credibility Scoring Metrics of Perception Systems for Autonomous Driving
by Viren Khandal et al

12-21-2021

Cloud Sphere: A 3D Shape Representation via Progressive Deformation
by Zongji Wang et al

12-22-2021

Recur, Attend or Convolve? Frame Dependency Modeling Matters for Cross-Domain Robustness in Action Recognition
by Sofia Broomé et al

12-21-2021

fMRI Neurofeedback Learning Patterns are Predictive of Personal and Clinical Traits
by Rotem Leibovitz et al

12-23-2021

Iteratively Selecting an Easy Reference Frame Makes Unsupervised Video Object Segmentation Easier
by Youngjo Lee et al

12-21-2021

RepMLPNet: Hierarchical Vision MLP with Re-parameterized Locality
by Xiaohan Ding et al

12-23-2021

On the relationship between calibrated predictors and unbiased volume estimation
by Teodora Popordanoska et al

12-23-2021

Manifold Learning Benefits GANs
by Yao Ni et al

12-22-2021

YOLO-Z: Improving small object detection in YOLOv5 for autonomous vehicles
by Aduen Benjumea et al

12-22-2021

Simple and Effective Balance of Contrastive Losses
by Arnaud Sors et al

12-22-2021

Maximum Entropy on Erroneous Predictions (MEEP): Improving model calibration for medical image segmentation
by Agostina Larrazabal et al

12-22-2021

MC-DGCNN: A Novel DNN Architecture for Multi-Category Point Set Classification
by Majid Farhadloo et al

12-21-2021

Convolutional neural network based on transfer learning for breast cancer screening
by Hussin Ragb et al

12-21-2021

Learned Queries for Efficient Local Attention
by Moab Arar et al

12-22-2021

Adaptive Contrast for Image Regression in Computer-Aided Disease Assessment
by Weihang Dai et al

12-22-2021

High-Accuracy RGB-D Face Recognition via Segmentation-Aware Face Depth Estimation and Mask-Guided Attention Network
by Meng-Tzu Chiu et al

12-22-2021

Multimodal Personality Recognition using Cross-Attention Transformer and Behaviour Encoding
by Tanay Agrawal et al

12-21-2021

Multispectral image fusion by super pixel statistics
by Nati Ofir

12-22-2021

Binary Image Skeletonization Using 2-Stage U-Net
by Mohamed A. Ghanem et al

12-23-2021

Boosting Generative Zero-Shot Learning by Synthesizing Diverse Features with Attribute Augmentation
by Xiaojie Zhao et al

12-23-2021

Data-efficient learning for 3D mirror symmetry detection
by Yancong Lin et al

12-23-2021

Cross Modal Retrieval with Querybank Normalisation
by Simion-Vlad Bogolin et al

12-22-2021

Fine-grained Multi-Modal Self-Supervised Learning
by Duo Wang et al

12-22-2021

Automatic Estimation of Anthropometric Human Body Measurements
by Dana Škorvánková et al

12-21-2021

EPNet++: Cascade Bi-directional Fusion for Multi-Modal 3D Object Detection
by Zhe Liu et al

12-21-2021

Improving Robustness with Image Filtering
by Matteo Terzi et al

12-22-2021

Multi-View Partial (MVP) Point Cloud Challenge 2021 on Completion and Registration: Methods and Results
by Liang Pan et al

12-23-2021

Predi\c{c}\~ao da Idade Cerebral a partir de Imagens de Resson\^ancia Magn\etica utilizando Redes Neurais Convolucionais
by Victor H. R. Oliveira et al

12-21-2021

High-Fidelity Point Cloud Completion with Low-Resolution Recovery and Noise-Aware Upsampling
by Ren-Wu Li et al

12-21-2021

Expansion-Squeeze-Excitation Fusion Network for Elderly Activity Recognition
by Xiangbo Shu et al

12-21-2021

PONet: Robust 3D Human Pose Estimation via Learning Orientations Only
by Jue Wang et al

12-23-2021

Digital Editions as Distant Supervision for Layout Analysis of Printed Books
by Alejandro H. Toselli et al

12-23-2021

InDuDoNet+: A Model-Driven Interpretable Dual Domain Network for Metal Artifact Reduction in CT Images
by Hong Wang et al

12-22-2021

Generalized Local Optimality for Video Steganalysis in Motion Vector Domain
by Liming Zhai et al

12-22-2021

Fusion of medical imaging and electronic health records with attention and multi-head machanisms
by Cheng Jiang et al

12-22-2021

Looking Beyond Corners: Contrastive Learning of Visual Representations for Keypoint Detection and Description Extraction
by Henrique Siqueira et al

12-22-2021

Multi-Centroid Representation Network for Domain Adaptive Person Re-ID
by Yuhang Wu et al

12-22-2021

Comparing radiologists gaze and saliency maps generated by interpretability methods for chest x-rays
by Ricardo Bigolin Lanfredi et al

12-22-2021

Community Detection in Medical Image Datasets: Using Wavelets and Spectral Methods
by Roozbeh Yousefzadeh

12-21-2021

EyePAD++: A Distillation-based approach for joint Eye Authentication and Presentation Attack Detection using Periocular Images
by Prithviraj Dhar et al

12-22-2021

Meta-Learning and Self-Supervised Pretraining for Real World Image Translation
by Ileana Rugina et al

12-21-2021

iSegFormer: Interactive Image Segmentation with Transformers
by Qin Liu

12-21-2021

Point spread function estimation for blind image deblurring problems based on framelet transform
by Reza Parvaz

12-22-2021

Deep learning for brain metastasis detection and segmentation in longitudinal MRI data
by Yixing Huang et al

12-21-2021

Learning Human Motion Prediction via Stochastic Differential Equations
by Kedi Lyu et al

12-22-2021

BEVDet: High-performance Multi-camera 3D Object Detection in Bird-Eye-View
by Junjie Huang et al

12-21-2021

Contrastive Object Detection Using Knowledge Graph Embeddings
by Christopher Lang et al

12-22-2021

Exploring Inter-frequency Guidance of Image for Lightweight Gaussian Denoising
by Zhuang Jia

12-22-2021

Entropy Regularized Iterative Weighted Shrinkage-Thresholding Algorithm (ERIWSTA): An Application to CT Image Restoration
by Bingxue Wu et al

12-22-2021

Improved 2D Keypoint Detection in Out-of-Balance and Fall Situations -- combining input rotations and a kinematic model
by Michael Zwölfer et al

12-23-2021

FourierMask: Instance Segmentation using Fourier Mapping in Implicit Neural Networks
by Hamd ul Moqeet Riaz et al

12-21-2021

Generalizable Cross-modality Medical Image Segmentation via Style Augmentation and Dual Normalization
by Ziqi Zhou et al

12-21-2021

Review of Face Presentation Attack Detection Competitions
by Zitong Yu et al

12-22-2021

CLEVR3D: Compositional Language and Elementary Visual Reasoning for Question Answering in 3D Real-World Scenes
by Xu Yan et al

12-21-2021

GOAL: Generating 4D Whole-Body Motion for Hand-Object Grasping
by Omid Taheri et al

12-22-2021

Human Activity Recognition on wrist-worn accelerometers using self-supervised neural networks
by Niranjan Sridhar et al

12-21-2021

ADJUST: A Dictionary-Based Joint Reconstruction and Unmixing Method for Spectral Tomography
by Mathé T. Zeegers et al

12-23-2021

Towards Universal GAN Image Detection
by Davide Cozzolino et al

12-22-2021

Class-aware Sounding Objects Localization via Audiovisual Correspondence
by Di Hu et al

12-22-2021

Leveraging Synthetic Data in Object Detection on Unmanned Aerial Vehicles
by Benjamin Kiefer et al

12-21-2021

GAN Based Boundary Aware Classifier for Detecting Out-of-distribution Samples
by Sen Pei et al

12-23-2021

Omni-Seg: A Single Dynamic Network for Multi-label Renal Pathology Image Segmentation using Partially Labeled Data
by Ruining Deng et al

12-21-2021

Leveraging Image Complexity in Macro-Level Neural Network Design for Medical Image Segmentation
by Tariq M. Khan et al

12-22-2021

Improved skin lesion recognition by a Self-Supervised Curricular Deep Learning approach
by Kirill Sirotkin et al

12-22-2021

Ghost-dil-NetVLAD: A Lightweight Neural Network for Visual Place Recognition
by Qingyuan Gong et al

12-21-2021

RC-Net: A Convolutional Neural Network for Retinal Vessel Segmentation
by Tariq M Khan et al

12-21-2021

PointCaps: Raw Point Cloud Processing using Capsule Networks with Euclidean Distance Routing
by Dishanika Denipitiyage et al

12-22-2021

Barely-Supervised Learning: Semi-Supervised Learning with very few labeled images
by Thomas Lucas et al

12-21-2021

AdaptPose: Cross-Dataset Adaptation for 3D Human Pose Estimation by Learnable Motion Generation
by Mohsen Gholami et al

12-22-2021

Can Deep Neural Networks be Converted to Ultra Low-Latency Spiking Neural Networks?
by Gourav Datta et al

12-22-2021

Geodesic squared exponential kernel for non-rigid shape registration
by Florent Jousse et al

12-21-2021

Teacher-Student Architecture for Mixed Supervised Lung Tumor Segmentation
by Vemund Fredriksen et al

12-22-2021

DA-FDFtNet: Dual Attention Fake Detection Fine-tuning Network to Detect Various AI-Generated Fake Images
by Young Oh Bang et al

12-22-2021

Bottom-up approaches for multi-person pose estimation and its applications: A brief review
by Milan Kresović et al

12-21-2021

Decompose the Sounds and Pixels, Recompose the Events
by Varshanth R. Rao et al

12-22-2021

Reflash Dropout in Image Super-Resolution
by Xiangtao Kong et al

12-22-2021

Neuroevolution deep learning architecture search for estimation of river surface elevation from photogrammetric Digital Surface Models
by Radosław Szostak et al

12-23-2021

Comparison and Analysis of Image-to-Image Generative Adversarial Networks: A Survey
by Sagar Saxena et al

12-21-2021

A novel approach for the automated segmentation and volume quantification of cardiac fats on computed tomography
by Érick Oliveira Rodrigues et al

12-22-2021

Few-shot Font Generation with Weakly Supervised Localized Representations
by Song Park et al

12-22-2021

A Discriminative Single-Shot Segmentation Network for Visual Object Tracking
by Alan Lukežič et al

12-21-2021

Real-time Street Human Motion Capture
by Yanquan Chen et al

12-21-2021

MIA-Former: Efficient and Robust Vision Transformers via Multi-grained Input-Adaptation
by Zhongzhi Yu et al

12-21-2021

Anomaly Clustering: Grouping Images into Coherent Clusters of Anomaly Types
by Kihyuk Sohn et al

12-21-2021

Distribution-aware Margin Calibration for Semantic Segmentation in Images
by Litao Yu et al

12-22-2021

NVS-MonoDepth: Improving Monocular Depth Prediction with Novel View Synthesis
by Zuria Bauer et al

12-21-2021

Input-Specific Robustness Certification for Randomized Smoothing
by Ruoxin Chen et al

 
Craig Smith