2022.2.21 Vision papers

 

02-16-2022

Vision Models Are More Robust And Fair When Pretrained On Uncurated Images Without Supervision
by Priya Goyal et al

02-16-2022

A Data-Augmentation Is Worth A Thousand Samples: Exact Quantification From Analytical Augmented Sample Moments
by Randall Balestriero et al

02-16-2022

AKB-48: A Real-World Articulated Object Knowledge Base
by Liu Liu et al

02-16-2022

Limitations of Neural Collapse for Understanding Generalization in Deep Learning
by Like Hui et al

02-15-2022

General-purpose, long-context autoregressive modeling with Perceiver AR
by Curtis Hawthorne et al

02-16-2022

Anomalib: A Deep Learning Library for Anomaly Detection
by Samet Akcay et al

02-15-2022

Dont Lie to Me! Robust and Efficient Explainability with Verified Perturbation Analysis
by Thomas Fel et al

02-16-2022

Ditto: Building Digital Twins of Articulated Objects from Interaction
by Zhenyu Jiang et al

02-16-2022

Learning Smooth Neural Functions via Lipschitz Regularization
by Hsueh-Ti Derek Liu et al

02-15-2022

ViNTER: Image Narrative Generation with Emotion-Arc-Aware Transformer
by Kohei Uehara et al

02-15-2022

CommerceMM: Large-Scale Commerce MultiModal Representation Learning with Omni Retrieval
by Licheng Yu et al

02-17-2022

Feels Bad Man: Dissecting Automated Hateful Meme Detection Through the Lens of Facebooks Challenge
by Catherine Jennifer et al

02-15-2022

Not All Patches are What You Need: Expediting Vision Transformers via Token Reorganizations
by Youwei Liang et al

02-16-2022

Bias in Automated Image Colorization: Metrics and Error Types
by Frank Stapel et al

02-17-2022

V2X-Sim: A Virtual Collaborative Perception Dataset for Autonomous Driving
by Yiming Li et al

02-17-2022

Grammar-Based Grounded Lexicon Learning
by Jiayuan Mao et al

02-15-2022

Fairness Indicators for Systematic Assessments of Visual Feature Extractors
by Priya Goyal et al

02-16-2022

Generative modeling with projected entangled-pair states
by Tom Vieijra et al

02-17-2022

Fourier PlenOctrees for Dynamic Radiance Field Rendering in Real-time
by Liao Wang et al

02-17-2022

A study of deep perceptual metrics for image quality assessment
by Rémi Kazmierczak et al

02-15-2022

Ab-initio Contrast Estimation and Denoising of Cryo-EM Images
by Yunpeng Shi et al

02-17-2022

General Cyclical Training of Neural Networks
by Leslie N. Smith

02-17-2022

OmniSyn: Synthesizing 360 Videos with Wide-baseline Panoramas
by David Li et al

02-16-2022

Planckian jitter: enhancing the color quality of self-supervised visual representations
by Simone Zini et al

02-16-2022

When Did It Happen? Duration-informed Temporal Localization of Narrated Actions in Vlogs
by Oana Ignat et al

02-16-2022

Evaluation and Analysis of Different Aggregation and Hyperparameter Selection Methods for Federated Brain Tumor Segmentation
by Ece Isik-Polat et al

02-16-2022

OpenKBP-Opt: An international and reproducible evaluation of 76 knowledge-based planning pipelines
by Aaron Babier et al

02-16-2022

Meta Knowledge Distillation
by Jihao Liu et al

02-16-2022

Can Deep Learning be Applied to Model-Based Multi-Object Tracking?
by Juliano Pinto et al

02-15-2022

Neural Architecture Search for Dense Prediction Tasks in Computer Vision
by Thomas Elsken et al

02-18-2022

Autoencoding Low-Resolution MRI for Semantically Smooth Interpolation of Anisotropic MRI
by Jörg Sander et al

02-17-2022

A hybrid 2-stage vision transformer for AI-assisted 5 class pathologic diagnosis of gastric endoscopic biopsies
by Yujin Oh et al

02-17-2022

A Study of Designing Compact Audio-Visual Wake Word Spotting System Based on Iterative Fine-Tuning in Neural Network Pruning
by Hengshun Zhou et al

02-17-2022

CADRE: A Cascade Deep Reinforcement Learning Framework for Vision-based Autonomous Urban Driving
by Yinuo Zhao et al

02-18-2022

VCVTS: Multi-speaker Video-to-Speech synthesis via cross-modal knowledge transfer from voice conversion
by Disong Wang et al

02-17-2022

Two-Stage Architectural Fine-Tuning with Neural Architecture Search using Early-Stopping in Image Classification
by Youngkee Kim et al

02-17-2022

CSCNet: Contextual Semantic Consistency Network for Trajectory Prediction in Crowded Spaces
by Beihao Xia et al

02-17-2022

Dynamic Object Comprehension: A Framework For Evaluating Artificial Visual Perception
by Scott Y. L. Chin et al

02-16-2022

CortexODE: Learning Cortical Surface Reconstruction by Neural ODEs
by Qiang Ma et al

02-18-2022

VLP: A Survey on Vision-Language Pre-training
by Feilong Chen et al

02-16-2022

Learning to Generalize across Domains on Single Test Samples
by Zehao Xiao et al

02-16-2022

A multi-reconstruction study of breast density estimation using Deep Learning
by Vikash Gupta et al

02-15-2022

DualConv: Dual Convolutional Kernels for Lightweight Deep Neural Networks
by Jiachen Zhong et al

02-17-2022

Detecting and Learning the Unknown in Semantic Segmentation
by Robin Chan et al

02-16-2022

IPD:An Incremental Prototype based DBSCAN for large-scale data with cluster representatives
by Jayasree Saha et al

02-16-2022

Cross-Modal Common Representation Learning with Triplet Loss Functions
by Felix Ott et al

02-16-2022

Diagnosing Batch Normalization in Class Incremental Learning
by Minghao Zhou et al

02-15-2022

A Unified Framework for Masked and Mask-Free Face Recognition via Feature Rectification
by Shaozhe Hao et al

02-17-2022

Point Cloud Generation with Continuous Conditioning
by Larissa T. Triess et al

02-16-2022

PENCIL: Deep Learning with Noisy Labels
by Kun Yi et al

02-17-2022

KINet: Keypoint Interaction Networks for Unsupervised Forward Modeling
by Alireza Rezazadeh et al

02-17-2022

Survey on Self-supervised Representation Learning Using Image Transformations
by Muhammad Ali et al

02-17-2022

CLS: Cross Labeling Supervision for Semi-Supervised Learning
by Yao Yao et al

02-16-2022

ADAM Challenge: Detecting Age-related Macular Degeneration from Fundus Images
by Huihui Fang et al

02-16-2022

Phase Aberration Robust Beamformer for Planewave US Using Self-Supervised Learning
by Shujaat Khan et al

02-17-2022

TransCG: A Large-Scale Real-World Dataset for Transparent Object Depth Completion and Grasping
by Hongjie Fang et al

02-18-2022

Unsupervised Multiple-Object Tracking with a Dynamical Variational Autoencoder
by Xiaoyu Lin et al

02-17-2022

Synthetic data for unsupervised polyp segmentation
by Enric Moreu et al

02-17-2022

EBHI:A New Enteroscope Biopsy Histopathological H&E Image Dataset for Image Classification Evaluation
by Weiming Hu et al

02-18-2022

Generalizing Aggregation Functions in GNNs:High-Capacity GNNs via Nonlinear Neighborhood Aggregators
by Beibei Wang et al

02-17-2022

An overview of deep learning in medical imaging
by Imran Ul Haq

02-16-2022

Visual attention analysis of pathologists examining whole slide images of Prostate cancer
by Souradeep Chakraborty et al

02-15-2022

Few-shot semantic segmentation via mask aggregation
by Wei Ao et al

02-17-2022

End-to-end Neuron Instance Segmentation based on Weakly Supervised Efficient UNet and Morphological Post-processing
by Huaqian Wu et al

02-15-2022

Reducing Overconfidence Predictions for Autonomous Driving Perception
by Gledson Melotti et al

02-18-2022

Critical Checkpoints for Evaluating Defence Models Against Adversarial Attack and Robustness
by Kanak Tekwani et al

02-17-2022

PCB Component Detection using Computer Vision for Hardware Assurance
by Wenwei Zhao et al

02-17-2022

TraSeTR: Track-to-Segment Transformer with Contrastive Query for Instance-level Instrument Segmentation in Robotic Surgery
by Zixu Zhao et al

02-15-2022

Lie Point Symmetry Data Augmentation for Neural PDE Solvers
by Johannes Brandstetter et al

02-15-2022

Applying adversarial networks to increase the data efficiency and reliability of Self-Driving Cars
by Aakash Kumar

02-17-2022

Domain Randomization for Object Counting
by Enric Moreu et al

02-17-2022

Mirror-Yolo: An attention-based instance segmentation and detection model for mirrors
by Fengze Li et al

02-15-2022

Beyond Natural Motion: Exploring Discontinuity for Video Frame Interpolation
by Sangjin Lee et al

02-16-2022

ActionFormer: Localizing Moments of Actions with Transformers
by Chenlin Zhang et al

02-17-2022

Adiabatic Quantum Computing for Multi Object Tracking
by Jan-Nico Zaech et al

02-15-2022

Multimodal Driver Referencing: A Comparison of Pointing to Objects Inside and Outside the Vehicle
by Abdul Rafey Aftab et al

02-17-2022

Visual Ground Truth Construction as Faceted Classification
by Fausto Giunchiglia et al

02-15-2022

Exploring the Devil in Graph Spectral Domain for 3D Point Cloud Attacks
by Qianjiang Hu et al

02-16-2022

FPIC: A Novel Semantic Dataset for Optical PCB Assurance
by Nathan Jessurun et al

02-15-2022

Deep Constrained Least Squares for Blind Image Super-Resolution
by Ziwei Luo et al

02-17-2022

3D-Aware Indoor Scene Synthesis with Depth Priors
by Zifan Shi et al

02-18-2022

MultiRes-NetVLAD: Augmenting Place Recognition Training with Low-Resolution Imagery
by Ahmad Khaliq et al

02-16-2022

Image translation of Ultrasound to Pseudo Anatomical Display Using Artificial Intelligence
by Lilach Barkat et al

02-15-2022

A Survey of Semen Quality Evaluation in Microscopic Videos Using Computer Assisted Sperm Analysis
by Wenwei Zhao et al

02-15-2022

Segmentation and Risk Score Prediction of Head and Neck Cancers in PET/CT Volumes with 3D U-Net and Cox Proportional Hazard Neural Networks
by Fereshteh Yousefirizi et al

02-15-2022

RNGDet: Road Network Graph Detection by Transformer in Aerial Images
by Zhenhua Xu et al

02-15-2022

Spatial Transformer K-Means
by Romain Cosentino et al

02-17-2022

R2-D2: Repetitive Reprediction Deep Decipher for Semi-Supervised Deep Learning
by Guo-Hua Wang et al

02-17-2022

Point cloud completion on structured feature map with feedback network
by Zejia Su et al

02-17-2022

Anatomically Parameterized Statistical Shape Model: Explaining Morphometry through Statistical Learning
by Arnaud Boutillon et al

02-15-2022

Review of the Fingerprint Liveness Detection (LivDet) competition series: from 2009 to 2021
by Marco Micheletto et al

02-15-2022

Texture Aware Autoencoder Pre-training And Pairwise Learning Refinement For Improved Iris Recognition
by Manashi Chakraborty et al

02-16-2022

How to Fill the Optimum Set? Population Gradient Descent with Harmless Diversity
by Chengyue Gong et al

02-16-2022

360 Depth Estimation in the Wild -- The Depth360 Dataset and the SegFuse Network
by Qi Feng et al

02-16-2022

Shift-Memory Network for Temporal Scene Segmentation
by Guo Cheng et al

02-17-2022

Single UHD Image Dehazing via Interpretable Pyramid Network
by Boxue Xiao et al

02-17-2022

Domain Adaptation for Underwater Image Enhancement via Content and Style Separation
by Yu-Wei Chen et al

02-15-2022

Deeply-Supervised Knowledge Distillation
by Shiya Luo et al

02-17-2022

How Well Do Self-Supervised Methods Perform in Cross-Domain Few-Shot Learning?
by Yiyi Zhang et al

02-15-2022

A precortical module for robust CNNs to light variations
by R. Fioresi et al

02-15-2022

Deep Learning-based Anomaly Detection on X-ray Images of Fuel Cell Electrodes
by Simon B. Jensen et al

02-17-2022

Semantically Proportional Patchmix for Few-Shot Learning
by Jingquan Wang et al

02-18-2022

Exploring Adversarially Robust Training for Unsupervised Domain Adaptation
by Shao-Yuan Lo et al

02-17-2022

LG-LSQ: Learned Gradient Linear Symmetric Quantization
by Shih-Ting Lin et al

02-17-2022

An Active and Contrastive Learning Framework for Fine-Grained Off-Road Semantic Segmentation
by Biao Gao et al

02-17-2022

TAFNet: A Three-Stream Adaptive Fusion Network for RGB-T Crowd Counting
by Haihan Tang et al

02-17-2022

REFUGE2 Challenge: Treasure for Multi-Domain Learning in Glaucoma Assessment
by Huihui Fang et al

02-16-2022

Unified smoke and fire detection in an evolutionary framework with self-supervised progressive data augment
by Hang Zhang et al

02-15-2022

Label fusion and training methods for reliable representation of inter-rater uncertainty
by Andreanne Lemay et al

02-16-2022

Practical Network Acceleration with Tiny Sets
by Guo-Hua Wang et al

02-18-2022

Iterative Learning for Instance Segmentation
by Tuomas Sormunen et al

02-16-2022

Neural Marionette: Unsupervised Learning of Motion Skeleton and Latent Dynamics from Volumetric Video
by Jinseok Bae et al

02-15-2022

Balancing Domain Experts for Long-Tailed Camera-Trap Recognition
by Byeongjun Park et al

02-18-2022

Incorporating Texture Information into Dimensionality Reduction for High-Dimensional Images
by Alexander Vieth et al

02-18-2022

A Machine Learning Paradigm for Studying Pictorial Realism: Are Constables Clouds More Real than His Contemporaries?
by Zhuomin Zhang et al

02-18-2022

Lightweight Multi-Drone Detection and 3D-Localization via YOLO
by Aryan Sharma et al

02-16-2022

Contextualize differential privacy in image database: a lightweight image differential privacy approach based on principle component analysis inverse
by Shiliang Zhang et al

02-15-2022

HAA4D: Few-Shot Human Atomic Action Recognition via 3D Spatio-Temporal Skeletal Alignment
by Mu-Ruei Tseng et al

02-15-2022

MeshLeTemp: Leveraging the Learnable Vertex-Vertex Relationship to Generalize Human Pose and Mesh Reconstruction for In-the-Wild Scenes
by Trung Q. Tran et al

02-17-2022

Classification of ADHD Patients by Kernel Hierarchical Extreme Learning Machine
by Sartaj Ahmed Salman et al

02-15-2022

Using Social Media Images for Building Function Classification
by Eike Jens Hoffmann et al

02-15-2022

Cross-view and Cross-domain Underwater Localization based on Optical Aerial and Acoustic Underwater Images
by Matheus M. Dos Santos et al

02-17-2022

Level set based particle filter driven by optical flow: an application to track the salt boundary from X-ray CT time-series
by Karim Makki et al

02-18-2022

Towards better understanding and better generalization of few-shot classification in histology images with contrastive learning
by Jiawei Yang et al

02-18-2022

(2.5+1)D Spatio-Temporal Scene Graphs for Video Question Answering
by Anoop Cherian et al

02-18-2022

Task Specific Attention is one more thing you need for object detection
by Sang Yon Lee

02-15-2022

Hyper-relationship Learning Network for Scene Graph Generation
by Yibing Zhan et al

02-17-2022

Colonoscopy polyp detection with massive endoscopic images
by Jialin Yu et al

02-16-2022

Learning to Detect People on the Fly: A Bio-inspired Event-based Visual System for Drones
by Ali Safa et al

02-15-2022

Random Walks for Adversarial Meshes
by Amir Belder et al

02-17-2022

A Wavelet-based Dual-stream Network for Underwater Image Enhancement
by Ziyin Ma et al

02-15-2022

PCRP: Unsupervised Point Cloud Object Retrieval and Pose Estimation
by Pranav Kadam et al

02-16-2022

Label Propagation for Annotation-Efficient Nuclei Segmentation from Pathology Images
by Yi Lin et al

02-16-2022

Less is More: Surgical Phase Recognition from Timestamp Supervision
by Zixun Wang et al

02-17-2022

Joint Learning of Frequency and Spatial Domains for Dense Predictions
by Shaocheng Jia et al

02-17-2022

Realistic Blur Synthesis for Learning Image Deblurring
by Jaesung Rim et al

02-16-2022

FUN-SIS: a Fully UNsupervised approach for Surgical Instrument Segmentation
by Luca Sestini et al

02-16-2022

Learning to Adapt to Light
by Kai-Fu Yang et al

02-15-2022

Improving the repeatability of deep learning models with Monte Carlo dropout
by Andreanne Lemay et al

02-17-2022

A Comprehensive Survey with Quantitative Comparison of Image Analysis Methods for Microorganism Biovolume Measurements
by Jiawei Zhang et al

02-17-2022

Graph Convolutional Networks for Multi-modality Medical Imaging: Methods, Architectures, and Clinical Applications
by Kexin Ding et al

02-18-2022

Towards Simple and Accurate Human Pose Estimation with Stair Network
by Chenru Jiang et al

02-15-2022

Energy-Efficient Parking Analytics System using Deep Reinforcement Learning
by Yoones Rezaei et al

02-15-2022

SODAR: Segmenting Objects by DynamicallyAggregating Neighboring Mask Representations
by Tao Wang et al

02-16-2022

A Developmentally-Inspired Examination of Shape versus Texture Bias in Machines
by Alexa R. Tartaglini et al

02-15-2022

On Representation Learning with Feedback
by Hao Li

02-15-2022

ScoreNet: Learning Non-Uniform Attention and Augmentation for Transformer-Based Histopathological Image Classification
by Thomas Stegmüller et al

02-15-2022

Post-Training Quantization for Cross-Platform Learned Image Compression
by Dailan He et al

02-17-2022

Prior image-based medical image reconstruction using a style-based generative adversarial network
by Varun A. Kelkar et al

02-17-2022

When, Why, and Which Pretrained GANs Are Useful?
by Timofey Grigoryev et al

02-17-2022

On Guiding Visual Attention with Language Specification
by Suzanne Petryk et al

02-18-2022

Spatio-Temporal Outdoor Lighting Aggregation on Image Sequences using Transformer Networks
by Haebom Lee et al

02-18-2022

Guide Local Feature Matching by Overlap Estimation
by Ying Chen et al

02-15-2022

A Subjective Quality Study for Video Frame Interpolation
by Duolikun Danier et al

02-16-2022

Flexible-Modal Face Anti-Spoofing: A Benchmark
by Zitong Yu et al

02-15-2022

Beyond Deterministic Translation for Unsupervised Domain Adaptation
by Eleni Chiou et al

02-15-2022

Normalized K-Means for Noise-Insensitive Multi-Dimensional Feature Learning
by Nicholas Pellegrino et al

02-15-2022

Enhancing Deformable Convolution based Video Frame Interpolation with Coarse-to-fine 3D CNN
by Duolikun Danier et al

02-15-2022

Misinformation Detection in Social Media Video Posts
by Kehan Wang et al

02-17-2022

Machine learning models and facial regions videos for estimating heart rate: a review on Patents, Datasets and Literature
by Tiago Palma Pagano et al

02-15-2022

Privacy Preserving Visual Question Answering
by Cristian-Paul Bara et al

02-16-2022

Cyclical Focal Loss
by Leslie N. Smith

02-15-2022

Deep Learning-Assisted Co-registration of Full-Spectral Autofluorescence Lifetime Microscopic Images with H&E-Stained Histology Images
by Qiang Wang et al

02-17-2022

Deep Transfer Learning on Satellite Imagery Improves Air Quality Estimates in Developing Nations
by Nishant Yadav et al

02-15-2022

Self-Supervised Class-Cognizant Few-Shot Classification
by Ojas Kishore Shirekar et al

02-17-2022

Continuous-Time vs. Discrete-Time Vision-based SLAM: A Comparative Study
by Giovanni Cioffi et al

02-17-2022

Developing Imperceptible Adversarial Patches to Camouflage Military Assets From Computer Vision Enabled Technologies
by Christopher Wise et al

 
Craig Smith