2022.1.17 Vision papers

 

01-11-2022

HyperTransformer: Model Generation for Supervised and Semi-Supervised Few-Shot Learning
by Andrey Zhmoginov et al

01-11-2022

In Defense of the Unitary Scalarization for Deep Multi-Task Learning
by Vitaly Kurin et al

01-11-2022

HumanNeRF: Free-viewpoint Rendering of Moving People from Monocular Video
by Chung-Yi Weng et al

01-13-2022

Pushing the limits of self-supervised ResNets: Can we outperform supervised learning without labels on ImageNet?
by Nenad Tomasev et al

01-13-2022

SeamlessGAN: Self-Supervised Synthesis of Tileable Texture Maps
by Carlos Rodriguez-Pardo et al

01-13-2022

GradMax: Growing Neural Networks using Gradient Information
by Utku Evci et al

01-12-2022

Robust Contrastive Learning against Noisy Views
by Ching-Yao Chuang et al

01-14-2022

When less is more: Simplifying inputs aids neural network understanding
by Robin Tibor Schirrmeister et al

01-11-2022

Multiview Transformers for Video Recognition
by Shen Yan et al

01-12-2022

Get your Foes Fooled: Proximal Gradient Split Learning for Defense against Model Inversion Attacks on IoMT data
by Sunder Ali Khowaja et al

01-12-2022

BigDatasetGAN: Synthesizing ImageNet with Pixel-wise Annotations
by Daiqing Li et al

01-11-2022

Incidents1M: a large-scale dataset of images with natural disasters, damage, and incidents
by Ethan Weber et al

01-13-2022

Stereo Magnification with Multi-Layer Images
by Taras Khakhulin et al

01-12-2022

Spatial-Temporal Map Vehicle Trajectory Detection Using Dynamic Mode Decomposition and Res-UNet+ Neural Networks
by Tianya T. Zhang et al

01-13-2022

CLIP-Event: Connecting Text and Images with Event Structures
by Manling Li et al

01-13-2022

Conditional Variational Autoencoder with Balanced Pre-training for Generative Adversarial Networks
by Yuchong Yao et al

01-11-2022

gDNA: Towards Generative Detailed Neural Avatars
by Xu Chen et al

01-13-2022

Self-semantic contour adaptation for cross modality brain tumor segmentation
by Xiaofeng Liu et al

01-13-2022

Weakly Supervised Scene Text Detection using Deep Reinforcement Learning
by Emanuel Metzenthin et al

01-13-2022

A Thousand Words Are Worth More Than a Picture: Natural Language-Centric Outside-Knowledge Visual Question Answering
by Feng Gao et al

01-11-2022

Learning to Denoise Raw Mobile UI Layouts for Improving Datasets at Scale
by Gang Li et al

01-12-2022

Real-Time Style Modelling of Human Locomotion via Feature-Wise Transformations and Local Motion Phases
by Ian Mason et al

01-12-2022

Early Diagnosis of Parkinsons Disease by Analyzing Magnetic Resonance Imaging Brain Scans and Patient Characteristics
by Sabrina Zhu

01-12-2022

Virtual Elastic Objects
by Hsiao-yu Chen et al

01-13-2022

Boundary-aware Self-supervised Learning for Video Scene Segmentation
by Jonghwan Mun et al

01-12-2022

Neural Residual Flow Fields for Efficient Video Representations
by Daniel Rho et al

01-12-2022

Optimizing Prediction of MGMT Promoter Methylation from MRI Scans using Adversarial Learning
by Sauman Das

01-12-2022

Beyond the Visible: A Survey on Cross-spectral Face Recognition
by David Anghelone et al

01-13-2022

Technical Report for ICCV 2021 Challenge SSLAD-Track3B: Transformers Are Better Continual Learners
by Duo Li et al

01-13-2022

BridgeFormer: Bridging Video-text Retrieval with Multiple Choice Questions
by Yuying Ge et al

01-11-2022

MobileFaceSwap: A Lightweight Framework for Video Face Swapping
by Zhiliang Xu et al

01-11-2022

Captcha Attack: Turning Captchas Against Humanity
by Mauro Conti et al

01-11-2022

Uni-EDEN: Universal Encoder-Decoder Network by Multi-Granular Vision-Language Pre-training
by Yehao Li et al

01-13-2022

Scalable Cluster-Consistency Statistics for Robust Multi-Object Matching
by Yunpeng Shi et al

01-12-2022

Towards Adversarially Robust Deep Image Denoising
by Hanshu Yan et al

01-11-2022

Classification of Beer Bottles using Object Detection and Transfer Learning
by Philipp Hohlfeld et al

01-13-2022

Recursive Least Squares for Training and Pruning Convolutional Neural Networks
by Tianzong Yu et al

01-12-2022

Uniformer: Unified Transformer for Efficient Spatiotemporal Representation Learning
by Kunchang Li et al

01-11-2022

Neural Capacitance: A New Perspective of Neural Network Selection via Edge Dynamics
by Chunheng Jiang et al

01-11-2022

Dynamical Audio-Visual Navigation: Catching Unheard Moving Sound Sources in Unmapped 3D Environments
by Abdelrahman Younes

01-11-2022

MobilePhys: Personalized Mobile Camera-Based Contactless Physiological Sensing
by Xin Liu et al

01-13-2022

VoLux-GAN: A Generative Model for 3D Face Synthesis with HDRI Relighting
by Feitong Tan et al

01-12-2022

Collision Detection: An Improved Deep Learning Approach Using SENet and ResNext
by Aloukik Aditya et al

01-13-2022

Beyond Simple Meta-Learning: Multi-Purpose Models for Multi-Domain, Active and Continual Few-Shot Learning
by Peyman Bateni et al

01-13-2022

Automatic Sparse Connectivity Learning for Neural Networks
by Zhimin Tang et al

01-12-2022

Adversarially Robust Classification by Conditional Generative Model Inversion
by Mitra Alirezaei et al

01-13-2022

S22FPR: Crowd Counting via Self-Supervised Coarse to Fine Feature Pyramid Ranking
by Jiaqi Gao et al

01-14-2022

Unsupervised Temporal Video Grounding with Deep Semantic Clustering
by Daizong Liu et al

01-13-2022

EMT-NET: Efficient multitask network for computer-aided diagnosis of breast cancer
by Jiaqiao Shi et al

01-12-2022

MoViDNN: A Mobile Platform for Evaluating Video Quality Enhancement with Deep Neural Networks
by Ekrem Çetinkaya et al

01-12-2022

Knee Cartilage Defect Assessment by Graph Representation and Surface Convolution
by Zixu Zhuang et al

01-11-2022

Feature Extraction Framework based on Contrastive Learning with Adaptive Positive and Negative Samples
by Hongjie Zhang

01-11-2022

Emotion Estimation from EEG -- A Dual Deep Learning Approach Combined with Saliency
by Victor Delvigne et al

01-13-2022

TransVOD: End-to-end Video Object Detection with Spatial-Temporal Transformers
by Qianyu Zhou et al

01-12-2022

Partial-Attribution Instance Segmentation for Astronomical Source Detection and Deblending
by Ryan Hausen et al

01-12-2022

Predicting Alzheimers Disease Using 3DMgNet
by Yelu Gao et al

01-11-2022

Optimization Planning for 3D ConvNets
by Zhaofan Qiu et al

01-14-2022

A New Deep Hybrid Boosted and Ensemble Learning-based Brain Tumor Analysis using MRI
by Mirza Mumtaz Zahoor et al

01-12-2022

MAg: a simple learning-based patient-level aggregation method for detecting microsatellite instability from whole-slide images
by Kaifeng Pang et al

01-12-2022

ECONet: Efficient Convolutional Online Likelihood Network for Scribble-based Interactive Segmentation
by Muhammad Asad et al

01-11-2022

Drone Object Detection Using RGB/IR Fusion
by Lizhi Yang et al

01-12-2022

A Survey on Masked Facial Detection Methods and Datasets for Fighting Against COVID-19
by Bingshu Wang et al

01-12-2022

Unlocking large-scale crop field delineation in smallholder farming systems with transfer learning and weak supervision
by Sherrie Wang et al

01-12-2022

SCSNet: An Efficient Paradigm for Learning Simultaneously Image Colorization and Super-Resolution
by Jiangning Zhang et al

01-11-2022

Image quality measurements and denoising using Fourier Ring Correlations
by J. Kaczmar-Michalska et al

01-13-2022

Fantastic Data and How to Query Them
by Trung-Kien Tran et al

01-13-2022

CFNet: Learning Correlation Functions for One-Stage Panoptic Segmentation
by Yifeng Chen et al

01-12-2022

Depth Estimation from Single-shot Monocular Endoscope Image Using Image Domain Adaptation And Edge-Aware Depth Estimation
by Masahiro Oda et al

01-12-2022

AI Singapore Trusted Media Challenge Dataset
by Weiling Chen et al

01-13-2022

Flexible Style Image Super-Resolution using Conditional Objective
by Seung Ho Park et al

01-13-2022

Fully Adaptive Bayesian Algorithm for Data Analysis, FABADA
by Pablo M Sanchez-Alarcon et al

01-13-2022

RealGait: Gait Recognition for Person Re-Identification
by Shaoxiong Zhang et al

01-13-2022

Hand-Object Interaction Reasoning
by Jian Ma et al

01-12-2022

Structure and position-aware graph neural network for airway labeling
by Weiyi Xie et al

01-13-2022

SimReg: Regression as a Simple Yet Effective Tool for Self-supervised Knowledge Distillation
by K L Navaneet et al

01-12-2022

Sparsely Annotated Object Detection: A Region-based Semi-supervised Approach
by Sai Saketh Rambhatla et al

01-13-2022

On Adversarial Robustness of Trajectory Prediction for Autonomous Vehicles
by Qingzhao Zhang et al

01-13-2022

Deep Leaning-Based Ultra-Fast Stair Detection
by Chen Wang et al

01-13-2022

Unsupervised Domain Adaptation for Cross-Modality Retinal Vessel Segmentation via Disentangling Representation Style Transfer and Collaborative Consistency Learning
by Linkai Peng et al

01-11-2022

SmartDet: Context-Aware Dynamic Control of Edge Task Offloading for Mobile Object Detection
by Davide Callegaro et al

01-11-2022

On the Efficacy of Co-Attention Transformer Layers in Visual Question Answering
by Ankur Sikarwar et al

01-11-2022

Similarity-based Gray-box Adversarial Attack Against Deep Face Recognition
by Hanrui Wang et al

01-12-2022

Maximizing Self-supervision from Thermal Image for Effective Self-supervised Learning of Depth and Ego-motion
by Ukcheol Shin et al

01-12-2022

SensatUrban: Learning Semantics from Urban-Scale Photogrammetric Point Clouds
by Qingyong Hu et al

01-13-2022

Realistic Endoscopic Image Generation Method Using Virtual-to-real Image-domain Translation
by Masahiro Oda et al

01-13-2022

MetaDance: Few-shot Dancing Video Retargeting via Temporal-aware Meta-learning
by Yuying Ge et al

01-14-2022

Semi-automated Virtual Unfolded View Generation Method of Stomach from CT Volumes
by Masahiro Oda et al

01-14-2022

AWSnet: An Auto-weighted Supervision Attention Network for Myocardial Scar and Edema Segmentation in Multi-sequence Cardiac Magnetic Resonance Images
by Kai-Ni Wang et al

01-12-2022

Roadside Lidar Vehicle Detection and Tracking Using Range And Intensity Background Subtraction
by Tianya Zhang et al

01-13-2022

SnapshotNet: Self-supervised Feature Learning for Point Cloud Data Segmentation Using Minimal Labeled Data
by Xingye Li et al

01-13-2022

Learning Semantic Abstraction of Shape via 3D Region of Interest
by Haiyue Fang et al

01-12-2022

Coarse-to-Fine Embedded PatchMatch and Multi-Scale Dynamic Aggregation for Reference-based Super-Resolution
by Bin Xia et al

01-14-2022

A Novel Skeleton-Based Human Activity Discovery Technique Using Particle Swarm Optimization with Gaussian Mutation
by Parham Hadikhani et al

01-14-2022

HYLDA: End-to-end Hybrid Learning Domain Adaptation for LiDAR Semantic Segmentation
by Eduardo R. Corral-Soto et al

01-14-2022

Saliency Constrained Arbitrary Image Style Transfer using SIFT and DCNN
by HuiHuang Zhao et al

01-13-2022

Learning Enhancement of CNNs via Separation Index Maximizing at the First Convolutional Layer
by Ali Karimi et al

01-12-2022

Globally Optimal Multi-Scale Monocular Hand-Eye Calibration Using Dual Quaternions
by Thomas Wodtko et al

01-14-2022

HardBoost: Boosting Zero-Shot Learning with Hard Classes
by Bo Liu et al

01-13-2022

STEdge: Self-training Edge Detection with Multi-layer Teaching and Regularization
by Yunfan Ye et al

01-14-2022

Emergence of Machine Language: Towards Symbolic Intelligence with Neural Networks
by Yuqi Wang et al

01-12-2022

Toddler-Guidance Learning: Impacts of Critical Period on Multimodal AI Agents
by Junseok Park et al

01-11-2022

Pyramid Fusion Transformer for Semantic Segmentation
by Zipeng Qin et al

01-12-2022

OCSampler: Compressing Videos to One Clip with Single-step Sampling
by Jintao Lin et al

01-14-2022

SRVIO: Super Robust Visual Inertial Odometry for dynamic environments and challenging Loop-closure conditions
by Ali Samadzadeh et al

01-11-2022

Where Is My Mind (looking at)? Predicting Visual Attention from Brain Activity
by Victor Delvigne et al

01-13-2022

Multi-granularity Association Learning Framework for on-the-fly Fine-Grained Sketch-based Image Retrieval
by Dawei Dai et al

01-14-2022

Determination of building flood risk maps from LiDAR mobile mapping data
by Yu Feng et al

01-11-2022

On Exploring Pose Estimation as an Auxiliary Learning Task for Visible-Infrared Person Re-identification
by Yunqi Miao et al

01-11-2022

Unsupervised Domain Adaptive Person Re-id with Local-enhance and Prototype Dictionary Learning
by Haopeng Hou

01-13-2022

Argus++: Robust Real-time Activity Detection for Unconstrained Video Streams with Overlapping Cube Proposals
by Lijun Yu et al

01-13-2022

MMNet: Muscle motion-guided network for micro-expression recognition
by Hanting Li et al

01-11-2022

Smart Director: An Event-Driven Directing System for Live Broadcasting
by Yingwei Pan et al

01-11-2022

Efficient Non-Local Contrastive Attention for Image Super-Resolution
by Bin Xia et al

01-11-2022

COROLLA: An Efficient Multi-Modality Fusion Framework with Supervised Contrastive Learning for Glaucoma Grading
by Zhiyuan Cai et al

01-12-2022

MDS-Net: A Multi-scale Depth Stratification Based Monocular 3D Object Detection Algorithm
by Zhouzhen Xie et al

01-11-2022

DM-VIO: Delayed Marginalization Visual-Inertial Odometry
by Lukas von Stumberg et al

01-11-2022

Motion-Focused Contrastive Learning of Video Representations
by Rui Li et al

01-11-2022

Representing Videos as Discriminative Sub-graphs for Action Recognition
by Dong Li et al

01-13-2022

Manifoldron: Direct Space Partition via Manifold Discovery
by Dayang Wang et al

01-11-2022

Towards Lightweight Neural Animation : Exploration of Neural Network Pruning in Mixture of Experts-based Animation Models
by Antoine Maiorca et al

01-11-2022

MDPose: Human Skeletal Motion Reconstruction Using WiFi Micro-Doppler Signatures
by Chong Tang et al

01-11-2022

Region-based Layout Analysis of Music Score Images
by Francisco J. Castellanos et al

01-11-2022

Overview of the HECKTOR Challenge at MICCAI 2021: Automatic Head and Neck Tumor Segmentation and Outcome Prediction in PET/CT Images
by Vincent Andrearczyk et al

01-14-2022

ViT2Hash: Unsupervised Information-Preserving Hashing
by Qinkang Gong et al

01-14-2022

Multimodal registration of FISH and nanoSIMS images using convolutional neural network models
by Xiaojia He et al

01-11-2022

Condensing a Sequence to One Informative Frame for Video Recognition
by Zhaofan Qiu et al

01-12-2022

Semantic Labeling of Human Action For Visually Impaired And Blind People Scene Interaction
by Leyla Benhamida et al

01-11-2022

Boosting Video Representation Learning with Multi-Faceted Integration
by Zhaofan Qiu et al

01-13-2022

Density Estimation from Schlieren Images through Machine Learning
by Bryn Noel Ubald et al

 
Craig Smith