2022.1.17 Vision papers

01-11-2022	HyperTransformer: Model Generation for Supervised and Semi-Supervised Few-Shot Learning by Andrey Zhmoginov et al
01-11-2022	In Defense of the Unitary Scalarization for Deep Multi-Task Learning by Vitaly Kurin et al
01-11-2022	HumanNeRF: Free-viewpoint Rendering of Moving People from Monocular Video by Chung-Yi Weng et al
01-13-2022	Pushing the limits of self-supervised ResNets: Can we outperform supervised learning without labels on ImageNet? by Nenad Tomasev et al
01-13-2022	SeamlessGAN: Self-Supervised Synthesis of Tileable Texture Maps by Carlos Rodriguez-Pardo et al
01-13-2022	GradMax: Growing Neural Networks using Gradient Information by Utku Evci et al
01-12-2022	Robust Contrastive Learning against Noisy Views by Ching-Yao Chuang et al
01-14-2022	When less is more: Simplifying inputs aids neural network understanding by Robin Tibor Schirrmeister et al
01-11-2022	Multiview Transformers for Video Recognition by Shen Yan et al
01-12-2022	Get your Foes Fooled: Proximal Gradient Split Learning for Defense against Model Inversion Attacks on IoMT data by Sunder Ali Khowaja et al
01-12-2022	BigDatasetGAN: Synthesizing ImageNet with Pixel-wise Annotations by Daiqing Li et al
01-11-2022	Incidents1M: a large-scale dataset of images with natural disasters, damage, and incidents by Ethan Weber et al
01-13-2022	Stereo Magnification with Multi-Layer Images by Taras Khakhulin et al
01-12-2022	Spatial-Temporal Map Vehicle Trajectory Detection Using Dynamic Mode Decomposition and Res-UNet+ Neural Networks by Tianya T. Zhang et al
01-13-2022	CLIP-Event: Connecting Text and Images with Event Structures by Manling Li et al
01-13-2022	Conditional Variational Autoencoder with Balanced Pre-training for Generative Adversarial Networks by Yuchong Yao et al
01-11-2022	gDNA: Towards Generative Detailed Neural Avatars by Xu Chen et al
01-13-2022	Self-semantic contour adaptation for cross modality brain tumor segmentation by Xiaofeng Liu et al
01-13-2022	Weakly Supervised Scene Text Detection using Deep Reinforcement Learning by Emanuel Metzenthin et al
01-13-2022	A Thousand Words Are Worth More Than a Picture: Natural Language-Centric Outside-Knowledge Visual Question Answering by Feng Gao et al
01-11-2022	Learning to Denoise Raw Mobile UI Layouts for Improving Datasets at Scale by Gang Li et al
01-12-2022	Real-Time Style Modelling of Human Locomotion via Feature-Wise Transformations and Local Motion Phases by Ian Mason et al
01-12-2022	Early Diagnosis of Parkinsons Disease by Analyzing Magnetic Resonance Imaging Brain Scans and Patient Characteristics by Sabrina Zhu
01-12-2022	Virtual Elastic Objects by Hsiao-yu Chen et al
01-13-2022	Boundary-aware Self-supervised Learning for Video Scene Segmentation by Jonghwan Mun et al
01-12-2022	Neural Residual Flow Fields for Efficient Video Representations by Daniel Rho et al
01-12-2022	Optimizing Prediction of MGMT Promoter Methylation from MRI Scans using Adversarial Learning by Sauman Das
01-12-2022	Beyond the Visible: A Survey on Cross-spectral Face Recognition by David Anghelone et al
01-13-2022	Technical Report for ICCV 2021 Challenge SSLAD-Track3B: Transformers Are Better Continual Learners by Duo Li et al
01-13-2022	BridgeFormer: Bridging Video-text Retrieval with Multiple Choice Questions by Yuying Ge et al
01-11-2022	MobileFaceSwap: A Lightweight Framework for Video Face Swapping by Zhiliang Xu et al
01-11-2022	Captcha Attack: Turning Captchas Against Humanity by Mauro Conti et al
01-11-2022	Uni-EDEN: Universal Encoder-Decoder Network by Multi-Granular Vision-Language Pre-training by Yehao Li et al
01-13-2022	Scalable Cluster-Consistency Statistics for Robust Multi-Object Matching by Yunpeng Shi et al
01-12-2022	Towards Adversarially Robust Deep Image Denoising by Hanshu Yan et al
01-11-2022	Classification of Beer Bottles using Object Detection and Transfer Learning by Philipp Hohlfeld et al
01-13-2022	Recursive Least Squares for Training and Pruning Convolutional Neural Networks by Tianzong Yu et al
01-12-2022	Uniformer: Unified Transformer for Efficient Spatiotemporal Representation Learning by Kunchang Li et al
01-11-2022	Neural Capacitance: A New Perspective of Neural Network Selection via Edge Dynamics by Chunheng Jiang et al
01-11-2022	Dynamical Audio-Visual Navigation: Catching Unheard Moving Sound Sources in Unmapped 3D Environments by Abdelrahman Younes
01-11-2022	MobilePhys: Personalized Mobile Camera-Based Contactless Physiological Sensing by Xin Liu et al
01-13-2022	VoLux-GAN: A Generative Model for 3D Face Synthesis with HDRI Relighting by Feitong Tan et al
01-12-2022	Collision Detection: An Improved Deep Learning Approach Using SENet and ResNext by Aloukik Aditya et al
01-13-2022	Beyond Simple Meta-Learning: Multi-Purpose Models for Multi-Domain, Active and Continual Few-Shot Learning by Peyman Bateni et al
01-13-2022	Automatic Sparse Connectivity Learning for Neural Networks by Zhimin Tang et al
01-12-2022	Adversarially Robust Classification by Conditional Generative Model Inversion by Mitra Alirezaei et al
01-13-2022	S22FPR: Crowd Counting via Self-Supervised Coarse to Fine Feature Pyramid Ranking by Jiaqi Gao et al
01-14-2022	Unsupervised Temporal Video Grounding with Deep Semantic Clustering by Daizong Liu et al
01-13-2022	EMT-NET: Efficient multitask network for computer-aided diagnosis of breast cancer by Jiaqiao Shi et al
01-12-2022	MoViDNN: A Mobile Platform for Evaluating Video Quality Enhancement with Deep Neural Networks by Ekrem Çetinkaya et al
01-12-2022	Knee Cartilage Defect Assessment by Graph Representation and Surface Convolution by Zixu Zhuang et al
01-11-2022	Feature Extraction Framework based on Contrastive Learning with Adaptive Positive and Negative Samples by Hongjie Zhang
01-11-2022	Emotion Estimation from EEG -- A Dual Deep Learning Approach Combined with Saliency by Victor Delvigne et al
01-13-2022	TransVOD: End-to-end Video Object Detection with Spatial-Temporal Transformers by Qianyu Zhou et al
01-12-2022	Partial-Attribution Instance Segmentation for Astronomical Source Detection and Deblending by Ryan Hausen et al
01-12-2022	Predicting Alzheimers Disease Using 3DMgNet by Yelu Gao et al
01-11-2022	Optimization Planning for 3D ConvNets by Zhaofan Qiu et al
01-14-2022	A New Deep Hybrid Boosted and Ensemble Learning-based Brain Tumor Analysis using MRI by Mirza Mumtaz Zahoor et al
01-12-2022	MAg: a simple learning-based patient-level aggregation method for detecting microsatellite instability from whole-slide images by Kaifeng Pang et al
01-12-2022	ECONet: Efficient Convolutional Online Likelihood Network for Scribble-based Interactive Segmentation by Muhammad Asad et al
01-11-2022	Drone Object Detection Using RGB/IR Fusion by Lizhi Yang et al
01-12-2022	A Survey on Masked Facial Detection Methods and Datasets for Fighting Against COVID-19 by Bingshu Wang et al
01-12-2022	Unlocking large-scale crop field delineation in smallholder farming systems with transfer learning and weak supervision by Sherrie Wang et al
01-12-2022	SCSNet: An Efficient Paradigm for Learning Simultaneously Image Colorization and Super-Resolution by Jiangning Zhang et al
01-11-2022	Image quality measurements and denoising using Fourier Ring Correlations by J. Kaczmar-Michalska et al
01-13-2022	Fantastic Data and How to Query Them by Trung-Kien Tran et al
01-13-2022	CFNet: Learning Correlation Functions for One-Stage Panoptic Segmentation by Yifeng Chen et al
01-12-2022	Depth Estimation from Single-shot Monocular Endoscope Image Using Image Domain Adaptation And Edge-Aware Depth Estimation by Masahiro Oda et al
01-12-2022	AI Singapore Trusted Media Challenge Dataset by Weiling Chen et al
01-13-2022	Flexible Style Image Super-Resolution using Conditional Objective by Seung Ho Park et al
01-13-2022	Fully Adaptive Bayesian Algorithm for Data Analysis, FABADA by Pablo M Sanchez-Alarcon et al
01-13-2022	RealGait: Gait Recognition for Person Re-Identification by Shaoxiong Zhang et al
01-13-2022	Hand-Object Interaction Reasoning by Jian Ma et al
01-12-2022	Structure and position-aware graph neural network for airway labeling by Weiyi Xie et al
01-13-2022	SimReg: Regression as a Simple Yet Effective Tool for Self-supervised Knowledge Distillation by K L Navaneet et al
01-12-2022	Sparsely Annotated Object Detection: A Region-based Semi-supervised Approach by Sai Saketh Rambhatla et al
01-13-2022	On Adversarial Robustness of Trajectory Prediction for Autonomous Vehicles by Qingzhao Zhang et al
01-13-2022	Deep Leaning-Based Ultra-Fast Stair Detection by Chen Wang et al
01-13-2022	Unsupervised Domain Adaptation for Cross-Modality Retinal Vessel Segmentation via Disentangling Representation Style Transfer and Collaborative Consistency Learning by Linkai Peng et al
01-11-2022	SmartDet: Context-Aware Dynamic Control of Edge Task Offloading for Mobile Object Detection by Davide Callegaro et al
01-11-2022	On the Efficacy of Co-Attention Transformer Layers in Visual Question Answering by Ankur Sikarwar et al
01-11-2022	Similarity-based Gray-box Adversarial Attack Against Deep Face Recognition by Hanrui Wang et al
01-12-2022	Maximizing Self-supervision from Thermal Image for Effective Self-supervised Learning of Depth and Ego-motion by Ukcheol Shin et al
01-12-2022	SensatUrban: Learning Semantics from Urban-Scale Photogrammetric Point Clouds by Qingyong Hu et al
01-13-2022	Realistic Endoscopic Image Generation Method Using Virtual-to-real Image-domain Translation by Masahiro Oda et al
01-13-2022	MetaDance: Few-shot Dancing Video Retargeting via Temporal-aware Meta-learning by Yuying Ge et al
01-14-2022	Semi-automated Virtual Unfolded View Generation Method of Stomach from CT Volumes by Masahiro Oda et al
01-14-2022	AWSnet: An Auto-weighted Supervision Attention Network for Myocardial Scar and Edema Segmentation in Multi-sequence Cardiac Magnetic Resonance Images by Kai-Ni Wang et al
01-12-2022	Roadside Lidar Vehicle Detection and Tracking Using Range And Intensity Background Subtraction by Tianya Zhang et al
01-13-2022	SnapshotNet: Self-supervised Feature Learning for Point Cloud Data Segmentation Using Minimal Labeled Data by Xingye Li et al
01-13-2022	Learning Semantic Abstraction of Shape via 3D Region of Interest by Haiyue Fang et al
01-12-2022	Coarse-to-Fine Embedded PatchMatch and Multi-Scale Dynamic Aggregation for Reference-based Super-Resolution by Bin Xia et al
01-14-2022	A Novel Skeleton-Based Human Activity Discovery Technique Using Particle Swarm Optimization with Gaussian Mutation by Parham Hadikhani et al
01-14-2022	HYLDA: End-to-end Hybrid Learning Domain Adaptation for LiDAR Semantic Segmentation by Eduardo R. Corral-Soto et al
01-14-2022	Saliency Constrained Arbitrary Image Style Transfer using SIFT and DCNN by HuiHuang Zhao et al
01-13-2022	Learning Enhancement of CNNs via Separation Index Maximizing at the First Convolutional Layer by Ali Karimi et al
01-12-2022	Globally Optimal Multi-Scale Monocular Hand-Eye Calibration Using Dual Quaternions by Thomas Wodtko et al
01-14-2022	HardBoost: Boosting Zero-Shot Learning with Hard Classes by Bo Liu et al
01-13-2022	STEdge: Self-training Edge Detection with Multi-layer Teaching and Regularization by Yunfan Ye et al
01-14-2022	Emergence of Machine Language: Towards Symbolic Intelligence with Neural Networks by Yuqi Wang et al
01-12-2022	Toddler-Guidance Learning: Impacts of Critical Period on Multimodal AI Agents by Junseok Park et al
01-11-2022	Pyramid Fusion Transformer for Semantic Segmentation by Zipeng Qin et al
01-12-2022	OCSampler: Compressing Videos to One Clip with Single-step Sampling by Jintao Lin et al
01-14-2022	SRVIO: Super Robust Visual Inertial Odometry for dynamic environments and challenging Loop-closure conditions by Ali Samadzadeh et al
01-11-2022	Where Is My Mind (looking at)? Predicting Visual Attention from Brain Activity by Victor Delvigne et al
01-13-2022	Multi-granularity Association Learning Framework for on-the-fly Fine-Grained Sketch-based Image Retrieval by Dawei Dai et al
01-14-2022	Determination of building flood risk maps from LiDAR mobile mapping data by Yu Feng et al
01-11-2022	On Exploring Pose Estimation as an Auxiliary Learning Task for Visible-Infrared Person Re-identification by Yunqi Miao et al
01-11-2022	Unsupervised Domain Adaptive Person Re-id with Local-enhance and Prototype Dictionary Learning by Haopeng Hou
01-13-2022	Argus++: Robust Real-time Activity Detection for Unconstrained Video Streams with Overlapping Cube Proposals by Lijun Yu et al
01-13-2022	MMNet: Muscle motion-guided network for micro-expression recognition by Hanting Li et al
01-11-2022	Smart Director: An Event-Driven Directing System for Live Broadcasting by Yingwei Pan et al
01-11-2022	Efficient Non-Local Contrastive Attention for Image Super-Resolution by Bin Xia et al
01-11-2022	COROLLA: An Efficient Multi-Modality Fusion Framework with Supervised Contrastive Learning for Glaucoma Grading by Zhiyuan Cai et al
01-12-2022	MDS-Net: A Multi-scale Depth Stratification Based Monocular 3D Object Detection Algorithm by Zhouzhen Xie et al
01-11-2022	DM-VIO: Delayed Marginalization Visual-Inertial Odometry by Lukas von Stumberg et al
01-11-2022	Motion-Focused Contrastive Learning of Video Representations by Rui Li et al
01-11-2022	Representing Videos as Discriminative Sub-graphs for Action Recognition by Dong Li et al
01-13-2022	Manifoldron: Direct Space Partition via Manifold Discovery by Dayang Wang et al
01-11-2022	Towards Lightweight Neural Animation : Exploration of Neural Network Pruning in Mixture of Experts-based Animation Models by Antoine Maiorca et al
01-11-2022	MDPose: Human Skeletal Motion Reconstruction Using WiFi Micro-Doppler Signatures by Chong Tang et al
01-11-2022	Region-based Layout Analysis of Music Score Images by Francisco J. Castellanos et al
01-11-2022	Overview of the HECKTOR Challenge at MICCAI 2021: Automatic Head and Neck Tumor Segmentation and Outcome Prediction in PET/CT Images by Vincent Andrearczyk et al
01-14-2022	ViT2Hash: Unsupervised Information-Preserving Hashing by Qinkang Gong et al
01-14-2022	Multimodal registration of FISH and nanoSIMS images using convolutional neural network models by Xiaojia He et al
01-11-2022	Condensing a Sequence to One Informative Frame for Video Recognition by Zhaofan Qiu et al
01-12-2022	Semantic Labeling of Human Action For Visually Impaired And Blind People Scene Interaction by Leyla Benhamida et al
01-11-2022	Boosting Video Representation Learning with Multi-Faceted Integration by Zhaofan Qiu et al
01-13-2022	Density Estimation from Schlieren Images through Machine Learning by Bryn Noel Ubald et al

Craig SmithJanuary 17, 2022