2021.11.8 Vision papers

11-05-2021	SSA: Semantic Structure Aware Inference for Weakly Pixel-Wise Dense Predictions without Cost by Yanpeng Sun et al
11-05-2021	Learning of Frequency-Time Attention Mechanism for Automatic Modulation Recognition by Shangao Lin et al
11-05-2021	Edge Tracing using Gaussian Process Regression by Jamie Burke et al
11-04-2021	Multi-scale 2D Representation Learning for weakly-supervised moment retrieval by Ding Li et al
11-04-2021	LVIS Challenge Track Technical Report 1st Place Solution: Distribution Balanced and Boundary Refinement for Large Vocabulary Instance Segmentation by WeiFu Fu et al
11-02-2021	A Critical Study on the Recent Deep Learning Based Semi-Supervised Video Anomaly Detection Methods by Mohammad Baradaran et al
11-02-2021	PolyTrack: Tracking with Bounding Polygons by Gaspar Faure et al
11-03-2021	Deep Point Set Resampling via Gradient Fields by Haolan Chen et al
11-03-2021	Sequence-to-Sequence Modeling for Action Identification at High Temporal Resolution by Aakash Kaku et al
11-03-2021	An Empirical Study of Training End-to-End Vision-and-Language Transformers by Zi-Yi Dou et al
11-05-2021	A Deep Learning Generative Model Approach for Image Synthesis of Plant Leaves by Alessandrop Benfenati et al
11-05-2021	Versatile Learned Video Compression by Runsen Feng et al
11-05-2021	Seamless Satellite-image Synthesis by Jialin Zhu et al
11-04-2021	GraN-GAN: Piecewise Gradient Normalization for Generative Adversarial Networks by Vineeth S. Bhaskara et al
11-05-2021	Interpreting Representation Quality of DNNs for 3D Point Cloud Processing by Wen Shen et al
11-05-2021	Synchronized Smartphone Video Recording System of Depth and RGB Image Frames with Sub-millisecond Precision by Marsel Faizullin et al
11-05-2021	Single Image Deraining Network with Rain Embedding Consistency and Layered LSTM by Yizhou Li et al
11-02-2021	CPSeg: Cluster-free Panoptic Segmentation of 3D LiDAR Point Clouds by Enxu Li et al
11-04-2021	Qimera: Data-free Quantization with Synthetic Boundary Supporting Samples by Kanghyun Choi et al
11-04-2021	Towards dynamic multi-modal phenotyping using chest radiographs and physiological data by Nasir Hayat et al
11-04-2021	Facial Emotion Recognition using Deep Residual Networks in Real-World Environments by Panagiotis Tzirakis et al
11-03-2021	Breast Cancer Classification Using: Pixel Interpolation by Osama Rezq Shahin et al
11-02-2021	WORD: Revisiting Organs Segmentation in the Whole Abdominal Region by Xiangde Luo et al
11-02-2021	Skin Cancer Classification using Inception Network and Transfer Learning by Priscilla Benedetti et al
11-03-2021	FAST: Searching for a Faster Arbitrarily-Shaped Text Detector with Minimalist Kernel Representation by Zhe Chen et al
11-05-2021	Spatial-Temporal Residual Aggregation for High Resolution Video Inpainting by Vishnu Sanjay Ramiya Srinivasan et al
11-05-2021	A Unified Game-Theoretic Interpretation of Adversarial Robustness by Jie Ren et al
11-02-2021	HHP-Net: A light Heteroscedastic neural network for Head Pose estimation with uncertainty by Giorgio Cantarini et al
11-02-2021	Explainable Medical Image Segmentation via Generative Adversarial Networks and Layer-wise Relevance Propagation by Awadelrahman M. A. Ahmed et al
11-02-2021	A Pixel-Level Meta-Learner for Weakly Supervised Few-Shot Semantic Segmentation by Yuan-Hao Lee et al
11-05-2021	Negative Sample is Negative in Its Own Way: Tailoring Negative Sentences for Image-Text Retrieval by Zhihao Fan et al
11-03-2021	The Klarna Product Page Dataset: A RealisticBenchmark for Web Representation Learning by Alexandra Hotti et al
11-03-2021	Subpixel Heatmap Regression for Facial Landmark Localization by Adrian Bulat et al
11-05-2021	Hepatic vessel segmentation based on 3Dswin-transformer with inductive biased multi-head self-attention by Mian Wu et al
11-03-2021	Improving Pose Estimation through Contextual Activity Fusion by David Poulton et al
11-04-2021	Towards Panoptic 3D Parsing for Single Image in the Wild by Sainan Liu et al
11-04-2021	Online Continual Learning via Multiple Deep Metric Learning and Uncertainty-guided Episodic Memory Replay -- 3rd Place Solution for ICCV 2021 Workshop SSLAD Track 3A Continual Object Classification by Muhammad Rifki Kurniawan et al
11-05-2021	AGPCNet: Attention-Guided Pyramid Context Networks for Infrared Small Target Detection by Tianfang Zhang et al
11-02-2021	Relational Self-Attention: Whats Missing in Attention for Video Understanding by Manjin Kim et al
11-03-2021	Video Salient Object Detection via Contrastive Features and Attention Modules by Yi-Wen Chen et al
11-04-2021	The role of MRI physics in brain segmentation CNNs: achieving acquisition invariance and instructive uncertainties by Pedro Borges et al
11-03-2021	Discriminator Synthesis: On reusing the other half of Generative Adversarial Networks by Diego Porres
11-03-2021	A cross-modal fusion network based on self-attention and residual structure for multimodal emotion recognition by Ziwang Fu et al
11-03-2021	Event and Activity Recognition in Video Surveillance for Cyber-Physical Systems by Swarnabja Bhaumik et al
11-04-2021	StyleCLIPDraw: Coupling Content and Style in Text-to-Drawing Synthesis by Peter Schaldenbrand et al
11-05-2021	TermiNeRF: Ray Termination Prediction for Efficient Neural Rendering by Martin Piala et al
11-05-2021	Normalizing Flow as a Flexible Fidelity Objective for Photo-Realistic Super-resolution by Andreas Lugmayr et al
11-02-2021	3-D PET Image Generation with tumour masks using TGAN by Robert V Bergen et al
11-02-2021	A dataset for multi-sensor drone detection by Fredrik Svanström et al
11-03-2021	Deep-Learning-Based Single-Image Height Reconstruction from Very-High-Resolution SAR Intensity Data by Michael Recla et al
11-02-2021	Body Size and Depth Disambiguation in Multi-Person Reconstruction from Single Images by Nicolas Ugrinovic et al
11-04-2021	Skeleton-Split Framework using Spatial Temporal Graph Convolutional Networks for Action Recogntion by Motasem Alsawadi et al
11-03-2021	Unified 3D Mesh Recovery of Humans and Animals by Learning Animal Exercise by Kim Youwang et al
11-04-2021	Addressing Multiple Salient Object Detection via Dual-Space Long-Range Dependencies by Bowen Deng et al
11-03-2021	Resampling and super-resolution of hexagonally sampled images using deep learning by Dylan Flaute et al
11-04-2021	Towards Smart Monitored AM: Open Source in-Situ Layer-wise 3D Printing Image Anomaly Detection Using Histograms of Oriented Gradients and a Physics-Based Rendering Engine by Aliaksei Petsiuk et al
11-04-2021	Attention on Classification for Fire Segmentation by Milad Niknejad et al
11-05-2021	The Curious Layperson: Fine-Grained Image Recognition without Expert Labels by Subhabrata Choudhury et al
11-02-2021	Out of distribution detection for skin and malaria images by Muhammad Zaida et al
11-02-2021	A high performance fingerprint liveness detection method based on quality related features by Javier Galbally et al
11-05-2021	Semantic Consistency in Image-to-Image Translation for Unsupervised Domain Adaptation by Stephan Brehm et al
11-03-2021	Beyond PRNU: Learning Robust Device-Specific Fingerprint for Source Camera Identification by Manisha et al
11-04-2021	FEAFA+: An Extended Well-Annotated Dataset for Facial Expression Analysis and 3D Facial Animation by Wei Gan et al
11-02-2021	Estimating 3D Motion and Forces of Human-Object Interactions from Internet Videos by Zongmian Li et al
11-05-2021	BBC-Oxford British Sign Language Dataset by Samuel Albanie et al
11-04-2021	Generalization in Dexterous Manipulation via Geometry-Aware Multi-Task Learning by Wenlong Huang et al
11-04-2021	PDBL: Improving Histopathological Tissue Classification with Plug-and-Play Pyramidal Deep-Broad Learning by Jiatai Lin et al
11-05-2021	DriveGuard: Robustification of Automated Driving Systems with Deep Spatio-Temporal Convolutional Autoencoder by Andreas Papachristodoulou et al
11-05-2021	Improving Visual Quality of Image Synthesis by A Token-based Generator with Transformers by Yanhong Zeng et al
11-05-2021	Event-based Motion Segmentation by Cascaded Two-Level Multi-Model Fitting by Xiuyuan Lu et al
11-05-2021	Cross Modality 3D Navigation Using Reinforcement Learning and Neural Style Transfer by Cesare Magnetti et al
11-05-2021	Pathological Analysis of Blood Cells Using Deep Learning Techniques by Virender Ranga et al
11-03-2021	VLMo: Unified Vision-Language Pre-Training with Mixture-of-Modality-Experts by Wenhui Wang et al
11-03-2021	A Comparison of Deep Learning Models for the Prediction of Hand Hygiene Videos by Rashmi Bakshi
11-02-2021	BiosecurID: a multimodal biometric database by Julian Fierrez et al
11-04-2021	TimeMatch: Unsupervised Cross-Region Adaptation by Temporal Shift Estimation by Joachim Nyborg et al
11-02-2021	PatchGame: Learning to Signal Mid-level Patches in Referential Games by Kamal Gupta et al
11-04-2021	Nondestructive Testing of Composite Fibre Materials with Hyperspectral Imaging : Evaluative Studies in the EU H2020 FibreEUse Project by Yijun Yan et al
11-03-2021	LTD: Low Temperature Distillation for Robust Adversarial Training by Erh-Chung Chen et al
11-05-2021	Remote Sensing Image Super-resolution and Object Detection: Benchmark and State of the Art by Yi Wang et al
11-05-2021	KORSAL: Key-point Detection based Online Real-Time Spatio-Temporal Action Localization by Kalana Abeywardena et al
11-03-2021	Learned Image Compression for Machine Perception by Felipe Codevilla et al
11-04-2021	Unsupervised Learning of Compositional Energy Concepts by Yilun Du et al
11-04-2021	A deep ensemble approach to X-ray polarimetry by A. L. Peirson et al
11-02-2021	Deep learning for identification and face, gender, expression recognition under constraints by Ahmad B. Hassanat et al
11-02-2021	Revisiting spatio-temporal layouts for compositional action recognition by Gorjan Radevski et al
11-02-2021	Detect-and-Segment: a Deep Learning Approach to Automate Wound Image Segmentation by Gaetano Scebba et al
11-02-2021	StyleGAN of All Trades: Image Manipulation with Only Pretrained StyleGAN by Min Jin Chong et al
11-03-2021	HS3: Learning with Proper Task Complexity in Hierarchically Supervised Semantic Segmentation by Shubhankar Borse et al
11-02-2021	Trajectory Prediction with Graph-based Dual-scale Context Fusion by Lu Zhang et al
11-03-2021	Roadmap on Signal Processing for Next Generation Measurement Systems by D. K. Iakovidis et al
11-03-2021	Multi-Glimpse Network: A Robust and Efficient Classification Architecture based on Recurrent Downsampled Attention by Sia Huat Tan et al
11-03-2021	Multi-Cue Adaptive Emotion Recognition Network by Willams Costa et al
11-05-2021	Frequency-Aware Physics-Inspired Degradation Model for Real-World Image Super-Resolution by Zhenxing Dong et al
11-04-2021	Voxel-based 3D Detection and Reconstruction of Multiple Objects from a Single Image by Feng Liu et al
11-04-2021	Generalized Radiograph Representation Learning via Cross-supervision between Images and Free-text Radiology Reports by Hong-Yu Zhou et al
11-03-2021	ProSTformer: Pre-trained Progressive Space-Time Self-attention Model for Traffic Flow Forecasting by Xiao Yan et al
11-03-2021	Certainty Volume Prediction for Unsupervised Domain Adaptation by Tobias Ringwald et al
11-04-2021	Bootstrap Your Object Detector via Mixed Training by Mengde Xu et al
11-02-2021	Personalized One-Shot Lipreading for an ALS Patient by Bipasha Sen et al
11-02-2021	LogAvgExp Provides a Principled and Performant Global Pooling Operator by Scott C. Lowe et al
11-04-2021	A semi-automatic ultrasound image analysis system for the grading diagnosis of COVID-19 pneumonia by Yuanyuan Wang et al
11-03-2021	Automatic ultrasound vessel segmentation with deep spatiotemporal context learning by Baichuan Jiang et al
11-04-2021	Testing using Privileged Information by Adapting Features with Statistical Dependence by Kwang In Kim et al
11-04-2021	Stable and Compact Face Recognition via Unlabeled Data Driven Sparse Representation-Based Classification by Xiaohui Yang et al
11-05-2021	Sampling Equivariant Self-attention Networks for Object Detection in Aerial Images by Guo-Ye Yang et al
11-05-2021	Solving Traffic4Cast Competition with U-Net and Temporal Domain Adaptation by Vsevolod Konyakhin et al
11-04-2021	Unsupervised Change Detection of Extreme Events Using ML On-Board by Vít Růžička et al
11-04-2021	MixSiam: A Mixture-based Approach to Self-supervised Representation Learning by Xiaoyang Guo et al
11-04-2021	Temporal Fusion Based Mutli-scale Semantic Segmentation for Detecting Concealed Baggage Threats by Muhammed Shafay et al
11-03-2021	Building Damage Mapping with Self-PositiveUnlabeled Learning by Junshi Xia et al
11-04-2021	Fast Camouflaged Object Detection via Edge-based Reversible Re-calibration Network by Ge-Peng Ji et al
11-03-2021	FaceQvec: Vector Quality Assessment for Face Biometrics based on ISO Compliance by Javier Hernandez-Ortega et al
11-03-2021	Influence of image noise on crack detection performance of deep convolutional neural networks by Riccardo Chianese et al
11-03-2021	Dual Progressive Prototype Network for Generalized Zero-Shot Learning by Chaoqun Wang et al
11-03-2021	Slapping Cats, Bopping Heads, and Oreo Shakes: Understanding Indicators of Virality in TikTok Short Videos by Chen Ling et al
11-05-2021	Visualizing the Emergence of Intermediate Visual Patterns in DNNs by Mingjie Li et al
11-04-2021	EditGAN: High-Precision Semantic Image Editing by Huan Ling et al
11-02-2021	Fitness Landscape Footprint: A Framework to Compare Neural Architecture Search Problems by Kalifou René Traoré et al
11-02-2021	ISP-Agnostic Image Reconstruction for Under-Display Cameras by Miao Qi et al
11-05-2021	Segmentation of 2D Brain MR Images by Angad Ripudaman Singh Bajwa
11-03-2021	On the Frequency Bias of Generative Models by Katja Schwarz et al
11-03-2021	LAION-400M: Open Dataset of CLIP-Filtered 400 Million Image-Text Pairs by Christoph Schuhmann et al
11-03-2021	Panoptic 3D Scene Reconstruction From a Single RGB Image by Manuel Dahnert et al
11-04-2021	Technical Report: Disentangled Action Parsing Networks for Accurate Part-level Action Parsing by Xuanhan Wang et al
11-05-2021	MSC-VO: Exploiting Manhattan and Structural Constraints for Visual Odometry by Joan P. Company-Corcoles et al
11-05-2021	A bone suppression model ensemble to improve COVID-19 detection in chest X-rays by Sivaramakrishnan Rajaraman et al
11-04-2021	Deep Learning Methods for Daily Wildfire Danger Forecasting by Ioannis Prapas et al
11-03-2021	Partial supervision for the FeTA challenge 2021 by Lucas Fidon et al
11-02-2021	MixFace: Improving Face Verification Focusing on Fine-grained Conditions by Junuk Jung et al
11-02-2021	Meta-Learning the Search Distribution of Black-Box Random Search Based Adversarial Attacks by Maksym Yatsura et al
11-02-2021	Absolute distance prediction based on deep learning object detection and monocular depth estimation models by Armin Masoumian et al
11-02-2021	Human Attention in Fine-grained Classification by Yao Rong et al
11-02-2021	A Tri-attention Fusion Guided Multi-modal Segmentation Network by Tongxue Zhou et al
11-02-2021	Boundary Distribution Estimation to Precise Object Detection by Haoran Zhou et al
11-05-2021	FBNet: Feature Balance Network for Urban-Scene Segmentation by Lei Gan et al
11-04-2021	Extended Abstract Version: CNN-based Human Detection System for UAVs in Search and Rescue by Nikite Mesvan
11-05-2021	Recognizing Vector Graphics without Rasterization by Xinyang Jiang et al
11-04-2021	Tea Chrysanthemum Detection under Unstructured Environments Using the TC-YOLO Model by Chao Qi et al
11-03-2021	Rethinking the Image Feature Biases Exhibited by Deep CNN Models by Dawei Dai et al
11-04-2021	When Neural Networks Using Different Sensors Create Similar Features by Hugues Moreau et al
11-03-2021	Understanding Cross Domain Presentation Attack Detection for Visible Face Recognition by Jennifer Hamblin et al
11-04-2021	Multi-Spectral Multi-Image Super-Resolution of Sentinel-2 with Radiometric Consistency Losses and Its Effect on Building Delineation by Muhammed Razzak et al
11-03-2021	ML-PersRef: A Machine Learning-based Personalized Multimodal Fusion Approach for Referencing Outside Objects From a Moving Vehicle by Amr Gomaa et al
11-05-2021	Structure-aware Image Inpainting with Two Parallel Streams by Zhilin Huang et al
11-02-2021	Adversarially Perturbed Wavelet-based Morphed Face Generation by Kelsey O'Haire et al
11-03-2021	Categorical Difference and Related Brain Regions of the Attentional Blink Effect by Renzhou Gui et al
11-03-2021	Recent Advancements in Self-Supervised Paradigms for Visual Feature Representation by Mrinal Anand et al
11-03-2021	An Entropy-guided Reinforced Partial Convolutional Network for Zero-Shot Learning by Yun Li et al
11-03-2021	Efficient 3D Deep LiDAR Odometry by Guangming Wang et al

Craig SmithNovember 8, 2021