2021.11.8 Vision papers

 

11-05-2021

SSA: Semantic Structure Aware Inference for Weakly Pixel-Wise Dense Predictions without Cost
by Yanpeng Sun et al

11-05-2021

Learning of Frequency-Time Attention Mechanism for Automatic Modulation Recognition
by Shangao Lin et al

11-05-2021

Edge Tracing using Gaussian Process Regression
by Jamie Burke et al

11-04-2021

Multi-scale 2D Representation Learning for weakly-supervised moment retrieval
by Ding Li et al

11-04-2021

LVIS Challenge Track Technical Report 1st Place Solution: Distribution Balanced and Boundary Refinement for Large Vocabulary Instance Segmentation
by WeiFu Fu et al

11-02-2021

A Critical Study on the Recent Deep Learning Based Semi-Supervised Video Anomaly Detection Methods
by Mohammad Baradaran et al

11-02-2021

PolyTrack: Tracking with Bounding Polygons
by Gaspar Faure et al

11-03-2021

Deep Point Set Resampling via Gradient Fields
by Haolan Chen et al

11-03-2021

Sequence-to-Sequence Modeling for Action Identification at High Temporal Resolution
by Aakash Kaku et al

11-03-2021

An Empirical Study of Training End-to-End Vision-and-Language Transformers
by Zi-Yi Dou et al

11-05-2021

A Deep Learning Generative Model Approach for Image Synthesis of Plant Leaves
by Alessandrop Benfenati et al

11-05-2021

Versatile Learned Video Compression
by Runsen Feng et al

11-05-2021

Seamless Satellite-image Synthesis
by Jialin Zhu et al

11-04-2021

GraN-GAN: Piecewise Gradient Normalization for Generative Adversarial Networks
by Vineeth S. Bhaskara et al

11-05-2021

Interpreting Representation Quality of DNNs for 3D Point Cloud Processing
by Wen Shen et al

11-05-2021

Synchronized Smartphone Video Recording System of Depth and RGB Image Frames with Sub-millisecond Precision
by Marsel Faizullin et al

11-05-2021

Single Image Deraining Network with Rain Embedding Consistency and Layered LSTM
by Yizhou Li et al

11-02-2021

CPSeg: Cluster-free Panoptic Segmentation of 3D LiDAR Point Clouds
by Enxu Li et al

11-04-2021

Qimera: Data-free Quantization with Synthetic Boundary Supporting Samples
by Kanghyun Choi et al

11-04-2021

Towards dynamic multi-modal phenotyping using chest radiographs and physiological data
by Nasir Hayat et al

11-04-2021

Facial Emotion Recognition using Deep Residual Networks in Real-World Environments
by Panagiotis Tzirakis et al

11-03-2021

Breast Cancer Classification Using: Pixel Interpolation
by Osama Rezq Shahin et al

11-02-2021

WORD: Revisiting Organs Segmentation in the Whole Abdominal Region
by Xiangde Luo et al

11-02-2021

Skin Cancer Classification using Inception Network and Transfer Learning
by Priscilla Benedetti et al

11-03-2021

FAST: Searching for a Faster Arbitrarily-Shaped Text Detector with Minimalist Kernel Representation
by Zhe Chen et al

11-05-2021

Spatial-Temporal Residual Aggregation for High Resolution Video Inpainting
by Vishnu Sanjay Ramiya Srinivasan et al

11-05-2021

A Unified Game-Theoretic Interpretation of Adversarial Robustness
by Jie Ren et al

11-02-2021

HHP-Net: A light Heteroscedastic neural network for Head Pose estimation with uncertainty
by Giorgio Cantarini et al

11-02-2021

Explainable Medical Image Segmentation via Generative Adversarial Networks and Layer-wise Relevance Propagation
by Awadelrahman M. A. Ahmed et al

11-02-2021

A Pixel-Level Meta-Learner for Weakly Supervised Few-Shot Semantic Segmentation
by Yuan-Hao Lee et al

11-05-2021

Negative Sample is Negative in Its Own Way: Tailoring Negative Sentences for Image-Text Retrieval
by Zhihao Fan et al

11-03-2021

The Klarna Product Page Dataset: A RealisticBenchmark for Web Representation Learning
by Alexandra Hotti et al

11-03-2021

Subpixel Heatmap Regression for Facial Landmark Localization
by Adrian Bulat et al

11-05-2021

Hepatic vessel segmentation based on 3Dswin-transformer with inductive biased multi-head self-attention
by Mian Wu et al

11-03-2021

Improving Pose Estimation through Contextual Activity Fusion
by David Poulton et al

11-04-2021

Towards Panoptic 3D Parsing for Single Image in the Wild
by Sainan Liu et al

11-04-2021

Online Continual Learning via Multiple Deep Metric Learning and Uncertainty-guided Episodic Memory Replay -- 3rd Place Solution for ICCV 2021 Workshop SSLAD Track 3A Continual Object Classification
by Muhammad Rifki Kurniawan et al

11-05-2021

AGPCNet: Attention-Guided Pyramid Context Networks for Infrared Small Target Detection
by Tianfang Zhang et al

11-02-2021

Relational Self-Attention: Whats Missing in Attention for Video Understanding
by Manjin Kim et al

11-03-2021

Video Salient Object Detection via Contrastive Features and Attention Modules
by Yi-Wen Chen et al

11-04-2021

The role of MRI physics in brain segmentation CNNs: achieving acquisition invariance and instructive uncertainties
by Pedro Borges et al

11-03-2021

Discriminator Synthesis: On reusing the other half of Generative Adversarial Networks
by Diego Porres

11-03-2021

A cross-modal fusion network based on self-attention and residual structure for multimodal emotion recognition
by Ziwang Fu et al

11-03-2021

Event and Activity Recognition in Video Surveillance for Cyber-Physical Systems
by Swarnabja Bhaumik et al

11-04-2021

StyleCLIPDraw: Coupling Content and Style in Text-to-Drawing Synthesis
by Peter Schaldenbrand et al

11-05-2021

TermiNeRF: Ray Termination Prediction for Efficient Neural Rendering
by Martin Piala et al

11-05-2021

Normalizing Flow as a Flexible Fidelity Objective for Photo-Realistic Super-resolution
by Andreas Lugmayr et al

11-02-2021

3-D PET Image Generation with tumour masks using TGAN
by Robert V Bergen et al

11-02-2021

A dataset for multi-sensor drone detection
by Fredrik Svanström et al

11-03-2021

Deep-Learning-Based Single-Image Height Reconstruction from Very-High-Resolution SAR Intensity Data
by Michael Recla et al

11-02-2021

Body Size and Depth Disambiguation in Multi-Person Reconstruction from Single Images
by Nicolas Ugrinovic et al

11-04-2021

Skeleton-Split Framework using Spatial Temporal Graph Convolutional Networks for Action Recogntion
by Motasem Alsawadi et al

11-03-2021

Unified 3D Mesh Recovery of Humans and Animals by Learning Animal Exercise
by Kim Youwang et al

11-04-2021

Addressing Multiple Salient Object Detection via Dual-Space Long-Range Dependencies
by Bowen Deng et al

11-03-2021

Resampling and super-resolution of hexagonally sampled images using deep learning
by Dylan Flaute et al

11-04-2021

Towards Smart Monitored AM: Open Source in-Situ Layer-wise 3D Printing Image Anomaly Detection Using Histograms of Oriented Gradients and a Physics-Based Rendering Engine
by Aliaksei Petsiuk et al

11-04-2021

Attention on Classification for Fire Segmentation
by Milad Niknejad et al

11-05-2021

The Curious Layperson: Fine-Grained Image Recognition without Expert Labels
by Subhabrata Choudhury et al

11-02-2021

Out of distribution detection for skin and malaria images
by Muhammad Zaida et al

11-02-2021

A high performance fingerprint liveness detection method based on quality related features
by Javier Galbally et al

11-05-2021

Semantic Consistency in Image-to-Image Translation for Unsupervised Domain Adaptation
by Stephan Brehm et al

11-03-2021

Beyond PRNU: Learning Robust Device-Specific Fingerprint for Source Camera Identification
by Manisha et al

11-04-2021

FEAFA+: An Extended Well-Annotated Dataset for Facial Expression Analysis and 3D Facial Animation
by Wei Gan et al

11-02-2021

Estimating 3D Motion and Forces of Human-Object Interactions from Internet Videos
by Zongmian Li et al

11-05-2021

BBC-Oxford British Sign Language Dataset
by Samuel Albanie et al

11-04-2021

Generalization in Dexterous Manipulation via Geometry-Aware Multi-Task Learning
by Wenlong Huang et al

11-04-2021

PDBL: Improving Histopathological Tissue Classification with Plug-and-Play Pyramidal Deep-Broad Learning
by Jiatai Lin et al

11-05-2021

DriveGuard: Robustification of Automated Driving Systems with Deep Spatio-Temporal Convolutional Autoencoder
by Andreas Papachristodoulou et al

11-05-2021

Improving Visual Quality of Image Synthesis by A Token-based Generator with Transformers
by Yanhong Zeng et al

11-05-2021

Event-based Motion Segmentation by Cascaded Two-Level Multi-Model Fitting
by Xiuyuan Lu et al

11-05-2021

Cross Modality 3D Navigation Using Reinforcement Learning and Neural Style Transfer
by Cesare Magnetti et al

11-05-2021

Pathological Analysis of Blood Cells Using Deep Learning Techniques
by Virender Ranga et al

11-03-2021

VLMo: Unified Vision-Language Pre-Training with Mixture-of-Modality-Experts
by Wenhui Wang et al

11-03-2021

A Comparison of Deep Learning Models for the Prediction of Hand Hygiene Videos
by Rashmi Bakshi

11-02-2021

BiosecurID: a multimodal biometric database
by Julian Fierrez et al

11-04-2021

TimeMatch: Unsupervised Cross-Region Adaptation by Temporal Shift Estimation
by Joachim Nyborg et al

11-02-2021

PatchGame: Learning to Signal Mid-level Patches in Referential Games
by Kamal Gupta et al

11-04-2021

Nondestructive Testing of Composite Fibre Materials with Hyperspectral Imaging : Evaluative Studies in the EU H2020 FibreEUse Project
by Yijun Yan et al

11-03-2021

LTD: Low Temperature Distillation for Robust Adversarial Training
by Erh-Chung Chen et al

11-05-2021

Remote Sensing Image Super-resolution and Object Detection: Benchmark and State of the Art
by Yi Wang et al

11-05-2021

KORSAL: Key-point Detection based Online Real-Time Spatio-Temporal Action Localization
by Kalana Abeywardena et al

11-03-2021

Learned Image Compression for Machine Perception
by Felipe Codevilla et al

11-04-2021

Unsupervised Learning of Compositional Energy Concepts
by Yilun Du et al

11-04-2021

A deep ensemble approach to X-ray polarimetry
by A. L. Peirson et al

11-02-2021

Deep learning for identification and face, gender, expression recognition under constraints
by Ahmad B. Hassanat et al

11-02-2021

Revisiting spatio-temporal layouts for compositional action recognition
by Gorjan Radevski et al

11-02-2021

Detect-and-Segment: a Deep Learning Approach to Automate Wound Image Segmentation
by Gaetano Scebba et al

11-02-2021

StyleGAN of All Trades: Image Manipulation with Only Pretrained StyleGAN
by Min Jin Chong et al

11-03-2021

HS3: Learning with Proper Task Complexity in Hierarchically Supervised Semantic Segmentation
by Shubhankar Borse et al

11-02-2021

Trajectory Prediction with Graph-based Dual-scale Context Fusion
by Lu Zhang et al

11-03-2021

Roadmap on Signal Processing for Next Generation Measurement Systems
by D. K. Iakovidis et al

11-03-2021

Multi-Glimpse Network: A Robust and Efficient Classification Architecture based on Recurrent Downsampled Attention
by Sia Huat Tan et al

11-03-2021

Multi-Cue Adaptive Emotion Recognition Network
by Willams Costa et al

11-05-2021

Frequency-Aware Physics-Inspired Degradation Model for Real-World Image Super-Resolution
by Zhenxing Dong et al

11-04-2021

Voxel-based 3D Detection and Reconstruction of Multiple Objects from a Single Image
by Feng Liu et al

11-04-2021

Generalized Radiograph Representation Learning via Cross-supervision between Images and Free-text Radiology Reports
by Hong-Yu Zhou et al

11-03-2021

ProSTformer: Pre-trained Progressive Space-Time Self-attention Model for Traffic Flow Forecasting
by Xiao Yan et al

11-03-2021

Certainty Volume Prediction for Unsupervised Domain Adaptation
by Tobias Ringwald et al

11-04-2021

Bootstrap Your Object Detector via Mixed Training
by Mengde Xu et al

11-02-2021

Personalized One-Shot Lipreading for an ALS Patient
by Bipasha Sen et al

11-02-2021

LogAvgExp Provides a Principled and Performant Global Pooling Operator
by Scott C. Lowe et al

11-04-2021

A semi-automatic ultrasound image analysis system for the grading diagnosis of COVID-19 pneumonia
by Yuanyuan Wang et al

11-03-2021

Automatic ultrasound vessel segmentation with deep spatiotemporal context learning
by Baichuan Jiang et al

11-04-2021

Testing using Privileged Information by Adapting Features with Statistical Dependence
by Kwang In Kim et al

11-04-2021

Stable and Compact Face Recognition via Unlabeled Data Driven Sparse Representation-Based Classification
by Xiaohui Yang et al

11-05-2021

Sampling Equivariant Self-attention Networks for Object Detection in Aerial Images
by Guo-Ye Yang et al

11-05-2021

Solving Traffic4Cast Competition with U-Net and Temporal Domain Adaptation
by Vsevolod Konyakhin et al

11-04-2021

Unsupervised Change Detection of Extreme Events Using ML On-Board
by Vít Růžička et al

11-04-2021

MixSiam: A Mixture-based Approach to Self-supervised Representation Learning
by Xiaoyang Guo et al

11-04-2021

Temporal Fusion Based Mutli-scale Semantic Segmentation for Detecting Concealed Baggage Threats
by Muhammed Shafay et al

11-03-2021

Building Damage Mapping with Self-PositiveUnlabeled Learning
by Junshi Xia et al

11-04-2021

Fast Camouflaged Object Detection via Edge-based Reversible Re-calibration Network
by Ge-Peng Ji et al

11-03-2021

FaceQvec: Vector Quality Assessment for Face Biometrics based on ISO Compliance
by Javier Hernandez-Ortega et al

11-03-2021

Influence of image noise on crack detection performance of deep convolutional neural networks
by Riccardo Chianese et al

11-03-2021

Dual Progressive Prototype Network for Generalized Zero-Shot Learning
by Chaoqun Wang et al

11-03-2021

Slapping Cats, Bopping Heads, and Oreo Shakes: Understanding Indicators of Virality in TikTok Short Videos
by Chen Ling et al

11-05-2021

Visualizing the Emergence of Intermediate Visual Patterns in DNNs
by Mingjie Li et al

11-04-2021

EditGAN: High-Precision Semantic Image Editing
by Huan Ling et al

11-02-2021

Fitness Landscape Footprint: A Framework to Compare Neural Architecture Search Problems
by Kalifou René Traoré et al

11-02-2021

ISP-Agnostic Image Reconstruction for Under-Display Cameras
by Miao Qi et al

11-05-2021

Segmentation of 2D Brain MR Images
by Angad Ripudaman Singh Bajwa

11-03-2021

On the Frequency Bias of Generative Models
by Katja Schwarz et al

11-03-2021

LAION-400M: Open Dataset of CLIP-Filtered 400 Million Image-Text Pairs
by Christoph Schuhmann et al

11-03-2021

Panoptic 3D Scene Reconstruction From a Single RGB Image
by Manuel Dahnert et al

11-04-2021

Technical Report: Disentangled Action Parsing Networks for Accurate Part-level Action Parsing
by Xuanhan Wang et al

11-05-2021

MSC-VO: Exploiting Manhattan and Structural Constraints for Visual Odometry
by Joan P. Company-Corcoles et al

11-05-2021

A bone suppression model ensemble to improve COVID-19 detection in chest X-rays
by Sivaramakrishnan Rajaraman et al

11-04-2021

Deep Learning Methods for Daily Wildfire Danger Forecasting
by Ioannis Prapas et al

11-03-2021

Partial supervision for the FeTA challenge 2021
by Lucas Fidon et al

11-02-2021

MixFace: Improving Face Verification Focusing on Fine-grained Conditions
by Junuk Jung et al

11-02-2021

Meta-Learning the Search Distribution of Black-Box Random Search Based Adversarial Attacks
by Maksym Yatsura et al

11-02-2021

Absolute distance prediction based on deep learning object detection and monocular depth estimation models
by Armin Masoumian et al

11-02-2021

Human Attention in Fine-grained Classification
by Yao Rong et al

11-02-2021

A Tri-attention Fusion Guided Multi-modal Segmentation Network
by Tongxue Zhou et al

11-02-2021

Boundary Distribution Estimation to Precise Object Detection
by Haoran Zhou et al

11-05-2021

FBNet: Feature Balance Network for Urban-Scene Segmentation
by Lei Gan et al

11-04-2021

Extended Abstract Version: CNN-based Human Detection System for UAVs in Search and Rescue
by Nikite Mesvan

11-05-2021

Recognizing Vector Graphics without Rasterization
by Xinyang Jiang et al

11-04-2021

Tea Chrysanthemum Detection under Unstructured Environments Using the TC-YOLO Model
by Chao Qi et al

11-03-2021

Rethinking the Image Feature Biases Exhibited by Deep CNN Models
by Dawei Dai et al

11-04-2021

When Neural Networks Using Different Sensors Create Similar Features
by Hugues Moreau et al

11-03-2021

Understanding Cross Domain Presentation Attack Detection for Visible Face Recognition
by Jennifer Hamblin et al

11-04-2021

Multi-Spectral Multi-Image Super-Resolution of Sentinel-2 with Radiometric Consistency Losses and Its Effect on Building Delineation
by Muhammed Razzak et al

11-03-2021

ML-PersRef: A Machine Learning-based Personalized Multimodal Fusion Approach for Referencing Outside Objects From a Moving Vehicle
by Amr Gomaa et al

11-05-2021

Structure-aware Image Inpainting with Two Parallel Streams
by Zhilin Huang et al

11-02-2021

Adversarially Perturbed Wavelet-based Morphed Face Generation
by Kelsey O'Haire et al

11-03-2021

Categorical Difference and Related Brain Regions of the Attentional Blink Effect
by Renzhou Gui et al

11-03-2021

Recent Advancements in Self-Supervised Paradigms for Visual Feature Representation
by Mrinal Anand et al

11-03-2021

An Entropy-guided Reinforced Partial Convolutional Network for Zero-Shot Learning
by Yun Li et al

11-03-2021

Efficient 3D Deep LiDAR Odometry
by Guangming Wang et al

 
Craig Smith