2021.12.13 Vision papers

 

12-09-2021

Plenoxels: Radiance Fields without Neural Networks
by Alex Yu et al

12-08-2021

InvGAN: Invertible GANs
by Partha Ghosh et al

12-09-2021

GAN-Supervised Dense Visual Alignment
by William Peebles et al

12-09-2021

Extending the WILDS Benchmark for Unsupervised Adaptation
by Shiori Sagawa et al

12-07-2021

Grounded Language-Image Pre-training
by Liunian Harold Li et al

12-09-2021

Multimodal Conditional Image Synthesis with Product-of-Experts GANs
by Xun Huang et al

12-07-2021

CMA-CLIP: Cross-Modality Attention CLIP for Image-Text Classification
by Huidong Liu et al

12-08-2021

MLP Architectures for Vision-and-Language Modeling: An Empirical Study
by Yixin Nie et al

12-08-2021

FLAVA: A Foundational Language And Vision Alignment Model
by Amanpreet Singh et al

12-07-2021

Ref-NeRF: Structured View-Dependent Appearance for Neural Radiance Fields
by Dor Verbin et al

12-09-2021

HairCLIP: Design Your Hair by Text and Reference Image
by Tianyi Wei et al

12-08-2021

Everything at Once -- Multi-modal Fusion Transformer for Video Retrieval
by Nina Shvetsova et al

12-09-2021

CLIP-NeRF: Text-and-Image Driven Manipulation of Neural Radiance Fields
by Can Wang et al

12-08-2021

Whats Behind the Couch? Directed Ray Distance Functions (DRDF) for 3D Scene Reconstruction
by Nilesh Kulkarni et al

12-09-2021

Neural Radiance Fields for Outdoor Scene Relighting
by Viktor Rudnev et al

12-09-2021

PTR: A Benchmark for Part-based Conceptual, Relational, and Physical Reasoning
by Yining Hong et al

12-09-2021

Fast Point Transformer
by Chunghyun Park et al

12-08-2021

Tracking People by Predicting 3D Appearance, Location & Pose
by Jathushan Rajasegaran et al

12-08-2021

Prompting Visual-Language Models for Efficient Video Understanding
by Chen Ju et al

12-08-2021

Contrastive Instruction-Trajectory Learning for Vision-Language Navigation
by Xiwen Liang et al

12-09-2021

Self-Supervised Image-to-Text and Text-to-Image Synthesis
by Anindya Sundar Das et al

12-09-2021

A Bilingual, OpenWorld Video Text Dataset and End-to-end Video Text Spotter with Transformer
by Weijia Wu et al

12-08-2021

Do Pedestrians Pay Attention? Eye Contact Detection in the Wild
by Younes Belkada et al

12-09-2021

Neural Descriptor Fields: SE(3)-Equivariant Object Representations for Manipulation
by Anthony Simeonov et al

12-08-2021

BACON: Band-limited Coordinate Networks for Multiscale Scene Representation
by David B. Lindell et al

12-09-2021

Latent Space Explanation by Intervention
by Itai Gat et al

12-08-2021

Symmetry Perception by Deep Networks: Inadequacy of Feed-Forward Architectures and Improvements with Recurrent Connections
by Shobhita Sundaram et al

12-07-2021

Activation to Saliency: Forming High-Quality Labels for Unsupervised Salient Object Detection
by Huajun Zhou et al

12-08-2021

Adverse Weather Image Translation with Asymmetric and Uncertainty-aware GAN
by Jeong-gi Kwak et al

12-09-2021

Semi-Supervised Medical Image Segmentation via Cross Teaching between CNN and Transformer
by Xiangde Luo et al

12-10-2021

CityNeRF: Building NeRF at City Scale
by Yuanbo Xiangli et al

12-08-2021

Exploring Temporal Granularity in Self-Supervised Video Representation Learning
by Rui Qian et al

12-09-2021

PixMix: Dreamlike Pictures Comprehensively Improve Safety Measures
by Dan Hendrycks et al

12-07-2021

A Survey on Intrinsic Images: Delving Deep Into Lambert and Beyond
by Elena Garces et al

12-08-2021

DualFormer: Local-Global Stratified Transformer for Efficient Video Recognition
by Yuxuan Liang et al

12-08-2021

A Hierarchical Spatio-Temporal Graph Convolutional Neural Network for Anomaly Detection in Videos
by Xianlin Zeng et al

12-08-2021

Shortest Paths in Graphs with Matrix-Valued Edges: Concepts, Algorithm and Application to 3D Multi-Shape Analysis
by Viktoria Ehm et al

12-08-2021

Revisiting Contrastive Learning through the Lens of Neighborhood Component Analysis: an Integrated Framework
by Ching-Yun Ko et al

12-09-2021

Evaluating saliency methods on artificial data with different background types
by Céline Budding et al

12-08-2021

Feature Statistics Mixing Regularization for Generative Adversarial Networks
by Junho Kim et al

12-07-2021

Evaluating Generic Auto-ML Tools for Computational Pathology
by Lars Ole Schwen et al

12-09-2021

Does Redundancy in AI Perception Systems Help to Test for Super-Human Automated Driving Performance?
by Hanno Gottschalk et al

12-08-2021

Come-Closer-Diffuse-Faster: Accelerating Conditional Diffusion Models for Inverse Problems through Stochastic Contraction
by Hyungjin Chung et al

12-08-2021

SNEAK: Synonymous Sentences-Aware Adversarial Attack on Natural Language Video Localization
by Wenbo Gou et al

12-07-2021

Parallel Discrete Convolutions on Adaptive Particle Representations of Images
by Joel Jonsson et al

12-09-2021

Generating Useful Accident-Prone Driving Scenarios via a Learned Traffic Prior
by Davis Rempe et al

12-09-2021

BLT: Bidirectional Layout Transformer for Controllable Layout Generation
by Xiang Kong et al

12-09-2021

MAGMA -- Multimodal Augmentation of Generative Models through Adapter-based Finetuning
by Constantin Eichenberg et al

12-07-2021

Bootstrapping ViTs: Towards Liberating Vision Transformers from Pre-training
by Haofei Zhang et al

12-08-2021

Self-Supervised Models are Continual Learners
by Enrico Fini et al

12-10-2021

UNIST: Unpaired Neural Implicit Shape Translation Network
by Qimin Chen et al

12-09-2021

Critical configurations for two projective views, a new approach
by Martin Bråtelund

12-08-2021

Geometry-Guided Progressive NeRF for Generalizable and Efficient Neural Human Rendering
by Mingfei Chen et al

12-07-2021

Unsupervised Representation Learning via Neural Activation Coding
by Yookoon Park et al

12-08-2021

CoSSL: Co-Learning of Representation and Classifier for Imbalanced Semi-Supervised Learning
by Yue Fan et al

12-07-2021

Nuclei Segmentation in Histopathology Images using Deep Learning with Local and Global Views
by Mahdi Arab Loodaricheh et al

12-08-2021

Transformer-Based Approach for Joint Handwriting and Named Entity Recognition in Historical documents
by Ahmed Cheikh Rouhoua et al

12-09-2021

DVHN: A Deep Hashing Framework for Large-scale Vehicle Re-identification
by Yongbiao Chen et al

12-08-2021

Trajectory-Constrained Deep Latent Visual Attention for Improved Local Planning in Presence of Heterogeneous Terrain
by Stefan Wapnick et al

12-09-2021

Locally Shifted Attention With Early Global Integration
by Shelly Sheynin et al

12-10-2021

HeadNeRF: A Real-time NeRF-based Parametric Head Model
by Yang Hong et al

12-08-2021

Audio-Visual Synchronisation in the wild
by Honglie Chen et al

12-09-2021

Auto-X3D: Ultra-Efficient Video Understanding via Finer-Grained Neural Architecture Search
by Yifan Jiang et al

12-09-2021

Superpixel-Based Building Damage Detection from Post-earthquake Very High Resolution Imagery Using Deep Neural Networks
by Jun Wang et al

12-09-2021

FaceFormer: Speech-Driven 3D Facial Animation with Transformers
by Yingruo Fan et al

12-08-2021

Reverse image filtering using total derivative approximation and accelerated gradient descent
by Fernando J. Galetto et al

12-09-2021

CLIP2StyleGAN: Unsupervised Extraction of StyleGAN Edit Directions
by Rameen Abdal et al

12-07-2021

Low-rank Tensor Decomposition for Compression of Convolutional Neural Networks Using Funnel Regularization
by Bo-Shiuan Chu et al

12-08-2021

Topology-aware Convolutional Neural Network for Efficient Skeleton-based Action Recognition
by Kailin Xu et al

12-10-2021

More Control for Free! Image Synthesis with Semantic Diffusion Guidance
by Xihui Liu et al

12-09-2021

RamBoAttack: A Robust Query Efficient Deep Neural Network Decision Exploit
by Viet Quoc Vo et al

12-08-2021

Neural Points: Point Cloud Representation with Neural Fields
by Wanquan Feng et al

12-09-2021

Mimicking the Oracle: An Initial Phase Decorrelation Approach for Class Incremental Learning
by Yujun Shi et al

12-08-2021

Contrastive Learning with Large Memory Bank and Negative Embedding Subtraction for Accurate Copy Detection
by Shuhei Yokoo

12-07-2021

Gaussian map predictions for 3D surface feature localisation and counting
by Justin Le Louëdec et al

12-09-2021

One-dimensional Deep Low-rank and Sparse Network for Accelerated MRI
by Zi Wang et al

12-09-2021

Mutual Adversarial Training: Learning together is better than going alone
by Jiang Liu et al

12-09-2021

Searching Parameterized AP Loss for Object Detection
by Chenxin Tao et al

12-07-2021

Time-Equivariant Contrastive Video Representation Learning
by Simon Jenni et al

12-08-2021

Binary Change Guided Hyperspectral Multiclass Change Detection
by Meiqi Hu et al

12-08-2021

Learn2Reg: comprehensive multi-task medical image registration challenge, dataset and evaluation in the era of deep learning
by Alessa Hering et al

12-09-2021

Model Doctor: A Simple Gradient Aggregation Strategy for Diagnosing and Treating CNN Classifiers
by Zunlei Feng et al

12-07-2021

Fully Attentional Network for Semantic Segmentation
by Qi Song et al

12-07-2021

A Contrastive Distillation Approach for Incremental Semantic Segmentation in Aerial Images
by Edoardo Arnaudo et al

12-09-2021

Injecting Semantic Concepts into End-to-End Image Captioning
by Zhiyuan Fang et al

12-10-2021

Couplformer:Rethinking Vision Transformer with Coupling Attention Map
by Hai Lan et al

12-08-2021

VISOLO: Grid-Based Space-Time Aggregation for Efficient Online Video Instance Segmentation
by Su Ho Han et al

12-08-2021

Boosting Contrastive Learning with Relation Knowledge Distillation
by Kai Zheng et al

12-08-2021

Burn After Reading: Online Adaptation for Cross-domain Streaming Data
by Luyu Yang et al

12-09-2021

Adaptive Methods for Aggregated Domain Generalization
by Xavier Thomas et al

12-09-2021

3D-VField: Learning to Adversarially Deform Point Clouds for Robust 3D Object Detection
by Alexander Lehner et al

12-09-2021

Exploring the Equivalence of Siamese Self-Supervised Learning via A Unified Gradient Framework
by Chenxin Tao et al

12-08-2021

Assessing a Single Image in Reference-Guided Image Synthesis
by Jiayi Guo et al

12-07-2021

DeepFace-EMD: Re-ranking Using Patch-wise Earth Movers Distance Improves Out-Of-Distribution Face Identification
by Hai Phan et al

12-08-2021

Progressive Multi-stage Interactive Training in Mobile Network for Fine-grained Recognition
by Zhenxin Wu et al

12-09-2021

Learning with Nested Scene Modeling and Cooperative Architecture Search for Low-Light Vision
by Risheng Liu et al

12-08-2021

Garment4D: Garment Reconstruction from Point Cloud Sequences
by Fangzhou Hong et al

12-08-2021

BA-Net: Bridge Attention for Deep Convolutional Neural Networks
by Yue Zhao et al

12-09-2021

Explainability of the Implications of Supervised and Unsupervised Face Image Quality Estimations Through Activation Map Variation Analyses in Face Recognition Models
by Biying Fu et al

12-08-2021

SoK: Anti-Facial Recognition Technology
by Emily Wenger et al

12-08-2021

Enhancing Food Intake Tracking in Long-Term Care with Automated Food Imaging and Nutrient Intake Tracking (AFINI-T) Technology
by Kaylen J. Pfisterer et al

12-08-2021

A Unified Architecture of Semantic Segmentation and Hierarchical Generative Adversarial Networks for Expression Manipulation
by Rumeysa Bodur et al

12-09-2021

Amicable Aid: Turning Adversarial Attack to Benefit Classification
by Juyeop Kim et al

12-09-2021

Self-Supervised Keypoint Discovery in Behavioral Videos
by Jennifer J. Sun et al

12-07-2021

GPCO: An Unsupervised Green Point Cloud Odometry Method
by Pranav Kadam et al

12-07-2021

Generation of Non-Deterministic Synthetic Face Datasets Guided by Identity Priors
by Marcel Grimmer et al

12-07-2021

Fully Context-Aware Image Inpainting with a Learned Semantic Pyramid
by Wendong Zhang et al

12-08-2021

Implicit Neural Representations for Image Compression
by Yannick Strümpler et al

12-10-2021

Unified Multimodal Pre-training and Prompt-based Tuning for Vision-Language Understanding and Generation
by Tianyi Liu et al

12-08-2021

GCA-Net : Utilizing Gated Context Attention for Improving Image Forgery Localization and Detection
by Sowmen Das et al

12-09-2021

Robust Weakly Supervised Learning for COVID-19 Recognition Using Multi-Center CT Images
by Qinghao Ye et al

12-08-2021

Unimodal Face Classification with Multimodal Training
by Wenbin Teng et al

12-08-2021

Transformaly -- Two (Feature Spaces) Are Better Than One
by Matan Jacob Cohen et al

12-10-2021

VUT: Versatile UI Transformer for Multi-Modal Multi-Task User Interface Modeling
by Yang Li et al

12-07-2021

CG-NeRF: Conditional Generative Neural Radiance Fields
by Kyungmin Jo et al

12-08-2021

A novel multi-view deep learning approach for BI-RADS and density assessment of mammograms
by Huyen T. X. Nguyen et al

12-09-2021

PE-former: Pose Estimation Transformer
by Paschalis Panteleris et al

12-10-2021

Predicting Physical World Destinations for Commands Given to Self-Driving Cars
by Dusan Grujicic et al

12-10-2021

Critical configurations for three projective views
by Martin Bråtelund

12-09-2021

BLPnet: A New DNN model for Automatic License Plate Detection with Bengali OCR
by Md Saif Hassan Onim et al

12-08-2021

Adversarial Parametric Pose Prior
by Andrey Davydov et al

12-07-2021

Domain Generalization via Progressive Layer-wise and Channel-wise Dropout
by Jintao Guo et al

12-07-2021

Variance-Aware Weight Initialization for Point Convolutional Neural Networks
by Pedro Hermosilla et al

12-08-2021

SimulSLT: End-to-End Simultaneous Sign Language Translation
by Aoxiong Yin et al

12-09-2021

Progressive Attention on Multi-Level Dense Difference Maps for Generic Event Boundary Detection
by Jiaqi Tang et al

12-07-2021

Suppressing Static Visual Cues via Normalizing Flows for Self-Supervised Video Representation Learning
by Manlin Zhang et al

12-09-2021

HBReID: Harder Batch for Re-identification
by Wen Li et al

12-07-2021

Auxiliary Learning for Self-Supervised Video Representation via Similarity-based Knowledge Distillation
by Amirhossein Dadashzadeh et al

12-07-2021

Vehicle trajectory prediction works, but not everywhere
by Mohammadhossein Bahari et al

12-08-2021

SoK: Vehicle Orientation Representations for Deep Rotation Estimation
by Huahong Tu et al

12-09-2021

AdaStereo: An Efficient Domain-Adaptive Stereo Matching Approach
by Xiao Song et al

12-08-2021

Learning Auxiliary Monocular Contexts Helps Monocular 3D Object Detection
by Xianpeng Liu et al

12-07-2021

A Robust Completed Local Binary Pattern (RCLBP) for Surface Defect Detection
by Nana Kankam Gyimah et al

12-09-2021

Contextualized Spatio-Temporal Contrastive Learning with Self-Supervision
by Liangzhe Yuan et al

12-07-2021

Wild ToFu: Improving Range and Quality of Indirect Time-of-Flight Depth with RGB Fusion in Challenging Environments
by HyunJun Jung et al

12-08-2021

On visual self-supervision and its effect on model robustness
by Michal Kucer et al

12-09-2021

CaSP: Class-agnostic Semi-Supervised Pretraining for Detection and Segmentation
by Lu Qi et al

12-07-2021

Few-Shot Image Classification Along Sparse Graphs
by Joseph F Comer et al

12-08-2021

Unsupervised Complementary-aware Multi-process Fusion for Visual Place Recognition
by Stephen Hausler et al

12-09-2021

Implicit Feature Refinement for Instance Segmentation
by Lufan Ma et al

12-08-2021

Recurrent Glimpse-based Decoder for Detection with Transformer
by Zhe Chen et al

12-09-2021

Spatio-temporal Relation Modeling for Few-shot Action Recognition
by Anirudh Thatipelli et al

12-07-2021

Flexible Networks for Learning Physical Dynamics of Deformable Objects
by Jinhyung Park et al

12-09-2021

Exploring Event-driven Dynamic Context for Accident Scene Segmentation
by Jiaming Zhang et al

12-09-2021

IterMVS: Iterative Probability Estimation for Efficient Multi-View Stereo
by Fangjinhua Wang et al

12-07-2021

Unsupervised Learning of Compositional Scene Representations from Multiple Unspecified Viewpoints
by Jinyang Yuan et al

12-09-2021

A Shared Representation for Photorealistic Driving Simulators
by Saeed Saadatnejad et al

12-09-2021

ScaleNet: A Shallow Architecture for Scale Estimation
by Axel Barroso-Laguna et al

12-09-2021

Knowledge Distillation for Object Detection via Rank Mimicking and Prediction-guided Feature Imitation
by Gang Li et al

12-08-2021

DMRVisNet: Deep Multi-head Regression Network for Pixel-wise Visibility Estimation Under Foggy Weather
by Jing You et al

12-07-2021

A Generic Approach for Enhancing GANs by Regularized Latent Optimization
by Yufan Zhou et al

12-07-2021

Image Enhancement via Bilateral Learning
by Saeedeh Rezaee et al

12-07-2021

Regularity Learning via Explicit Distribution Modeling for Skeletal Video Anomaly Detection
by Shoubin Yu et al

12-07-2021

Traversing within the Gaussian Typical Set: Differentiable Gaussianization Layers for Inverse Problems Augmented by Normalizing Flows
by Dongzhuo Li et al

12-09-2021

Illumination and Temperature-Aware Multispectral Networks for Edge-Computing-Enabled Pedestrian Detection
by Yifan Zhuang et al

12-08-2021

FPPN: Future Pseudo-LiDAR Frame Prediction for Autonomous Driving
by Xudong Huang et al

12-08-2021

Classification-Then-Grounding: Reformulating Video Scene Graphs as Temporal Bipartite Graphs
by Kaifeng Gao et al

12-08-2021

SimIPU: Simple 2D Image and 3D Point Cloud Unsupervised Pre-Training for Spatial-Aware Visual Representations
by Zhenyu Li et al

12-10-2021

Artificial Intellgence -- Application in Life Sciences and Beyond. The Upper Rhine Artificial Intelligence Symposium UR-AI 2021
by Karl-Herbert Schäfer et al

12-07-2021

ADD: Frequency Attention and Multi-View based Knowledge Distillation to Detect Low-Quality Compressed Deepfake Images
by Binh M. Le et al

12-07-2021

Polarimetric Pose Prediction
by Daoyi Gao et al

12-09-2021

Few-Shot Keypoint Detection as Task Adaptation via Latent Embeddings
by Mel Vecerik et al

12-10-2021

Preemptive Image Robustification for Protecting Users against Man-in-the-Middle Adversarial Attacks
by Seungyong Moon et al

12-08-2021

STAF: A Spatio-Temporal Attention Fusion Network for Few-shot Video Classification
by Rex Liu et al

12-07-2021

STC-mix: Space, Time, Channel mixing for Self-supervised Video Representation
by Srijan Das et al

12-07-2021

ViewCLR: Learning Self-supervised Video Representation for Unseen Viewpoints
by Srijan Das et al

12-07-2021

MS-TCT: Multi-Scale Temporal ConvTransformer for Action Detection
by Rui Dai et al

12-07-2021

Causal Imitative Model for Autonomous Driving
by Mohammad Reza Samsami et al

12-07-2021

Dilated convolution with learnable spacings
by Ismail Khalfaoui Hassani et al

12-07-2021

Presentation Attack Detection Methods based on Gaze Tracking and Pupil Dynamic: A Comprehensive Survey
by Jalil Nourmohammadi Khiarak

12-07-2021

GraDIRN: Learning Iterative Gradient Descent-based Energy Minimization for Deformable Image Registration
by Huaqi Qiu et al

12-07-2021

Scalable 3D Semantic Segmentation for Gun Detection in CT Scans
by Marius Memmel et al

12-07-2021

BT-Unet: A self-supervised learning framework for biomedical image segmentation using Barlow Twins with U-Net models
by Narinder Singh Punn et al

12-07-2021

Contrastive Learning from Extremely Augmented Skeleton Sequences for Self-supervised Action Recognition
by Tianyu Guo et al

12-07-2021

E22(GO)MOTION: Motion Augmented Event Stream for Egocentric Action Recognition
by Chiara Plizzari et al

12-09-2021

PRA-Net: Point Relation-Aware Network for 3D Point Cloud Analysis
by Silin Cheng et al

12-08-2021

Dynamic multi feature-class Gaussian process models
by Jean-Rassaire Fouefack et al

12-08-2021

SIRfyN: Single Image Relighting from your Neighbors
by D. A. Forsyth et al

12-08-2021

Revisiting Global Statistics Aggregation for Improving Image Restoration
by Xiaojie Chu et al

12-08-2021

Multiscale Softmax Cross Entropy for Fovea Localization on Color Fundus Photography
by Yuli Wu et al

12-07-2021

Image classifiers can not be made robust to small perturbations
by Zheng Dai et al

12-07-2021

Saliency Diversified Deep Ensemble for Robustness to Adversaries
by Alex Bogun et al

12-07-2021

Vision-Cloud Data Fusion for ADAS: A Lane Change Prediction Case Study
by Yongkang Liu et al

12-07-2021

Handwritten Mathematical Expression Recognition via Attention Aggregation based Bi-directional Mutual Learning
by Xiaohang Bian et al

12-07-2021

SalFBNet: Learning Pseudo-Saliency Distribution via Feedback Convolutional Networks
by Guanqun Ding et al

12-08-2021

Dual Cluster Contrastive learning for Person Re-Identification
by Hantao Yao et al

12-08-2021

Segment and Complete: Defending Object Detectors against Adversarial Patch Attacks with Robust Patch Detection
by Jiang Liu et al

12-08-2021

Style Mixing and Patchwise Prototypical Matching for One-Shot Unsupervised Domain Adaptive Semantic Segmentation
by Xinyi Wu et al

12-07-2021

TCGL: Temporal Contrastive Graph for Self-supervised Video Representation Learning
by Yang Liu et al

12-09-2021

3D Medical Point Transformer: Introducing Convolution to Attention Networks for Medical Point Cloud Analysis
by Jianhui Yu et al

12-07-2021

Image Compressed Sensing Using Non-local Neural Network
by Wenxue Cui et al

12-07-2021

Gram-SLD: Automatic Self-labeling and Detection for Instance Objects
by Rui Wang et al

12-08-2021

A Simple and efficient deep Scanpath Prediction
by Mohamed Amine Kerkouri et al

12-10-2021

Cross-Modal Transferable Adversarial Attacks from Images to Videos
by Zhipeng Wei et al

12-08-2021

Extending nn-UNet for brain tumor segmentation
by Huan Minh Luu et al

12-08-2021

Feature matching for multi-epoch historical aerial images
by Lulin Zhang et al

12-09-2021

Representing 3D Shapes with Probabilistic Directed Distance Fields
by Tristan Aumentado-Armstrong et al

12-07-2021

A Conditional Point Diffusion-Refinement Paradigm for 3D Point Cloud Completion
by Zhaoyang Lyu et al

12-07-2021

Learning Pixel-Adaptive Weights for Portrait Photo Retouching
by Binglu Wang et al

12-09-2021

KartalOl: Transfer learning using deep neural network for iris segmentation and localization: New dataset for iris segmentation
by Jalil Nourmohammadi Khiarak et al

12-10-2021

A Deep Learning Based Automated Hand Hygiene Training System
by Mobina Shahbandeh et al

12-07-2021

GaTector: A Unified Framework for Gaze Object Prediction
by Binglu Wang et al

12-10-2021

The Large Labelled Logo Dataset (L3D): A Multipurpose and Hand-Labelled Continuously Growing Dataset
by Asier Gutiérrez-Fandiño et al

12-10-2021

Hyperdimensional Feature Fusion for Out-Of-Distribution Detection
by Samuel Wilson et al

12-09-2021

Uncertainty, Edge, and Reverse-Attention Guided Generative Adversarial Network for Automatic Building Detection in Remotely Sensed Images
by Somrita Chattopadhyay et al

12-10-2021

Roominoes: Generating Novel 3D Floor Plans From Existing 3D Rooms
by Kai Wang et al

12-10-2021

Deep Learning based Framework for Automatic Diagnosis of Glaucoma based on analysis of Focal Notching in the Optic Nerve Head
by Sneha Dasgupta et al

12-09-2021

IFR-Explore: Learning Inter-object Functional Relationships in 3D Indoor Scenes
by Qi Li et al

12-07-2021

Which images to label for few-shot medical landmark detection?
by Quan Quan et al

12-07-2021

DCAN: Improving Temporal Action Detection via Dual Context Aggregation
by Guo Chen et al

12-09-2021

7th AI Driving Olympics: 1st Place Report for Panoptic Tracking
by Rohit Mohan et al

12-08-2021

Constrained Mean Shift Using Distant Yet Related Neighbors for Representation Learning
by Ajinkya Tejankar et al

12-09-2021

Sparse-View CT Reconstruction using Recurrent Stacked Back Projection
by Wenrui Li et al

12-10-2021

Network Compression via Central Filter
by Yuanzhi Duan et al

12-10-2021

Tradeoffs Between Contrastive and Supervised Learning: An Empirical Study
by Ananya Karthik et al

12-07-2021

SSAT: A Symmetric Semantic-Aware Transformer Network for Makeup Transfer and Removal
by Zhaoyang Sun et al

12-09-2021

MantissaCam: Learning Snapshot High-dynamic-range Imaging with Perceptually-based In-pixel Irradiance Encoding
by Haley M. So et al

12-09-2021

Hidden Path Selection Network for Semantic Segmentation of Remote Sensing Images
by Kunping Yang et al

12-09-2021

Road Extraction from Overhead Images with Graph Neural Networks
by Gaetan Bahl et al

12-09-2021

Progressive Seed Generation Auto-encoder for Unsupervised Point Cloud Learning
by Juyoung Yang et al

12-09-2021

The Many Faces of Anger: A Multicultural Video Dataset of Negative Emotions in the Wild (MFA-Wild)
by Roya Javadi et al

12-10-2021

Sparse Depth Completion with Semantic Mesh Deformation Optimization
by Bing Zhou et al

12-10-2021

DeepRLS: A Recurrent Network Architecture with Least Squares Implicit Layers for Non-blind Image Deconvolution
by Iaroslav Koshelev et al

12-10-2021

Optimizing Edge Detection for Image Segmentation with Multicut Penalties
by Steffen Jung et al

12-09-2021

Dynamic hardware system for cascade SVM classification of melanoma
by Shereen Afifi et al

12-09-2021

DiffuseMorph: Unsupervised Deformable Image Registration Along Continuous Trajectory Using Diffusion Models
by Boah Kim et al

12-10-2021

Mask-invariant Face Recognition through Template-level Knowledge Distillation
by Marco Huber et al

12-10-2021

Label, Verify, Correct: A Simple Few Shot Object Detection Method
by Prannay Kaul et al

12-09-2021

Image-to-Image Translation-based Data Augmentation for Robust EV Charging Inlet Detection
by Yeonjun Bang et al

12-09-2021

LCTR: On Awakening the Local Continuity of Transformer for Weakly Supervised Object Localization
by Zhiwei Chen et al

12-10-2021

Seq-Masks: Bridging the gap between appearance and gait modeling for video-based person re-identification
by Zhigang Chang et al

12-09-2021

Long-Range Thermal 3D Perception in Low Contrast Environments
by Andrey Filippov et al

12-10-2021

GPU-accelerated image alignment for object detection in industrial applications
by Trung-Son Le et al

12-10-2021

Rethinking the Two-Stage Framework for Grounded Situation Recognition
by Meng Wei et al

12-09-2021

3D Scene Understanding at Urban Intersection using Stereo Vision and Digital Map
by Prarthana Bhattacharyya et al

12-10-2021

Global Attention Mechanism: Retain Information to Enhance Channel-Spatial Interactions
by Yichao Liu et al

12-10-2021

Towards Full-to-Empty Room Generation with Structure-Aware Feature Encoding and Soft Semantic Region-Adaptive Normalization
by Vasileios Gkitsas et al

12-10-2021

Graph-based Generative Face Anonymisation with Pose Preservation
by Nicola Dall'Asen et al

12-10-2021

PERF: Performant, Explicit Radiance Fields
by Sverker Rasmuson et al

12-09-2021

Transfer learning using deep neural networks for Ear Presentation Attack Detection: New Database for PAD
by Jalil Nourmohammadi Khiarak

12-10-2021

DronePose: The identification, segmentation, and orientation detection of drones via neural networks
by Stirling Scholes et al

12-10-2021

Visual Transformers with Primal Object Queries for Multi-Label Image Classification
by Vacit Oguz Yazici et al

12-10-2021

Discrete neural representations for explainable anomaly detection
by Stanislaw Szymanowicz et al

12-09-2021

Deep Recurrent Neural Network with Multi-scale Bi-directional Propagation for Video Deblurring
by Chao Zhu et al

12-10-2021

Camera Condition Monitoring and Readjustment by means of Noise and Blur
by Maik Wischow et al

12-10-2021

Exploring Pixel-level Self-supervision for Weakly Supervised Semantic Segmentation
by Sung-Hoon Yoon et al

12-09-2021

Attention-based Transformation from Latent Features to Point Clouds
by Kaiyi Zhang et al

12-10-2021

Neural Belief Propagation for Scene Graph Generation
by Daqi Liu et al

12-09-2021

Surrogate-based cross-correlation for particle image velocimetry
by Yong Lee et al

12-09-2021

Report-Guided Automatic Lesion Annotation for Deep Learning-Based Prostate Cancer Detection in bpMRI
by Joeran S. Bosma et al

12-09-2021

Self-Ensemling for 3D Point Cloud Domain Adaption
by Qing Li et al

12-10-2021

Multimedia Datasets for Anomaly Detection: A Survey
by Pratibha Kumari et al

12-08-2021

Edge-aware Guidance Fusion Network for RGB Thermal Scene Parsing
by Wujie Zhou et al

12-09-2021

Learning Deep Context-Sensitive Decomposition for Low-Light Image Enhancement
by Long Ma et al

 
Craig Smith