2022.3.28 Vision papers

 

03-24-2022

Make-A-Scene: Scene-Based Text-to-Image Generation with Human Priors
by Oran Gafni et al

03-23-2022

R3M: A Universal Visual Representation for Robot Manipulation
by Suraj Nair et al

03-22-2022

GradViT: Gradient Inversion of Vision Transformers
by Ali Hatamizadeh et al

03-23-2022

Learning to generate line drawings that convey geometry and semantics
by Caroline Chan et al

03-22-2022

Focal Modulation Networks
by Jianwei Yang et al

03-22-2022

Vision-and-Language Navigation: A Survey of Tasks, Methods, and Future Directions
by Jing Gu et al

03-24-2022

Neural Neighbor Style Transfer
by Nicholas Kolkin et al

03-24-2022

SharpContour: A Contour-based Boundary Refinement Approach for Efficient and Accurate Instance Segmentation
by Chenming Zhu et al

03-23-2022

Learning Motion-Dependent Appearance for High-Fidelity Rendering of Dynamic Humans from a Single Camera
by Jae Shin Yoon et al

03-23-2022

Revisiting Multi-Scale Feature Fusion for Semantic Segmentation
by Tianjian Meng et al

03-22-2022

Self-supervision through Random Segments with Autoregressive Coding (RandSAC)
by Tianyu Hua et al

03-22-2022

WuDaoMM: A large-scale Multi-Modal Dataset for Pre-training models
by Sha Yuan et al

03-22-2022

Dataset Distillation by Matching Training Trajectories
by George Cazenavette et al

03-23-2022

How Do You Do It? Fine-Grained Action Understanding with Pseudo-Adverbs
by Hazel Doughty et al

03-24-2022

Hierarchical Nearest Neighbor Graph Embedding for Efficient Dimensionality Reduction
by M. Saquib Sarfraz et al

03-24-2022

Text to Mesh Without 3D Supervision Using Limit Subdivision
by Nasir Khalid et al

03-24-2022

NPBG++: Accelerating Neural Point-Based Graphics
by Ruslan Rakhimov et al

03-22-2022

Open-Vocabulary DETR with Conditional Matching
by Yuhang Zang et al

03-24-2022

Bailando: 3D Dance Generation by Actor-Critic GPT with Choreographic Memory
by Li Siyao et al

03-22-2022

Visual Prompt Tuning
by Menglin Jia et al

03-23-2022

NeuMan: Neural Human Radiance Field from a Single Video
by Wei Jiang et al

03-22-2022

Improving Generalization in Federated Learning by Seeking Flat Minima
by Debora Caldarola et al

03-22-2022

Generating natural images with direct Patch Distributions Matching
by Ariel Elnekave et al

03-23-2022

VideoMAE: Masked Autoencoders are Data-Efficient Learners for Self-Supervised Video Pre-Training
by Zhan Tong et al

03-23-2022

Industrial Style Transfer with Large-scale Geometric Warping and Content Preservation
by Jinchao Yang et al

03-24-2022

Is Geometry Enough for Matching in Visual Localization?
by Qunjie Zhou et al

03-24-2022

Learning Dense Correspondence from Synthetic Environments
by Mithun Lal et al

03-24-2022

BigDetection: A Large-scale Benchmark for Improved Object Detector Pre-training
by Likun Cai et al

03-22-2022

Practical Stereo Matching via Cascaded Recurrent Network with Adaptive Correlation
by Jiankun Li et al

03-22-2022

A Prompt Array Keeps the Bias Away: Debiasing Vision-Language Models with Adversarial Learning
by Hugo Berg et al

03-22-2022

A Real-time Junk Food Recognition System based on Machine Learning
by Sirajum Munira Shifat et al

03-25-2022

Efficient-VDVAE: Less is more
by Louay Hazami et al

03-24-2022

Global Tracking Transformers
by Xingyi Zhou et al

03-25-2022

3D GAN Inversion for Controllable Portrait Image Animation
by Connor Z. Lin et al

03-24-2022

Pastiche Master: Exemplar-Based High-Resolution Portrait Style Transfer
by Shuai Yang et al

03-23-2022

Interpretable Prediction of Lung Squamous Cell Carcinoma Recurrence With Self-supervised Learning
by Weicheng Zhu et al

03-25-2022

AutoAvatar: Autoregressive Neural Fields for Dynamic Avatar Modeling
by Ziqian Bai et al

03-23-2022

When Accuracy Meets Privacy: Two-Stage Federated Transfer Learning Framework in Classification of Medical Images on Limited Data: A COVID-19 Case Study
by Alexandros Shikun Zhang et al

03-24-2022

CVF-SID: Cyclic multi-Variate Function for Self-Supervised Image Denoising by Disentangling Noise from Image
by Reyhaneh Neshatavar et al

03-24-2022

Learning Hierarchical Cross-Modal Association for Co-Speech Gesture Generation
by Xian Liu et al

03-23-2022

Maximum Spatial Perturbation Consistency for Unpaired Image-to-Image Translation
by Yanwu Xu et al

03-22-2022

HOP: History-and-Order Aware Pre-training for Vision-and-Language Navigation
by Yanyuan Qiao et al

03-22-2022

CM-GAN: Image Inpainting with Cascaded Modulation GAN and Object-Aware Training
by Haitian Zheng et al

03-23-2022

Random Forest Regression for continuous affect using Facial Action Units
by Saurabh Hinduja et al

03-24-2022

Beyond Fixation: Dynamic Window Visual Transformer
by Pengzhen Ren et al

03-23-2022

Physics-Driven Deep Learning for Computational Magnetic Resonance Imaging
by Kerstin Hammernik et al

03-23-2022

Evaluation of Non-Invasive Thermal Imaging for detection of Viability of Onchocerciasis worms
by Ronak Dedhiya et al

03-22-2022

Enabling faster and more reliable sonographic assessment of gestational age through machine learning
by Chace Lee et al

03-22-2022

Lymphocyte Classification in Hyperspectral Images of Ovarian Cancer Tissue Biopsy Samples
by Benjamin Paulson et al

03-23-2022

MR Image Denoising and Super-Resolution Using Regularized Reverse Diffusion
by Hyungjin Chung et al

03-24-2022

NPC: Neuron Path Coverage via Characterizing Decision Logic of Deep Neural Networks
by Xiaofei Xie et al

03-22-2022

Learning from All Vehicles
by Dian Chen et al

03-22-2022

IDEA-Net: Dynamic 3D Point Cloud Interpolation via Deep Embedding Alignment
by Yiming Zeng et al

03-24-2022

Open-set Recognition via Augmentation-based Similarity Learning
by Sepideh Esmaeilpour et al

03-24-2022

A Representation Separation Perspective to Correspondences-free Unsupervised 3D Point Cloud Registration
by Zhiyuan Zhang et al

03-23-2022

GriTS: Grid table similarity metric for table structure recognition
by Brandon Smock et al

03-24-2022

Interpretable Prediction of Pulmonary Hypertension in Newborns using Echocardiograms
by Hanna Ragnarsdottir et al

03-24-2022

EPro-PnP: Generalized End-to-End Probabilistic Perspective-n-Points for Monocular Object Pose Estimation
by Hansheng Chen et al

03-24-2022

Dexterous Imitation Made Easy: A Learning-Based Framework for Efficient Dexterous Manipulation
by Sridhar Pandian Arunachalam et al

03-24-2022

A Deep-Discrete Learning Framework for Spherical Surface Registration
by Mohamed A. Suliman et al

03-24-2022

Direct evaluation of progression or regression of disease burden in brain metastatic disease with Deep Neuroevolution
by Joseph Stember et al

03-24-2022

RayTran: 3D pose estimation and shape reconstruction of multiple objects from videos with ray-traced transformers
by Michał J. Tyszkiewicz et al

03-23-2022

The Challenges of Continuous Self-Supervised Learning
by Senthil Purushwalkam et al

03-22-2022

PlaneMVS: 3D Plane Reconstruction from Multi-View Stereo
by Jiachen Liu et al

03-22-2022

Convolutional Neural Network to Restore Low-Dose Digital Breast Tomosynthesis Projections in a Variance Stabilization Domain
by Rodrigo de Barros Vimieiro et al

03-24-2022

Towards Exemplar-Free Continual Learning in Vision Transformers: an Account of Attention, Functional and Weight Regularization
by Francesco Pelosin et al

03-22-2022

Was that so hard? Estimating human classification difficulty
by Morten Rieger Hannemose et al

03-22-2022

Fast on-line signature recognition based on VQ with time modeling
by Juan-Manuel Pascual-Gaspar et al

03-23-2022

Self-Supervised Robust Scene Flow Estimation via the Alignment of Probability Density Functions
by Pan He et al

03-22-2022

{\phi}-SfT: Shape-from-Template with a Physics-Based Deformation Model
by Navami Kairanda et al

03-23-2022

Towards Semi-Supervised Deep Facial Expression Recognition with An Adaptive Confidence Margin
by Hangyu Li et al

03-23-2022

Binary Morphological Neural Network
by Theodore Aouad et al

03-23-2022

A Deep Learning Framework to Reconstruct Face under Mask
by Gourango Modak et al

03-22-2022

Meta-attention for ViT-backed Continual Learning
by Mengqi Xue et al

03-24-2022

VRNet: Learning the Rectified Virtual Corresponding Points for 3D Point Cloud Registration
by Zhiyuan Zhang et al

03-23-2022

Enhancing Classifier Conservativeness and Robustness by Polynomiality
by Ziqi Wang et al

03-23-2022

Learning Scene Flow in 3D Point Clouds with Noisy Pseudo Labels
by Bing Li et al

03-23-2022

Improving the Fairness of Chest X-ray Classifiers
by Haoran Zhang et al

03-23-2022

Biceph-Net: A robust and lightweight framework for the diagnosis of Alzheimers disease using 2D-MRI scans and deep similarity learning
by A. H. Rashid et al

03-23-2022

Learning to Censor by Noisy Sampling
by Ayush Chopra et al

03-24-2022

Multitask Emotion Recognition Model with Knowledge Distillation and Task Discriminator
by Euiseok Jeong et al

03-22-2022

Look for the Change: Learning Object States and State-Modifying Actions from Untrimmed Web Videos
by Tomáš Souček et al

03-24-2022

Multi-modal Emotion Estimation for in-the-wild Videos
by Liyu Meng et al

03-24-2022

Coarse-to-Fine Cascaded Networks with Smooth Predicting for Video Facial Expression Recognition
by Fanglei Xue et al

03-23-2022

Negative Selection by Clustering for Contrastive Learning in Human Activity Recognition
by Jinqiang Wang et al

03-23-2022

A Hybrid Mesh-neural Representation for 3D Transparent Object Reconstruction
by Jiamin Xu et al

03-24-2022

IA-FaceS: A Bidirectional Method for Semantic Face Editing
by Wenjing Huang et al

03-25-2022

Chaos is a Ladder: A New Theoretical Understanding of Contrastive Learning via Augmentation Overlap
by Yifei Wang et al

03-23-2022

Cell segmentation from telecentric bright-field transmitted light microscopic images using a Residual Attention U-Net: a case study on HeLa line
by Ali Ghaznavi et al

03-23-2022

Q-FW: A Hybrid Classical-Quantum Frank-Wolfe for Quadratic Binary Optimization
by Alp Yurtsever et al

03-22-2022

Generative Modeling Helps Weak Supervision (and Vice Versa)
by Benedikt Boecking et al

03-24-2022

FitCLIP: Refining Large-Scale Pretrained Image-Text Models for Zero-Shot Video Understanding Tasks
by Santiago Castro et al

03-23-2022

Sparse Instance Activation for Real-Time Instance Segmentation
by Tianheng Cheng et al

03-22-2022

Pixel VQ-VAEs for Improved Pixel Art Representation
by Akash Saravanan et al

03-24-2022

Facial Expression Recognition based on Multi-head Cross Attention Network
by Jae-Yeop Jeong et al

03-23-2022

Computed Tomography Reconstruction using Generative Energy-Based Priors
by Martin Zach et al

03-24-2022

Feature visualization for convolutional neural network models trained on neuroimaging data
by Fabian Eitel et al

03-22-2022

A Broad Study of Pre-training for Domain Generalization and Adaptation
by Donghyun Kim et al

03-23-2022

Unsupervised Simultaneous Learning for Camera Re-Localization and Depth Estimation from Video
by Shun Taguchi et al

03-23-2022

A Method of Data Augmentation to Train a Small Area Fingerprint Recognition Deep Neural Network with a Normal Fingerprint Database
by JuSong Kim

03-22-2022

Improving Neural Predictivity in the Visual Cortex with Gated Recurrent Connections
by Simone Azeglio et al

03-23-2022

StructToken : Rethinking Semantic Segmentation with Structural Prior
by Fangjian Lin et al

03-22-2022

Weakly-Supervised Salient Object Detection Using Point Supervison
by Shuyong Gao et al

03-24-2022

Transformer Compressed Sensing via Global Image Tokens
by Marlon Bran Lorenzana et al

03-22-2022

Channel Self-Supervision for Online Knowledge Distillation
by Shixiao Fan et al

03-22-2022

A Novel Framework for Assessment of Learning-based Detectors in Realistic Conditions with Application to Deepfake Detection
by Yuhang Lu et al

03-23-2022

SMEMO: Social Memory for Trajectory Forecasting
by Francesco Marchetti et al

03-22-2022

AP-BSN: Self-Supervised Denoising for Real-World Images via Asymmetric PD and Blind-Spot Network
by Wooseok Lee et al

03-24-2022

RNNPose: Recurrent 6-DoF Object Pose Refinement with Robust Correspondence Field Estimation and Pose Optimization
by Yan Xu et al

03-23-2022

Event-Based Dense Reconstruction Pipeline
by Kun Xiao et al

03-22-2022

Fine-Grained Scene Graph Generation with Data Transfer
by Ao Zhang et al

03-23-2022

Activation-Based Sampling for Pixel- to Image-Level Aggregation in Weakly-Supervised Segmentation
by Arvi Jonnarth et al

03-25-2022

Polarization Multiplexed Diffractive Computing: All-Optical Implementation of a Group of Linear Transformations Through a Polarization-Encoded Diffractive Network
by Jingxi Li et al

03-24-2022

Deep learning for laboratory earthquake prediction and autoregressive forecasting of fault zone stress
by Laura Laurenti et al

03-24-2022

Compound Domain Generalization via Meta-Knowledge Encoding
by Chaoqi Chen et al

03-23-2022

Deep Frequency Filtering for Domain Generalization
by Shiqi Lin et al

03-23-2022

Refine-Net: Normal Refinement Neural Network for Noisy Point Clouds
by Haoran Zhou et al

03-22-2022

Multi-layer Clustering-based Residual Sparsifying Transform for Low-dose CT Image Reconstruction
by Xikai Yang et al

03-25-2022

On the performance of preconditioned methods to solve LpLp-norm phase unwrapping
by Ricardo Legarda-Saenz et al

03-24-2022

Learning Disentangled Representation for One-shot Progressive Face Swapping
by Qi Li et al

03-24-2022

R-DFCIL: Relation-Guided Representation Learning for Data-Free Class Incremental Learning
by Qiankun Gao et al

03-25-2022

A Unified Contrastive Energy-based Model for Understanding the Generative Ability of Adversarial Training
by Yifei Wang et al

03-25-2022

CNN LEGO: Disassembling and Assembling Convolutional Neural Network
by Jiacong Hu et al

03-22-2022

Reinforcement-based frugal learning for satellite image change detection
by Sebastien Deschamps et al

03-24-2022

Egocentric Prediction of Action Target in 3D
by Yiming Li et al

03-24-2022

Facial Action Unit Recognition With Multi-models Ensembling
by Wenqiang Jiang et al

03-24-2022

SIFT and SURF based feature extraction for the anomaly detection
by Simon Bilik et al

03-22-2022

Deep Portrait Delighting
by Joshua Weir et al

03-24-2022

Video Instance Segmentation via Multi-scale Spatio-temporal Split Attention Transformer
by Omkar Thawakar et al

03-22-2022

Semi-Supervised Hybrid Spine Network for Segmentation of Spine MR Images
by Meiyan Huang et al

03-23-2022

MT-UDA: Towards Unsupervised Cross-modality Medical Image Segmentation with Limited Source Labels
by Ziyuan Zhao et al

03-24-2022

Moving Window Regression: A Novel Approach to Ordinal Regression
by Nyeong-Ho Shin et al

03-23-2022

A Multi-Characteristic Learning Method with Micro-Doppler Signatures for Pedestrian Identification
by Yu Xiang et al

03-23-2022

U-Boost NAS: Utilization-Boosted Differentiable Neural Architecture Search
by Ahmet Caner Yüzügüler et al

03-22-2022

Mask Usage Recognition using Vision Transformer with Transfer Learning and Data Augmentation
by Hensel Donato Jahja et al

03-24-2022

DyRep: Bootstrapping Training with Dynamic Re-parameterization
by Tao Huang et al

03-24-2022

Compositional Temporal Grounding with Structured Variational Cross-Graph Correspondence Learning
by Juncheng Li et al

03-24-2022

A Simulation Benchmark for Vision-based Autonomous Navigation
by Lauri Suomela et al

03-22-2022

A New Approach to Improve Learning-based Deepfake Detection in Realistic Conditions
by Yuhang Lu et al

03-24-2022

Physics-based Learning of Parameterized Thermodynamics from Real-time Thermography
by Hamza El-Kebir et al

03-22-2022

CP2: Copy-Paste Contrastive Pretraining for Semantic Segmentation
by Feng Wang et al

03-24-2022

AziNorm: Exploiting the Radial Symmetry of Point Cloud for Azimuth-Normalized 3D Perception
by Shaoyu Chen et al

03-24-2022

Focus-and-Detect: A Small Object Detection Framework for Aerial Images
by Onur Can Koyun et al

03-24-2022

A Preliminary Research on Space Situational Awareness Based on Event Cameras
by Kun Xiao et al

03-24-2022

Steganalysis of Image with Adaptively Parametric Activation
by Hai Su et al

03-24-2022

Self-supervised Video-centralised Transformer for Video Face Clustering
by Yujiang Wang et al

03-23-2022

Self-supervised Learning of Adversarial Example: Towards Good Generalizations for Deepfake Detection
by Liang Chen et al

03-24-2022

A Perturbation Constrained Adversarial Attack for Evaluating the Robustness of Optical Flow
by Jenny Schmalfuss et al

03-23-2022

Transformer-based Multimodal Information Fusion for Facial Expression Analysis
by Wei Zhang et al

03-22-2022

Unsupervised Anomaly Detection in Medical Images with a Memory-augmented Multi-level Cross-attentional Masked Autoencoder
by Yu Tian et al

03-22-2022

Adaptive Patch Exiting for Scalable Single Image Super-Resolution
by Shizun Wang et al

03-23-2022

UMT: Unified Multi-modal Transformers for Joint Video Moment Retrieval and Highlight Detection
by Ye Liu et al

03-22-2022

DepthGAN: GAN-based Depth Generation of Indoor Scenes from Semantic Layouts
by Yidi Li et al

03-23-2022

Subjective and Objective Analysis of Streamed Gaming Videos
by Xiangxu Yu et al

03-22-2022

4D-OR: Semantic Scene Graphs for OR Domain Modeling
by Ege Özsoy et al

03-22-2022

Unified Negative Pair Generation toward Well-discriminative Feature Space for Face Recognition
by Junuk Jung et al

03-23-2022

HMFS: Hybrid Masking for Few-Shot Segmentation
by Seonghyeon Moon et al

03-23-2022

ViT-FOD: A Vision Transformer based Fine-grained Object Discriminator
by Zi-Chao Zhang et al

03-24-2022

Neural Reflectance for Shape Recovery with Shadow Handling
by Junxuan Li et al

03-24-2022

Privileged Attribution Constrained Deep Networks for Facial Expression Recognition
by Jules Bonnard et al

03-23-2022

Robust Text Line Detection in Historical Documents: Learning and Evaluation Methods
by Mélodie Boillet et al

03-22-2022

Cross-View Panorama Image Synthesis
by Songsong Wu et al

03-22-2022

Contrastive Transformer-based Multiple Instance Learning for Weakly Supervised Polyp Frame Detection
by Yu Tian et al

03-23-2022

AIMusicGuru: Music Assisted Human Pose Correction
by Snehesh Shrestha et al

03-22-2022

Remember Intentions: Retrospective-Memory-based Trajectory Prediction
by Chenxin Xu et al

03-22-2022

Ray3D: ray-based 3D human pose estimation for monocular absolute 3D localization
by Yu Zhan et al

03-22-2022

Under the Hood of Transformer Networks for Trajectory Forecasting
by Luca Franco et al

03-23-2022

Real-time Object Detection for Streaming Perception
by Jinrong Yang et al

03-23-2022

CroMo: Cross-Modal Learning for Monocular Depth Estimation
by Yannick Verdié et al

03-23-2022

Scale-Equivalent Distillation for Semi-Supervised Object Detection
by Qiushan Guo et al

03-23-2022

DR.VIC: Decomposition and Reasoning for Video Individual Counting
by Tao Han et al

03-23-2022

3D Adapted Random Forest Vision (3DARFV) for Untangling Heterogeneous-Fabric Exceeding Deep Learning Semantic Segmentation Efficiency at the Utmost Accuracy
by Omar Alfarisi et al

03-24-2022

Continuous Emotion Recognition using Visual-audio-linguistic information: A Technical Report for ABAW3
by Su Zhang et al

03-24-2022

Towards Escaping from Language Bias and OCR Error: Semantics-Centered Text Visual Question Answering
by Chengyang Fang et al

03-23-2022

Autofocus for Event Cameras
by Shijie Lin et al

03-23-2022

Ev-TTA: Test-Time Adaptation for Event-Based Object Recognition
by Junho Kim et al

03-25-2022

Versatile Multi-Modal Pre-Training for Human-Centric Perception
by Fangzhou Hong et al

03-23-2022

Self-supervised HDR Imaging from Motion and Exposure Cues
by Michal Nazarczuk et al

03-24-2022

WarpingGAN: Warping Multiple Uniform Priors for Adversarial 3D Point Cloud Generation
by Yingzhi Tang et al

03-23-2022

Domain-Generalized Textured Surface Anomaly Detection
by Shang-Fu Chen et al

03-23-2022

Lane detection with Position Embedding
by Jun Xie et al

03-23-2022

DAN: a Segmentation-free Document Attention Network for Handwritten Document Recognition
by Denis Coquenet et al

03-22-2022

ProgressiveMotionSeg: Mutually Reinforced Framework for Event-Based Motion Segmentation
by Jinze Chen et al

03-24-2022

Practical Blind Denoising via Swin-Conv-UNet and Data Synthesis
by Kai Zhang et al

03-23-2022

Affective Feedback Synthesis Towards Multimodal Text and Image Data
by Puneet Kumar et al

03-22-2022

Rebalanced Siamese Contrastive Mining for Long-Tailed Recognition
by Zhisheng Zhong et al

03-23-2022

On the (Limited) Generalization of MasterFace Attacks and Its Relation to the Capacity of Face Representations
by Philipp Terhörst et al

03-24-2022

Bayesian Nonparametric Submodular Video Partition for Robust Anomaly Detection
by Hitesh Sapkota et al

03-24-2022

Multiple Emotion Descriptors Estimation at the ABAW3 Challenge
by Didan Deng

03-24-2022

Keypoints Tracking via Transformer Networks
by Oleksii Nasypanyi et al

03-24-2022

Semantic Image Manipulation with Background-guided Internal Learning
by Zhongping Zhang et al

03-23-2022

An Attention-based Method for Action Unit Detection at the 3rd ABAW Competition
by Duy Le Hoai et al

03-23-2022

Your Attention Deserves Attention: A Self-Diversified Multi-Channel Attention for Facial Action Analysis
by Xiaotian Li et al

03-23-2022

Multidimensional Belief Quantification for Label-Efficient Meta-Learning
by Deep Pandey et al

03-23-2022

Training-free Transformer Architecture Search
by Qinqin Zhou et al

03-24-2022

Searching for fingerspelled content in American Sign Language
by Bowen Shi et al

03-22-2022

QS-Craft: Learning to Quantize, Scrabble and Craft for Conditional Human Motion Animation
by Yuxin Hong et al

03-24-2022

X-ray Dissectography Improves Lung Nodule Detection
by Chuang Niu et al

03-24-2022

The Fixed Sub-Center: A Better Way to Capture Data Complexity
by Zhemin Zhang et al

03-22-2022

High-resolution Iterative Feedback Network for Camouflaged Object Detection
by Xiaobin Hu et al

03-23-2022

Multi-label Transformer for Action Unit Detection
by Gauthier Tallec et al

03-22-2022

Exploring and Evaluating Image Restoration Potential in Dynamic Scenes
by Cheng Zhang et al

03-22-2022

Mixed Differential Privacy in Computer Vision
by Aditya Golatkar et al

03-23-2022

Towards Efficient and Elastic Visual Question Answering with Doubly Slimmable Transformer
by Zhou Yu et al

03-22-2022

Dense Residual Networks for Gaze Mapping on Indian Roads
by Chaitanya Kapoor et al

03-25-2022

Neural Networks with Divisive normalization for image segmentation with application in cityscapes dataset
by Pablo Hernández-Cámara et al

03-22-2022

Leveraging Textures in Zero-shot Understanding of Fine-Grained Domains
by Chenyun Wu et al

03-22-2022

Unifying Motion Deblurring and Frame Interpolation with Events
by Xiang Zhang et al

03-22-2022

Adaptive Transformers for Robust Few-shot Cross-domain Face Anti-spoofing
by Hsin-Ping Huang et al

03-22-2022

Frugal Learning of Virtual Exemplars for Label-Efficient Satellite Image Change Detection
by Hichem Sahbi et al

03-24-2022

Transformers Meet Visual Learning Understanding: A Comprehensive Review
by Yuting Yang et al

03-22-2022

Comprehensive Benchmark Datasets for Amharic Scene Text Detection and Recognition
by Wondimu Dikubab et al

03-22-2022

Detection, Recognition, and Tracking: A Survey
by Shiyao Chen et al

03-23-2022

Unsupervised Salient Object Detection with Spectral Cluster Voting
by Gyungin Shin et al

03-25-2022

Interpretation of Chest x-rays affected by bullets using deep transfer learning
by Shaheer Khan et al

03-22-2022

GOSS: Towards Generalized Open-set Semantic Segmentation
by Jie Hong et al

03-23-2022

What to Hide from Your Students: Attention-Guided Masked Image Modeling
by Ioannis Kakogeorgiou et al

03-24-2022

Quantum Motion Segmentation
by Federica Arrigoni et al

03-23-2022

DynamicEarthNet: Daily Multi-Spectral Satellite Dataset for Semantic Change Segmentation
by Aysim Toker et al

03-25-2022

Stochastic Trajectory Prediction via Motion Indeterminacy Diffusion
by Tianpei Gu et al

03-23-2022

Hyper-Spectral Imaging for Overlapping Plastic Flakes Segmentation
by Guillem Martinez et al

03-25-2022

The TerraByte Client: providing access to terabytes of plant data
by Michael A. Beck et al

03-25-2022

Non-Probability Sampling Network for Stochastic Human Trajectory Prediction
by Inhwan Bae et al

03-25-2022

ST-FL: Style Transfer Preprocessing in Federated Learning for COVID-19 Segmentation
by Antonios Georgiadis et al

03-24-2022

Making Heads or Tails: Towards Semantically Consistent Visual Counterfactuals
by Simon Vandenhende et al

03-22-2022

End-to-End Learned Block-Based Image Compression with Block-Level Masked Convolutions and Asymptotic Closed Loop Training
by Fatih Kamisli

03-24-2022

Expression Classification using Concatenation of Deep Neural Network for the 3rd ABAW3 Competition
by Kim Ngan Phan et al

03-23-2022

Adaptively Re-weighting Multi-Loss Untrained Transformer for Sparse-View Cone-Beam CT Reconstruction
by Minghui Wu et al

03-22-2022

Convolutional Neural Network-based Efficient Dense Point Cloud Generation using Unsigned Distance Fields
by Abol Basher et al

03-24-2022

Probing Representation Forgetting in Supervised and Unsupervised Continual Learning
by MohammadReza Davari et al

03-25-2022

MDsrv -- visual sharing and analysis of molecular dynamics simulations
by Michelle Kampfrath et al

03-25-2022

Facial Expression Recognition with Swin Transformer
by Jun-Hwa Kim et al

03-25-2022

Interactive Style Transfer: All is Your Palette
by Zheng Lin et al

03-24-2022

Weakly-Supervised End-to-End CAD Retrieval to Scan Objects
by Tim Beyer et al

03-25-2022

Compare learning: bi-attention network for few-shot learning
by Li Ke et al

03-22-2022

Semantic State Estimation in Cloth Manipulation Tasks
by Georgies Tzelepis et al

03-25-2022

PANDORA: Polarization-Aided Neural Decomposition Of Radiance
by Akshat Dave et al

03-25-2022

SeCo: Separating Unknown Musical Visual Sounds with Consistency Guidance
by Xinchi Zhou et al

03-22-2022

SSD-KD: A Self-supervised Diverse Knowledge Distillation Method for Lightweight Skin Lesion Classification Using Dermoscopic Images
by Yongwei Wang et al

03-25-2022

Continual Test-Time Domain Adaptation
by Qin Wang et al

03-23-2022

Efficient Few-Shot Object Detection via Knowledge Inheritance
by Ze Yang et al

03-24-2022

Effectively leveraging Multi-modal Features for Movie Genre Classification
by Zhongping Zhang et al

03-24-2022

Intrinsic Bias Identification on Medical Image Datasets
by Shijie Zhang et al

03-22-2022

DTFD-MIL: Double-Tier Feature Distillation Multiple Instance Learning for Histopathology Whole Slide Image Classification
by Hongrun Zhang et al

03-22-2022

Joint Feature Learning and Relation Modeling for Tracking: A One-Stream Framework
by Botao Ye et al

03-22-2022

Unsupervised Deraining: Where Contrastive Learning Meets Self-similarity
by Ye Yuntong et al

03-25-2022

Lightweight Graph Convolutional Networks with Topologically Consistent Magnitude Pruning
by Hichem Sahbi

03-24-2022

Microstructure Surface Reconstruction from SEM Images: An Alternative to Digital Image Correlation (DIC)
by Khalid El-Awady

03-24-2022

FAMLP: A Frequency-Aware MLP-Like Architecture For Domain Generalization
by Kecheng Zheng et al

03-24-2022

An Ensemble Approach for Facial Expression Analysis in Video
by Hong-Hai Nguyen et al

03-22-2022

FxP-QNet: A Post-Training Quantizer for the Design of Mixed Low-Precision DNNs with Dynamic Fixed-Point Representation
by Ahmad Shawahna et al

03-22-2022

WayFAST: Traversability Predictive Navigation for Field Robots
by Mateus Valverde Gasparino et al

03-25-2022

Navigable Proximity Graph-Driven Native Hybrid Queries with Structured and Unstructured Constraints
by Mengzhao Wang et al

03-24-2022

Noisy Boundaries: Lemon or Lemonade for Semi-supervised Instance Segmentation?
by Zhenyu Wang et al

03-25-2022

Deformable Butterfly: A Highly Structured and Sparse Linear Transform
by Rui Lin et al

03-24-2022

Self-Supervised Predictive Learning: A Negative-Free Method for Sound Source Localization in Visual Scenes
by Zengjie Song et al

03-22-2022

TransFusion: Robust LiDAR-Camera Fusion for 3D Object Detection with Transformers
by Xuyang Bai et al

03-22-2022

FrameHopper: Selective Processing of Video Frames in Detection-driven Real-Time Video Analytics
by Md Adnan Arefeen et al

03-25-2022

Vision Transformer Compression with Structured Pruning and Low Rank Approximation
by Ankur Kumar

03-25-2022

StretchBEV: Stretching Future Instance Prediction Spatially and Temporally
by Adil Kaan Akan et al

03-25-2022

Analysis of the Production Strategy of Mask Types in the COVID-19 Environment
by Xiangri Lu et al

03-25-2022

Searching for Network Width with Bilaterally Coupled Network
by Xiu Su et al

03-25-2022

A Visual Navigation Perspective for Category-Level Object Pose Estimation
by Jiaxin Guo et al

03-24-2022

Multi-scale and Cross-scale Contrastive Learning for Semantic Segmentation
by Theodoros Pissas et al

03-25-2022

Analysis of the use of color and its emotional relationship in visual creations based on experiences during the context of the COVID-19 pandemic
by César González-Martín et al

03-25-2022

Implicit Neural Representations for Variable Length Human Motion Generation
by Pablo Cervantes et al

03-25-2022

Unsupervised Image Deraining: Optimization Model Driven Deep CNN
by Changfeng Yu et al

03-22-2022

Satellite Infrastructure/Mission Tradeoffs
by Matthew Ciolino

03-22-2022

Learning Geodesic-Aware Local Features from RGB-D Images
by Guilherme Potje et al

03-25-2022

Salt Detection Using Segmentation of Seismic Image
by Mrinmoy Sarkar

03-25-2022

Efficient Visual Tracking via Hierarchical Cross-Attention Transformer
by Xin Chen et al

03-25-2022

Multimodal Pre-training Based on Graph Attention Network for Document Understanding
by Zhenrong Zhang et al

03-25-2022

High-Performance Transformer Tracking
by Xin Chen et al

03-25-2022

Spatially Multi-conditional Image Generation
by Ritika Chakraborty et al

03-25-2022

Visual-based Safe Landing for UAVs in Populated Areas: Real-time Validation in Virtual Environments
by Hector Tovanche-Picon et al

03-25-2022

Playing Lottery Tickets in Style Transfer Models
by Meihao Kong et al

03-25-2022

Continuous Dynamic-NeRF: Spline-NeRF
by Julian Knodt

03-24-2022

Multi-modal Multi-label Facial Action Unit Detection with Transformer
by Lingfeng Wang et al

03-24-2022

Weakly-Supervised Online Action Segmentation in Multi-View Instructional Videos
by Reza Ghoddoosian et al

03-24-2022

Human Gait Recognition Using Bag of Words Feature Representation Method
by Nasrin Bayat et al

03-24-2022

MonoDETR: Depth-aware Transformer for Monocular 3D Object Detection
by Renrui Zhang et al

03-25-2022

CAD: Co-Adapting Discriminative Features for Improved Few-Shot Classification
by Philip Chikontwe et al

03-25-2022

RD-Optimized Trit-Plane Coding of Deep Compressed Image Latent Tensors
by Seungmin Jeon et al

03-25-2022

Improving Adversarial Transferability with Spatial Momentum
by Guoqiu Wang et al

03-25-2022

Dense Continuous-Time Optical Flow from Events and Frames
by Mathias Gehrig et al

03-25-2022

Semi-supervised and Deep learning Frameworks for Video Classification and Key-frame Identification
by Sohini Roychowdhury

03-25-2022

PCA-Based Knowledge Distillation Towards Lightweight and Content-Style Balanced Photorealistic Style Transfer Models
by Tai-Yin Chiu et al

03-25-2022

Unsupervised Pre-training for Temporal Action Localization Tasks
by Can Zhang et al

03-25-2022

Adjacent Context Coordination Network for Salient Object Detection in Optical Remote Sensing Images
by Gongyang Li et al

03-25-2022

MDAN: Multi-level Dependent Attention Network for Visual Emotion Analysis
by Liwen Xu et al

03-25-2022

Fast Hybrid Image Retargeting
by Daniel Valdez-Balderas et al

03-24-2022

Continuous-Time Audiovisual Fusion with Recurrence vs. Attention for In-The-Wild Affect Recognition
by Vincent Karas et al

03-25-2022

Learning to Adapt to Unseen Abnormal Activities under Weak Supervision
by Jaeyoo Park et al

03-25-2022

Class-Incremental Learning for Action Recognition in Videos
by Jaeyoo Park et al

03-25-2022

Stabilizing Adversarially Learned One-Class Novelty Detection Using Pseudo Anomalies
by Muhammad Zaigham Zaheer et al

03-24-2022

Frame-level Prediction of Facial Expressions, Valence, Arousal and Action Units for Mobile Devices
by Andrey V. Savchenko

03-24-2022

BCOT: A Markerless High-Precision 3D Object Tracking Benchmark
by Jiachen Li et al

03-25-2022

FReSCO: Flow Reconstruction and Segmentation for low latency Cardiac Output monitoring using deep artifact suppression and segmentation
by Olivier Jaubert et al

03-25-2022

Rope3D: TheRoadside Perception Dataset for Autonomous Driving and Monocular 3D Object Detection Task
by Xiaoqing Ye et al

03-22-2022

Learning Patch-to-Cluster Attention in Vision Transformer
by Ryan Grainger et al

03-25-2022

Clustering Aided Weakly Supervised Training to Detect Anomalous Events in Surveillance Videos
by Muhammad Zaigham Zaheer et al

03-24-2022

Point2Seq: Detecting 3D Objects as Sequences
by Yujing Xue et al

03-25-2022

Digital Fingerprinting of Microstructures
by Michael D. White et al

03-25-2022

Contrastive learning of Class-agnostic Activation Map for Weakly Supervised Object Localization and Semantic Segmentation
by Jinheng Xie et al

03-25-2022

Give Me Your Attention: Dot-Product Attention Considered Harmful for Adversarial Patch Robustness
by Giulio Lovisotto et al

03-24-2022

CrossFormer: Cross Spatio-Temporal Transformer for 3D Human Pose Estimation
by Mohammed Hassanin et al

03-24-2022

Occluded Human Mesh Recovery
by Rawal Khirodkar et al

03-24-2022

Repairing Group-Level Errors for DNNs Using Weighted Regularization
by Ziyuan Zhong et al

 
Craig Smith