2022.4.11 Vision papers

 

04-07-2022

The Effects of Regularization and Data Augmentation are Class Dependent
by Randall Balestriero et al

04-07-2022

Video Diffusion Models
by Jonathan Ho et al

04-06-2022

KNN-Diffusion: Image Generation via Large-Scale Retrieval
by Oron Ashual et al

04-06-2022

Last Layer Re-Training is Sufficient for Robustness to Spurious Correlations
by Polina Kirichenko et al

04-06-2022

3D face reconstruction with dense landmarks
by Erroll Wood et al

04-07-2022

Unified Contrastive Learning in Image-Text-Label Space
by Jianwei Yang et al

04-06-2022

Unleashing Vanilla Vision Transformer with Masked Image Modeling for Object Detection
by Yuxin Fang et al

04-07-2022

AutoRF: Learning 3D Object Radiance Fields from Single View Observations
by Norman Müller et al

04-07-2022

SunStage: Portrait Reconstruction and Relighting using the Sun as a Light Stage
by Yifan Wang et al

04-05-2022

ObjectFolder 2.0: A Multisensory Object Dataset for Sim2Real Transfer
by Ruohan Gao et al

04-06-2022

Winoground: Probing Vision and Language Models for Visio-Linguistic Compositionality
by Tristan Thrush et al

04-07-2022

Long Video Generation with Time-Agnostic VQGAN and Time-Sensitive Transformer
by Songwei Ge et al

04-05-2022

iSDF: Real-Time Neural Signed Distance Fields for Robot Perception
by Joseph Ortiz et al

04-05-2022

Neural Convolutional Surfaces
by Luca Morreale et al

04-06-2022

AUV-Net: Learning Aligned UV Maps for Texture Transfer and Synthesis
by Zhiqin Chen et al

04-05-2022

IRON: Inverse Rendering by Optimizing Neural SDFs and Materials from Photometric Images
by Kai Zhang et al

04-07-2022

Habitat-Web: Learning Embodied Object-Search Strategies from Human Demonstrations at Scale
by Ram Ramrakhya et al

04-07-2022

Solving ImageNet: a Unified Scheme for Training any Backbone to Top Results
by Tal Ridnik et al

04-05-2022

MixFormer: Mixing Features across Windows and Dimensions
by Qiang Chen et al

04-07-2022

Unsupervised Image-to-Image Translation with Generative Prior
by Shuai Yang et al

04-05-2022

Texturify: Generating Textures on 3D Shape Surfaces
by Yawar Siddiqui et al

04-06-2022

SqueezeNeRF: Further factorized FastNeRF for memory-efficient inference
by Krishna Wadhwani et al

04-07-2022

Unsupervised Prompt Learning for Vision-Language Models
by Tony Huang et al

04-05-2022

Text2LIVE: Text-Driven Layered Image and Video Editing
by Omer Bar-Tal et al

04-07-2022

What You See is What You Get: Distributional Generalization for Algorithm Design in Deep Learning
by Bogdan Kulynych et al

04-06-2022

ECLIPSE: Efficient Long-range Video Retrieval using Sight and Sound
by Yan-Bo Lin et al

04-05-2022

A Generative Deep Learning Approach to Stochastic Downscaling of Precipitation Forecasts
by Lucy Harris et al

04-07-2022

Visualizing Deep Neural Networks with Topographic Activation Maps
by Andreas Krug et al

04-05-2022

Predicting and Explaining Mobile UI Tappability with Vision Modeling and Saliency Analysis
by Eldon Schoop et al

04-06-2022

Influence of Color Spaces for Deep Learning Image Colorization
by Coloma Ballester et al

04-06-2022

Towards An End-to-End Framework for Flow-Guided Video Inpainting
by Zhen Li et al

04-06-2022

Temporal Alignment Networks for Long-term Video
by Tengda Han et al

04-07-2022

Learning to Compose Soft Prompts for Compositional Zero-Shot Learning
by Nihal V. Nayak et al

04-06-2022

Fusing finetuned models for better pretraining
by Leshem Choshen et al

04-07-2022

Pin the Memory: Learning to Generalize Semantic Segmentation
by Jin Kim et al

04-07-2022

Pre-train, Self-train, Distill: A simple recipe for Supersizing 3D Reconstruction
by Kalyan Vasudev Alwala et al

04-07-2022

DaViT: Dual Attention Vision Transformers
by Mingyu Ding et al

04-05-2022

A deep learning framework for the detection and quantification of drusen and reticular pseudodrusen on optical coherence tomography
by Roy Schwartz et al

04-05-2022

Rethinking Visual Geo-localization for Large-Scale Applications
by Gabriele Berton et al

04-05-2022

The Probabilistic Normal Epipolar Constraint for Frame-To-Frame Rotation Optimization under Uncertain Feature Positions
by Dominik Muhle et al

04-05-2022

CLEVR-X: A Visual Reasoning Dataset for Natural Language Explanations
by Leonard Salewski et al

04-07-2022

Deep Visual Geo-localization Benchmark
by Gabriele Berton et al

04-05-2022

Leveraging Equivariant Features for Absolute Pose Regression
by Mohamed Adel Musallam et al

04-06-2022

Statistical Model Criticism of Variational Auto-Encoders
by Claartje Barkhof et al

04-05-2022

VocaLiST: An Audio-Visual Synchronisation Model for Lips and Voices
by Venkatesh S. Kadandale et al

04-06-2022

Emotional Speech Recognition with Pre-trained Deep Visual Models
by Waleed Ragheb et al

04-05-2022

Modeling Motion with Multi-Modal Features for Text-Based Video Segmentation
by Wangbo Zhao et al

04-05-2022

SwapMix: Diagnosing and Regularizing the Over-Reliance on Visual Context in Visual Question Answering
by Vipul Gupta et al

04-08-2022

Dancing under the stars: video denoising in starlight
by Kristina Monakhova et al

04-05-2022

Multi-View Transformer for 3D Visual Grounding
by Shijia Huang et al

04-08-2022

Probabilistic Representations for Video Contrastive Learning
by Jungin Park et al

04-06-2022

Adversarial Machine Learning Attacks Against Video Anomaly Detection Systems
by Furkan Mumcu et al

04-05-2022

SNUG: Self-Supervised Neural Dynamic Garments
by Igor Santesteban et al

04-06-2022

Detecting key Soccer match events to create highlights using Computer Vision
by Narayana Darapaneni et al

04-07-2022

Optimizing the Long-Term Behaviour of Deep Reinforcement Learning for Pushing and Grasping
by Rodrigo Chau

04-07-2022

Class-Incremental Learning with Strong Pre-trained Models
by Tz-Ying Wu et al

04-06-2022

Universal Representations: A Unified Look at Multiple Task and Domain Learning
by Wei-Hong Li et al

04-06-2022

Simple and Effective Synthesis of Indoor 3D Scenes
by Jing Yu Koh et al

04-06-2022

Open-Source Tools for Behavioral Video Analysis: Setup, Methods, and Development
by Kevin Luxem et al

04-07-2022

MDA GAN: Adversarial-Learning-based 3-D Seismic Data Interpolation and Reconstruction for Complex Missing
by Yimin Dou et al

04-05-2022

Bimodal Distributed Binarized Neural Networks
by Tal Rozen et al

04-05-2022

SE(3)-Equivariant Attention Networks for Shape Reconstruction in Function Space
by Evangelos Chatzipantazis et al

04-07-2022

Equivariance Discovery by Learned Parameter-Sharing
by Raymond A. Yeh et al

04-05-2022

Learning Pneumatic Non-Prehensile Manipulation with a Mobile Blower
by Jimmy Wu et al

04-06-2022

BFRnet: A deep learning-based MR background field removal method for QSM of the brain containing significant pathological susceptibility sources
by Xuanyu Zhu et al

04-06-2022

Analysis of Different Losses for Deep Learning Image Colorization
by Coloma Ballester et al

04-06-2022

Exploring Cross-Domain Pretrained Model for Hyperspectral Image Classification
by Hyungtae Lee et al

04-05-2022

A lightweight and accurate YOLO-like network for small target detection in Aerial Imagery
by Alessandro Betti

04-05-2022

Learning Generalizable Dexterous Manipulation from Human Grasp Affordance
by Yueh-Hua Wu et al

04-05-2022

Action-Conditioned Contrastive Policy Pretraining
by Qihang Zhang et al

04-05-2022

Split Hierarchical Variational Compression
by Tom Ryder et al

04-07-2022

Many-to-many Splatting for Efficient Video Frame Interpolation
by Ping Hu et al

04-07-2022

A Learnable Variational Model for Joint Multimodal MRI Reconstruction and Synthesis
by Wanyu Bian et al

04-07-2022

Multi-Sample ζζ-mixup: Richer, More Realistic Synthetic Samples from a pp-Series Interpolant
by Kumar Abhishek et al

04-06-2022

Expression-preserving face frontalization improves visually assisted speech processing
by Zhiqi Kang et al

04-05-2022

Complex-Valued Autoencoders for Object Discovery
by Sindy Löwe et al

04-07-2022

HunYuan_tvr for Text-Video Retrivial
by Shaobo Min et al

04-05-2022

Lost in Latent Space: Disentangled Models and the Challenge of Combinatorial Generalisation
by Milton L. Montero et al

04-06-2022

DiffCloud: Real-to-Sim from Point Clouds with Differentiable Simulation and Rendering of Deformable Objects
by Priya Sundaresan et al

04-07-2022

FineDiving: A Fine-grained Dataset for Procedure-aware Action Quality Assessment
by Jinglin Xu et al

04-07-2022

MHMS: Multimodal Hierarchical Multimedia Summarization
by Jielin Qiu et al

04-07-2022

Detection of Distracted Driver using Convolution Neural Network
by Narayana Darapaneni et al

04-05-2022

P3Depth: Monocular Depth Estimation with a Piecewise Planarity Prior
by Vaishakh Patil et al

04-05-2022

Explainable Deep Learning Algorithm for Distinguishing Incomplete Kawasaki Disease by Coronary Artery Lesions on Echocardiographic Imaging
by Haeyun Lee et al

04-07-2022

HIT-UAV: A High-altitude Infrared Thermal Dataset for Unmanned Aerial Vehicles
by Jiashun Suo et al

04-05-2022

Spread Spurious Attribute: Improving Worst-group Accuracy with Spurious Attribute Estimation
by Junhyun Nam et al

04-07-2022

Surface Vision Transformers: Flexible Attention-Based Modelling of Biomedical Surfaces
by Simon Dahan et al

04-07-2022

Implementing a Real-Time, YOLOv5 based Social Distancing Measuring System for Covid-19
by Narayana Darapaneni et al

04-07-2022

Event Transformer. A sparse-aware solution for efficient event data processing
by Alberto Sabater et al

04-07-2022

A Pathology-Based Machine Learning Method to Assist in Epithelial Dysplasia Diagnosis
by Karoline da Rocha et al

04-07-2022

Incremental Prototype Prompt-tuning with Pre-trained Representation for Class Incremental Learning
by Jieren Deng et al

04-05-2022

LatentGAN Autoencoder: Learning Disentangled Latent Distribution
by Sanket Kalwar et al

04-07-2022

Efficient Multiscale Object-based Superpixel Framework
by Felipe Belém et al

04-05-2022

Pyramid Frequency Network with Spatial Attention Residual Refinement Module for Monocular Depth Estimation
by Zhengyang Lu et al

04-07-2022

A Comprehensive Review of Sign Language Recognition: Different Types, Modalities, and Datasets
by Dr. M. Madhiarasan et al

04-07-2022

Multi-Task Distributed Learning using Vision Transformer with Random Patch Permutation
by Sangjoon Park et al

04-07-2022

Coarse-to-Fine Feature Mining for Video Semantic Segmentation
by Guolei Sun et al

04-06-2022

Late multimodal fusion for image and audio music transcription
by María Alfaro-Contreras et al

04-05-2022

SALISA: Saliency-based Input Sampling for Efficient Video Object Detection
by Babak Ehteshami Bejnordi et al

04-07-2022

Predicting Solar Flares Using CNN and LSTM on Two Solar Cycles of Active Region Data
by Zeyu Sun et al

04-07-2022

ProbNVS: Fast Novel View Synthesis with Learned Probability-Guided Sampling
by Yuemei Zhou et al

04-06-2022

Super-resolved multi-temporal segmentation with deep permutation-invariant networks
by Diego Valsesia et al

04-06-2022

Rolling Colors: Adversarial Laser Exploits against Traffic Light Recognition
by Chen Yan et al

04-07-2022

Task-Aware Active Learning for Endoscopic Image Analysis
by Shrawan Kumar Thapa et al

04-06-2022

Georeferencing of Photovoltaic Modules from Aerial Infrared Videos using Structure-from-Motion
by Lukas Bommes et al

04-07-2022

Adapting CLIP For Phrase Localization Without Further Training
by Jiahao Li et al

04-08-2022

From 2D Images to 3D Model:Weakly Supervised Multi-View Face Reconstruction with Deep Fusion
by Weiguang Zhao et al

04-06-2022

Face recognition in a transformed domain
by Marcos Faundez-Zanuy

04-07-2022

Total Variation Optimization Layers for Computer Vision
by Raymond A. Yeh et al

04-06-2022

Cloning Outfits from Real-World Images to 3D Characters for Generalizable Person Re-Identification
by Yanan Wang et al

04-05-2022

Learning Video Salient Object Detection Progressively from Unlabeled Videos
by Binwei Xu et al

04-08-2022

Controllable Missingness from Uncontrollable Missingness: Joint Learning Measurement Policy and Imputation
by Seongwook Yoon et al

04-06-2022

CAIPI in Practice: Towards Explainable Interactive Medical Image Classification
by Emanuel Slany et al

04-06-2022

An Empirical Study of Remote Sensing Pretraining
by Di Wang et al

04-05-2022

A Transformer-Based Contrastive Learning Approach for Few-Shot Sign Language Recognition
by Silvan Ferreira et al

04-06-2022

LilNetX: Lightweight Networks with EXtreme Model Compression and Structured Sparsification
by Sharath Girish et al

04-08-2022

SuperNet in Neural Architecture Search: A Taxonomic Survey
by Stephen Cha et al

04-07-2022

Learning Online Multi-Sensor Depth Fusion
by Erik Sandström et al

04-07-2022

ECCV Caption: Correcting False Negatives by Collecting Machine-and-Human-verified Image-Caption Associations for MS-COCO
by Sanghyuk Chun et al

04-07-2022

SurroundDepth: Entangling Surrounding Views for Self-Supervised Multi-Camera Depth Estimation
by Yi Wei et al

04-07-2022

Pan-cancer computational histopathology reveals tumor mutational burden status through weakly-supervised deep learning
by Siteng Chen et al

04-05-2022

Real-time Hyperspectral Imaging in Hardware via Trained Metasurface Encoders
by Maksim Makarenko et al

04-05-2022

Adversarial Robustness through the Lens of Convolutional Filters
by Paul Gavrikov et al

04-05-2022

Joint Learning of Feature Extraction and Cost Aggregation for Semantic Correspondence
by Jiwon Kim et al

04-06-2022

The Swiss Army Knife for Image-to-Image Translation: Multi-Task Diffusion Models
by Julia Wolleb et al

04-05-2022

Dual-AI: Dual-path Actor Interaction Learning for Group Activity Recognition
by Mingfei Han et al

04-07-2022

Zero-Shot Category-Level Object Pose Estimation
by Walter Goodwin et al

04-05-2022

Federated Cross Learning for Medical Image Segmentation
by Xuanang Xu et al

04-05-2022

Overcoming Catastrophic Forgetting in Incremental Object Detection via Elastic Response Distillation
by Tao Feng et al

04-06-2022

Audio-Visual Person-of-Interest DeepFake Detection
by Davide Cozzolino et al

04-06-2022

Fine-Grained Predicates Learning for Scene Graph Generation
by Xinyu Lyu et al

04-06-2022

CCAT-NET: A Novel Transformer Based Semi-supervised Framework for Covid-19 Lung Lesion Segmentation
by Mingyang Liu et al

04-05-2022

Audio-visual multi-channel speech separation, dereverberation and recognition
by Guinan Li et al

04-05-2022

Vision Transformer Equipped with Neural Resizer on Facial Expression Recognition Task
by Hyeonbin Hwang et al

04-07-2022

Pneumonia Detection in Chest X-Rays using Neural Networks
by Narayana Darapaneni et al

04-05-2022

Detecting Cloud-Based Phishing Attacks by Combining Deep Learning Models
by Medha Atre et al

04-05-2022

Emphasis on the Minimization of False Negatives or False Positives in Binary Classification
by Sanskriti Singh

04-07-2022

Learning to Sieve: Prediction of Grading Curves from Images of Concrete Aggregate
by Max Coenen et al

04-05-2022

RODD: A Self-Supervised Approach for Robust Out-of-Distribution Detection
by Umar Khalid et al

04-05-2022

When Sparsity Meets Dynamic Convolution
by Shwai He et al

04-06-2022

Learning to Generate Realistic Noisy Images via Pixel-level Noise-aware Adversarial Training
by Yuanhao Cai et al

04-05-2022

Arbitrary-Scale Image Synthesis
by Evangelos Ntavelis et al

04-05-2022

RBGNet: Ray-based Grouping for 3D Object Detection
by Haiyang Wang et al

04-05-2022

Leveraging Disentangled Representations to Improve Vision-Based Keystroke Inference Attacks Under Low Data
by John Lim et al

04-07-2022

Sparse Optical Flow-Based Line Feature Tracking
by Qiang Fu et al

04-08-2022

Prediction of COVID-19 using chest X-ray images
by Narayana Darapaneni et al

04-07-2022

Gravitationally Lensed Black Hole Emission Tomography
by Aviad Levis et al

04-07-2022

Using Multiple Self-Supervised Tasks Improves Model Robustness
by Matthew Lawhon et al

04-07-2022

Context-Sensitive Temporal Feature Learning for Gait Recognition
by Xiaohu Huang et al

04-06-2022

EfficientCellSeg: Efficient Volumetric Cell Segmentation Using Context Aware Pseudocoloring
by Royden Wagner et al

04-05-2022

Automatic Image Content Extraction: Operationalizing Machine Learning in Humanistic Photographic Studies of Large Visual Archives
by Anssi Männistö et al

04-06-2022

Aesthetic Text Logo Synthesis via Content-aware Layout Inferring
by Yizhi Wang et al

04-07-2022

Automated Design of Salient Object Detection Algorithms with Brain Programming
by Gustavo Olague et al

04-07-2022

MC-UNet Multi-module Concatenation based on U-shape Network for Retinal Blood Vessels Segmentation
by Ting Zhang et al

04-06-2022

Masking Adversarial Damage: Finding Adversarial Saliency for Robust and Sparse Network
by Byung-Kwan Lee et al

04-05-2022

Learning to Reduce Information Bottleneck for Object Detection in Aerial Images
by Yuchen Shen et al

04-07-2022

Canonical Mean Filter for Almost Zero-Shot Multi-Task classification
by Yong Li et al

04-08-2022

Deep Learning-Based Intra Mode Derivation for Versatile Video Coding
by Linwei Zhu et al

04-06-2022

End-to-End Instance Edge Detection
by Xueyan Zou et al

04-06-2022

The Pedestrian next to the Lamppost Adaptive Object Graphs for Better Instantaneous Mapping
by Avishkar Saha et al

04-06-2022

Implicit Motion-Compensated Network for Unsupervised Video Object Segmentation
by Lin Xi et al

04-08-2022

SnapMode: An Intelligent and Distributed Large-Scale Fashion Image Retrieval Platform Based On Big Data and Deep Generative Adversarial Network Technologies
by Narges Norouzi et al

04-06-2022

FocalClick: Towards Practical Interactive Image Segmentation
by Xi Chen et al

04-06-2022

ShowFace: Coordinated Face Inpainting with Memory-Disentangled Refinement Networks
by Zhuojie Wu et al

04-05-2022

Birds of A Feather Flock Together: Category-Divergence Guidance for Domain Adaptive Segmentation
by Bo Yuan et al

04-07-2022

PSTR: End-to-End One-Step Person Search With Transformers
by Jiale Cao et al

04-06-2022

SMU-Net: Style matching U-Net for brain tumor segmentation with missing modalities
by Reza Azad et al

04-06-2022

Low-Dose CT Denoising via Sinogram Inner-Structure Transformer
by Liutao Yang et al

04-06-2022

Flexible Sampling for Long-tailed Skin Lesion Classification
by Lie Ju et al

04-05-2022

Gait Recognition in the Wild with Dense 3D Representations and A Benchmark
by Jinkai Zheng et al

04-08-2022

CD22-pFed: Cyclic Distillation-guided Channel Decoupling for Model Personalization in Federated Learning
by Yiqing Shen et al

04-06-2022

Semi-DRDNet Semi-supervised Detail-recovery Image Deraining Network via Unpaired Contrastive Learning
by Yiyang Shen et al

04-06-2022

Video Demoireing with Relation-Based Temporal Consistency
by Peng Dai et al

04-06-2022

Intervertebral Disc Labeling With Learning Shape Information, A Look Once Approach
by Reza Azad et al

04-06-2022

DBF: Dynamic Belief Fusion for Combining Multiple Object Detectors
by Hyungtae Lee et al

04-07-2022

Evaluating Procedures for Establishing Generative Adversarial Network-based Stochastic Image Models in Medical Imaging
by Varun A. Kelkar et al

04-07-2022

L2G: A Simple Local-to-Global Knowledge Transfer Framework for Weakly Supervised Semantic Segmentation
by Peng-Tao Jiang et al

04-07-2022

Convolutional Neural Network for Early Pulmonary Embolism Detection via Computed Tomography Pulmonary Angiography
by Ching-Yuan Yu et al

04-06-2022

An Empirical Study of End-to-End Temporal Action Detection
by Xiaolong Liu et al

04-06-2022

Just-Noticeable-Difference Based Edge Map Quality Measure
by Ijaz Ahmad et al

04-06-2022

S-R2F2U-Net: A single-stage model for teeth segmentation
by Mrinal Kanti Dhar et al

04-07-2022

Semantic Representation and Dependency Learning for Multi-Label Image Recognition
by Tao Pu et al

04-07-2022

Practical Digital Disguises: Leveraging Face Swaps to Protect Patient Privacy
by Ethan Wilson et al

04-05-2022

CHORE: Contact, Human and Object REconstruction from a single RGB image
by Xianghui Xie et al

04-05-2022

Style-Hallucinated Dual Consistency Learning for Domain Generalized Semantic Segmentation
by Yuyang Zhao et al

04-07-2022

Powering Finetuning in Few-shot Learning: Domain-Agnostic Feature Adaptation with Rectified Class Prototypes
by Ran Tao et al

04-06-2022

Retrieval-based Spatially Adaptive Normalization for Semantic Image Synthesis
by Yupeng Shi et al

04-06-2022

Demonstrate Once, Imitate Immediately (DOME): Learning Visual Servoing for One-Shot Imitation Learning
by Eugene Valassakis et al

04-05-2022

Detector-Free Weakly Supervised Group Activity Recognition
by Dongkeun Kim et al

04-06-2022

Domain-Agnostic Prior for Transfer Semantic Segmentation
by Xinyue Huo et al

04-06-2022

SEAL: A Large-scale Video Dataset of Multi-grained Spatio-temporally Action Localization
by Shimin Chen et al

04-06-2022

Contextual Attention Mechanism, SRGAN Based Inpainting System for Eliminating Interruptions from Images
by Narayana Darapaneni et al

04-06-2022

IterVM: Iterative Vision Modeling Module for Scene Text Recognition
by Xiaojie Chu et al

04-08-2022

Reusing the Task-specific Classifier as a Discriminator: Discriminator-free Adversarial Domain Adaptation
by Lin Chen et al

04-05-2022

An efficient real-time target tracking algorithm using adaptive feature fusion
by Yanyan Liu et al

04-08-2022

Multimodal Quasi-AutoRegression: Forecasting the visual popularity of new fashion products
by Stefanos I. Papadopoulos et al

04-06-2022

Banana Sub-Family Classification and Quality Prediction using Computer Vision
by Narayana Darapaneni et al

04-06-2022

Learning to Anticipate Future with Dynamic Context Removal
by Xinyu Xu et al

04-05-2022

Grounding of the Functional Object-Oriented Network in Industrial Tasks
by Rafik Ayari et al

04-06-2022

Faster-TAD: Towards Temporal Action Detection with Proposal Generation and Classification in a Unified Network
by Shimin Chen et al

04-06-2022

LEAD: Self-Supervised Landmark Estimation by Aligning Distributions of Feature Similarity
by Tejan Karmali et al

04-07-2022

Swarm behavior tracking based on a deep vision algorithm
by Meihong Wu et al

04-07-2022

BankNote-Net: Open dataset for assistive universal currency recognition
by Felipe Oviedo et al

04-05-2022

Learning Optimal K-space Acquisition and Reconstruction using Physics-Informed Neural Networks
by Wei Peng et al

04-07-2022

Multi-objective optimization determines when, which and how to fuse deep networks: an application to predict COVID-19 outcomes
by Valerio Guarrasi et al

04-07-2022

Deep Learning for Real Time Satellite Pose Estimation on Low Power Edge TPU
by Alessandro Lotti et al

04-06-2022

BMD: A General Class-balanced Multicentric Dynamic Prototype Strategy for Source-free Domain Adaptation
by Sanqing Qu et al

04-06-2022

Follow My Eye: Using Gaze to Supervise Computer-Aided Diagnosis
by Sheng Wang et al

04-06-2022

Multi-Scale Memory-Based Video Deblurring
by Bo Ji et al

04-06-2022

The Self-Optimal-Transport Feature Transform
by Daniel Shalam et al

04-08-2022

Spatiotemporal Augmentation on Selective Frequencies for Video Representation Learning
by Jinhyung Kim et al

04-08-2022

Multi-scale temporal network for continuous sign language recognition
by Qidan Zhu et al

04-08-2022

General Incremental Learning with Domain-aware Categorical Representations
by Jiangwei Xie et al

04-08-2022

Spatial Transformer Network on Skeleton-based Gait Recognition
by Cun Zhang et al

04-06-2022

Learning from Untrimmed Videos: Self-Supervised Video Representation Learning with Hierarchical Consistency
by Zhiwu Qing et al

04-08-2022

Vision Transformers for Single Image Dehazing
by Yuda Song et al

04-08-2022

Engagement Detection with Multi-Task Training in E-Learning Environments
by Onur Copur et al

04-08-2022

Biometric identification by means of hand geometry and a neural net classifier
by Marcos Faundez-Zanuy et al

04-08-2022

Deep Hyperspectral-Depth Reconstruction Using Single Color-Dot Projection
by Chunyu Li et al

04-05-2022

DT2I: Dense Text-to-Image Generation from Region Descriptions
by Stanislav Frolov et al

04-06-2022

Hierarchical Self-supervised Representation Learning for Movie Understanding
by Fanyi Xiao et al

04-05-2022

Depth-Guided Sparse Structure-from-Motion for Movies and TV Shows
by Sheng Liu et al

04-08-2022

Study of a committee of neural networks for biometric hand-geometry recognition
by Marcos Faundez-Zanuy

04-08-2022

Does Robustness on ImageNet Transfer to Downstream Tasks?
by Yutaro Yamada et al

04-06-2022

UIGR: Unified Interactive Garment Retrieval
by Xiao Han et al

04-05-2022

Hospital-Agnostic Image Representation Learning in Digital Pathology
by Milad Sikaroudi et al

04-05-2022

Zero-shot Blind Image Denoising via Implicit Neural Representations
by Chaewon Kim et al

04-08-2022

A Generic Image Retrieval Method for Date Estimation of Historical Document Collections
by Adrià Molina et al

04-06-2022

Towards Robust Adaptive Object Detection under Noisy Annotations
by Xinyu Liu et al

04-07-2022

TorMentor: Deterministic dynamic-path, data augmentations with fractals
by Anguelos Nicolaou et al

04-05-2022

Real-time Online Multi-Object Tracking in Compressed Domain
by Qiankun Liu et al

04-06-2022

Instance Segmentation of Unlabeled Modalities via Cyclic Segmentation GAN
by Leander Lauenburg et al

04-06-2022

PP-LiteSeg: A Superior Real-Time Semantic Segmentation Model
by Juncai Peng et al

04-06-2022

AutoCOR: Autonomous Condylar Offset Ratio Calculator on TKA-Postoperative Lateral Knee X-ray
by Gulsade Rabia Cakmak et al

04-06-2022

DSGN++: Exploiting Visual-Spatial Relation forStereo-based 3D Detectors
by Yilun Chen et al

04-08-2022

Efficient tracking of team sport players with few game-specific annotations
by Adrien Maglo et al

04-06-2022

Thermal to Visible Image Synthesis under Atmospheric Turbulence
by Kangfu Mei et al

04-05-2022

Multi-Weight Respecification of Scan-specific Learning for Parallel Imaging
by Hui Tao et al

04-05-2022

Training-Free Robust Multimodal Learning via Sample-Wise Jacobian Regularization
by Zhengqi Gao et al

04-08-2022

Optical tracking in team sports
by Pegah Rahimian et al

04-05-2022

Semi-supervised Semantic Segmentation with Error Localization Network
by Donghyeon Kwon et al

04-06-2022

OSCARS: An Outlier-Sensitive Content-Based Radiography Retrieval System
by Xiaoyuan Guo et al

04-08-2022

Dynamic super-resolution in particle tracking problems
by Ping Liu et al

04-08-2022

Investigating Spherical Epipolar Rectification for Multi-View Stereo 3D Reconstruction
by Mostafa Elhashash et al

04-07-2022

DAD-3DHeads: A Large-scale Dense, Accurate and Diverse Dataset for 3D Head Alignment from a Single Image
by Tetiana Martyniuk et al

04-06-2022

Mitosis domain generalization in histopathology images -- The MIDOG challenge
by Marc Aubreville et al

04-06-2022

Drivers attention detection: a systematic literature review
by Luiz G. Véras et al

04-08-2022

Identifying Ambiguous Similarity Conditions via Semantic Matching
by Han-Jia Ye et al

04-07-2022

Adaptive-Gravity: A Defense Against Adversarial Samples
by Ali Mirzaeian et al

04-08-2022

Invariant Descriptors for Intrinsic Reflectance Optimization
by Anil S. Baslamisli et al

04-08-2022

Points to Patches: Enabling the Use of Self-Attention for 3D Shape Recognition
by Axel Berg et al

04-06-2022

Sampling-based Fast Gradient Rescaling Method for Highly Transferable Adversarial Attacks
by Xu Han et al

04-08-2022

Sat2lod2: A Software For Automated Lod-2 Modeling From Satellite-Derived Orthophoto And Digital Surface Model
by Shengxi Gui et al

04-08-2022

Constrained Bundle Adjustment for Structure From Motion Using Uncalibrated Multi-Camera Systems
by Debao Huang et al

04-08-2022

Visible-Thermal UAV Tracking: A Large-Scale Benchmark and New Baseline
by Pengyu Zhang et al

04-08-2022

Particle Videos Revisited: Tracking Through Occlusions Using Point Trajectories
by Adam W. Harley et al

04-08-2022

A Video Anomaly Detection Framework based on Appearance-Motion Semantics Representation Consistency
by Xiangyu Huang et al

04-07-2022

Transfer Attacks Revisited: A Large-Scale Empirical Study in Real Computer Vision Settings
by Yuhao Mao et al

04-08-2022

Underwater Image Enhancement Using Pre-trained Transformer
by Abderrahmene Boudiaf et al

04-05-2022

A Dempster-Shafer approach to trustworthy AI with application to fetal brain MRI segmentation
by Lucas Fidon et al

04-08-2022

A Novel Intrinsic Image Decomposition Method to Recover Albedo for Aerial Images in Photogrammetry Processing
by Shuang Song et al

04-08-2022

POSTER: A Pyramid Cross-Fusion Transformer Network for Facial Expression Recognition
by Ce Zheng et al

04-08-2022

On Distinctive Image Captioning via Comparing and Reweighting
by Jiuniu Wang et al

04-07-2022

TemporalUV: Capturing Loose Clothing with Temporally Coherent UV Coordinates
by You Xie et al

04-06-2022

PlutoNet: An Efficient Polyp Segmentation Network
by Tugberk Erol et al

04-07-2022

Identification of Autism spectrum disorder based on a novel feature selection method and Variational Autoencoder
by Fangyu Zhang et al

 
Craig Smith