2021.9.13 Vision papers

 

09-09-2021

IICNet: A Generic Framework for Reversible Image Conversion
by Ka Leong Cheng et al

09-07-2021

PP-OCRv2: Bag of Tricks for Ultra Lightweight OCR System
by Yuning Du et al

09-07-2021

Sensor-Augmented Egocentric-Video Captioning with Dynamic Modal Attention
by Katsuyuki Nakamura et al

09-08-2021

OSSR-PID: One-Shot Symbol Recognition in P&ID Sheets using Path Sampling and GCN
by Shubham Paliwal et al

09-09-2021

TxT: Crossmodal End-to-End Learning with Transformers
by Jan-Martin O. Steitz et al

09-09-2021

Talk-to-Edit: Fine-Grained Facial Editing via Dialog
by Yuming Jiang et al

09-09-2021

UCTransNet: Rethinking the Skip Connections in U-Net from a Channel-wise Perspective with Transformer
by Haonan Wang et al

09-07-2021

Perceptual Learned Video Compression with Recurrent Conditional GAN
by Ren Yang et al

09-07-2021

Brand Label Albedo Extraction of eCommerce Products using Generative Adversarial Network
by Suman Sapkota et al

09-08-2021

Toward Real-World Super-Resolution via Adaptive Downsampling Models
by Sanghyun Son et al

09-08-2021

Unfolding Taylors Approximations for Image Restoration
by Man Zhou et al

09-07-2021

Multi-Branch Deep Radial Basis Function Networks for Facial Emotion Recognition
by Fernanda Hernández-Luquin et al

09-07-2021

ICCAD Special Session Paper: Quantum-Classical Hybrid Machine Learning for Image Classification
by Mahabubul Alam et al

09-07-2021

nnFormer: Interleaved Transformer for Volumetric Segmentation
by Hong-Yu Zhou et al

09-08-2021

Which and Where to Focus: A Simple yet Accurate Framework for Arbitrary-Shaped Nearby Text Detection in Scene Images
by Youhui Guo et al

09-09-2021

Per Garment Capture and Synthesis for Real-time Virtual Try-on
by Toby Chong et al

09-07-2021

Learning Fast Sample Re-weighting Without Reward Data
by Zizhao Zhang et al

09-09-2021

Tiny CNN for feature point description for document analysis: approach and dataset
by A. Sheshkus et al

09-09-2021

Multilingual Audio-Visual Smartphone Dataset And Evaluation
by Hareesh Mandalapu et al

09-07-2021

Self-supervised Tumor Segmentation through Layer Decomposition
by Xiaoman Zhang et al

09-08-2021

Egocentric View Hand Action Recognition by Leveraging Hand Surface and Hand Grasp Type
by Sangpil Kim et al

09-08-2021

FIDNet: LiDAR Point Cloud Semantic Segmentation with Fully Interpolation Decoding
by Yiming Zhao et al

09-08-2021

Temporal RoI Align for Video Object Recognition
by Tao Gong et al

09-08-2021

FaceCook: Face Generation Based on Linear Scaling Factors
by Tianren Wang et al

09-07-2021

Rethinking Common Assumptions to Mitigate Racial Bias in Face Recognition Datasets
by Matthew Gwilliam et al

09-10-2021

Residual 3D Scene Flow Learning with Context-Aware Feature Extraction
by Guangming Wang et al

09-07-2021

Unpaired Adversarial Learning for Single Image Deraining with Rain-Space Contrastive Constraints
by Xiang Chen et al

09-07-2021

FuseFormer: Fusing Fine-Grained Information in Transformers for Video Inpainting
by Rui Liu et al

09-07-2021

Smart Traffic Monitoring System using Computer Vision and Edge Computing
by Guanxiong Liu et al

09-09-2021

PhysGNN: A Physics-Driven Graph Neural Network Based Model for Predicting Soft Tissue Deformation in Image-Guided Neurosurgery
by Yasmin Salehi et al

09-07-2021

Fishr: Invariant Gradient Variances for Out-of-distribution Generalization
by Alexandre Rame et al

09-09-2021

ErfAct: Non-monotonic smooth trainable Activation Functions
by Koushik Biswas et al

09-07-2021

Evaluation of an Audio-Video Multimodal Deepfake Dataset using Unimodal and Multimodal Detectors
by Hasam Khalid et al

09-07-2021

Grassmannian Graph-attentional Landmark Selection for Domain Adaptation
by Bin Sun et al

09-07-2021

Rendezvous: Attention Mechanisms for the Recognition of Surgical Action Triplets in Endoscopic Videos
by Chinedu Innocent Nwoye et al

09-08-2021

Time Alignment using Lip Images for Frame-based Electrolaryngeal Voice Conversion
by Yi-Syuan Liou et al

09-08-2021

Adaptive Few-Shot Learning PoC Ultrasound COVID-19 Diagnostic System
by Michael Karnes et al

09-09-2021

EVOQUER: Enhancing Temporal Grounding with Video-Pivoted BackQuery Generation
by Yanjun Gao et al

09-07-2021

Melatect: A Machine Learning Model Approach For Identifying Malignant Melanoma in Skin Growths
by Vidushi Meel et al

09-08-2021

Pose-guided Inter- and Intra-part Relational Transformer for Occluded Person Re-Identification
by Zhongxing Ma et al

09-08-2021

Shuffled Patch-Wise Supervision for Presentation Attack Detection
by Alperen Kantarcı et al

09-09-2021

NEAT: Neural Attention Fields for End-to-End Autonomous Driving
by Kashyap Chitta et al

09-10-2021

Automatic Displacement and Vibration Measurement in Laboratory Experiments with A Deep Learning Method
by Yongsheng Bai et al

09-08-2021

Scaled ReLU Matters for Training Vision Transformers
by Pichao Wang et al

09-08-2021

fastMRI+: Clinical Pathology Annotations for Knee and Brain Fully Sampled Multi-Coil MRI Data
by Ruiyang Zhao et al

09-09-2021

Fair Conformal Predictors for Applications in Medical Imaging
by Charles Lu et al

09-10-2021

PIP: Physical Interaction Prediction via Mental Imagery with Span Selection
by Jiafei Duan et al

09-08-2021

Improving Building Segmentation for Off-Nadir Satellite Imagery
by Hanxiang Hao et al

09-10-2021

EfficientCLIP: Efficient Cross-Modal Pre-training by Ensemble Confident Learning and Language Modeling
by Jue Wang et al

09-10-2021

Face-NMS: A Core-set Selection Approach for Efficient Face Recognition
by Yunze Chen et al

09-07-2021

CovarianceNet: Conditional Generative Model for Correct Covariance Prediction in Human Motion Prediction
by Aleksey Postnikov et al

09-09-2021

HSMD: An object motion detection algorithm using a Hybrid Spiking Neural Network Architecture
by Pedro Machado et al

09-07-2021

Learning to Combine the Modalities of Language and Video for Temporal Moment Localization
by Jungkyoo Shin et al

09-10-2021

TADA: Taxonomy Adaptive Domain Adaptation
by Rui Gong et al

09-10-2021

View Blind-spot as Inpainting: Self-Supervised Denoising with Mask Guided Residual Convolution
by Yuhongze Zhou et al

09-10-2021

Mesh convolutional neural networks for wall shear stress estimation in 3D artery models
by Julian Suk et al

09-07-2021

Resolving gas bubbles ascending in liquid metal from low-SNR neutron radiography images
by Mihails Birjukovs et al

09-09-2021

Automatic Portrait Video Matting via Context Motion Network
by Qiqi Hou et al

09-10-2021

Line as a Visual Sentence: Context-aware Line Descriptor for Visual Localization
by Sungho Yoon et al

09-08-2021

Panoptic nuScenes: A Large-Scale Benchmark for LiDAR Panoptic Segmentation and Tracking
by Whye Kit Fong et al

09-09-2021

Dynamic Modeling of Hand-Object Interactions via Tactile Sensing
by Qiang Zhang et al

09-09-2021

Preservational Learning Improves Self-supervised Medical Image Models by Reconstructing Diverse Contexts
by Hong-Yu Zhou et al

09-09-2021

Object recognition for robotics from tactile time series data utilising different neural network architectures
by Wolfgang Bottcher et al

09-09-2021

Taming Self-Supervised Learning for Presentation Attack Detection: In-Image De-Folding and Out-of-Image De-Mixing
by Haozhe Liu et al

09-08-2021

Axial multi-layer perceptron architecture for automatic segmentation of choroid plexus in multiple sclerosis
by Marius Schmidt-Mengin et al

09-07-2021

Improving Phenotype Prediction using Long-Range Spatio-Temporal Dynamics of Functional Connectivity
by Simon Dahan et al

09-08-2021

Identification of Social-Media Platform of Videos through the Use of Shared Features
by Luca Maiano et al

09-10-2021

An Empirical Study of GPT-3 for Few-Shot Knowledge-Based VQA
by Zhengyuan Yang et al

09-09-2021

ConvMLP: Hierarchical Convolutional MLPs for Vision
by Jiachen Li et al

09-08-2021

Digitize-PID: Automatic Digitization of Piping and Instrumentation Diagrams
by Shubham Paliwal et al

09-09-2021

Towards Transferable Adversarial Attacks on Vision Transformers
by Zhipeng Wei et al

09-10-2021

Detection of GAN-synthesized street videos
by Omran Alamayreh et al

09-09-2021

Single Image 3D Object Estimation with Primitive Graph Networks
by Qian He et al

09-08-2021

Improving Deep Metric Learning by Divide and Conquer
by Artsiom Sanakoyeu et al

09-07-2021

Simple Video Generation using Neural ODEs
by David Kanaa et al

09-07-2021

Self-Supervised Representation Learning using Visual Field Expansion on Digital Pathology
by Joseph Boyd et al

09-07-2021

Certifiable Outlier-Robust Geometric Perception: Exact Semidefinite Relaxations and Scalable Global Optimization
by Heng Yang et al

09-09-2021

IFBiD: Inference-Free Bias Detection
by Ignacio Serna et al

09-10-2021

Saliency Guided Experience Packing for Replay in Continual Learning
by Gobinda Saha et al

09-09-2021

Neural-IMLS: Learning Implicit Moving Least-Squares for Surface Reconstruction from Unoriented Point clouds
by Zixiong Wang et al

09-09-2021

Is Attention Better Than Matrix Decomposition?
by Zhengyang Geng et al

09-08-2021

Modified Supervised Contrastive Learning for Detecting Anomalous Driving Behaviours
by Shehroz S. Khan et al

09-08-2021

Deriving Explanation of Deep Visual Saliency Models
by Sai Phani Kumar Malladi et al

09-10-2021

Emerging AI Security Threats for Autonomous Cars -- Case Studies
by Shanthi Lekkala et al

09-07-2021

DeepFakes: Detecting Forged and Synthetic Media Content Using Machine Learning
by Sm Zobaed et al

09-07-2021

GCsT: Graph Convolutional Skeleton Transformer for Action Recognition
by Ruwen Bai et al

09-07-2021

Journalistic Guidelines Aware News Image Captioning
by Xuewen Yang et al

09-07-2021

Capturing the objects of vision with neural networks
by Benjamin Peters et al

09-08-2021

Learning Local-Global Contextual Adaptation for Fully End-to-End Bottom-Up Human Pose Estimation
by Nan Xue et al

09-10-2021

ReconfigISP: Reconfigurable Camera Image Processing Pipeline
by Ke Yu et al

09-09-2021

Copy-Move Image Forgery Detection Based on Evolving Circular Domains Coverage
by Shilin Lu et al

09-08-2021

Panoptic SegFormer
by Zhiqi Li et al

09-08-2021

Multi-Tensor Network Representation for High-Order Tensor Completion
by Chang Nie et al

09-08-2021

Disentangling Alzheimers disease neurodegeneration from typical brain aging using machine learning
by Gyujoon Hwang et al

09-08-2021

LiDARTouch: Monocular metric depth estimation with a few-beam LiDAR
by Florent Bartoccioni et al

09-10-2021

Temporally Coherent Person Matting Trained on Fake-Motion Dataset
by Ivan Molodetskikh et al

09-08-2021

SSEGEP: Small SEGment Emphasized Performance evaluation metric for medical image segmentation
by Ammu R et al

09-07-2021

RoadAtlas: Intelligent Platform for Automated Road Defect Detection and Asset Management
by Zhuoxiao Chen et al

09-09-2021

ACFNet: Adaptively-Cooperative Fusion Network for RGB-D Salient Object Detection
by Jinchao Zhu

09-07-2021

Fair Comparison: Quantifying Variance in Resultsfor Fine-grained Visual Categorization
by Matthew Gwilliam et al

09-09-2021

Vision-and-Language or Vision-for-Language? On Cross-Modal Influence in Multimodal Transformers
by Stella Frank et al

09-08-2021

Unsupervised clothing change adaptive person ReID
by Ziyue Zhang et al

09-09-2021

PIMNet: A Parallel, Iterative and Mimicking Network for Scene Text Recognition
by Zhi Qiao et al

09-07-2021

Efficient ADMM-based Algorithms for Convolutional Sparse Coding
by Farshad G. Veshki et al

09-07-2021

Learning to Discriminate Information for Online Action Detection: Analysis and Application
by Sumin Lee et al

09-08-2021

RGB-D Salient Object Detection with Ubiquitous Target Awareness
by Yifan Zhao et al

09-08-2021

Mask is All You Need: Rethinking Mask R-CNN for Dense and Arbitrary-Shaped Scene Text Detection
by Xugong Qin et al

09-07-2021

Master Face Attacks on Face Recognition Systems
by Huy H. Nguyen et al

09-09-2021

CrowdDriven: A New Challenging Dataset for Outdoor Visual Localization
by Ara Jafarzadeh et al

09-08-2021

SORNet: Spatial Object-Centric Representations for Sequential Manipulation
by Wentao Yuan et al

09-09-2021

Reconstructing and grounding narrated instructional videos in 3D
by Dimitri Zhukov et al

09-09-2021

Application of the Singular Spectrum Analysis on electroluminescence images of thin-film photovoltaic modules
by Evgenii Sovetkin et al

09-09-2021

ACP++: Action Co-occurrence Priors for Human-Object Interaction Detection
by Dong-Jin Kim et al

09-09-2021

Towards Robust Cross-domain Image Understanding with Unsupervised Noise Removal
by Lei Zhu et al

09-08-2021

Energy-Efficient Mobile Robot Control via Run-time Monitoring of Environmental Complexity and Computing Workload
by Sherif A. S. Mohamed et al

09-07-2021

YouRefIt: Embodied Reference Understanding with Language and Gesture
by Yixin Chen et al

09-10-2021

Unsupervised Change Detection in Hyperspectral Images using Feature Fusion Deep Convolutional Autoencoders
by Debasrita Chakraborty et al

09-08-2021

On Recognizing Occluded Faces in the Wild
by Mustafa Ekrem Erakın et al

09-10-2021

Temporal Pyramid Transformer with Multimodal Interaction for Video Question Answering
by Min Peng et al

09-08-2021

Automated LoD-2 Model Reconstruction from Very-HighResolution Satellite-derived Digital Surface Model and Orthophoto
by Shengxi Gui et al

09-09-2021

Leveraging Local Domains for Image-to-Image Translation
by Anthony Dell'Eva et al

09-07-2021

Fine-grained Hand Gesture Recognition in Multi-viewpoint Hand Hygiene
by Huy Q. Vo et al

09-09-2021

Improving Video-Text Retrieval by Multi-Stream Corpus Alignment and Dual Softmax Loss
by Xing Cheng et al

09-09-2021

Fine-grained Data Distribution Alignment for Post-Training Quantization
by Yunshan Zhong et al

09-09-2021

Towards Fully Automated Segmentation of Rat Cardiac MRI by Leveraging Deep Learning Frameworks
by Daniel Fernandez-Llaneza et al

09-07-2021

GTT-Net: Learned Generalized Trajectory Triangulation
by Xiangyu Xu et al

09-10-2021

Negative Sample Matters: A Renaissance of Metric Learning for Temporal Grounding
by Zhenzhi Wang et al

09-10-2021

Spatio-Temporal Recurrent Networks for Event-Based Optical Flow Estimation
by Ziluo Ding et al

09-09-2021

Efficiently Identifying Task Groupings for Multi-Task Learning
by Christopher Fifty et al

09-10-2021

Panoptic Narrative Grounding
by C. González et al

09-07-2021

Support Vector Machine for Handwritten Character Recognition
by Jomy John

09-08-2021

Cross-Site Severity Assessment of COVID-19 from CT Images via Domain Adaptation
by Geng-Xin Xu et al

09-09-2021

Learning Cross-Scale Visual Representations for Real-Time Image Geo-Localization
by Tianyi Zhang et al

09-07-2021

Knowledge Distillation Using Hierarchical Self-Supervision Augmented Distribution
by Chuanguang Yang et al

09-09-2021

Continuous Event-Line Constraint for Closed-Form Velocity Initialization
by Peng Xin et al

09-09-2021

Deep Hough Voting for Robust Global Registration
by Junha Lee et al

09-09-2021

S3G-ARM: Highly Compressive Visual Self-localization from Sequential Semantic Scene Graph Using Absolute and Relative Measurements
by Mitsuki Yoshida et al

09-09-2021

Self Supervision to Distillation for Long-Tailed Visual Recognition
by Tianhao Li et al

09-09-2021

M5Product: A Multi-modal Pretraining Benchmark for E-commercial Product Downstream Tasks
by Xiao Dong et al

09-08-2021

Tactile Image-to-Image Disentanglement of Contact Geometry from Motion-Induced Shear
by Anupam K. Gupta et al

09-08-2021

Level Set Binocular Stereo with Occlusions
by Jialiang Wang et al

09-08-2021

Recalibrating the KITTI Dataset Camera Setup for Improved Odometry Accuracy
by Igor Cvišić et al

09-08-2021

Elastic Significant Bit Quantization and Acceleration for Deep Neural Networks
by Cheng Gong et al

09-08-2021

Matching in the Dark: A Dataset for Matching Image Pairs of Low-light Scenes
by W. Song et al

09-10-2021

LibFewShot: A Comprehensive Library for Few-shot Learning
by Wenbin Li et al

09-07-2021

MRI Reconstruction Using Deep Energy-Based Model
by Yu Guan et al

09-07-2021

FDA: Feature Decomposition and Aggregation for Robust Airway Segmentation
by Minghui Zhang et al

09-09-2021

Energy Attack: On Transferring Adversarial Examples
by Ruoxi Shi et al

 
Craig Smith