Details - 西安交通大学机构知识库

Query：

学者姓名：孟德宇

Refining：

Year

2023 (3)
2022 (17)
2021 (20)
2020 (21)
2019 (15)
2018 (19)
2017 (21)
2016 (14)
2015 (23)
2014 (9)
2013 (9)
2012 (3)
2011 (3)
2010 (3)
2009 (1)
2008 (3)
2007 (5)
2006 (1)
2005 (1)
2004 (1)

Submit Unfold

Type

期刊论文 (136)
会议论文 (56)

Submit Unfold

Indexed by

EI (164)
Scopus (136)
SCIE (122)
CPCI-S (42)
CSCD (14)
PKU (9)
PubMed (8)
CSSCI-E (1)

Submit Unfold

Source

IEEE TRANSACTIONS ON IMAGE PROCESSING (13)
NEUROCOMPUTING (9)
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE (8)
IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING (7)
IEEE TRANSACTIONS ON MEDICAL IMAGING (6)
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS (6)
29th AAAI Conference on Artificial Intelligence, AAAI 2015 and the 27th Innovative Applications of Artificial Intelligence Conference, IAAI 2015 (4)
IEEE International Conference on Computer Vision (4)
PATTERN RECOGNITION (4)
REMOTE SENSING (4)
16th European Conference on Computer Vision, ECCV 2020 (3)
16th IEEE International Conference on Computer Vision (ICCV) (3)
2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (3)
IEEE GEOSCIENCE AND REMOTE SENSING LETTERS (3)
INFORMATION SCIENCES (3)
1st Chinese Conference on Computer Vision (CCCV) (2)
25th International Joint Conference on Artificial Intelligence, IJCAI 2016 (2)
27th IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (2)
30th AAAI Conference on Artificial Intelligence, AAAI 2016 (2)
31st IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2)
32nd IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2019 (2)
33rd Conference on Neural Information Processing Systems (NeurIPS) (2)
Academic Journal of Xi'an Jiaotong University (2)
IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING (2)
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY (2)
IEEE TRANSACTIONS ON CYBERNETICS (2)
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS (2)
IEEE Transactions on Geoscience and Remote Sensing (2)
IEEE Transactions on Image Processing (2)
IMAGE AND VISION COMPUTING (2)
International Conference on Medical Image Computing and Computer Assisted Intervention (MICCAI) (2)
KNOWLEDGE-BASED SYSTEMS (2)
Remote Sensing (2)
Ruan Jian Xue Bao/Journal of Software (2)
SCIENCE CHINA-INFORMATION SCIENCES (2)
10th International Workshop on Machine Learning in Medical Imaging (MLMI) / 22nd International Conference on Medical Image Computing and Computer-Assisted Intervention (MICCAI) (1)
12th Asian Conference on Computer Vision (ACCV) (1)
13th European Conference on Computer Vision (ECCV) (1)
2005 International Conference on Neural Networks and Brain Proceedings, ICNNB'05 (1)
2014 4th ACM International Conference on Multimedia Retrieval, ICMR 2014 (1)
2014 ACM Conference on Multimedia, MM 2014 (1)
2018 IEEE Nuclear Science Symposium and Medical Imaging Conference, NSS/MIC 2018 (1)
2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2020 (1)
24th International Conference on Medical Image Computing and Computer Assisted Intervention, MICCAI 2021 (1)
28th Annual Conference on Neural Information Processing Systems 2014, NIPS 2014 (1)
30th IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (1)
35th AAAI Conference on Artificial Intelligence / 33rd Conference on Innovative Applications of Artificial Intelligence (1)
35th AAAI Conference on Artificial Intelligence / 33rd Conference on Innovative Applications of Artificial Intelligence / 11th Symposium on Educational Advances in Artificial Intelligence (1)
3rd International Symposium on Neural Networks, ISNN 2006 - Advances in Neural Networks (1)
5th ACM International Conference on Multimedia Retrieval (ICMR) (1)
7th IEEE International Symposium on Signal Processing and Information Technology (1)
ACM International Conference on Multimedia (MM) (1)
APPLIED ENERGY (1)
Anatomical Brain Barriers to Cancer Spread: Segmentation from CT and MR Images Challenge, ABCs 2020, Learn2Reg Challenge, L2R 2020 and Thyroid Nodule Segmentation and Classification in Ultrasound Images Challenge, TN-SCUI 2020 held in conjunction with 23rd International Conference on Medical Image Computing and Computer-Assisted Intervention, MICCAI 2020 (1)
BMC CANCER (1)
CHINESE JOURNAL OF ELECTRONICS (1)
COMPUTER VISION, ECCV 2022, PT XIX (1)
Conference on Medical Imaging - Physics of Medical Imaging (1)
DATA & KNOWLEDGE ENGINEERING (1)
IEEE ACCESS (1)
IEEE International Conference on Computer Vision (ICCV) (1)
IEEE International Conference on Multimedia and Expo Workshops (ICMEW) (1)
IEEE International Geoscience & Remote Sensing Symposium (1)
IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing (1)
IEEE SIGNAL PROCESSING LETTERS (1)
IEEE TRANSACTIONS ON COMPUTATIONAL IMAGING (1)
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING (1)
IEEE TRANSACTIONS ON MULTIMEDIA (1)
IEEE Transactions on Cybernetics (1)
IEEE Transactions on Evolutionary Computation (1)
IEEE Transactions on Medical Imaging (1)
IEEE Transactions on Neural Networks and Learning Systems (1)
IEEE Transactions on Pattern Analysis and Machine Intelligence (1)
IEEE transactions on computational imaging (1)
IET COMPUTER VISION (1)
IET IMAGE PROCESSING (1)
INTERNATIONAL JOURNAL OF COMPUTER VISION (1)
INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS (1)
INTERNATIONAL JOURNAL OF MOLECULAR SCIENCES (1)
International Journal of Computer Vision (1)
JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY (1)
JOURNAL OF MACHINE LEARNING RESEARCH (1)
Jisuanji Xuebao/Chinese Journal of Computers (1)
Jisuanji Yanjiu yu Fazhan/Computer Research and Development (1)
Journal of Computer Science and Technology (1)
Knowledge-Based Systems (1)
MATHEMATICAL PROBLEMS IN ENGINEERING (1)
MEDICAL IMAGE ANALYSIS (1)
MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2022, PT VI (1)
Medical Imaging 2020: Physics of Medical Imaging (1)
Moshi Shibie yu Rengong Zhineng/Pattern Recognition and Artificial Intelligence (1)
NEURAL COMPUTATION (1)
National Science Review (1)
Neurocomputing (1)
PACIFIC JOURNAL OF OPTIMIZATION (1)
PATTERN RECOGNITION LETTERS (1)
Physics in Medicine and Biology (1)
SCIENTIFIC REPORTS (1)
SIAM JOURNAL ON NUMERICAL ANALYSIS (1)
SIAM Journal on Scientific Computing (1)

Submit Unfold

Complex

First Comm (93)
First Author (20)
Reprint Author (51)
Reprint Comm (109)
ESI HCP (17)
CAS 1 (28)
CAS 2 (39)

Submit Unfold

Co-Author

Xu, Zongben (53)
Zhao, Qian (50)
Xie, Qi (29)
Zuo, Wangmeng (21)
Zhang, Lei (18)
Jiang, Lu (15)
Cao, Xiangyong (14)
Gao, Chenqiang (14)
Leung, Yee (14)
Hauptmann, Alexander G. (13)
Wang, Yao (12)
Yang, Yi (10)
Ma, Jianhua (9)
Wang, Hong (9)
Zeng, Dong (9)
Bian, Zhaoying (7)
Han, Zhi (7)
Yao, Jing (7)
Zheng, Yefeng (7)
Peng, Jiangjun (6)
Shao, Mingwen (6)
Xu, Zong-Ben (6)
Chanussot, Jocelyn (5)
Gu, Shuhang (5)
Han, Junwei (5)
Hong, Danfeng (5)
Huang, Jing (5)
Liu, Jiang (5)
Li, Yuexiang (5)
Ma, Fan (5)
Miao, Qiguang (5)
Wang, Renzhen (5)
Yong, Hongwei (5)
Yue, Zongsheng (5)
Cao, Wenfei (4)
Gong, Maoguo (4)
Gu, Nan-Nan (4)
Hauptmann, Alexander (4)
Liang, Zhengrong (4)
Li, Hao (4)
Li, Sui (4)
Ma, Kai (4)
Shan, Shiguang (4)
Wang, Lisheng (4)
Xu, Yong (4)
Yu, Shoou-I (4)
Zhang, Dingwen (4)
Zhang, Hao (4)
Zhao, Ji (4)
Chen, Huai (3)
Chen, Yang (3)
Dai, Mingwei (3)
Dong, Xuanyi (3)
Fung, Tung (3)
Gu, Nannan (3)
Liang, Yong (3)
Lin, Liang (3)
Lin, Lin (3)
Liu, Wenbo (3)
Li, Zhaoxin (3)
Mitamura, Teruko (3)
Shu, Jun (3)
Wang, Kuanquan (3)
Wang, Yongbo (3)
Wei, Wei (3)
Wu, Yichen (3)
Xu, Lin (3)
Yang, Luyu (3)
Zhang, David (3)
Zhang, Yong (3)
Zhao, Xi-Le (3)
Zhao, Yue (3)
Zhou, Sanping (3)
Cao, Shilei (2)
Chai, Hua (2)
Du, Yinhe (2)
Fan, Mingyu (2)
Feng, Xiangchu (2)
Fu, Xueyang (2)
Gao, Qi (2)
Gao, Xinbo (2)
Gong, Yihong (2)
Hauptmann, Alexander G (2)
Hu, Qinghua (2)
Kong, Xu (2)
Liang, Dong (2)
Liao, Yuting (2)
Li, Danyang (2)
Li, Hui (2)
Li, Le (2)
Li, Minghan (2)
Ling, Yongfa (2)
Liu, Yang (2)
Ma, Tian-Hui (2)
Rui, Xiangyu (2)

Submit Unfold

Language

English (183)
Chinese (9)

Submit

Clean All

Select All Export Sort by：

Default

Default
Title
Year
WOS Cited Count
Impact factor
Ascending
Descending

< Page ，Total 20 >

Uncertainty-guided hierarchical frequency domain Transformer for image restoration EI SCIE Scopus

期刊论文 | 2023 , 263 | KNOWLEDGE-BASED SYSTEMS

Shao, Mingwen | Qiao, Yuanjian | Meng, Deyu | Zuo, Wangmeng

SCOPUS Cited Count： 16

Abstract&Keyword Cite

Abstract ：

Existing convolutional neural network (CNN)-based and vision Transformer (ViT)-based image restora-tion methods are usually explored in the spatial domain. However, we employ Fourier analysis to show that these spatial domain models cannot perceive the entire frequency spectrum of images, i.e., mainly focus on either high-frequency (CNN-based models) or low-frequency components (ViT-based models). This intrinsic limitation results in the partial missing of semantic information and the appearance of artifacts. To address this limitation, we propose a novel uncertainty-guided hierarchical frequency domain Transformer named HFDT to effectively learn both high and low-frequency information while perceiving local and global features. Specifically, to aggregate semantic information from various fre-quency levels, we propose a dual-domain feature interaction mechanism, in which the global frequency information and local spatial features are extracted by corresponding branches. The frequency domain branch adopts the Fast Fourier Transform (FFT) to convert the features from the spatial domain to the frequency domain, where the global low and high-frequency components are learned with Log -linear complexity. Complementarily, an efficient convolution group is employed in the spatial domain branch to capture local high-frequency details. Moreover, we introduce an uncertainty degradation -guided strategy to efficiently represent degraded prior information, rather than simply distinguishing degraded/non-degraded regions in binary form. Our approach achieves competitive results in several degraded scenarios, including rain streaks, raindrops, motion blur, and defocus blur.(c) 2023 Elsevier B.V. All rights reserved.

Keyword ：

Frequency-domain Transformer Image restoration Log-linear complexity Uncertainty-guided

Cite：

Copy from the list or Export to your reference management。

GB/T 7714	Shao, Mingwen , Qiao, Yuanjian , Meng, Deyu et al. Uncertainty-guided hierarchical frequency domain Transformer for image restoration [J]. \| KNOWLEDGE-BASED SYSTEMS , 2023 , 263 .
MLA	Shao, Mingwen et al. "Uncertainty-guided hierarchical frequency domain Transformer for image restoration" . \| KNOWLEDGE-BASED SYSTEMS 263 (2023) .
APA	Shao, Mingwen , Qiao, Yuanjian , Meng, Deyu , Zuo, Wangmeng . Uncertainty-guided hierarchical frequency domain Transformer for image restoration . \| KNOWLEDGE-BASED SYSTEMS , 2023 , 263 .
Export to	NoteExpress RIS BibTex

InDuDoNet plus : A deep unfolding dual domain network for metal artifact reduction in CT images EI SCIE Scopus

期刊论文 | 2023 , 85 | MEDICAL IMAGE ANALYSIS

Wang, Hong | Li, Yuexiang | Zhang, Haimiao | Meng, Deyu | Zheng, Yefeng

SCOPUS Cited Count： 31

Abstract&Keyword Cite

Abstract ：

During the computed tomography (CT) imaging process, metallic implants within patients often cause harmful artifacts, which adversely degrade the visual quality of reconstructed CT images and negatively affect the subsequent clinical diagnosis. For the metal artifact reduction (MAR) task, current deep learning based methods have achieved promising performance. However, most of them share two main common limitations: (1) the CT physical imaging geometry constraint is not comprehensively incorporated into deep network structures; (2) the entire framework has weak interpretability for the specific MAR task; hence, the role of each network module is difficult to be evaluated. To alleviate these issues, in the paper, we construct a novel deep unfolding dual domain network, termed InDuDoNet+, into which CT imaging process is finely embedded. Concretely, we derive a joint spatial and Radon domain reconstruction model and propose an optimization algorithm with only simple operators for solving it. By unfolding the iterative steps involved in the proposed algorithm into the corresponding network modules, we easily build the InDuDoNet+ with clear interpretability. Furthermore, we analyze the CT values among different tissues, and merge the prior observations into a prior network for our InDuDoNet+, which significantly improve its generalization performance. Comprehensive experiments on synthesized data and clinical data substantiate the superiority of the proposed methods as well as the superior generalization performance beyond the current state-of-the-art (SOTA) MAR methods. Code is available at https://github.com/hongwang01/InDuDoNet_plus.

Keyword ：

CT imaging geometry Generalization ability Metal artifact reduction Physical interpretability

Cite：

Copy from the list or Export to your reference management。

GB/T 7714	Wang, Hong , Li, Yuexiang , Zhang, Haimiao et al. InDuDoNet plus : A deep unfolding dual domain network for metal artifact reduction in CT images [J]. \| MEDICAL IMAGE ANALYSIS , 2023 , 85 .
MLA	Wang, Hong et al. "InDuDoNet plus : A deep unfolding dual domain network for metal artifact reduction in CT images" . \| MEDICAL IMAGE ANALYSIS 85 (2023) .
APA	Wang, Hong , Li, Yuexiang , Zhang, Haimiao , Meng, Deyu , Zheng, Yefeng . InDuDoNet plus : A deep unfolding dual domain network for metal artifact reduction in CT images . \| MEDICAL IMAGE ANALYSIS , 2023 , 85 .
Export to	NoteExpress RIS BibTex

RCDNet: An Interpretable Rain Convolutional Dictionary Network for Single Image Deraining SCIE Scopus

期刊论文 | 2023 | IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS

SCOPUS Cited Count： 13

Abstract&Keyword Cite

Abstract ：

As common weather, rain streaks adversely degrade the image quality and tend to negatively affect the performance of outdoor computer vision systems. Hence, removing rains from an image has become an important issue in the field. To handle such an ill-posed single image deraining task, in this article, we specifically build a novel deep architecture, called rain convolutional dictionary network (RCDNet), which embeds the intrinsic priors of rain streaks and has clear interpretability. In specific, we first establish a rain convolutional dictionary (RCD) model for representing rain streaks and utilize the proximal gradient descent technique to design an iterative algorithm only containing simple operators for solving the model. By unfolding it, we then build the RCDNet in which every network module has clear physical meanings and corresponds to each operation involved in the algorithm. This good interpretability greatly facilitates an easy visualization and analysis of what happens inside the network and why it works well in the inference process. Moreover, taking into account the domain gap issue in real scenarios, we further design a novel dynamic RCDNet, where the rain kernels can be dynamically inferred corresponding to input rainy images and then help shrink the space for rain layer estimation with few rain maps, so as to ensure a fine generalization performance in the inconsistent scenarios of rain types between training and testing data. By end-to-end training such an interpretable network, all involved rain kernels and proximal operators can be automatically extracted, faithfully characterizing the features of both rain and clean background layers and, thus, naturally leading to better deraining performance. Comprehensive experiments implemented on a series of representative synthetic and real datasets substantiate the superiority of our method, especially on its well generality to diverse testing scenarios and good interpretability for all its modules, compared with state-of-the-art single image derainers both visually and quantitatively.

Keyword ：

Dictionary learning generalization performance interpretable deep learning (DL) single image rain removal

Cite：

Copy from the list or Export to your reference management。

GB/T 7714	Wang, Hong , Xie, Qi , Zhao, Qian et al. RCDNet: An Interpretable Rain Convolutional Dictionary Network for Single Image Deraining [J]. \| IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS , 2023 .
MLA	Wang, Hong et al. "RCDNet: An Interpretable Rain Convolutional Dictionary Network for Single Image Deraining" . \| IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS (2023) .
APA	Wang, Hong , Xie, Qi , Zhao, Qian , Li, Yuexiang , Liang, Yong , Zheng, Yefeng et al. RCDNet: An Interpretable Rain Convolutional Dictionary Network for Single Image Deraining . \| IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS , 2023 .
Export to	NoteExpress RIS BibTex

Plenty is Plague: Fine-Grained Learning for Visual Question Answering SCIE

期刊论文 | 2022 , 44 (2) , 697-709 | IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE

Abstract&Keyword Cite

Abstract ：

Visual Question Answering (VQA) has attracted extensive research focus recently. Along with the ever-increasing data scale and model complexity, the enormous training cost has become an emerging challenge for VQA. In this article, we show such a massive training cost is indeed plague. In contrast, a fine-grained design of the learning paradigm can be extremely beneficial in terms of both training efficiency and model accuracy. In particular, we argue that there exist two essential and unexplored issues in the existing VQA training paradigm that randomly samples data in each epoch, namely, the "difficulty diversity" and the "label redundancy". Concretely, "difficulty diversity" refers to the varying difficulty levels of different question types, while "label redundancy" refers to the redundant and noisy labels contained in individual question type. To tackle these two issues, in this article we propose a fine-grained VQA learning paradigm with an actor-critic based learning agent, termed FG-A1C. Instead of using all training data from scratch, FG-A1C includes a learning agent that adaptively and intelligently schedules the most difficult question types in each training epoch. Subsequently, two curriculum learning based schemes are further designed to identify the most useful data to be learned within each inidividual question type. We conduct extensive experiments on the VQA2.0 and VQA-CP v2 datasets, which demonstrate the significant benefits of our approach. For instance, on VQA-CP v2, with less than 75 percent of the training data, our learning paradigms can help the model achieves better performance than using the whole dataset. Meanwhile, we also shows the effectivenesss of our method in guiding data labeling. Finally, the proposed paradigm can be seamlessly integrated with any cutting-edge VQA models, without modifying their structures.

Keyword ：

Data models Feature extraction Fine-grained learning Knowledge discovery Redundancy Training Training data Visualization visual question answering

Cite：

Copy from the list or Export to your reference management。

GB/T 7714	Zhou, Yiyi , Ji, Rongrong , Sun, Xiaoshuai et al. Plenty is Plague: Fine-Grained Learning for Visual Question Answering [J]. \| IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE , 2022 , 44 (2) : 697-709 .
MLA	Zhou, Yiyi et al. "Plenty is Plague: Fine-Grained Learning for Visual Question Answering" . \| IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE 44 . 2 (2022) : 697-709 .
APA	Zhou, Yiyi , Ji, Rongrong , Sun, Xiaoshuai , Su, Jinsong , Meng, Deyu , Gao, Yue et al. Plenty is Plague: Fine-Grained Learning for Visual Question Answering . \| IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE , 2022 , 44 (2) , 697-709 .
Export to	NoteExpress RIS BibTex

Infrared Action Detection in the Dark via Cross-Stream Attention Mechanism EI SCIE Scopus

期刊论文 | 2022 , 24 , 288-300 | IEEE TRANSACTIONS ON MULTIMEDIA

Chen, Xu | Gao, Chenqiang | Li, Chaoyu | Yang, Yi | Meng, Deyu

SCOPUS Cited Count： 17

Abstract&Keyword Cite

Abstract ：

Action detection plays an important role in video understanding and attracts considerable attention in the last decade. However, current action detection methods are mainly based on visible videos, and few of them consider scenes with low-light, where actions are difficult to be detected by existing methods, or even by human eyes. Compared with visible videos, infrared videos are more suitable for the dark environment and resistant to background clutter. In this paper, we investigate the temporal action detection problem in the dark by using infrared videos, which is, to the best of our knowledge, the first attempt in the action detection community. Our model takes the whole video as input, a Flow Estimation Network (FEN) is employed to generate the optical flow for infrared data, and it is optimized with the whole network to obtain action-related motion representations. After feature extraction, the infrared stream and flow stream are fed into a Selective Cross-stream Attention (SCA) module to narrow the performance gap between infrared and visible videos. The SCA emphasizes informative snippets and focuses on the more discriminative stream automatically. Then we adopt a snippet-level classifier to obtain action scores for all snippets and link continuous snippets into final detections. All these modules are trained in an end-to-end manner. We collect an Infrared action Detection (InfDet) dataset obtained in the dark and conduct extensive experiments to verify the effectiveness of the proposed method. Experimental results show that our proposed method surpasses state-of-the-art temporal action detection methods designed for visible videos, and it also achieves the best performance compared with other infrared action recognition methods on both InfAR and Infrared-Visible datasets.

Keyword ：

Feature extraction Image recognition Infrared video Optical imaging Proposals selective cross-stream attention Streaming media Task analysis temporal action detection Three-dimensional displays

Cite：

Copy from the list or Export to your reference management。

GB/T 7714	Chen, Xu , Gao, Chenqiang , Li, Chaoyu et al. Infrared Action Detection in the Dark via Cross-Stream Attention Mechanism [J]. \| IEEE TRANSACTIONS ON MULTIMEDIA , 2022 , 24 : 288-300 .
MLA	Chen, Xu et al. "Infrared Action Detection in the Dark via Cross-Stream Attention Mechanism" . \| IEEE TRANSACTIONS ON MULTIMEDIA 24 (2022) : 288-300 .
APA	Chen, Xu , Gao, Chenqiang , Li, Chaoyu , Yang, Yi , Meng, Deyu . Infrared Action Detection in the Dark via Cross-Stream Attention Mechanism . \| IEEE TRANSACTIONS ON MULTIMEDIA , 2022 , 24 , 288-300 .
Export to	NoteExpress RIS BibTex

KXNet: A Model-Driven Deep Neural Network for Blind Super-Resolution CPCI-S Scopus

期刊论文 | 2022 , 13679 , 235-253 | COMPUTER VISION, ECCV 2022, PT XIX

SCOPUS Cited Count： 8

Abstract&Keyword Cite

Abstract ：

Although current deep learning-based methods have gained promising performance in the blind single image super-resolution (SISR) task, most of them mainly focus on heuristically constructing diverse network architectures and put less emphasis on the explicit embedding of the physical generation mechanism between blur kernels and highresolution (HR) images. To alleviate this issue, we propose a modeldriven deep neural network, called KXNet, for blind SISR. Specifically, to solve the classical SISR model, we propose a simple-yet-effective iterative algorithm. Then by unfolding the involved iterative steps into the corresponding network module, we naturally construct the KXNet. The main specificity of the proposed KXNet is that the entire learning process is fully and explicitly integrated with the inherent physical mechanism underlying this SISR task. Thus, the learned blur kernel has clear physical patterns and the mutually iterative process between blur kernel and HR image can soundly guide the KXNet to be evolved in the right direction. Extensive experiments on synthetic and real data finely demonstrate the superior accuracy and generality of our method beyond the current representative state-of-the-art blind SISR methods. Code is available at: https://github.com/jiahong- fu/KXNet.

Keyword ：

Blind single image super-resolution Kernel estimation Model-driven Mutual learning Physical generation mechanism

Cite：

Copy from the list or Export to your reference management。

GB/T 7714	Fu, Jiahong , Wang, Hong , Xie, Qi et al. KXNet: A Model-Driven Deep Neural Network for Blind Super-Resolution [J]. \| COMPUTER VISION, ECCV 2022, PT XIX , 2022 , 13679 : 235-253 .
MLA	Fu, Jiahong et al. "KXNet: A Model-Driven Deep Neural Network for Blind Super-Resolution" . \| COMPUTER VISION, ECCV 2022, PT XIX 13679 (2022) : 235-253 .
APA	Fu, Jiahong , Wang, Hong , Xie, Qi , Zhao, Qian , Meng, Deyu , Xu, Zongben . KXNet: A Model-Driven Deep Neural Network for Blind Super-Resolution . \| COMPUTER VISION, ECCV 2022, PT XIX , 2022 , 13679 , 235-253 .
Export to	NoteExpress RIS BibTex

Orientation-Shared Convolution Representation for CT Metal Artifact Learning CPCI-S Scopus

期刊论文 | 2022 , 13436 , 665-675 | MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2022, PT VI

SCOPUS Cited Count： 10

Abstract&Keyword Cite

Abstract ：

During X-ray computed tomography (CT) scanning, metallic implants carrying with patients often lead to adverse artifacts in the captured CT images and then impair the clinical treatment. Against this metal artifact reduction (MAR) task, the existing deep-learning-based methods have gained promising reconstruction performance. Nevertheless, there is still some room for further improvement of MAR performance and generalization ability, since some important prior knowledge underlying this specific task has not been fully exploited. Hereby, in this paper, we carefully analyze the characteristics of metal artifacts and propose an orientation-shared convolution representation strategy to adapt the physical prior structures of artifacts, i.e., rotationally symmetrical streaking patterns. The proposed method rationally adopts Fourier-series-expansion-based filter parametrization in artifact modeling, which can better separate artifacts from anatomical tissues and boost the model generalizability. Comprehensive experiments executed on synthesized and clinical datasets show the superiority of our method in detail preservation beyond the current representative MAR methods. Code will be available at https://github.com/hongwang01/OSCNet.

Keyword ：

Fourier series expansion Metal artifact reduction Model generalizability Orientation-shared convolution Rotation prior

Cite：

Copy from the list or Export to your reference management。

GB/T 7714	Wang, Hong , Xie, Qi , Li, Yuexiang et al. Orientation-Shared Convolution Representation for CT Metal Artifact Learning [J]. \| MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2022, PT VI , 2022 , 13436 : 665-675 .
MLA	Wang, Hong et al. "Orientation-Shared Convolution Representation for CT Metal Artifact Learning" . \| MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2022, PT VI 13436 (2022) : 665-675 .
APA	Wang, Hong , Xie, Qi , Li, Yuexiang , Huang, Yawen , Meng, Deyu , Zheng, Yefeng . Orientation-Shared Convolution Representation for CT Metal Artifact Learning . \| MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2022, PT VI , 2022 , 13436 , 665-675 .
Export to	NoteExpress RIS BibTex

Sparsity-Enhanced Convolutional Decomposition: A Novel Tensor-Based Paradigm for Blind Hyperspectral Unmixing EI SCIE Scopus

期刊论文 | 2022 , 60 | IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING

SCOPUS Cited Count： 56

Abstract&Keyword Cite

Abstract ：

Blind hyperspectral unmixing (HU) has long been recognized as a crucial component in analyzing the hyperspectral imagery (HSI) collected by airborne and spaceborne sensors. Due to the highly ill-posed problems of such a blind source separation scheme and the effects of spectral variability in hyperspectral imaging, the ability to accurately and effectively unmixing the complex HSI still remains limited. To this end, this article presents a novel blind HU model, called sparsity-enhanced convolutional decomposition (SeCoDe), by jointly capturing spatial-spectral information of HSI in a tensor-based fashion. SeCoDe benefits from two perspectives. On the one hand, the convolutional operation is employed in SeCoDe to locally model the spatial relation between the targeted pixel and its neighbors, which can be well explained by spectral bundles that are capable of addressing spectral variabilities effectively. It maintains, on the other hand, physically continuous spectral components by decomposing the HSI along with the spectral domain. With sparsity-enhanced regularization, an alternative optimization strategy with alternating direction method of multipliers (ADMM)-based optimization algorithm is devised for efficient model inference. Extensive experiments conducted on three different data sets demonstrate the superiority of the proposed SeCoDe compared to previous state-of-the-art methods. We will also release the code at https://github.com/danfenghong/IEEE_TGRS_SeCoDe to encourage the reproduction of the given results.

Keyword ：

Blind hyperspectral unmixing (HU) Context modeling Convolutional codes convolutional sparse coding (CSC) Encoding Hyperspectral imaging Optimization spectral bundles spectral variability (SV) Task analysis tensor decomposition Tensors

Cite：

Copy from the list or Export to your reference management。

GB/T 7714	Yao, Jing , Hong, Danfeng , Xu, Lin et al. Sparsity-Enhanced Convolutional Decomposition: A Novel Tensor-Based Paradigm for Blind Hyperspectral Unmixing [J]. \| IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING , 2022 , 60 .
MLA	Yao, Jing et al. "Sparsity-Enhanced Convolutional Decomposition: A Novel Tensor-Based Paradigm for Blind Hyperspectral Unmixing" . \| IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING 60 (2022) .
APA	Yao, Jing , Hong, Danfeng , Xu, Lin , Meng, Deyu , Chanussot, Jocelyn , Xu, Zongben . Sparsity-Enhanced Convolutional Decomposition: A Novel Tensor-Based Paradigm for Blind Hyperspectral Unmixing . \| IEEE TRANSACTIONS ON GEOSCIENCE AND REMOTE SENSING , 2022 , 60 .
Export to	NoteExpress RIS BibTex

DICDNet: Deep Interpretable Convolutional Dictionary Network for Metal Artifact Reduction in CT Images EI SCIE Scopus

期刊论文 | 2022 , 41 (4) , 869-880 | IEEE TRANSACTIONS ON MEDICAL IMAGING

SCOPUS Cited Count： 35

Abstract&Keyword Cite

Abstract ：

Computed tomography (CT) images are often impaired by unfavorable artifacts caused by metallic implants within patients, which would adversely affect the subsequent clinical diagnosis and treatment. Although the existing deep-learning-based approaches have achieved promising success on metal artifact reduction (MAR) for CT images, most of them treated the task as a general image restoration problem and utilized off-the-shelf network modules for image quality enhancement. Hence, such frameworks always suffer from lack of sufficient model interpretability for the specific task. Besides, the existing MAR techniques largely neglect the intrinsic prior knowledge underlying metal-corrupted CT images which is beneficial for the MAR performance improvement. In this paper, we specifically propose a deep interpretable convolutional dictionary network (DICDNet) for the MAR task. Particularly, we first explore that the metal artifacts always present non-local streaking and star-shape patterns in CT images. Based on such observations, a convolutional dictionary model is deployed to encode the metal artifacts. To solve the model, we propose a novel optimization algorithm based on the proximal gradient technique. With only simple operators, the iterative steps of the proposed algorithm can be easily unfolded into corresponding network modules with specific physical meanings. Comprehensive experiments on synthesized and clinical datasets substantiate the effectiveness of the proposed DICDNet as well as its superior interpretability, compared to current state-of-the-art MAR methods. Code is available at https://github.com/hongwang01/DICDNet.

Keyword ：

Computed tomography CT metal artifact reduction Dictionaries generalization performance Image reconstruction interpretable dictionary learning Mars Metals Optimization Task analysis

Cite：

Copy from the list or Export to your reference management。

GB/T 7714	Wang, Hong , Li, Yuexiang , He, Nanjun et al. DICDNet: Deep Interpretable Convolutional Dictionary Network for Metal Artifact Reduction in CT Images [J]. \| IEEE TRANSACTIONS ON MEDICAL IMAGING , 2022 , 41 (4) : 869-880 .
MLA	Wang, Hong et al. "DICDNet: Deep Interpretable Convolutional Dictionary Network for Metal Artifact Reduction in CT Images" . \| IEEE TRANSACTIONS ON MEDICAL IMAGING 41 . 4 (2022) : 869-880 .
APA	Wang, Hong , Li, Yuexiang , He, Nanjun , Ma, Kai , Meng, Deyu , Zheng, Yefeng . DICDNet: Deep Interpretable Convolutional Dictionary Network for Metal Artifact Reduction in CT Images . \| IEEE TRANSACTIONS ON MEDICAL IMAGING , 2022 , 41 (4) , 869-880 .
Export to	NoteExpress RIS BibTex

Dual-Pyramidal Image Inpainting With Dynamic Normalization EI SCIE Scopus

期刊论文 | 2022 , 32 (9) , 5975-5988 | IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY

Wang, Chao | Shao, Mingwen | Meng, Deyu | Zuo, Wangmeng

SCOPUS Cited Count： 18

Abstract&Keyword Cite

Abstract ：

Deep autoencoder-based approaches have achieved significant improvements on restoring damaged images, yet they still suffer from artifacts due to the inadequate representation and inaccurate regularization of existing features. In this paper, we propose a dual-pyramidal inpainting framework called DPNet to address these two limitations, which seamlessly integrates sufficient feature learning and dynamic regularization within an autoencoder network. Specifically, to exhaustively extract multi-scale features, we adopt layer-wise pyramidal convolution in encoder, which provides an arbitrary combination pool of various receptive fields. Subsequently, to tackle the patch deterioration problem in previous cross-scale non-local schemes, we further propose a Pyramidal Attention Mechanism (PAM) in decoder to acquire finer patches directly from learned layers. Mutually benefited with pyramidal features extraction in encoder, the dissemination space for non-local pixels in our PAM is notably enlarged to pyramidal level, thus significantly benefiting the feature representation. Moreover, to avoid the mask error accumulation in existing works, a dynamic normalization mechanism utilizing the spatial mask information updated in encoder is introduced, which further ensures the feature integrity and consistency. Such a dual-pyramidal structure along with dynamic normalization significantly improve the inpainting quality, outperforming existing competitors. Comprehensive experiments conducted on three benchmark datasets demonstrate that our DPNet performs favorably against the state-of-the-arts.

Keyword ：

Convolution Deep learning feature extraction Feature extraction Frequency modulation generative adversarial networks image restoration information exchange Kernel Representation learning Semantics task analysis Task analysis

Cite：

Copy from the list or Export to your reference management。

GB/T 7714	Wang, Chao , Shao, Mingwen , Meng, Deyu et al. Dual-Pyramidal Image Inpainting With Dynamic Normalization [J]. \| IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY , 2022 , 32 (9) : 5975-5988 .
MLA	Wang, Chao et al. "Dual-Pyramidal Image Inpainting With Dynamic Normalization" . \| IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY 32 . 9 (2022) : 5975-5988 .
APA	Wang, Chao , Shao, Mingwen , Meng, Deyu , Zuo, Wangmeng . Dual-Pyramidal Image Inpainting With Dynamic Normalization . \| IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY , 2022 , 32 (9) , 5975-5988 .
Export to	NoteExpress RIS BibTex

10| 20| 50 per page

< Page ，Total 20 >

Type
Departments

All Years Choose Year From to