• Complex
  • Title
  • Author
  • Keyword
  • Abstract
  • Scholars
Search

Author:

Huang, Zi (Huang, Zi.) | Ji, Shulei (Ji, Shulei.) | Hu, Zhilan (Hu, Zhilan.) | Cai, Chuangjian (Cai, Chuangjian.) | Luo, Jing (Luo, Jing.) | Yang, Xinyu (Yang, Xinyu.)

Indexed by:

Abstract:

Music emotion recognition (MER), a sub-task of music information retrieval (MIR), has developed rapidly in recent years. However, the learning of affect-salient features remains a challenge. In this paper, we propose an end-to-end attention-based deep feature fusion (ADFF) approach for MER. Only taking log Mel-spectrogram as input, this method uses adapted VGGNet as spatial feature learning module (SFLM) to obtain spatial features across different levels. Then, these features are fed into squeeze-and-excitation (SE) attention-based temporal feature learning module (TFLM) to get multi-level emotion-related spatial-temporal features (ESTFs), which can discriminate emotions well in the final emotion space. In addition, a novel data processing is devised to cut the single-channel input into multichannel to improve calculative efficiency while ensuring the quality of MER. Experiments show that our proposed method achieves 10.43% and 4.82% relative improvement of valence and arousal respectively on the R2 score compared to the state-of-the-art model, meanwhile, performs better on datasets with distinct scales and in multi-task learning. Copyright © 2022 ISCA.

Keyword:

Data handling Emotion Recognition Learning systems Music Speech communication Speech recognition

Author Community:

  • [ 1 ] [Huang, Zi]School of Computer Science and Technology, Xi'an Jiaotong University, China
  • [ 2 ] [Ji, Shulei]School of Computer Science and Technology, Xi'an Jiaotong University, China
  • [ 3 ] [Hu, Zhilan]Media Technology Institute, Huawei Technologies Co., Ltd.
  • [ 4 ] [Cai, Chuangjian]Media Technology Institute, Huawei Technologies Co., Ltd.
  • [ 5 ] [Luo, Jing]School of Computer Science and Technology, Xi'an Jiaotong University, China
  • [ 6 ] [Yang, Xinyu]School of Computer Science and Technology, Xi'an Jiaotong University, China

Reprint Author's Address:

  • X. Yang;;School of Computer Science and Technology, Xi'an Jiaotong University, China;;email: yxyphd@mail.xjtu.edu.cn;;

Email:

Show more details

Related Keywords:

Related Article:

Source :

ISSN: 2308-457X

Year: 2022

Volume: 2022-September

Page: 4152-4156

Language: English

Cited Count:

WoS CC Cited Count:

SCOPUS Cited Count: 5

ESI Highly Cited Papers on the List: 0 Unfold All

WanFang Cited Count:

Chinese Cited Count:

30 Days PV: 9

FAQ| About| Online/Total:1161/203933240
Address:XI'AN JIAOTONG UNIVERSITY LIBRARY(No.28, Xianning West Road, Xi'an, Shaanxi Post Code:710049) Contact Us:029-82667865
Copyright:XI'AN JIAOTONG UNIVERSITY LIBRARY Technical Support:Beijing Aegean Software Co., Ltd.