Click here to flash read.
Target speaker information can be utilized in speech enhancement (SE) models
to more effectively extract the desired speech. Previous works introduce the
speaker embedding into speech enhancement models by means of concatenation or
affine transformation. In this paper, we propose a speaker attentive module to
calculate the attention scores between the speaker embedding and the
intermediate features, which are used to rescale the features. By merging this
module in the state-of-the-art SE model, we construct the personalized SE model
for ICASSP Signal Processing Grand Challenge: DNS Challenge 5 (2023). Our
system achieves a final score of 0.529 on the blind test set of track1 and
0.549 on track2.
Click here to read this post out
ID: 906; Unique Viewers: 0
Unique Voters: 0
Total Votes: 0
Votes:
Latest Change: March 17, 2023, 7:35 a.m.
Changes:
Dictionaries:
Words:
Spaces:
Views: 1074
CC:
No creative common's license
No creative common's license
Comments: