Direct Speech Synthesis from Non-Invasive, Neuromagnetic Signals
Jinuk Kwon1,2, David Harwath3, Debadatta Dash2, Paul Ferrari4, and Jun Wang1,2
1 Department of Speech, Language, and Hearing Sciences, The University of Texas at Austin, USA Target: Audio synthesized using the target Mel spectrogram
2 Department of Neurology, Dell Medical School, The University of Texas at Austin, USA
3 Department of Computer Science, The University of Texas at Austin, USA
4 Jack H. Miller Magnetoencephalography Center, Helen DeVos Children’s Medical Center, USA
Generated: Audio synthesized using the Generated Mel spectrogram from MEG
Phrase 1: Do you under stand me?
Participant | Sample1 | Sample2 | Sample3 | |||
---|---|---|---|---|---|---|
Target | Generated | Target | Generated | Target | Generated | |
A1 | ||||||
A2 | ||||||
A3 | ||||||
A4 |
Phrase 2: That's perfect
Participant | Sample1 | Sample2 | Sample3 | |||
---|---|---|---|---|---|---|
Target | Generated | Target | Generated | Target | Generated | |
A1 | ||||||
A2 | ||||||
A3 | ||||||
A4 |
Phrase 3: How are you?
Participant | Sample1 | Sample2 | Sample3 | |||
---|---|---|---|---|---|---|
Target | Generated | Target | Generated | Target | Generated | |
A1 | ||||||
A2 | ||||||
A3 | ||||||
A4 |
Phrase 4: Good-Bye
Participant | Sample1 | Sample2 | Sample3 | |||
---|---|---|---|---|---|---|
Target | Generated | Target | Generated | Target | Generated | |
A1 | ||||||
A2 | ||||||
A3 | ||||||
A4 |
Phrase 5: I need help
Participant | Sample1 | Sample2 | Sample3 | |||
---|---|---|---|---|---|---|
Target | Generated | Target | Generated | Target | Generated | |
A1 | ||||||
A2 | ||||||
A3 | ||||||
A4 |