Direct Speech Synthesis from Non-Invasive, Neuromagnetic Signals


Jinuk Kwon1,2, David Harwath3, Debadatta Dash2, Paul Ferrari4, and Jun Wang1,2


1 Department of Speech, Language, and Hearing Sciences, The University of Texas at Austin, USA
2 Department of Neurology, Dell Medical School, The University of Texas at Austin, USA
3 Department of Computer Science, The University of Texas at Austin, USA
4 Jack H. Miller Magnetoencephalography Center, Helen DeVos Children’s Medical Center, USA


 Target: Audio synthesized using the target Mel spectrogram     

Generated: Audio synthesized using the Generated Mel spectrogram from MEG     

      Phrase 1: Do you under stand me?

Participant Sample1 Sample2 Sample3
Target Generated Target Generated Target Generated
A1
A2
A3
A4

      Phrase 2: That's perfect

Participant Sample1 Sample2 Sample3
Target Generated Target Generated Target Generated
A1
A2
A3
A4

      Phrase 3: How are you?

Participant Sample1 Sample2 Sample3
Target Generated Target Generated Target Generated
A1
A2
A3
A4

      Phrase 4: Good-Bye

Participant Sample1 Sample2 Sample3
Target Generated Target Generated Target Generated
A1
A2
A3
A4

      Phrase 5: I need help

Participant Sample1 Sample2 Sample3
Target Generated Target Generated Target Generated
A1
A2
A3
A4