Direct Speech Synthesis from Non-Invasive, Neuromagnetic Signals
Jinuk Kwon1,2, David Harwath3, Debadatta Dash2, Paul Ferrari4, and Jun Wang1,2
1 Department of Speech, Language, and Hearing Sciences, The University of Texas at Austin, USA Target: Audio synthesized using the target Mel spectrogram
2 Department of Neurology, Dell Medical School, The University of Texas at Austin, USA
3 Department of Computer Science, The University of Texas at Austin, USA
4 Jack H. Miller Magnetoencephalography Center, Helen DeVos Children’s Medical Center, USA
Generated: Audio synthesized using the Generated Mel spectrogram from MEG
Phrase 1: Do you under stand me?
| Participant | Sample1 | Sample2 | Sample3 | |||
|---|---|---|---|---|---|---|
| Target | Generated | Target | Generated | Target | Generated | |
| A1 | ||||||
| A2 | ||||||
| A3 | ||||||
| A4 | ||||||
Phrase 2: That's perfect
| Participant | Sample1 | Sample2 | Sample3 | |||
|---|---|---|---|---|---|---|
| Target | Generated | Target | Generated | Target | Generated | |
| A1 | ||||||
| A2 | ||||||
| A3 | ||||||
| A4 | ||||||
Phrase 3: How are you?
| Participant | Sample1 | Sample2 | Sample3 | |||
|---|---|---|---|---|---|---|
| Target | Generated | Target | Generated | Target | Generated | |
| A1 | ||||||
| A2 | ||||||
| A3 | ||||||
| A4 | ||||||
Phrase 4: Good-Bye
| Participant | Sample1 | Sample2 | Sample3 | |||
|---|---|---|---|---|---|---|
| Target | Generated | Target | Generated | Target | Generated | |
| A1 | ||||||
| A2 | ||||||
| A3 | ||||||
| A4 | ||||||
Phrase 5: I need help
| Participant | Sample1 | Sample2 | Sample3 | |||
|---|---|---|---|---|---|---|
| Target | Generated | Target | Generated | Target | Generated | |
| A1 | ||||||
| A2 | ||||||
| A3 | ||||||
| A4 | ||||||