Abstract
Recently, sum-product networks (SPNs) showed convincing results on the ill-posed task of artificial bandwidth extension (ABE). However, SPNs are just one type of many architectures which can be summarized as representational models. In this paper, using ABE as benchmark task, we perform a comparative study of Gauss Bernoulli restricted Boltzmann machines, conditional restricted Boltzmann machines, higher order contractive autoencoders, SPNs and generative stochastic networks (GSNs). Especially the latter ones are promising architectures in terms of its reconstruction capabilities. Our experiments show impressive results of GSNs, achieving on average an improvement of 3.90dB and 4.08dB in segmental SNR on a speaker dependent (SD) and speaker independent (SI) scenario compared to SPNs, respectively.
| Original language | English |
|---|---|
| Title of host publication | Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH |
| Publisher | ISCA |
| Pages | 791-795 |
| Number of pages | 5 |
| Publication status | Published - 1 Jan 2015 |
| Externally published | Yes |
| Event | 16th Annual Conference of the International Speech Communication Association, INTERSPEECH 2015 - Dresden, Germany Duration: 6 Sept 2015 → 10 Sept 2015 |
Conference
| Conference | 16th Annual Conference of the International Speech Communication Association, INTERSPEECH 2015 |
|---|---|
| Country/Territory | Germany |
| City | Dresden |
| Period | 6/09/15 → 10/09/15 |
Keywords
- Bandwidth extension
- General stochastic network
- Representation learning
- Sum-product network
Fingerprint
Dive into the research topics of 'On representation learning for artificial bandwidth extension'. Together they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver