MDCTNet: A Hybrid Approach to Neural Audio Coding

This is a demonstration page for the paper "MDCTNet: a Hybrid Approach to Neural Audio Coding"

Audio Samples

This page has 19 items to demonstrate the MDCTNet and its core codec encoded at 24kb/s mono VBR. The content used for this demo is from the "ODAQ: Open Dataset of Audio Quality"[1].

Item Source Core Codec 24kb/s MDCTNet 24kb/s
1

2

3

4

5

6

7

8

9

10

11

12

13

14

15

16

17

18

19

References

[1] Matteo Torcoli, Chih-Wei Wu, Sascha Dick, Phillip Williams, Mhd Modar Halimeh, William Wolcott, Emanuël Habets, "ODAQ: OPEN DATASET OF AUDIO QUALITY," 2024 IEEE International Conference on Speech and Signal Processing (ICASSP), 2024.