I should have tried to figure this out for long. It took a while but here is a nice solution. Basically just use onenote. And to record, I can use audacity and record from pulse. And can use pulse audio control (pavucontrol) to select only system sound.
Texttospeech with Windows 10
Posted on