It includes a Text-To-Phoneme converter called reciter and a Phoneme-To-Speech routine for the final output. It aims for low memory impact and file size which is the reason I want to avoid the ...