Generally speaking, converting between audio formats is easy. Doing text to speech in the first place is very hard . So focus on the hard part first, get that solved, then deal with the specific requirements of the output format at the end.
Lccyplu.
how to convert an mp3 to wav