Bad Speech Synthesis Made Simple
Speech synthesis is a core component of many modern products - turn by turn navigation, automatic personal assistants/search like Siri and Alexa, and even the voices of AI/robots/people in computer games and movies are (in some cases) generated by speech synthesis systems. But how are these systems built? How can we make computers reply to our requests (and demands) using speech?