net - UK (2020-04)

(Antfer) #1

PROJECTS
Text-to-Speech API


TEXT-TO-SPEECH API


Imagine building a web-based app that can not
only understand what a user is saying to it, but
also respond in a voice customised to their liking.
All in real time. Combining chatbot dialog models
with voice recognition and now voice synthesis,
this scenario has become a reality. You can develop
solutions for education, hands-free communications,
call-centre automation and engaging games and
web experiences.
In this tutorial, you are going to create a simple
app to enable you to return AI-powered, human-
sounding speech, based on values you choose.

S3((&+6<17+(6,6Ơ7(;77263((&+ơ
Speech synthesis, or text-to-speech, is the
conversion of text input into human-like speech.
Although on the surface the concept may seem

TURN TEXT INTO SPEECH


WITH GOOGLE’S API


Richard Mattka lqwurgxfhv#|rx#wr#wkh#Ľhog#ri#DL#vshhfk#v|qwkhvlv#xvlqj#


Jrrjohġv#qhz#qhxudo-qhwzrun#srzhuhg#Wh{w0wr0Vshhfk#DSL


Artificial intelligence has become part of nearly
every aspect of our lives, from content-aware
fills for video and photos, facial recognition to
unlock your phone and even recommendations for
your mobile coffee order. The field is growing so
rapidly, it’s becoming increasingly difficult to nail
down a definitive definition. Machine learning, deep
learning, natural language processing (NLP),
computer vision, voice recognition and speech
synthesis... all these and many more fall under the
umbrella of artificial intelligence.
IBM, Google, Amazon and many others have
created API endpoints for developers to integrate and
start leveraging AI in their own projects. AI trained
on millions of data sets and models are at your
fingertips. Hooking into machine learning power
has never been easier.

ABOUT THE AUTHOR
RICHARD MATTKA
w: richardmattka.com
t: @synergyseeker
job: Creative director,
designer, developer
areas of expertise:
Shaders, VFX, WebGL
Free download pdf