ibm bootcamp - text to speech api lab

22
1 Colin McCabe @beekman33 IBM North America Cloud Technical Evangelist IBM Bootcamp Building an app with a Watson API: Text to Speech Deep Dive

Upload: colin-mccabe

Post on 21-Feb-2017

887 views

Category:

Engineering


4 download

TRANSCRIPT

Page 1: IBM Bootcamp - Text to Speech API Lab

1

Colin McCabe@beekman33IBM North America Cloud Technical Evangelist

IBM BootcampBuilding an app with a Watson API: Text to Speech Deep Dive

Page 2: IBM Bootcamp - Text to Speech API Lab

2

Introduction1) Today we will take an up close look at the Text-to-Speech Watson API2) Build your own Speech Synthesis App in Bluemix

Page 3: IBM Bootcamp - Text to Speech API Lab

3

IntroductionThe Science Behind Text to SpeechThe Text to Speech service relies on acoustic models based on decision trees. The models generate acoustic candidates to perform searches based on contextual features of the text. The system uses time-domain Pitch Synchronous Overlap and Add (PSOLA) - It can modify the pitch and duration of a speech signal. The service uses a text-processing front-end to parse the text before synthesizing it into audio form.

Based on research papers:•Database Mining for Flexible Concatenative Text-to-Speech•Automatic Exploration of Corpus-Specific Properties •Evaluation of the IBM Expressive Text-to-Speech Synthesis System •The IBM Submission to the 2008 Text-to-Speech Blizzard Challenge •The IBM expressive text-to-speech synthesis system for American English• Frequency warping based on mapping formant parameters.

Page 4: IBM Bootcamp - Text to Speech API Lab

4

Let’s CreateSign-up For Bluemix

API Access

Build anApp

Edit andControl Launch

1 2 3 4 5

Page 5: IBM Bootcamp - Text to Speech API Lab

5

Sign-up for BluemixSign-up For Bluemix

1. Bluemix.netTop Right > Sign Up

Page 6: IBM Bootcamp - Text to Speech API Lab

6

Dashboard – Select API2. Click Catalog at the Top

Page 7: IBM Bootcamp - Text to Speech API Lab

7

3. Select Text to Speech

Sign-up For Bluemix

API Access

API Access

Page 8: IBM Bootcamp - Text to Speech API Lab

8

4. Create The API Service (and your credentials)

Page 9: IBM Bootcamp - Text to Speech API Lab

95. Get your Service Credentials

Page 10: IBM Bootcamp - Text to Speech API Lab

106. Copy and Paste username and password (Sticky!)

Page 11: IBM Bootcamp - Text to Speech API Lab

11

CURL CHEATSHEEThttps://goo.gl/or7e4S

7. Test out some calls

Page 12: IBM Bootcamp - Text to Speech API Lab

12

HURL.IT

8. Windows users can try Hurl.it to test API

Page 13: IBM Bootcamp - Text to Speech API Lab

13

Starter App – GitHubSign-up For Bluemix

API Access

Build anApp

https://github.com/watson-developer-cloud/text-to-speech-nodejs

9. Google watson developer text to speech nodejs

Page 14: IBM Bootcamp - Text to Speech API Lab

14

Deploy to Bluemix

10. Name your app

Page 15: IBM Bootcamp - Text to Speech API Lab

15

Add Git11. Add GIT to prepare you DevOps tools

Page 16: IBM Bootcamp - Text to Speech API Lab

16

Using BluemixSign-up For Bluemix

API Access

Build anApp

Edit andControl

12. Click Edit Code

Page 17: IBM Bootcamp - Text to Speech API Lab

17

WEB IDE

13. Update your API credentials (app.js) and modify HTML (index.html)

Page 18: IBM Bootcamp - Text to Speech API Lab

18

DOMAINS / MOBILESign-up For Bluemix

API Access

Build anApp

Edit andControl Launch

14. Manage Organization / Domains15. Try it on Mobile / Download VLC to play OGG

Page 19: IBM Bootcamp - Text to Speech API Lab

19

Done!Congratulations, you’ve used the Watson API: Text to Speech, and built your own app.

Page 20: IBM Bootcamp - Text to Speech API Lab

20

Wrap-upSign-up For Bluemix

API Access

Build anApp

Edit andControl Launch

• Registration• Dashboard &

Catalog

• Select Text toSpeech API

• Create API Service

• Get ServiceCredentials

• Test API withCURL

• GitHub Starter• Name App• Add Git

(DevOps)

• Edit Code• Add API

Credentials• Manipulate

HTML

• Mobile• Domain

Name

Page 21: IBM Bootcamp - Text to Speech API Lab

21

Edit the manifest.yml file and change the <application-name> to something unique.applications:- services:- text-to-speech-servicename: <application-name>command: node app.jspath: .memory: 256M

The name you use will determinate your application url initially, e.g. <application-name>.mybluemix.net.Connect to Bluemix in the command line tool.$ cf api https://api.ng.bluemix.net$ cf login -u <your user ID>Create the Text to Speech service in Bluemix.$ cf create-service text_to_speech standard text-to-speech-servicePush it live!$ cf push

Command Line Option

Page 22: IBM Bootcamp - Text to Speech API Lab

22

Thanks

Colin McCabe@beekman33IBM North America Cloud Technical Evangelist