bringing characters to life with amazon polly text-to-speech - aws online tech talks
Post on 21-Jan-2018
421 Views
Preview:
TRANSCRIPT
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Robin Dautricourt, Sr. Product Manager
Felix Duchesneau, Software Development Manager,
11/14/2017
Bringing Characters to Life with
Amazon Polly Text-to-Speech
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Amazon Polly
• What is Amazon Polly?
• A short overview of how
Text-to-Speech is produced.
• Text-to-Speech features.
• Game development use case.
Amazon Lumberyard
• What is Lumberyard?
• A short overview of how video
games are made.
• Cloud assisted production tools.
• New experiences made possible.
Agenda for Today
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
What is Amazon Polly?
• AWS service that converts text into lifelike speech
• 50 voices across 24 languages
• Low latency responses allow developers to build real-time
systems
• Developers can store, replay, and distribute generated
speech
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Text-to-Speech Development
Text
Text normalization
Grapheme-to-phoneme conversion
Waveformgeneration
Speech
She has $20 in her pocket.
she has twenty dollars in her pocket
ˈ ʃ i ˈ h æ z ˈ t w ɛ n . t i ˈ d ɑ . ɫ ə ɹ z ˈ ɪ n ˈ h ɝ ɹ ˈ p ɑ . k ə t
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Speech Synthesis Markup Language (SSML) Features
<speak>I can speak normally,
<prosody volume="x-loud">or I can speak in a louder voice.</prosody>
</speak>
<speak>When I get excited,
<prosody rate="x-fast">I start to speak very
fast.</prosody></speak>
<speak>Sometimes when I get nervous,
<prosody pitch="high">my pitch goes way up.</prosody></speak>
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Speech Marks Feature
Synchronize Speech for an Enhanced Visual
Experience
• Speech Marks: additional metadata stream
• Timing information for visemes, words, sentences,
SSML
Sample use cases:
• Karaoke-style text highlighting
• Facial animation (lip-syncing)
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Amazon Polly Use Cases
Telephony
/ IVR
Special Needs
AI Assistant
AI AnnouncerVideo
Voice-Over
Navigation Content
Creation
Education
Technology
Podcasts Game Development
• Pre-Production Speech
• Character Casting
• Directing (with SSML)
• Real-time Speech
• Store and Replay
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Amazon Polly for Game Development -
Amazon Lumberyard Use Case
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
What is Lumberyard?
Amazon Lumberyard is a free AAA game engine deeply
integrated with AWS and Twitch – with full source.
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Video game development in a nutshell
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Producing a video game
Time
ProductionPre-productionConception
Gam
e L
aunches
Post-launch …
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Producing a video game
The environment will be under construction for most of the
production
The player, the NPCs, the gameplay mechanics, the story
telling need to be perfectly adapted to the environment
The assets: Characters, animations, props, audio will also
be under development for most of the production
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Producing a video game
Time
Produce 100s of characters
Put assets and characters in world
Build the world with tools in development
Produce 1000s of assets
Implement & improve gameplay mechanics
Playtest, Iterate, find and improve the fun
Story telling through cinematics and scripted events
Signs and feedback
Fix 1000s of bugs
Missions and level design
Assem
ble
every
thin
g
Hope f
or
the b
est
R&D and Experimentation
Develop new tools to build faster
Optimize
Prepare day1 patch
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Placeholders will be used until the very end
- Reserve “physical” space
- Reserve memory budget
- Reserve time (timing-wise)
- Iterate and playtest
- Test if there is enough variety
- Test the localization
- …
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Story telling - consistent story in a consistent world.
The story will be told by:
- Cinematics
- Scripted events
- Ambiance dialogues and audio
- Props, accessories
- Lighting
- Weather conditions
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Game ≠ Story
Find the fun through play tests and iteration
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Game mechanics
Player must understand what is going on.
Player must understand gameplay mechanics
Gameplay mechanics == computer algorithms
Player understands computer algorithms
How?
Signs and feedback
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Signs and feedback
- UI elements
- Sounds
- Ambient dialogues
- NPC dialogues
- FX
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Gameplay mechanics – An example
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
A stealth game with a gameplay loop of 4 states
CombatSearch
Alert Relaxed
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Relaxed state
Wait Patrol
Enemy seen Enemy heard
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Player has been momentarily seen or heard
Alert
“Did you hear something?”
“What was that?”
“What was that noise?”
“Huh?”
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Alert state
Unseen
(--)
Seen/heard
(++)
Detected!
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Player did not hide soon enough
Search
Show yourself!
Over there! I think I saw something.
Who’s there?
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Search
Search
“Come out we don’t have all night.”
“Come out, come out wherever you are!”
“Hope you have good life insurance dude.”
“It takes some guts to come in here uninvited.”
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Investigate last
known position
Search around
last known
position
Spread
Search state
Enemy
seen
Enemy
heard
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Enemies are attacking – Time to run or fight
Combat
“Get him!”
“There he is!”
“Attaaaaaaack!”
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Cloud assisted production tools
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Three Parts of a Cloud Gem
Backend
infrastructure
Client Component Cloud Gem Portal
Plugin
© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.© 2017, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
New runtime game experiences made possible
top related