development of a toolkit handling sunao harajulius am lm am lm am lm qadb qadb qadb synthesized...

HTTP Request

hts_engine

Cache check

Speech file - PCM - 16bit - 16kHz

Speech recognizer

N-best of ASR

hypothesis

Response generator

Dialog response

Speech recognizer

N-best of ASR

hypothesis

N-best of ASR

hypothesis

Response generator

Dialog response

Task Classifi- cation

Text-to-speech

Recorded speech log

HTTP Response

Speech file’s URL

Dialog response

Synthesized speech log

Julius

Synthesized speech file

Device ID

System=“Takemaru-kun” (jp… User=1D5C0439-81E2-44EF-B715-4…

System> Hello. User> Hello.

System> My name is Takema... User> Your name?

System> The nearest station… User> Where is the nearest station?

System=“Kita-chan”

(jp.naist.spalab.Kitachan)

User=1D5C0439-81E2-44EF-B715-

4CC0CFF0171A

Kita-chan>Are you finding

“Gakken-Kita-Ikoma” station? Takemaru-kun>The nearest station is

“Gakken-Kita-Ikoma” station.

Kita-chan

Conclusion / Future works

Sunao Hara Okayama University, JAPAN

Hiromichi Kawanami Hiroshi Saruwatari Kiyohiro Shikano Nara Institute of Science and Technology, JAPAN

Development of a toolkit handling

multiple speech-oriented guidance

agents for mobile applications

Main Agent

HTTP Response

HTTP Request

Speech file Dialog Management

Sub Agent

Dialog Management

Dialog response

Including task

classifica-tion

Including task

classifica-tion

Dialog response

Delegated agent’s URIs

HTTP Request

Delegated agent’s URI

HTTP Response

Dialog response

Device ID

Sub-agent URIs

Dialog response

Selection of subagents Location specific service using GPS (or A-GPS) Experiments/Evaluations under real environment

A Toolkit for developing a Spoken Dialog System (SDS)

Server-and-Client architecture Each server can communicate with the other servers

Aim of this study SDS as a “generalist” is constructed by a huge amount of SDSs as a “specialist” Specialist: Task-dependent dialog, location-dependent dialog, etc.

Approach Single main agent (“generalist”) and multiple sub agents (“specialists”) Client system: iTakemaru (Takemaru-kun for iPhone)

Web 1.0 Web 2.0 Web 3.0 Web 4.0? Information publishing

(one-side) URI is mainly used as URL Hyper-linking: definition for

implicitly relationships between resources

Bookmark (Favorite): Tools for remembering the resource’s URI

Information publishing (interactive)

Concept of usability and sharing was being important

Wiki-wiki, Web-API, Mash-up services, Information Tagging, etc.

Information publishing (collaborative)

Cloud: access to user’s data from any places

Social: information sharing with others

SNS, Cloud computing, Crowd sourcing, etc.

Information explosion: rapidly increasing in the amount of published information; e.g. “Total Recall”

Technology of Information Retrieval will be having more importance

Librarian (Concierge) of each database or websites may be promising

Lifelog, MyLifeBits (Microsoft), etc.

Development of a toolkit handling multiple speech-oriented guidance agents for mobile applications

Single main agent and multiple sub agents

System

In this study, we propose a toolkit to handle multiple speech-oriented guidance agents for mobile applications. The basic architecture of the toolkit is server-and-client architecture. We assumed the servers are located on a cloud-computing environment, and the clients are mobile phones, such as the iPhone. It is difficult to develop an omnipotent spoken dialog system, but it is easy to develop a spoken dialog agent that has limited but deep knowledge. If such limited agents could communicate with each other, a spoken dialog system with wide-ranging knowledge could be created.

development of a toolkit handling sunao harajulius am lm am lm am lm qadb qadb qadb synthesized...

Documents

belimo cz - 2. lm-7 lm informace o výrobku klapkové...

open day lm ien (3) tutto - ingegneria dell'energia ·...

catalogo perfiles lm guias 15 febrero small · catalogo...

sap qadb master data

solavanti gmci linegrazer · 555 lm 596 lm 569 lm 9° -...

lm-u1350a, lm-u1350d, lm-u1350x, lms-u1350

physics lm-17, astrophysics, 1st year › ... › upload ›...

optur down under...title: lm wind power newsletter october...

west eekly wild weekly2007/09/17 · - dsc-west 5. lauf am...

lm im am oum

gw cssrmu - mouser electronicsm2 190 lm 200 lm m3 200 lm 210...

logikmodul lm/s 1 - library.e.abb.com · pdf filelogikmodul...

busniess blueprint document - saphelp · sap r/3...

leain r flex - designplan.com · escala real. > 90 lm/w >...

typ lm - thk technical support · lm 12ga lm 12ga-aj lm...

(draft no. 1 2/7/2020 - lm - 11:09 am 1 to the house of

2/7/2020 - lm - 11:09 am

waitara park, tennis court and flood structure · lm lm lm...

havuz aydinlatmalari · gÜn iŞiĞi day light 3000 lm 3300...

sap qadb organization structure