SlideShare a Scribd company logo
Developing OpenHRI -Open Source
Software Components for Human
Robot Interaction-

              Yosuke Matsusaka

    National Institute of Advanced Industrial
         Science and Technology (AIST)
Part 1: Overview
Background
•   Industrial Robot
     –   Expert Operator
     –   Specified Action
•   Personal/Service Robot
     –   Naïve User=Operator
     –   Various Situations and Tasks




•   "Communication" is the key function.
Why Development of Communication
       Function is Difficult?
Reason: Requires Wide Range of Knowledge
  -Audio Signal Processing
  -Lexical Analysis
  -Speech Recognition / Synthesis
  -Dialogue, etc…
As a result…:
  High development effort.
      (When we develop from scratch…)
  Low learning curve.
      (Because we are not the professional in
       developing communication systems…)
OpenHRI

•Reference implementation of common interface
specification defined in NEDO Intelligent Robot
Technology Software Project (NEDO-IRTS).

•Integrate wide range of open source software
based on RT-Middleware specification.
NEDO IRTS common interface
specification and OpenHRI
  PortAudio, PulseAudio speex                                 Julius

               (Octet)                    (Octet)
Audio Device             Audio Filters              Speech Recognizer

                                     NLTK                     (XML)
                              (under development)
                                                    Lexical Processors
                                  (Any)                       SEAT
          OpenJTalk,
                                                         [matsusaka2009]
           Festival                                           (XML)
                                                               Soar

    Speech                                   Dialogue Manager
  Synthesizer             (XML)
                                                              (Any)
                                  (Any)
Application Examples




HRP-series:
Humanoid platform for general
tasks.                                                    RH-1:
                                TAIZO:                    Office service robot.
                                Rehabilitation exercise   Guide service, Control
                                demonstration robot.      TVs, etc...
                                17 exercise + greet.
                                                           OpenHRP:
                                                           Virtual simulator
                                SmartPalTM:
                                Object handling.
                                Navigaion.                           openhrp-demo.m4v
Comparison to Previous Researches
          Auditory            Complete      Complete       Support for
          Components          Communication Robotic System Dialog
                              System                       Manager

HARK             ◯                    △                 ×         ×
                               (Julius-HARK)

ROS+HA           ◯                    ◯                 ◯         ×
RK                             (Pocket Sphinx,        (100 >
                                    Flite)          components)

Galatea          ×                    ◯                 ×         ×
                                   (Julius,
                                 GalateaTalk)

OpenHRI          △                    ◯                 ◯         ◯
          (less support for   (Julius, OpenJTalk,     (200 >
             mic. array)            Festival)       components)
Feature: Very Easy to Install

On Ubuntu (3 commands):
$ sudo apt-add-repository ppa:openhri/ppa
$ sudo apt-get update
$ sudo apt-get install openhriaudio openhrivoice
  seatsat
-> Install all the dependencies.

On Windows (integrated installer):
http://openhri.net/getinstaller.php
-> Hit "Yes" on all the dialogs.
Feature: Standard Oriented
        Grammar and Tools
W3C-SRGS
<?xml version="1.0" encoding="UTF-8" ?>
<grammar>
 <rule id="greet">
   <one-of>
    <item>hello</item>
    <item>good afternoon</item>
    <item>good evening</item>
    <item>good bye</item>
    <item>bye</item>
   </one-of>
 </rule>
Feature: Standard Oriented
          Grammar and Tools
W3C-SRGS




$ srgstojulius sample.grxml | juliustographviz | dot
Feature: Multi-Lingual Voice
          Recognition and Synthesis
Currently Support:
- Japanese (OpenJTalk, Julius runkit)
- English (Festival, Julius acoustic model by
  voxforge)

Experimental Support:
- German (MARY, Julius acoustic model by voxforge)

Planned:
- Spanish, Korean, ...
Part 2:
Development Techniques
Multi-Lingual Testing

Normal system composition:
   AudioIn      VoiceRecog   DialogMgmt      VoiceSynth   AudioOut


- Require native speakers to test each language.

Make it inverse!
   Sentence                                                Result
                VoiceSynth   (AudioFilter)   VoiceRecog
   Producer                                               Validator

※ Thanks to component architecture. We only have
 to modify component connections do this.
Document Automation

We use SPHINX and rtdoc (included in rtshell) to
 automatically generate component documents.

       Component                   I18N Component
        Source                        Documents
  (Document Embedded)             (reStructuredText)
                          rtdoc
       gettext + poedit                 SPHINX

        Component
          Binary                   Integrated I18N
     (I18N Document                   Document
        Embedded)                     (html, pdf)
Document Automation
Deployment
For Ubuntu system:
- Heavily rely on service of launchpad.net.
 Upload the source package ->
 1) Compile the binaries for all the architectures.
 2) Construct and host the debian repository.

For Windows system:
- In-house package update script.
- Heavily rely on full time programmer working for
   this project.
- [Work-in-progress] Automatic compilation and
   packaging.
Summary
Overview:
- OpenHRI is a complete, easy to install components
  to construct human robot interaction system.
- Common interface is standard oriented and shared
  with commercial software companies.
Development Techniques:
- Test is done by inversing the perceptual system.
- Document automaton by SPHINX, rtdoc.
- Deployment by launchpad, update script.

Next challenge and open question:
"How can we make this sustainable?"
NEDO’s Strategy to Spread
              RT-Middleware Industry
     -2009                                                       2010-


More          Industrial Developers
performance (Commercial components)                     Switch




      High initial cost                                     Open Source
                          Co                                 Developers
                               mm
                                 on                     (FLOSS components)
                                      Inte
                                             rfa
                                                   ce
                                                           Less initial cost
                   Application
                   Developers

More Related Content

Viewers also liked (20)

PPTX
Negative Impacts of Nuclear Power
Shubha Brota Raha
 
PDF
Diversity Management in Pepsi Co
Shubha Brota Raha
 
DOCX
Cost of Quality in Apparel Industry
Shubha Brota Raha
 
DOCX
THE GROWTH OF INDIAN PRINT MEDIA IN THE LAST DECADE AND ITS FALLOUTS
Shubha Brota Raha
 
DOCX
Waste Management in Garment Industries
Shubha Brota Raha
 
PPTX
JVRC Tutorial
Yosuke Matsusaka
 
PPT
Nextage hackathon
Koji Terada
 
PPT
Listen to the natives power comvídeo
Marseadg
 
PPTX
Foreign market entry
Shubha Brota Raha
 
PPTX
E-Mail Campaign
aixwebmktg
 
DOC
Understanding Google Analytics
Shubha Brota Raha
 
DOC
ประวัติวันแม่
ruangkhow
 
PPT
Introduction to csharp
Satish Verma
 
PPTX
Palestinian Statehood
Saúl Vázquez
 
PPT
Visual studio.net
Satish Verma
 
PPTX
Google Presentation
Saúl Vázquez
 
PPT
Peace out Pam
misterjoshie
 
DOC
Google adwords bidding process
Shubha Brota Raha
 
PPT
Gu iintro(java)
Satish Verma
 
PPTX
T3 la fotografia3
apuntsclasse
 
Negative Impacts of Nuclear Power
Shubha Brota Raha
 
Diversity Management in Pepsi Co
Shubha Brota Raha
 
Cost of Quality in Apparel Industry
Shubha Brota Raha
 
THE GROWTH OF INDIAN PRINT MEDIA IN THE LAST DECADE AND ITS FALLOUTS
Shubha Brota Raha
 
Waste Management in Garment Industries
Shubha Brota Raha
 
JVRC Tutorial
Yosuke Matsusaka
 
Nextage hackathon
Koji Terada
 
Listen to the natives power comvídeo
Marseadg
 
Foreign market entry
Shubha Brota Raha
 
E-Mail Campaign
aixwebmktg
 
Understanding Google Analytics
Shubha Brota Raha
 
ประวัติวันแม่
ruangkhow
 
Introduction to csharp
Satish Verma
 
Palestinian Statehood
Saúl Vázquez
 
Visual studio.net
Satish Verma
 
Google Presentation
Saúl Vázquez
 
Peace out Pam
misterjoshie
 
Google adwords bidding process
Shubha Brota Raha
 
Gu iintro(java)
Satish Verma
 
T3 la fotografia3
apuntsclasse
 

Similar to RSJ2011 OSS Robotics and Tools OpenHRI Intro (20)

PDF
MORSE & HRI -- SIMPAR 2014 (Florian Lier)
Florian Lier
 
PPTX
Aplikace pro rozpoznávání řeči - Jan Šedivý
Asociace UX (Prague ACM SIGCHI)
 
PDF
Erlang Embedded — Concurrent Blinkenlights and More!
Omer Kilic
 
PDF
IRJET- Virtual Vision for Blinds
IRJET Journal
 
PDF
OSWALD: Lessons from and for the Open Hardware Movement
OSU Open Source Lab
 
DOC
Eric Theis resume61.1
Eric Theis
 
PDF
Command, Goal Disambiguation, Introspection, and Instruction in Gesture-Free ...
Vladimir Kulyukin
 
PPTX
Velocity-EHF for Android
michaeljfawcett
 
PDF
Rapid prototyping with open source
Alison Chaiken
 
PDF
Open Accessibility EverywhereGroundwork, Infrastructure, Standards
AEGIS-ACCESSIBLE Projects
 
PDF
Ovp Introduction Presentation
simon56
 
PDF
Improve Android System Component Performance
National Cheng Kung University
 
PDF
Cooking an ontology-based spoken dialogue system
Joana Paulo Pardal
 
PDF
Aj31253258
IJERA Editor
 
PDF
A Graphical Language for Real-Time Critical Robot Commands
Serge Stinckwich
 
PDF
Offline Voice Control
Jishnu Rajeev
 
PPTX
robotics for embedded systems.pptx
VikasMahor3
 
PPTX
Language for Embedded System
vkrhanjeeth .
 
PPTX
Language for embedded system
vkrhanjeeth .
 
MORSE & HRI -- SIMPAR 2014 (Florian Lier)
Florian Lier
 
Aplikace pro rozpoznávání řeči - Jan Šedivý
Asociace UX (Prague ACM SIGCHI)
 
Erlang Embedded — Concurrent Blinkenlights and More!
Omer Kilic
 
IRJET- Virtual Vision for Blinds
IRJET Journal
 
OSWALD: Lessons from and for the Open Hardware Movement
OSU Open Source Lab
 
Eric Theis resume61.1
Eric Theis
 
Command, Goal Disambiguation, Introspection, and Instruction in Gesture-Free ...
Vladimir Kulyukin
 
Velocity-EHF for Android
michaeljfawcett
 
Rapid prototyping with open source
Alison Chaiken
 
Open Accessibility EverywhereGroundwork, Infrastructure, Standards
AEGIS-ACCESSIBLE Projects
 
Ovp Introduction Presentation
simon56
 
Improve Android System Component Performance
National Cheng Kung University
 
Cooking an ontology-based spoken dialogue system
Joana Paulo Pardal
 
Aj31253258
IJERA Editor
 
A Graphical Language for Real-Time Critical Robot Commands
Serge Stinckwich
 
Offline Voice Control
Jishnu Rajeev
 
robotics for embedded systems.pptx
VikasMahor3
 
Language for Embedded System
vkrhanjeeth .
 
Language for embedded system
vkrhanjeeth .
 
Ad

Recently uploaded (20)

PDF
Transcript: Book industry state of the nation 2025 - Tech Forum 2025
BookNet Canada
 
PDF
CIFDAQ Market Wrap for the week of 4th July 2025
CIFDAQ
 
PDF
Home Cleaning App Development Services.pdf
V3cube
 
PDF
UPDF - AI PDF Editor & Converter Key Features
DealFuel
 
PPTX
Digital Circuits, important subject in CS
contactparinay1
 
PDF
“Squinting Vision Pipelines: Detecting and Correcting Errors in Vision Models...
Edge AI and Vision Alliance
 
PPTX
The Project Compass - GDG on Campus MSIT
dscmsitkol
 
PPTX
Manual Testing for Accessibility Enhancement
Julia Undeutsch
 
PDF
Software Development Company Keene Systems, Inc (1).pdf
Custom Software Development Company | Keene Systems, Inc.
 
PDF
“Computer Vision at Sea: Automated Fish Tracking for Sustainable Fishing,” a ...
Edge AI and Vision Alliance
 
PPTX
MuleSoft MCP Support (Model Context Protocol) and Use Case Demo
shyamraj55
 
PDF
Evolution: How True AI is Redefining Safety in Industry 4.0
vikaassingh4433
 
PDF
Dev Dives: Accelerating agentic automation with Autopilot for Everyone
UiPathCommunity
 
PDF
🚀 Let’s Build Our First Slack Workflow! 🔧.pdf
SanjeetMishra29
 
PDF
UiPath DevConnect 2025: Agentic Automation Community User Group Meeting
DianaGray10
 
PDF
Modern Decentralized Application Architectures.pdf
Kalema Edgar
 
PDF
Bitkom eIDAS Summit | European Business Wallet: Use Cases, Macroeconomics, an...
Carsten Stoecker
 
PPTX
Mastering ODC + Okta Configuration - Chennai OSUG
HathiMaryA
 
PPTX
CapCut Pro PC Crack Latest Version Free Free
josanj305
 
PPTX
Role_of_Artificial_Intelligence_in_Livestock_Extension_Services.pptx
DrRajdeepMadavi
 
Transcript: Book industry state of the nation 2025 - Tech Forum 2025
BookNet Canada
 
CIFDAQ Market Wrap for the week of 4th July 2025
CIFDAQ
 
Home Cleaning App Development Services.pdf
V3cube
 
UPDF - AI PDF Editor & Converter Key Features
DealFuel
 
Digital Circuits, important subject in CS
contactparinay1
 
“Squinting Vision Pipelines: Detecting and Correcting Errors in Vision Models...
Edge AI and Vision Alliance
 
The Project Compass - GDG on Campus MSIT
dscmsitkol
 
Manual Testing for Accessibility Enhancement
Julia Undeutsch
 
Software Development Company Keene Systems, Inc (1).pdf
Custom Software Development Company | Keene Systems, Inc.
 
“Computer Vision at Sea: Automated Fish Tracking for Sustainable Fishing,” a ...
Edge AI and Vision Alliance
 
MuleSoft MCP Support (Model Context Protocol) and Use Case Demo
shyamraj55
 
Evolution: How True AI is Redefining Safety in Industry 4.0
vikaassingh4433
 
Dev Dives: Accelerating agentic automation with Autopilot for Everyone
UiPathCommunity
 
🚀 Let’s Build Our First Slack Workflow! 🔧.pdf
SanjeetMishra29
 
UiPath DevConnect 2025: Agentic Automation Community User Group Meeting
DianaGray10
 
Modern Decentralized Application Architectures.pdf
Kalema Edgar
 
Bitkom eIDAS Summit | European Business Wallet: Use Cases, Macroeconomics, an...
Carsten Stoecker
 
Mastering ODC + Okta Configuration - Chennai OSUG
HathiMaryA
 
CapCut Pro PC Crack Latest Version Free Free
josanj305
 
Role_of_Artificial_Intelligence_in_Livestock_Extension_Services.pptx
DrRajdeepMadavi
 
Ad

RSJ2011 OSS Robotics and Tools OpenHRI Intro

  • 1. Developing OpenHRI -Open Source Software Components for Human Robot Interaction- Yosuke Matsusaka National Institute of Advanced Industrial Science and Technology (AIST)
  • 3. Background • Industrial Robot – Expert Operator – Specified Action • Personal/Service Robot – Naïve User=Operator – Various Situations and Tasks • "Communication" is the key function.
  • 4. Why Development of Communication Function is Difficult? Reason: Requires Wide Range of Knowledge -Audio Signal Processing -Lexical Analysis -Speech Recognition / Synthesis -Dialogue, etc… As a result…: High development effort. (When we develop from scratch…) Low learning curve. (Because we are not the professional in developing communication systems…)
  • 5. OpenHRI •Reference implementation of common interface specification defined in NEDO Intelligent Robot Technology Software Project (NEDO-IRTS). •Integrate wide range of open source software based on RT-Middleware specification.
  • 6. NEDO IRTS common interface specification and OpenHRI PortAudio, PulseAudio speex Julius (Octet) (Octet) Audio Device Audio Filters Speech Recognizer NLTK (XML) (under development) Lexical Processors (Any) SEAT OpenJTalk, [matsusaka2009] Festival (XML) Soar Speech Dialogue Manager Synthesizer (XML) (Any) (Any)
  • 7. Application Examples HRP-series: Humanoid platform for general tasks. RH-1: TAIZO: Office service robot. Rehabilitation exercise Guide service, Control demonstration robot. TVs, etc... 17 exercise + greet. OpenHRP: Virtual simulator SmartPalTM: Object handling. Navigaion. openhrp-demo.m4v
  • 8. Comparison to Previous Researches Auditory Complete Complete Support for Components Communication Robotic System Dialog System Manager HARK ◯ △ × × (Julius-HARK) ROS+HA ◯ ◯ ◯ × RK (Pocket Sphinx, (100 > Flite) components) Galatea × ◯ × × (Julius, GalateaTalk) OpenHRI △ ◯ ◯ ◯ (less support for (Julius, OpenJTalk, (200 > mic. array) Festival) components)
  • 9. Feature: Very Easy to Install On Ubuntu (3 commands): $ sudo apt-add-repository ppa:openhri/ppa $ sudo apt-get update $ sudo apt-get install openhriaudio openhrivoice seatsat -> Install all the dependencies. On Windows (integrated installer): http://openhri.net/getinstaller.php -> Hit "Yes" on all the dialogs.
  • 10. Feature: Standard Oriented Grammar and Tools W3C-SRGS <?xml version="1.0" encoding="UTF-8" ?> <grammar> <rule id="greet"> <one-of> <item>hello</item> <item>good afternoon</item> <item>good evening</item> <item>good bye</item> <item>bye</item> </one-of> </rule>
  • 11. Feature: Standard Oriented Grammar and Tools W3C-SRGS $ srgstojulius sample.grxml | juliustographviz | dot
  • 12. Feature: Multi-Lingual Voice Recognition and Synthesis Currently Support: - Japanese (OpenJTalk, Julius runkit) - English (Festival, Julius acoustic model by voxforge) Experimental Support: - German (MARY, Julius acoustic model by voxforge) Planned: - Spanish, Korean, ...
  • 14. Multi-Lingual Testing Normal system composition: AudioIn VoiceRecog DialogMgmt VoiceSynth AudioOut - Require native speakers to test each language. Make it inverse! Sentence Result VoiceSynth (AudioFilter) VoiceRecog Producer Validator ※ Thanks to component architecture. We only have to modify component connections do this.
  • 15. Document Automation We use SPHINX and rtdoc (included in rtshell) to automatically generate component documents. Component I18N Component Source Documents (Document Embedded) (reStructuredText) rtdoc gettext + poedit SPHINX Component Binary Integrated I18N (I18N Document Document Embedded) (html, pdf)
  • 17. Deployment For Ubuntu system: - Heavily rely on service of launchpad.net. Upload the source package -> 1) Compile the binaries for all the architectures. 2) Construct and host the debian repository. For Windows system: - In-house package update script. - Heavily rely on full time programmer working for this project. - [Work-in-progress] Automatic compilation and packaging.
  • 18. Summary Overview: - OpenHRI is a complete, easy to install components to construct human robot interaction system. - Common interface is standard oriented and shared with commercial software companies. Development Techniques: - Test is done by inversing the perceptual system. - Document automaton by SPHINX, rtdoc. - Deployment by launchpad, update script. Next challenge and open question: "How can we make this sustainable?"
  • 19. NEDO’s Strategy to Spread RT-Middleware Industry -2009 2010- More Industrial Developers performance (Commercial components) Switch High initial cost Open Source Co Developers mm on (FLOSS components) Inte rfa ce Less initial cost Application Developers