SlideShare a Scribd company logo
Mohamed Amine Chatti Informatik 5, RWTH Aachen, Germany PROLEARN Network of Excellence ALOA  – A Web Services Driven Framework for  A utomatic  L earning  O bject  A nnotation
Agenda Why Automatic Metadata Generation? AMG v.1 AMG v.2 – SAmgI ALOA ALOA and AMG Conclusion and Future Work
Metadata Metadata is crucial for search, access, share, and reuse.  Dealing with metadata cannot be a human task (Duval and Hodgins, 2004)   Complex metadata standards (e.g. 9 LOM categories and  45 records of LOM level two) Benefit not immediately appreciated Metadata creators too expensive to be employed Tools not user friendly (“electronic forms must die”) Need for Automatic Metadata Generation
Automatic Approach Use information about the LO and its context to extract or generate its metadata. 4 aspects of AMG (Cardinaels et al., 2005) Content analysis (LO itself, e.g. keyword, language) Context analysis (environment the LO is stored or used in, e.g. LMS) Usage analysis (e.g. time spent reading a doc) Structure analysis (relationship amongst LOs)
AMG at KUL (Cardinaels et al., 2005; Ochoa et al., 2005) AMG v.1
It was an application (Java-based) No support for different languages Not possible to have a metadata subset as a result Not flexible and extensible Not really interoperable between platforms AMG v.1 Limitations
AMG v.2 Federated AMG Simple AMG Interface (SAmgI) (Meire et al., 2007) Main Design Goals: Extensibility – Pluggability Interoperability (Service oriented)
ObjectBasedGenerators based on the Factory design pattern Problem: checkout source code, recompile and rebuild the whole application AMG v.2 Extensibility
Federated AMG Engine - SAmgI installations / service endpoints Problem: some programming required (SAmgI WSDL specification, XML schemas, etc.) AMG v.2 Interoperability
ALOA A Framework for LOM-based Automatic LO Annotation Service Oriented Architecture (SOA) / Web Services Main focus on flexibility and extensibility
Indexer  performing these actions: read all configurations in the properties file (i.e. available extractors and generators, priority of each generator, maximum generated values) access the LO as an array of bytes detect the mime type of the LO look for the available extractor for this particular mime type extract the content and the embedded properties of the LO contact the available generators solve conflicts translate the generated metadata into the required languages return the generation result to the Web Service stub  ConflictResolver considers priorities of the generators Translator uses Google Translate as its translation service ALOA Core Engine
Extractors extract content information and embedded properties from LOs only one extractor for each LO mime type html extractor (Jericho library) pdf extractor (pdfBox library) word extractor (Apache POI library) ppt extractor (Apache POI library) Generators use the output of the extractors to generate one or parts of the metadata text/data mining libraries (e.g. Yahoo! Term Extraction, Tagthe, Topicalizer, LingPipe, Balie, Classifier4J) ALOA Components
Based on the ALOA Web Services API Automatically generate metadata from online LOs (html, plain text, word, ppt, pdf) Parameters URL location of the LO Target metadata languages (English, German, Arabic, French, Spanish, Korean) Subset of the generated metadata Output format (LOM XML, HTML, LOM Editor) ALOA User Interface
Enables to easily plug-in new components (extractors and generators), for instance: Extractor for multimedia LO (e.g. audio, video, image, flash) Generator for a specific context (e.g. LMS) The components can be deployed on different machines or on different application servers Once deployed, a component can be plugged into ALOA by just giving the address of the component service ALOA core engine validates and adds it to the component list in the properties file Dynamic addition in run time; no need to recompile and rebuild the system ALOA CMI also enables to manage the priorities of the generators and to define the maximum generated values (used by ALOA core engine) ALOA Configuration  Management Interface
ALOA adopts a slightly modified version of SAmgI WSDL specification New methods:  getLanguages ,  setLanguages Modified method:  getMetadata Web Services-based interactions between ALOA and AMG possible ALOA as a new SAmgI installation used by the federated AMG engine AMG as a new component (i.e. extractor or generator) of ALOA  ALOA and AMG
ALOA – A framework for LOM-based automatic metadata generation ALOA already implements different components (i.e. extractors and generators) ALOA already generates LOM from different types of LOs (html, plain text, pdf, ppt, word) Primary focus on flexibility and extensibility of the framework SOA-based architecture enabling new components to be easily plugged into the basic system ALOA provides a public Web Services API for third party applications Conclusion
Interactions between ALOA and AMG Extension with more extractors and generators based on other text/data mining techniques Look at model transformation techniques to support other metadata schemas (e.g. DC, MPEG) Further research of the quality of automatically generated metadata Combination of automatic metadata generation with a bottom up approach (e.g. Web 2.0 social tagging) Future Work
Thank You!
Ad

More Related Content

What's hot (12)

Object-Relational Mapping and Dependency Injection
Object-Relational Mapping and Dependency InjectionObject-Relational Mapping and Dependency Injection
Object-Relational Mapping and Dependency Injection
Shane Church
 
Mule soft meetup_4_mty_online_oct_2020
Mule soft meetup_4_mty_online_oct_2020Mule soft meetup_4_mty_online_oct_2020
Mule soft meetup_4_mty_online_oct_2020
Veyra Celina
 
OPC UA - Information Models & Companion Specifications
OPC UA - Information Models & Companion SpecificationsOPC UA - Information Models & Companion Specifications
OPC UA - Information Models & Companion Specifications
Stephane Potier
 
owb-platform-adapter-cookbook-177344
owb-platform-adapter-cookbook-177344owb-platform-adapter-cookbook-177344
owb-platform-adapter-cookbook-177344
Carnot Antonio Romero
 
Oracle9i Introduction To Sql Version 2
Oracle9i Introduction To Sql Version 2Oracle9i Introduction To Sql Version 2
Oracle9i Introduction To Sql Version 2
Thuan Nguyen
 
Graham r cit_presentation
Graham r cit_presentationGraham r cit_presentation
Graham r cit_presentation
rodger.graham
 
Object Relational Mapping In Real World Applications
Object Relational Mapping In Real World ApplicationsObject Relational Mapping In Real World Applications
Object Relational Mapping In Real World Applications
PhilWinstanley
 
J2EE PPT --CINTHIYA.M Krishnammal college for women
J2EE PPT --CINTHIYA.M Krishnammal college for womenJ2EE PPT --CINTHIYA.M Krishnammal college for women
J2EE PPT --CINTHIYA.M Krishnammal college for women
lissa cidhi
 
Les01
Les01Les01
Les01
Sudharsan S
 
Flex and PHP For the Flash Folks
Flex and PHP For the Flash FolksFlex and PHP For the Flash Folks
Flex and PHP For the Flash Folks
10n Software, LLC
 
Oracle forms les02
Oracle forms  les02Oracle forms  les02
Oracle forms les02
Abed Othman
 
Les24
Les24Les24
Les24
Sudharsan S
 
Object-Relational Mapping and Dependency Injection
Object-Relational Mapping and Dependency InjectionObject-Relational Mapping and Dependency Injection
Object-Relational Mapping and Dependency Injection
Shane Church
 
Mule soft meetup_4_mty_online_oct_2020
Mule soft meetup_4_mty_online_oct_2020Mule soft meetup_4_mty_online_oct_2020
Mule soft meetup_4_mty_online_oct_2020
Veyra Celina
 
OPC UA - Information Models & Companion Specifications
OPC UA - Information Models & Companion SpecificationsOPC UA - Information Models & Companion Specifications
OPC UA - Information Models & Companion Specifications
Stephane Potier
 
owb-platform-adapter-cookbook-177344
owb-platform-adapter-cookbook-177344owb-platform-adapter-cookbook-177344
owb-platform-adapter-cookbook-177344
Carnot Antonio Romero
 
Oracle9i Introduction To Sql Version 2
Oracle9i Introduction To Sql Version 2Oracle9i Introduction To Sql Version 2
Oracle9i Introduction To Sql Version 2
Thuan Nguyen
 
Graham r cit_presentation
Graham r cit_presentationGraham r cit_presentation
Graham r cit_presentation
rodger.graham
 
Object Relational Mapping In Real World Applications
Object Relational Mapping In Real World ApplicationsObject Relational Mapping In Real World Applications
Object Relational Mapping In Real World Applications
PhilWinstanley
 
J2EE PPT --CINTHIYA.M Krishnammal college for women
J2EE PPT --CINTHIYA.M Krishnammal college for womenJ2EE PPT --CINTHIYA.M Krishnammal college for women
J2EE PPT --CINTHIYA.M Krishnammal college for women
lissa cidhi
 
Flex and PHP For the Flash Folks
Flex and PHP For the Flash FolksFlex and PHP For the Flash Folks
Flex and PHP For the Flash Folks
10n Software, LLC
 
Oracle forms les02
Oracle forms  les02Oracle forms  les02
Oracle forms les02
Abed Othman
 

Viewers also liked (20)

Alg1 Ch9 Review Pg1
Alg1 Ch9 Review Pg1Alg1 Ch9 Review Pg1
Alg1 Ch9 Review Pg1
HeatherHunt
 
<meta />
<meta /><meta />
<meta />
Daniel
 
Elenco Candidati elezioni regionali 2010 Puglia - Provincia di Brindisi
Elenco Candidati elezioni regionali 2010 Puglia - Provincia di BrindisiElenco Candidati elezioni regionali 2010 Puglia - Provincia di Brindisi
Elenco Candidati elezioni regionali 2010 Puglia - Provincia di Brindisi
www.oria.info
 
Final Illustration3
Final Illustration3Final Illustration3
Final Illustration3
klsw4
 
Tecnologia
TecnologiaTecnologia
Tecnologia
Jeison Hurtado
 
Presentation1
Presentation1Presentation1
Presentation1
tbjerkvik
 
¿Como poner un reproductor de musica?
¿Como poner un reproductor de musica?¿Como poner un reproductor de musica?
¿Como poner un reproductor de musica?
minigolete
 
OB_Manager_Barometer_2016-17
OB_Manager_Barometer_2016-17OB_Manager_Barometer_2016-17
OB_Manager_Barometer_2016-17
Kristin Brodel
 
Andrews 1977 Parasitological techniques
Andrews 1977 Parasitological techniquesAndrews 1977 Parasitological techniques
Andrews 1977 Parasitological techniques
Chris Andrews
 
Class 23 1 a
Class 23 1 aClass 23 1 a
Class 23 1 a
jordanlachance
 
Nicola Minervini na Bahia - Out 2013 - Inscreva-se e veja como as PMEs italia...
Nicola Minervini na Bahia - Out 2013 - Inscreva-se e veja como as PMEs italia...Nicola Minervini na Bahia - Out 2013 - Inscreva-se e veja como as PMEs italia...
Nicola Minervini na Bahia - Out 2013 - Inscreva-se e veja como as PMEs italia...
EXPEDITO SILVA
 
www.posjeti/server/hvala.comNew Text Document
www.posjeti/server/hvala.comNew Text Documentwww.posjeti/server/hvala.comNew Text Document
www.posjeti/server/hvala.comNew Text Document
guestb7102a
 
Mapa Distrito Centro Historico Ord 1071
Mapa Distrito Centro Historico Ord 1071Mapa Distrito Centro Historico Ord 1071
Mapa Distrito Centro Historico Ord 1071
EvelinRomina
 
Live2give
Live2giveLive2give
Live2give
careercontrol
 
Fashion Technology Portfolio Pratigya Bhatnagar B.Sc Fashion Technology,Dezy...
Fashion Technology Portfolio Pratigya  Bhatnagar B.Sc Fashion Technology,Dezy...Fashion Technology Portfolio Pratigya  Bhatnagar B.Sc Fashion Technology,Dezy...
Fashion Technology Portfolio Pratigya Bhatnagar B.Sc Fashion Technology,Dezy...
dezyneecole
 
Fashion Design Portfolio By Tanu Chorasiya, B.Sc. Fashion Design
Fashion Design Portfolio By Tanu Chorasiya, B.Sc. Fashion Design Fashion Design Portfolio By Tanu Chorasiya, B.Sc. Fashion Design
Fashion Design Portfolio By Tanu Chorasiya, B.Sc. Fashion Design
dezyneecole
 
Alg1 Ch9 Review Pg1
Alg1 Ch9 Review Pg1Alg1 Ch9 Review Pg1
Alg1 Ch9 Review Pg1
HeatherHunt
 
<meta />
<meta /><meta />
<meta />
Daniel
 
Elenco Candidati elezioni regionali 2010 Puglia - Provincia di Brindisi
Elenco Candidati elezioni regionali 2010 Puglia - Provincia di BrindisiElenco Candidati elezioni regionali 2010 Puglia - Provincia di Brindisi
Elenco Candidati elezioni regionali 2010 Puglia - Provincia di Brindisi
www.oria.info
 
Final Illustration3
Final Illustration3Final Illustration3
Final Illustration3
klsw4
 
Presentation1
Presentation1Presentation1
Presentation1
tbjerkvik
 
¿Como poner un reproductor de musica?
¿Como poner un reproductor de musica?¿Como poner un reproductor de musica?
¿Como poner un reproductor de musica?
minigolete
 
OB_Manager_Barometer_2016-17
OB_Manager_Barometer_2016-17OB_Manager_Barometer_2016-17
OB_Manager_Barometer_2016-17
Kristin Brodel
 
Andrews 1977 Parasitological techniques
Andrews 1977 Parasitological techniquesAndrews 1977 Parasitological techniques
Andrews 1977 Parasitological techniques
Chris Andrews
 
Nicola Minervini na Bahia - Out 2013 - Inscreva-se e veja como as PMEs italia...
Nicola Minervini na Bahia - Out 2013 - Inscreva-se e veja como as PMEs italia...Nicola Minervini na Bahia - Out 2013 - Inscreva-se e veja como as PMEs italia...
Nicola Minervini na Bahia - Out 2013 - Inscreva-se e veja como as PMEs italia...
EXPEDITO SILVA
 
www.posjeti/server/hvala.comNew Text Document
www.posjeti/server/hvala.comNew Text Documentwww.posjeti/server/hvala.comNew Text Document
www.posjeti/server/hvala.comNew Text Document
guestb7102a
 
Mapa Distrito Centro Historico Ord 1071
Mapa Distrito Centro Historico Ord 1071Mapa Distrito Centro Historico Ord 1071
Mapa Distrito Centro Historico Ord 1071
EvelinRomina
 
Fashion Technology Portfolio Pratigya Bhatnagar B.Sc Fashion Technology,Dezy...
Fashion Technology Portfolio Pratigya  Bhatnagar B.Sc Fashion Technology,Dezy...Fashion Technology Portfolio Pratigya  Bhatnagar B.Sc Fashion Technology,Dezy...
Fashion Technology Portfolio Pratigya Bhatnagar B.Sc Fashion Technology,Dezy...
dezyneecole
 
Fashion Design Portfolio By Tanu Chorasiya, B.Sc. Fashion Design
Fashion Design Portfolio By Tanu Chorasiya, B.Sc. Fashion Design Fashion Design Portfolio By Tanu Chorasiya, B.Sc. Fashion Design
Fashion Design Portfolio By Tanu Chorasiya, B.Sc. Fashion Design
dezyneecole
 
Ad

Similar to Aloa - A Web Services Driven Framework for Automatic Learning Objcet Annotation (20)

Towards a Web Search Service for Minority Language Communities
Towards a Web Search Service for Minority Language CommunitiesTowards a Web Search Service for Minority Language Communities
Towards a Web Search Service for Minority Language Communities
Baden Hughes
 
Genre discovery in corpus management systems (2004)
Genre discovery in corpus management systems (2004)Genre discovery in corpus management systems (2004)
Genre discovery in corpus management systems (2004)
Joseba Abaitua
 
Managing the Digitization of Large Press Archives
Managing the Digitization of Large Press ArchivesManaging the Digitization of Large Press Archives
Managing the Digitization of Large Press Archives
DLFCLIR
 
The Hellenic Aggregator - Overview, procedures & the cooperation with Europeana
The Hellenic Aggregator - Overview, procedures & the cooperation with EuropeanaThe Hellenic Aggregator - Overview, procedures & the cooperation with Europeana
The Hellenic Aggregator - Overview, procedures & the cooperation with Europeana
Vangelis Banos
 
Niatalk24jan10
Niatalk24jan10Niatalk24jan10
Niatalk24jan10
Sunita Barve
 
Open Source ILS Add-Ons
Open Source ILS Add-OnsOpen Source ILS Add-Ons
Open Source ILS Add-Ons
loriayre
 
Digital Libraries
Digital LibrariesDigital Libraries
Digital Libraries
Jack Eapen
 
Digital Libraries
Digital LibrariesDigital Libraries
Digital Libraries
Jack Eapen
 
Interoperability issues between learning object repositories and metadata har...
Interoperability issues between learning object repositories and metadata har...Interoperability issues between learning object repositories and metadata har...
Interoperability issues between learning object repositories and metadata har...
Ricard de la Vega
 
Accessibility, Automation and Metadata
Accessibility, Automation and MetadataAccessibility, Automation and Metadata
Accessibility, Automation and Metadata
lisbk
 
Automatic metadata generation
Automatic metadata generationAutomatic metadata generation
Automatic metadata generation
hachilde
 
Reorienting open repositories to the challenges of the Semantic Web: Experien...
Reorienting open repositories to the challenges of the Semantic Web: Experien...Reorienting open repositories to the challenges of the Semantic Web: Experien...
Reorienting open repositories to the challenges of the Semantic Web: Experien...
AIMS (Agricultural Information Management Standards)
 
Greenstone aib 16_feb12_casarosa
Greenstone aib 16_feb12_casarosaGreenstone aib 16_feb12_casarosa
Greenstone aib 16_feb12_casarosa
accessoinformazione
 
ALIAOnline Practical Linked (Open) Data for Libraries, Archives & Museums
ALIAOnline Practical Linked (Open) Data for Libraries, Archives & MuseumsALIAOnline Practical Linked (Open) Data for Libraries, Archives & Museums
ALIAOnline Practical Linked (Open) Data for Libraries, Archives & Museums
Jon Voss
 
Ifla swsig meeting - Puerto Rico - 20110817
Ifla swsig meeting - Puerto Rico - 20110817Ifla swsig meeting - Puerto Rico - 20110817
Ifla swsig meeting - Puerto Rico - 20110817
Figoblog
 
Linked Open Data for Libraries, Archives, and Museums: An Aggregators View
Linked Open Data for Libraries, Archives, and Museums: An Aggregators ViewLinked Open Data for Libraries, Archives, and Museums: An Aggregators View
Linked Open Data for Libraries, Archives, and Museums: An Aggregators View
Richard Urban
 
Metadata Quality Evaluation: Experience from the Open Language Archives Commu...
Metadata Quality Evaluation: Experience from the Open Language Archives Commu...Metadata Quality Evaluation: Experience from the Open Language Archives Commu...
Metadata Quality Evaluation: Experience from the Open Language Archives Commu...
Baden Hughes
 
Lit mtap
Lit mtapLit mtap
Lit mtap
Andrea Ferracani
 
SAmgI: Automatic Metadata Generation v.2
SAmgI: Automatic Metadata Generation v.2SAmgI: Automatic Metadata Generation v.2
SAmgI: Automatic Metadata Generation v.2
Xavier Ochoa
 
BASE : a powerful search engine for Open Access documents
BASE : a powerful search engine for Open Access documents BASE : a powerful search engine for Open Access documents
BASE : a powerful search engine for Open Access documents
AIMS (Agricultural Information Management Standards)
 
Towards a Web Search Service for Minority Language Communities
Towards a Web Search Service for Minority Language CommunitiesTowards a Web Search Service for Minority Language Communities
Towards a Web Search Service for Minority Language Communities
Baden Hughes
 
Genre discovery in corpus management systems (2004)
Genre discovery in corpus management systems (2004)Genre discovery in corpus management systems (2004)
Genre discovery in corpus management systems (2004)
Joseba Abaitua
 
Managing the Digitization of Large Press Archives
Managing the Digitization of Large Press ArchivesManaging the Digitization of Large Press Archives
Managing the Digitization of Large Press Archives
DLFCLIR
 
The Hellenic Aggregator - Overview, procedures & the cooperation with Europeana
The Hellenic Aggregator - Overview, procedures & the cooperation with EuropeanaThe Hellenic Aggregator - Overview, procedures & the cooperation with Europeana
The Hellenic Aggregator - Overview, procedures & the cooperation with Europeana
Vangelis Banos
 
Open Source ILS Add-Ons
Open Source ILS Add-OnsOpen Source ILS Add-Ons
Open Source ILS Add-Ons
loriayre
 
Digital Libraries
Digital LibrariesDigital Libraries
Digital Libraries
Jack Eapen
 
Digital Libraries
Digital LibrariesDigital Libraries
Digital Libraries
Jack Eapen
 
Interoperability issues between learning object repositories and metadata har...
Interoperability issues between learning object repositories and metadata har...Interoperability issues between learning object repositories and metadata har...
Interoperability issues between learning object repositories and metadata har...
Ricard de la Vega
 
Accessibility, Automation and Metadata
Accessibility, Automation and MetadataAccessibility, Automation and Metadata
Accessibility, Automation and Metadata
lisbk
 
Automatic metadata generation
Automatic metadata generationAutomatic metadata generation
Automatic metadata generation
hachilde
 
Greenstone aib 16_feb12_casarosa
Greenstone aib 16_feb12_casarosaGreenstone aib 16_feb12_casarosa
Greenstone aib 16_feb12_casarosa
accessoinformazione
 
ALIAOnline Practical Linked (Open) Data for Libraries, Archives & Museums
ALIAOnline Practical Linked (Open) Data for Libraries, Archives & MuseumsALIAOnline Practical Linked (Open) Data for Libraries, Archives & Museums
ALIAOnline Practical Linked (Open) Data for Libraries, Archives & Museums
Jon Voss
 
Ifla swsig meeting - Puerto Rico - 20110817
Ifla swsig meeting - Puerto Rico - 20110817Ifla swsig meeting - Puerto Rico - 20110817
Ifla swsig meeting - Puerto Rico - 20110817
Figoblog
 
Linked Open Data for Libraries, Archives, and Museums: An Aggregators View
Linked Open Data for Libraries, Archives, and Museums: An Aggregators ViewLinked Open Data for Libraries, Archives, and Museums: An Aggregators View
Linked Open Data for Libraries, Archives, and Museums: An Aggregators View
Richard Urban
 
Metadata Quality Evaluation: Experience from the Open Language Archives Commu...
Metadata Quality Evaluation: Experience from the Open Language Archives Commu...Metadata Quality Evaluation: Experience from the Open Language Archives Commu...
Metadata Quality Evaluation: Experience from the Open Language Archives Commu...
Baden Hughes
 
SAmgI: Automatic Metadata Generation v.2
SAmgI: Automatic Metadata Generation v.2SAmgI: Automatic Metadata Generation v.2
SAmgI: Automatic Metadata Generation v.2
Xavier Ochoa
 
Ad

Recently uploaded (20)

Risk Analysis 101: Using a Risk Analyst to Fortify Your IT Strategy
Risk Analysis 101: Using a Risk Analyst to Fortify Your IT StrategyRisk Analysis 101: Using a Risk Analyst to Fortify Your IT Strategy
Risk Analysis 101: Using a Risk Analyst to Fortify Your IT Strategy
john823664
 
Scientific Large Language Models in Multi-Modal Domains
Scientific Large Language Models in Multi-Modal DomainsScientific Large Language Models in Multi-Modal Domains
Scientific Large Language Models in Multi-Modal Domains
syedanidakhader1
 
Longitudinal Benchmark: A Real-World UX Case Study in Onboarding by Linda Bor...
Longitudinal Benchmark: A Real-World UX Case Study in Onboarding by Linda Bor...Longitudinal Benchmark: A Real-World UX Case Study in Onboarding by Linda Bor...
Longitudinal Benchmark: A Real-World UX Case Study in Onboarding by Linda Bor...
UXPA Boston
 
Building the Customer Identity Community, Together.pdf
Building the Customer Identity Community, Together.pdfBuilding the Customer Identity Community, Together.pdf
Building the Customer Identity Community, Together.pdf
Cheryl Hung
 
Computer Systems Quiz Presentation in Purple Bold Style (4).pdf
Computer Systems Quiz Presentation in Purple Bold Style (4).pdfComputer Systems Quiz Presentation in Purple Bold Style (4).pdf
Computer Systems Quiz Presentation in Purple Bold Style (4).pdf
fizarcse
 
AI needs Hybrid Cloud - TEC conference 2025.pptx
AI needs Hybrid Cloud - TEC conference 2025.pptxAI needs Hybrid Cloud - TEC conference 2025.pptx
AI needs Hybrid Cloud - TEC conference 2025.pptx
Shikha Srivastava
 
AI and Meaningful Work by Pablo Fernández Vallejo
AI and Meaningful Work by Pablo Fernández VallejoAI and Meaningful Work by Pablo Fernández Vallejo
AI and Meaningful Work by Pablo Fernández Vallejo
UXPA Boston
 
Middle East and Africa Cybersecurity Market Trends and Growth Analysis
Middle East and Africa Cybersecurity Market Trends and Growth Analysis Middle East and Africa Cybersecurity Market Trends and Growth Analysis
Middle East and Africa Cybersecurity Market Trends and Growth Analysis
Preeti Jha
 
Right to liberty and security of a person.pdf
Right to liberty and security of a person.pdfRight to liberty and security of a person.pdf
Right to liberty and security of a person.pdf
danielbraico197
 
Cybersecurity Tools and Technologies - Microsoft Certificate
Cybersecurity Tools and Technologies - Microsoft CertificateCybersecurity Tools and Technologies - Microsoft Certificate
Cybersecurity Tools and Technologies - Microsoft Certificate
VICTOR MAESTRE RAMIREZ
 
Dark Dynamism: drones, dark factories and deurbanization
Dark Dynamism: drones, dark factories and deurbanizationDark Dynamism: drones, dark factories and deurbanization
Dark Dynamism: drones, dark factories and deurbanization
Jakub Šimek
 
Shoehorning dependency injection into a FP language, what does it take?
Shoehorning dependency injection into a FP language, what does it take?Shoehorning dependency injection into a FP language, what does it take?
Shoehorning dependency injection into a FP language, what does it take?
Eric Torreborre
 
Understanding SEO in the Age of AI.pdf
Understanding SEO in the Age of AI.pdfUnderstanding SEO in the Age of AI.pdf
Understanding SEO in the Age of AI.pdf
Fulcrum Concepts, LLC
 
Google DeepMind’s New AI Coding Agent AlphaEvolve.pdf
Google DeepMind’s New AI Coding Agent AlphaEvolve.pdfGoogle DeepMind’s New AI Coding Agent AlphaEvolve.pdf
Google DeepMind’s New AI Coding Agent AlphaEvolve.pdf
derrickjswork
 
Refactoring meta-rauc-community: Cleaner Code, Better Maintenance, More Machines
Refactoring meta-rauc-community: Cleaner Code, Better Maintenance, More MachinesRefactoring meta-rauc-community: Cleaner Code, Better Maintenance, More Machines
Refactoring meta-rauc-community: Cleaner Code, Better Maintenance, More Machines
Leon Anavi
 
Developing Product-Behavior Fit: UX Research in Product Development by Krysta...
Developing Product-Behavior Fit: UX Research in Product Development by Krysta...Developing Product-Behavior Fit: UX Research in Product Development by Krysta...
Developing Product-Behavior Fit: UX Research in Product Development by Krysta...
UXPA Boston
 
論文紹介:"InfLoRA: Interference-Free Low-Rank Adaptation for Continual Learning" ...
論文紹介:"InfLoRA: Interference-Free Low-Rank Adaptation for Continual Learning" ...論文紹介:"InfLoRA: Interference-Free Low-Rank Adaptation for Continual Learning" ...
論文紹介:"InfLoRA: Interference-Free Low-Rank Adaptation for Continual Learning" ...
Toru Tamaki
 
May Patch Tuesday
May Patch TuesdayMay Patch Tuesday
May Patch Tuesday
Ivanti
 
In-App Guidance_ Save Enterprises Millions in Training & IT Costs.pptx
In-App Guidance_ Save Enterprises Millions in Training & IT Costs.pptxIn-App Guidance_ Save Enterprises Millions in Training & IT Costs.pptx
In-App Guidance_ Save Enterprises Millions in Training & IT Costs.pptx
aptyai
 
machines-for-woodworking-shops-en-compressed.pdf
machines-for-woodworking-shops-en-compressed.pdfmachines-for-woodworking-shops-en-compressed.pdf
machines-for-woodworking-shops-en-compressed.pdf
AmirStern2
 
Risk Analysis 101: Using a Risk Analyst to Fortify Your IT Strategy
Risk Analysis 101: Using a Risk Analyst to Fortify Your IT StrategyRisk Analysis 101: Using a Risk Analyst to Fortify Your IT Strategy
Risk Analysis 101: Using a Risk Analyst to Fortify Your IT Strategy
john823664
 
Scientific Large Language Models in Multi-Modal Domains
Scientific Large Language Models in Multi-Modal DomainsScientific Large Language Models in Multi-Modal Domains
Scientific Large Language Models in Multi-Modal Domains
syedanidakhader1
 
Longitudinal Benchmark: A Real-World UX Case Study in Onboarding by Linda Bor...
Longitudinal Benchmark: A Real-World UX Case Study in Onboarding by Linda Bor...Longitudinal Benchmark: A Real-World UX Case Study in Onboarding by Linda Bor...
Longitudinal Benchmark: A Real-World UX Case Study in Onboarding by Linda Bor...
UXPA Boston
 
Building the Customer Identity Community, Together.pdf
Building the Customer Identity Community, Together.pdfBuilding the Customer Identity Community, Together.pdf
Building the Customer Identity Community, Together.pdf
Cheryl Hung
 
Computer Systems Quiz Presentation in Purple Bold Style (4).pdf
Computer Systems Quiz Presentation in Purple Bold Style (4).pdfComputer Systems Quiz Presentation in Purple Bold Style (4).pdf
Computer Systems Quiz Presentation in Purple Bold Style (4).pdf
fizarcse
 
AI needs Hybrid Cloud - TEC conference 2025.pptx
AI needs Hybrid Cloud - TEC conference 2025.pptxAI needs Hybrid Cloud - TEC conference 2025.pptx
AI needs Hybrid Cloud - TEC conference 2025.pptx
Shikha Srivastava
 
AI and Meaningful Work by Pablo Fernández Vallejo
AI and Meaningful Work by Pablo Fernández VallejoAI and Meaningful Work by Pablo Fernández Vallejo
AI and Meaningful Work by Pablo Fernández Vallejo
UXPA Boston
 
Middle East and Africa Cybersecurity Market Trends and Growth Analysis
Middle East and Africa Cybersecurity Market Trends and Growth Analysis Middle East and Africa Cybersecurity Market Trends and Growth Analysis
Middle East and Africa Cybersecurity Market Trends and Growth Analysis
Preeti Jha
 
Right to liberty and security of a person.pdf
Right to liberty and security of a person.pdfRight to liberty and security of a person.pdf
Right to liberty and security of a person.pdf
danielbraico197
 
Cybersecurity Tools and Technologies - Microsoft Certificate
Cybersecurity Tools and Technologies - Microsoft CertificateCybersecurity Tools and Technologies - Microsoft Certificate
Cybersecurity Tools and Technologies - Microsoft Certificate
VICTOR MAESTRE RAMIREZ
 
Dark Dynamism: drones, dark factories and deurbanization
Dark Dynamism: drones, dark factories and deurbanizationDark Dynamism: drones, dark factories and deurbanization
Dark Dynamism: drones, dark factories and deurbanization
Jakub Šimek
 
Shoehorning dependency injection into a FP language, what does it take?
Shoehorning dependency injection into a FP language, what does it take?Shoehorning dependency injection into a FP language, what does it take?
Shoehorning dependency injection into a FP language, what does it take?
Eric Torreborre
 
Understanding SEO in the Age of AI.pdf
Understanding SEO in the Age of AI.pdfUnderstanding SEO in the Age of AI.pdf
Understanding SEO in the Age of AI.pdf
Fulcrum Concepts, LLC
 
Google DeepMind’s New AI Coding Agent AlphaEvolve.pdf
Google DeepMind’s New AI Coding Agent AlphaEvolve.pdfGoogle DeepMind’s New AI Coding Agent AlphaEvolve.pdf
Google DeepMind’s New AI Coding Agent AlphaEvolve.pdf
derrickjswork
 
Refactoring meta-rauc-community: Cleaner Code, Better Maintenance, More Machines
Refactoring meta-rauc-community: Cleaner Code, Better Maintenance, More MachinesRefactoring meta-rauc-community: Cleaner Code, Better Maintenance, More Machines
Refactoring meta-rauc-community: Cleaner Code, Better Maintenance, More Machines
Leon Anavi
 
Developing Product-Behavior Fit: UX Research in Product Development by Krysta...
Developing Product-Behavior Fit: UX Research in Product Development by Krysta...Developing Product-Behavior Fit: UX Research in Product Development by Krysta...
Developing Product-Behavior Fit: UX Research in Product Development by Krysta...
UXPA Boston
 
論文紹介:"InfLoRA: Interference-Free Low-Rank Adaptation for Continual Learning" ...
論文紹介:"InfLoRA: Interference-Free Low-Rank Adaptation for Continual Learning" ...論文紹介:"InfLoRA: Interference-Free Low-Rank Adaptation for Continual Learning" ...
論文紹介:"InfLoRA: Interference-Free Low-Rank Adaptation for Continual Learning" ...
Toru Tamaki
 
May Patch Tuesday
May Patch TuesdayMay Patch Tuesday
May Patch Tuesday
Ivanti
 
In-App Guidance_ Save Enterprises Millions in Training & IT Costs.pptx
In-App Guidance_ Save Enterprises Millions in Training & IT Costs.pptxIn-App Guidance_ Save Enterprises Millions in Training & IT Costs.pptx
In-App Guidance_ Save Enterprises Millions in Training & IT Costs.pptx
aptyai
 
machines-for-woodworking-shops-en-compressed.pdf
machines-for-woodworking-shops-en-compressed.pdfmachines-for-woodworking-shops-en-compressed.pdf
machines-for-woodworking-shops-en-compressed.pdf
AmirStern2
 

Aloa - A Web Services Driven Framework for Automatic Learning Objcet Annotation

  • 1. Mohamed Amine Chatti Informatik 5, RWTH Aachen, Germany PROLEARN Network of Excellence ALOA – A Web Services Driven Framework for A utomatic L earning O bject A nnotation
  • 2. Agenda Why Automatic Metadata Generation? AMG v.1 AMG v.2 – SAmgI ALOA ALOA and AMG Conclusion and Future Work
  • 3. Metadata Metadata is crucial for search, access, share, and reuse. Dealing with metadata cannot be a human task (Duval and Hodgins, 2004) Complex metadata standards (e.g. 9 LOM categories and 45 records of LOM level two) Benefit not immediately appreciated Metadata creators too expensive to be employed Tools not user friendly (“electronic forms must die”) Need for Automatic Metadata Generation
  • 4. Automatic Approach Use information about the LO and its context to extract or generate its metadata. 4 aspects of AMG (Cardinaels et al., 2005) Content analysis (LO itself, e.g. keyword, language) Context analysis (environment the LO is stored or used in, e.g. LMS) Usage analysis (e.g. time spent reading a doc) Structure analysis (relationship amongst LOs)
  • 5. AMG at KUL (Cardinaels et al., 2005; Ochoa et al., 2005) AMG v.1
  • 6. It was an application (Java-based) No support for different languages Not possible to have a metadata subset as a result Not flexible and extensible Not really interoperable between platforms AMG v.1 Limitations
  • 7. AMG v.2 Federated AMG Simple AMG Interface (SAmgI) (Meire et al., 2007) Main Design Goals: Extensibility – Pluggability Interoperability (Service oriented)
  • 8. ObjectBasedGenerators based on the Factory design pattern Problem: checkout source code, recompile and rebuild the whole application AMG v.2 Extensibility
  • 9. Federated AMG Engine - SAmgI installations / service endpoints Problem: some programming required (SAmgI WSDL specification, XML schemas, etc.) AMG v.2 Interoperability
  • 10. ALOA A Framework for LOM-based Automatic LO Annotation Service Oriented Architecture (SOA) / Web Services Main focus on flexibility and extensibility
  • 11. Indexer performing these actions: read all configurations in the properties file (i.e. available extractors and generators, priority of each generator, maximum generated values) access the LO as an array of bytes detect the mime type of the LO look for the available extractor for this particular mime type extract the content and the embedded properties of the LO contact the available generators solve conflicts translate the generated metadata into the required languages return the generation result to the Web Service stub ConflictResolver considers priorities of the generators Translator uses Google Translate as its translation service ALOA Core Engine
  • 12. Extractors extract content information and embedded properties from LOs only one extractor for each LO mime type html extractor (Jericho library) pdf extractor (pdfBox library) word extractor (Apache POI library) ppt extractor (Apache POI library) Generators use the output of the extractors to generate one or parts of the metadata text/data mining libraries (e.g. Yahoo! Term Extraction, Tagthe, Topicalizer, LingPipe, Balie, Classifier4J) ALOA Components
  • 13. Based on the ALOA Web Services API Automatically generate metadata from online LOs (html, plain text, word, ppt, pdf) Parameters URL location of the LO Target metadata languages (English, German, Arabic, French, Spanish, Korean) Subset of the generated metadata Output format (LOM XML, HTML, LOM Editor) ALOA User Interface
  • 14. Enables to easily plug-in new components (extractors and generators), for instance: Extractor for multimedia LO (e.g. audio, video, image, flash) Generator for a specific context (e.g. LMS) The components can be deployed on different machines or on different application servers Once deployed, a component can be plugged into ALOA by just giving the address of the component service ALOA core engine validates and adds it to the component list in the properties file Dynamic addition in run time; no need to recompile and rebuild the system ALOA CMI also enables to manage the priorities of the generators and to define the maximum generated values (used by ALOA core engine) ALOA Configuration Management Interface
  • 15. ALOA adopts a slightly modified version of SAmgI WSDL specification New methods: getLanguages , setLanguages Modified method: getMetadata Web Services-based interactions between ALOA and AMG possible ALOA as a new SAmgI installation used by the federated AMG engine AMG as a new component (i.e. extractor or generator) of ALOA ALOA and AMG
  • 16. ALOA – A framework for LOM-based automatic metadata generation ALOA already implements different components (i.e. extractors and generators) ALOA already generates LOM from different types of LOs (html, plain text, pdf, ppt, word) Primary focus on flexibility and extensibility of the framework SOA-based architecture enabling new components to be easily plugged into the basic system ALOA provides a public Web Services API for third party applications Conclusion
  • 17. Interactions between ALOA and AMG Extension with more extractors and generators based on other text/data mining techniques Look at model transformation techniques to support other metadata schemas (e.g. DC, MPEG) Further research of the quality of automatically generated metadata Combination of automatic metadata generation with a bottom up approach (e.g. Web 2.0 social tagging) Future Work