JIST2019: The 9th Joint International Semantic Technology Conference
The premium Asian forum on Semantic Web, Knowledge Graph, Linked Data and AI on the Web. Nov. 25-27, 2019, Hangzhou, China.
http://jist2019.openkg.cn/
AAAI2023「Are Transformers Effective for Time Series Forecasting?」と、HuggingFace「Yes, Transformers are Effective for Time Series Forecasting (+ Autoformer)」の紹介です。
Learning Analytics for the Evaluation of Competencies and Behaviors in Seriou...MIT
This document discusses learning analytics for evaluating competencies and behaviors in serious games. It begins by introducing the presenters and their affiliations. It then discusses motivations for using games for learning and assessment, noting that games can assess complex skills and be engaging for learners. The document outlines the design, development, and evaluation process for game-based assessment, including gathering data during design and implementing assessment models. It provides an example game called Shadowspect and describes how evidence from the game informs constructs and algorithms to measure skills like efficiency. The document notes future work could include evaluating models with external measures and ensuring generalizability.
Los sistemas educacionales actuales aun se centran fuertemente en la evaluación de contenidos a pesar de que vivimos en la era de la información y mediante métodos tradicionales como evaluaciones estandarizadas que estresan a los estudiantes. Los juegos serios representan oportunidades educacionales de gran impacto al representar entornos más realistas, que capturan gran cantidad de datos sobre el proceso que siguen los estudiantes y que son más disfrutables y relajados que los exámenes. Estos datos, combinados con técnicas de analítica de aprendizaje, representan gran potencial para construir modelos que permitan la evaluación de competencias claves para la sociedad del siglo 21 a través de juegos serios, estas evaluaciones se implementan de forma indirecta en lo que se conoce como Stealth Assessment (evaluación fantasma), para evitar interrumpir el flujo de juego. En este charla veremos una metodología que se basa en tres etapas – Diseño, Implementación y Evaluación – para la implementación de sistemas de evaluación a través de juegos.
Learning Analytics for the Evaluation of Competencies and Behaviors in Seriou...MIT
This document discusses learning analytics for evaluating competencies and behaviors in serious games. It begins by introducing the presenters and their affiliations. It then discusses motivations for using games for learning and assessment, noting that games can assess complex skills and be engaging for learners. The document outlines the design, development, and evaluation process for game-based assessment, including gathering data during design and implementing assessment models. It provides an example game called Shadowspect and describes how evidence from the game informs constructs and algorithms to measure skills like efficiency. The document notes future work could include evaluating models with external measures and ensuring generalizability.
Los sistemas educacionales actuales aun se centran fuertemente en la evaluación de contenidos a pesar de que vivimos en la era de la información y mediante métodos tradicionales como evaluaciones estandarizadas que estresan a los estudiantes. Los juegos serios representan oportunidades educacionales de gran impacto al representar entornos más realistas, que capturan gran cantidad de datos sobre el proceso que siguen los estudiantes y que son más disfrutables y relajados que los exámenes. Estos datos, combinados con técnicas de analítica de aprendizaje, representan gran potencial para construir modelos que permitan la evaluación de competencias claves para la sociedad del siglo 21 a través de juegos serios, estas evaluaciones se implementan de forma indirecta en lo que se conoce como Stealth Assessment (evaluación fantasma), para evitar interrumpir el flujo de juego. En este charla veremos una metodología que se basa en tres etapas – Diseño, Implementación y Evaluación – para la implementación de sistemas de evaluación a través de juegos.
INT428_Zero Lecture Artificial Intelligence in L.P.UAyushSingh695401
Zero lecture of Artificial Intelligence (Subject - INT-428 ), In this book using in L.P.U. Any lpu student using the ppt and known to syllabus of Artificial Intelligence
Statistical Analysis of Results in Music Information Retrieval: Why and HowJulián Urbano
This document summarizes an introduction given by Julián Urbano and Arthur Flexer on statistical analysis in music information retrieval. It discusses why evaluation is important, including addressing questions about how good a system is and comparing different systems. It describes how the Cranfield paradigm addresses these questions by using fixed test collections with documents, topics and relevance judgments to simulate users and allow reproducible experiments. Finally, it outlines different types of music information retrieval tasks like retrieval, annotation and extraction and how the evaluation approach may differ depending on the specific task or use case.
This document discusses different methods for analyzing qualitative and quantitative user research data. It describes various techniques for simple quantitative analysis including calculating percentages, averages, and identifying patterns through graphical representations. The document also outlines three major theoretical frameworks that can be applied to qualitative data analysis: grounded theory, distributed cognition, and activity theory. Presenting findings may involve graphical representations, rigorous notations, stories, or summaries highlighting key results and statistics.
This document discusses different methods for analyzing qualitative and quantitative user research data. It describes various techniques for simple quantitative analysis including calculating percentages, averages, and identifying patterns through graphical representations. The document also outlines three major theoretical frameworks that can be applied to qualitative data analysis: grounded theory, distributed cognition, and activity theory. Presenting findings may involve graphical representations, rigorous notations, storytelling, or summarizing key highlights and statistics.
This document outlines the key steps in the scientific research process:
1) Selecting a research topic and ensuring it is narrow enough.
2) Determining if the topic is relevant by asking questions like if the problem can be investigated and if data can be analyzed.
3) Conducting a literature review to learn from past research and help define a research question.
4) Formulating the research problem as a clear hypothesis or question.
5) Designing the research methodology.
6) Analyzing and interpreting the collected data.
7) Presenting the results in a clear and concise manner, noting that one study provides indications rather than proven facts.
Recognizing and Organizing Opinions Expressed in the World ...butest
The document summarizes the MPQA project which investigated recognizing and organizing opinions expressed in text. The project developed a framework for annotating perspectives in documents, training machine learning models to identify perspectives, and using perspective information to cluster passages for question answering applications. Initial experiments found annotator agreement of 85% for direct opinions and 50% for indirect opinions. A simple classifier achieved 66.4% accuracy in identifying direct opinions, outperforming the baseline. Clustering results using perspective information were mixed, helping organize answers for some topics but not others.
Learning Analytics and Serious Games: Trends and ConsiderationsLaila Shoukry
Presentation for Serious Games Workshop SG'14 in ACM Multimedia 2014, Orlando, Florida
Paper: http://dl.acm.org/citation.cfm?id=2656729
Knowledge Graph Reasoning Techniques through Studies on Mystery Stories - Rep...KnowledgeGraph
1) The document summarizes the Knowledge Graph Reasoning Challenge (KGRC) held from 2018 to 2020.
2) The challenge task involved developing AI systems that can reason about and solve mysteries presented as open knowledge graphs based on Sherlock Holmes stories, providing reasonable explanations.
3) Over the three years of the challenge, 24 systems were submitted using various approaches like knowledge processing, machine learning, or combinations, and making use of different external knowledge resources. The challenge aims to promote techniques for explainable AI using knowledge graph reasoning.
Education must capitalize on the trend within technology toward big data. New types of data are becoming available. From evidence approaches to xAPI and the whole Training and Learning Architecture(TLA) big data is the foundation of all.
This document provides an agenda and overview for a deep learning course. The agenda includes an introduction to program and course learning outcomes, the syllabus, class management tools, and an introduction to week 1 of deep learning. The syllabus outlines 15 weekly topics on deep learning concepts and algorithms. Example student projects are provided showing applications of deep learning to areas like computer vision, natural language processing, and games. The introduction to week 1 discusses artificial intelligence, machine learning, and deep learning definitions and provides an overview of programming assignments and deep learning in action.
This presentation has slides from a talk that I gave at the annual Experimental Biology meeting, 2015, on our curriculum for Big Data Analytics in the Inland Empire.
A Space X Industry Day Briefing 7 Jul08 Jgm R4jmorriso
Stephen Rocci, Deputy Chief, Contracting Division, AFRL/PK
– AFRL/PK will serve as the contracting authority for A-SpaceX
– AFRL/PK will provide Contracting Officer (CO) and Contracting Officer
Technical Representative (COTR) support
• Technical Oversight:
– AFRL/RI will provide technical oversight and support to the program
– AFRL/RI POCs: Peter Rocci, Peter LaMonica, John Spina
7 July 2008 UNCLASSIFIED 33
Learning Analytics Design in Game-based LearningMIT
Summary: The workshop will deal with the problematic of designing learning analytics in games for learning, it makes special emphasis on the process and the design side, and will prepare assistants to start facing this or similar analytical challenges in the future.
- Methodology: It will be an active workshop where the instructor will do short introductions, present step-by-step examples and then participants will work in their own designs in groups, with the support of the instructor. We finalize by sharing with the rest of the class to see different designs for different games and constructs.
- Intended audience: Will definitely be interesting for anyone working around learning analytics, games for learning and alternative assessment methods. But anyone can enjoy this workshop as it will be dynamic and scaffolded. No requisites needed.
Systematic Literature Reviews and Systematic Mapping Studiesalessio_ferrari
Lecture slides on Systematic Literature Reviews and Systematic Mapping Studies in software engineering. It describes the different steps, discusses differences between the two methods, and gives guidelines on how to conduct these types of study.
Make a query regarding a topic of interest and come to know the sentiment for the day in pie-chart or for the week in form of line-chart for the tweets gathered from twitter.com
Linking Heterogeneous Scholarly Data Sources in an Interoperable Setting: the...Platforma Otwartej Nauki
“Open Research Data: Implications for Science and Society”, Warsaw, Poland, May 28–29, 2015, conference organized by the Open Science Platform — an initiative of the Interdisciplinary Centre for Mathematical and Computational Modelling at the University of Warsaw. pon.edu.pl @OpenSciPlatform #ORD2015
LODチャレンジ2022授賞式シンポジウムでの紹介スライドです。
受賞作品:https://github.com/KnowledgeGraphJapan/KGRC-RDF/blob/kgrc4si/extended_readme.md
受賞情報:https://2022.lodc.jp/awardPressRelease2022.html
引用:
江上周作,鵜飼孝典,窪田文也,大野美喜子,北村光司,福田賢一郎: 家庭内の事故予防に向けた合成ナレッジグラフの構築と推論,第56回人工知能学会セマンティックウェブとオントロジー研究会, SIG-SWO-056-14 (2022) DOI: https://doi.org/10.11517/jsaisigtwo.2022.SWO-056_14
Egami, S., Nishimura, S., Fukuda, K.: A Framework for Constructing and Augmenting Knowledge Graphs using Virtual Space: Towards Analysis of Daily Activities. Proceedings of the 33rd IEEE International Conference on Tools with Artificial Intelligence. pp.1226-1230 (2021) DOI: https://doi.org/10.1109/ICTAI52525.2021.00194
Egami, S., Nishimura, S., Fukuda, K.: VirtualHome2KG: Constructing and Augmenting Knowledge Graphs of Daily Activities Using Virtual Space. Proceedings of the ISWC 2021 Posters, Demos and Industry Tracks: From Novel Ideas to Industrial Practice, co-located with 20th International Semantic Web Conference. CEUR, Vol.2980 (2021) https://ceur-ws.org/Vol-2980/paper381.pdf
Linked Open Data勉強会2020 後編:SPARQLの簡単な使い方、SPARQLを使った簡単なアプリ開発KnowledgeGraph
Linked Open Data勉強会2020
後編:SPARQLの簡単な使い方、SPARQLを使った簡単なアプリ開発
前編:https://www.slideshare.net/KnowledgeGraph/linked-open-data2020-lod
This is the keynote of the Into the Box conference, highlighting the release of the BoxLang JVM language, its key enhancements, and its vision for the future.
Big Data Analytics Quick Research Guide by Arthur MorganArthur Morgan
This is a Quick Research Guide (QRG).
QRGs include the following:
- A brief, high-level overview of the QRG topic.
- A milestone timeline for the QRG topic.
- Links to various free online resource materials to provide a deeper dive into the QRG topic.
- Conclusion and a recommendation for at least two books available in the SJPL system on the QRG topic.
QRGs planned for the series:
- Artificial Intelligence QRG
- Quantum Computing QRG
- Big Data Analytics QRG
- Spacecraft Guidance, Navigation & Control QRG (coming 2026)
- UK Home Computing & The Birth of ARM QRG (coming 2027)
Any questions or comments?
- Please contact Arthur Morgan at [email protected].
100% human made.
Book industry standards are evolving rapidly. In the first part of this session, we’ll share an overview of key developments from 2024 and the early months of 2025. Then, BookNet’s resident standards expert, Tom Richardson, and CEO, Lauren Stewart, have a forward-looking conversation about what’s next.
Link to recording, presentation slides, and accompanying resource: https://bnctechforum.ca/sessions/standardsgoals-for-2025-standards-certification-roundup/
Presented by BookNet Canada on May 6, 2025 with support from the Department of Canadian Heritage.
Enhancing ICU Intelligence: How Our Functional Testing Enabled a Healthcare I...Impelsys Inc.
Impelsys provided a robust testing solution, leveraging a risk-based and requirement-mapped approach to validate ICU Connect and CritiXpert. A well-defined test suite was developed to assess data communication, clinical data collection, transformation, and visualization across integrated devices.
AI Changes Everything – Talk at Cardiff Metropolitan University, 29th April 2...Alan Dix
Talk at the final event of Data Fusion Dynamics: A Collaborative UK-Saudi Initiative in Cybersecurity and Artificial Intelligence funded by the British Council UK-Saudi Challenge Fund 2024, Cardiff Metropolitan University, 29th April 2025
https://alandix.com/academic/talks/CMet2025-AI-Changes-Everything/
Is AI just another technology, or does it fundamentally change the way we live and think?
Every technology has a direct impact with micro-ethical consequences, some good, some bad. However more profound are the ways in which some technologies reshape the very fabric of society with macro-ethical impacts. The invention of the stirrup revolutionised mounted combat, but as a side effect gave rise to the feudal system, which still shapes politics today. The internal combustion engine offers personal freedom and creates pollution, but has also transformed the nature of urban planning and international trade. When we look at AI the micro-ethical issues, such as bias, are most obvious, but the macro-ethical challenges may be greater.
At a micro-ethical level AI has the potential to deepen social, ethnic and gender bias, issues I have warned about since the early 1990s! It is also being used increasingly on the battlefield. However, it also offers amazing opportunities in health and educations, as the recent Nobel prizes for the developers of AlphaFold illustrate. More radically, the need to encode ethics acts as a mirror to surface essential ethical problems and conflicts.
At the macro-ethical level, by the early 2000s digital technology had already begun to undermine sovereignty (e.g. gambling), market economics (through network effects and emergent monopolies), and the very meaning of money. Modern AI is the child of big data, big computation and ultimately big business, intensifying the inherent tendency of digital technology to concentrate power. AI is already unravelling the fundamentals of the social, political and economic world around us, but this is a world that needs radical reimagining to overcome the global environmental and human challenges that confront us. Our challenge is whether to let the threads fall as they may, or to use them to weave a better future.
Massive Power Outage Hits Spain, Portugal, and France: Causes, Impact, and On...Aqusag Technologies
In late April 2025, a significant portion of Europe, particularly Spain, Portugal, and parts of southern France, experienced widespread, rolling power outages that continue to affect millions of residents, businesses, and infrastructure systems.
TrustArc Webinar: Consumer Expectations vs Corporate Realities on Data Broker...TrustArc
Most consumers believe they’re making informed decisions about their personal data—adjusting privacy settings, blocking trackers, and opting out where they can. However, our new research reveals that while awareness is high, taking meaningful action is still lacking. On the corporate side, many organizations report strong policies for managing third-party data and consumer consent yet fall short when it comes to consistency, accountability and transparency.
This session will explore the research findings from TrustArc’s Privacy Pulse Survey, examining consumer attitudes toward personal data collection and practical suggestions for corporate practices around purchasing third-party data.
Attendees will learn:
- Consumer awareness around data brokers and what consumers are doing to limit data collection
- How businesses assess third-party vendors and their consent management operations
- Where business preparedness needs improvement
- What these trends mean for the future of privacy governance and public trust
This discussion is essential for privacy, risk, and compliance professionals who want to ground their strategies in current data and prepare for what’s next in the privacy landscape.
Designing Low-Latency Systems with Rust and ScyllaDB: An Architectural Deep DiveScyllaDB
Want to learn practical tips for designing systems that can scale efficiently without compromising speed?
Join us for a workshop where we’ll address these challenges head-on and explore how to architect low-latency systems using Rust. During this free interactive workshop oriented for developers, engineers, and architects, we’ll cover how Rust’s unique language features and the Tokio async runtime enable high-performance application development.
As you explore key principles of designing low-latency systems with Rust, you will learn how to:
- Create and compile a real-world app with Rust
- Connect the application to ScyllaDB (NoSQL data store)
- Negotiate tradeoffs related to data modeling and querying
- Manage and monitor the database for consistently low latencies
TrsLabs - Fintech Product & Business ConsultingTrs Labs
Hybrid Growth Mandate Model with TrsLabs
Strategic Investments, Inorganic Growth, Business Model Pivoting are critical activities that business don't do/change everyday. In cases like this, it may benefit your business to choose a temporary external consultant.
An unbiased plan driven by clearcut deliverables, market dynamics and without the influence of your internal office equations empower business leaders to make right choices.
Getting things done within a budget within a timeframe is key to Growing Business - No matter whether you are a start-up or a big company
Talk to us & Unlock the competitive advantage
Dev Dives: Automate and orchestrate your processes with UiPath MaestroUiPathCommunity
This session is designed to equip developers with the skills needed to build mission-critical, end-to-end processes that seamlessly orchestrate agents, people, and robots.
📕 Here's what you can expect:
- Modeling: Build end-to-end processes using BPMN.
- Implementing: Integrate agentic tasks, RPA, APIs, and advanced decisioning into processes.
- Operating: Control process instances with rewind, replay, pause, and stop functions.
- Monitoring: Use dashboards and embedded analytics for real-time insights into process instances.
This webinar is a must-attend for developers looking to enhance their agentic automation skills and orchestrate robust, mission-critical processes.
👨🏫 Speaker:
Andrei Vintila, Principal Product Manager @UiPath
This session streamed live on April 29, 2025, 16:00 CET.
Check out all our upcoming Dev Dives sessions at https://community.uipath.com/dev-dives-automation-developer-2025/.
Artificial Intelligence is providing benefits in many areas of work within the heritage sector, from image analysis, to ideas generation, and new research tools. However, it is more critical than ever for people, with analogue intelligence, to ensure the integrity and ethical use of AI. Including real people can improve the use of AI by identifying potential biases, cross-checking results, refining workflows, and providing contextual relevance to AI-driven results.
News about the impact of AI often paints a rosy picture. In practice, there are many potential pitfalls. This presentation discusses these issues and looks at the role of analogue intelligence and analogue interfaces in providing the best results to our audiences. How do we deal with factually incorrect results? How do we get content generated that better reflects the diversity of our communities? What roles are there for physical, in-person experiences in the digital world?
DevOpsDays Atlanta 2025 - Building 10x Development Organizations.pptxJustin Reock
Building 10x Organizations with Modern Productivity Metrics
10x developers may be a myth, but 10x organizations are very real, as proven by the influential study performed in the 1980s, ‘The Coding War Games.’
Right now, here in early 2025, we seem to be experiencing YAPP (Yet Another Productivity Philosophy), and that philosophy is converging on developer experience. It seems that with every new method we invent for the delivery of products, whether physical or virtual, we reinvent productivity philosophies to go alongside them.
But which of these approaches actually work? DORA? SPACE? DevEx? What should we invest in and create urgency behind today, so that we don’t find ourselves having the same discussion again in a decade?
AI and Data Privacy in 2025: Global TrendsInData Labs
In this infographic, we explore how businesses can implement effective governance frameworks to address AI data privacy. Understanding it is crucial for developing effective strategies that ensure compliance, safeguard customer trust, and leverage AI responsibly. Equip yourself with insights that can drive informed decision-making and position your organization for success in the future of data privacy.
This infographic contains:
-AI and data privacy: Key findings
-Statistics on AI data privacy in the today’s world
-Tips on how to overcome data privacy challenges
-Benefits of AI data security investments.
Keep up-to-date on how AI is reshaping privacy standards and what this entails for both individuals and organizations.
AI and Data Privacy in 2025: Global TrendsInData Labs
Report on the First Knowledge Graph Reasoning Challenge 2018 -Toward the eXplainable AI System-
1. Report on the First Knowledge Graph
Reasoning Challenge 2018
– Toward the eXplainable AI System –
Takahiro Kawamura*1, Shusaku Egami*2, Koutarou Tamura*3,
Yasunori Hokazono*4, Takanori Ugai*5, Yusuke Koyanagi*5,
Fumihito Nishino*5, Seiji Okajima*5, Katsuhiko Murakami*5,
Kunihiko Takamatsu*6, Aoi Sugiura*7, Shun Shiramatsu*8,
Shawn Zhang*8, Kouji Kozaki*9
1.National Agriculture and Food Research Organization, Japan
2.National Institute of Maritime, Port and Aviation Technology, Japan
3. NRI digital, Ltd. 4.Nomura Research Institute, Ltd.
5. Fujitsu Laboratories Ltd. 6. Kobe Tokiwa University 7. Kobe City Nishi-Kobe Medical Center
8.Nagoya Institute of Technology 9. Osaka Electro-Communication University
2. Investig
ation
strategy
Criminal
motive ….
Summary of Knowledge Graph Reasoning Challenge
A Contest to develop AI systems which have
abilities for “Reasoning” and “Explanation”
such like Sherlock Holmes.
Sherlock
Holmes
mystery story
Open Knowledge
Graph(OKG) AI system that estimate criminals
with reasonable explanations using
the OKG and other knowledge
The motive is …
Trick is …
The criminal is
XX Because …
3. Agenda of Talk
• Summary of Knowledge Graph Reasoning Challenge
by K. Kozaki
• Knowledge Graph Construction
by S. Egami
• Approach for estimation and reasoning techniques
by T. Ugai
• Evaluation / Conclusion and Current Work
by T. Kawamra
3
5. Knowledge Graph construction process
Extract scenes
(Manual)
Sentence
Simplification
(Manual)
Semantic Role
Annotating
(Manual)
Translation
(Auto)
RDF
construction
(Auto)
Add object types
(Manual)
Add absolute
time
(Manual)
Scene Linking
(Manual)
• Discussion about schema design and methodology
• We first held five open workshops from November 2017 to April 2018.
• The total number of participants in the workshops was 110.
• After the preliminary experiment of knowledge graph construction cooperating with the participants, we
finally adopted the below process.
5
6. • Basic policy
– Focus on scenes in a novel and the relationship of those scenes, including the
characters, objects, places, etc., with related scenes
• A scene ID (IRI) has subjects, verbs, objects, etc.
• Edges mainly represent five Ws (When, Where, Who, What, and Why).
Architecture of the Knowledge Graph
Scene1 Scene2 Scene3
Scene4
Scene5
Resource
Literal
type subject
source
source
subject subject
hasPredicate
source
hasPredicate
subject
hasPredicate
then
therefore because
then
6
7. Scene ID
Source text
Subject
Predicate
Object
subject
hasPredicate
source
5W1H
Scene ID
Relatio
n
• Properties representing scenes
– subject (who): A person or object representing the subject of the scene
– hasPredicate: A predicate representing the content of the scene
– 5W1H: what, where, when, whom, why, how.
– Relation among scenes: then, if, because, …etc.
– time: Absolute time(xsd:DateTime)
– source: Source text(EN/JP literal)
subject object
predicate
Scene
subject objectpredicate
Schema (Scene)
7
8. Original Sentence (EN|JA)
Absolute Time
Property values are defined as
resources to be referred in the
other scene
Predicate
Subject
Relationship to other Scene ID
Scene Type
- Situation: Fact
- Statement:Remark by A
- Talk: Remark by A to B
- Thought: Idea of A
Schema (Scene): Example
Unique ID (IRI)
of Each scene
8
17. Evaluation
• For evaluating estimation and reasoning techniques with explainability,
– metrics design for explainability, utility, novelty, and performance is required,
– but evaluation is also based on a qualitative comparison through discussion and
peer reviews.
• DARPA XAI states that
– the current AI techniques have a trade-off between accuracy and explainability,
so both properties should be measured.
– In particular, to measure the effectiveness of the explainability,
DARPA XAI rates user satisfaction regarding its clarity and utility.
• We first share the basic information of the proposed approaches, and then discuss the
evaluation of experts and of the general public.
17
18. Sharing Basic Information for Preparation
• The basic info. Was investigated and shared with experts in advance,
who were 7 board members of the SIG on Semantic Web and Ontology in JSAI
– Correctness of the answer
Check if the resulting criminal was correct or not, regardless of the approach.
The criminal, in this case, is the one designated in the novel or story.
– Feasibility of the program
Check if the submitted program correctly worked and the results were reproduced
(excluding idea-only submissions).
– Performance of the program
Referential information on the system environment and performance of the submitted
program, except for the idea only.
– Amount of data/knowledge to be used
How much did the approach use the knowledge graph (the total num. of scene IDs used)?
If the approach used external knowledge and data, we noted information about them.
18
19. Expert Evaluation
• Over more than a week, the experts evaluated the following aspects
according to five grades (1–5).
• For Estimation and/or reasoning methods,
– Significance
Novelty and technical improvement of the method.
– Applicability
Is the approach applicable to the other problems?
3 : applicable to the other novels and stories
5 : applicable to other domains.
– Extensibility
Is the approach expected to have a further technical extension?
19
20. Expert Evaluation cont'd
• For Knowledge and data,
– Originality of knowledge/data construction
E.g., how much external knowledge and data were prepared?
– Originality of knowledge/data use
E.g., was a small set of knowledge used efficiently, or
was a large set of knowledge used to simplify the process.
• and…
– Feasibility of idea (for idea only)
Feasibility of idea including algorithms and data/knowledge construction.
– Logical explainability
Is an explanation logically persuadable?
1 : no explanation and evidence
3 : some evidence in any form is provided
5 : there is an explanation that is consistent with the estimation and reasoning process.
– Effort
Amount of effort required for the submission (knowledge/data/system).
20
21. Public Evaluation
• Although the experts determine if a logical explanation could be held,
the public eval. focused on the psychological aspect of the explanations, that is,
the satisfaction with the explanation.
• The 45 applicants of SIGSWO meeting, Nov. 2018 answered to
after 15 min presentation
– Total score
– Explainability
• We added the total score to include psychological impressions other than explainability,
– such as presentation quality and entertainment aspects.
– If we only had a score for explainability, such aspects could be mixed in the explainability.
21
22. Public evaluation results
• In the public eval.,
ave., med., and sd
of the total scores and
the explainability
were compared.
• Comparing 1st and 2nd prizes, we found
– Ave. of both the total score and the explainability were higher for 1st prize.
– Med. of the total score of 1st was higher than 2nd prize, but
Med. of the explainability of 1st was the same as 2nd prize.
– Sd of both the total score and the explainability were also bigger for 1st prize
than for 2nd prize. (smaller the better)
• The paired t-test (α = 0.05) indicated that
– diff. in total score had a statistically significant difference bet. 1st and 2nd., but
– diff. in the explainability was not significantly different.
>
>
>
>
=
>
22
23. Expert evaluation results
• In the expert evaluation,
– Ave. of each metric in 1st prize
were higher than 2nd prize, but
the explainability was statistically
significantly higher for 2nd prize.
– We should note that sd of ave.
for each metric were less than 0.1
• Therefore, the final decision was left
to the expert peer review.
• As a result, the prize order was
determined, since
– the metrics other than the explainability
of 1st prize were > or = 2nd
• The eval. including explainability was left
to the future challenge…
2nd 1st
>
<
23
24. Conclusion and Current Work
• The 1st knowledge graph reasoning challenge in 2018 was summarized
for the development of AI techniques integrating ML and reasoning.
• The 2nd challenge started in June 2019 !
– 4 knowledge graphs from 4 mystery novels were added and published on
https://github.com/KnowledgeGraphJapan/KGRC-RDF/tree/master/2019
(Speckled Band + A Case Of Identity / Crooked Man / Dancing Men / Devils Foot)
– In addition to guessing criminal with explanation,
– it is better to be commonly applied to as many novels as possible.
– New tool/utility creation task is also added.
• The 3rd international challenge will open call for application in 2020!
• The 4th challenge will have knowledge graphs of real social problems, e.g., books listing
best practices of social problem solving, etc. 24