SlideShare a Scribd company logo
What is to be done? The Future of Database Research Le Gruenwald National Science Foundation Presented to 2008 Database Self-Assessment Submit May 29-30, 2008
Topics to Consider Formal Data Semantics Graph Database Human-Centered Database Computing Multi-disciplinary Database Research Mobile Database Database Performance Evaluation
Formal Data Semantics For most of the past 40 years DB community has largely ignored most issues concerning data semantics, even such basic matters as measurement units. Nearly all DB systems today lack formal specification of data semantics. This issue is of increasing importance as we attempt to integrate more diverse databases. Need to provide formal data semantics (metadata), e.g., logic – but which logic?  DL, FOL, sorted, ... Need ability to integrate and query data semantics. Increasing demands for integrated DB retrieval and inference.
Graph Database Big demand: transportation, bio, social networks E.g. perform disjunctive queries over different relationship types E.g. find the shortest path from point A to point B Need a flexible data model and query language
Human-Centered Database Computing Need to accommodate different types of users Usability studies Visualization
Multi-disciplinary DB Research Need to reach out to other disciplines: What are the innovative uses of existing DB research results that enable transformative research in other disciplines? What transformative DB research would be derived from the needs of other disciplines? Major DB conferences and journals need to embrace multi-disciplinary DB research
Mobile Database Increasing demand for mobile applications (including mobile sensor applications) Issues: mobility, disconnection, energy limitation, etc. More activities in this area in Europe and Japan than in the U.S. Major DB conferences need to embrace mobile database research Can energy-aware mobile DB research be extended to achieve GREEN DB for static environments?
Database Performance Evaluation Many of current DB research evaluation plans include:  Performing simulation experiments using Synthetic datasets Real-life datasets Benchmark datasets (not always available) Making some generalized conclusions without regards to statistical relevance Too ad-hoc, lack of science -> Need a more credible evaluation approach
THANK YOU!
Extra Slides for additional topics
Data Models for Vector Fields Vector fields occur in many scientific, engineering applications: Computational fluid dynamics:  weather, climate, oceanography, airplane design, wind turbine design and placement, finite element modeling, .... Relational model is largely useless Attempts:  Fiber Bundle Data Model (lloyd Treinish, ibm walson david butler, limit point), Vector Bundle  Data Model (eddie saek, richard Muntz, ucla ...)‏ /*restricive fiber with map from mesh to vector space from one end to another end */ Need data models, query languages, ... Need interpolation
Shape Based Retrieval Applications: Part retrieval, protein docking, protein-ligand binding, drug design, archeology, airplane crash reconstruction, ... Need invariant shape descriptions w.r.t. translation and rotation Need efficient representations and query processing Need methods for “compliant” shape matching (docking)‏ /* mating */
Impedance Mismatch Between Programming Languages and DBMSs Longstanding problem of integration of queries into programs Generally poor support by programming languages OODBMSs failed Latest effort:  Microsoft Linq Remains, open, difficult problem  See related work on XDUCE, CDUCE
Very Large Data Integration Data Integration / DB Federation over large numbers of DB (100's or 1000's) remains unsolved problem Increasing important for bioinformatics, intelligence, e-commerce, ... Need better metadata, better tools, new approaches ??

More Related Content

PPT
Quality, Relevance and Importance in Information Retrieval with Fuzzy Semanti...
PPTX
Data model
PPTX
Data librarymarch2011
PDF
Erwin Folmer - Congres 'Data gedreven Beleidsontwikkeling'
PPTX
Adopting a situated learning framework for (big) data projects
PDF
Semantically Enhanced Interactions between Heterogeneous Data Life-Cycles - A...
PPT
Le Flow Proposal Planning Rwth
PDF
Cultural Heritage: when data are much worst than one can believe
Quality, Relevance and Importance in Information Retrieval with Fuzzy Semanti...
Data model
Data librarymarch2011
Erwin Folmer - Congres 'Data gedreven Beleidsontwikkeling'
Adopting a situated learning framework for (big) data projects
Semantically Enhanced Interactions between Heterogeneous Data Life-Cycles - A...
Le Flow Proposal Planning Rwth
Cultural Heritage: when data are much worst than one can believe

Similar to Claremont Report on Database Research: Research Directions (Le Gruenwald) (20)

PDF
Ted Willke, Senior Principal Engineer & GM, Datacenter Group, Intel at MLconf SF
PDF
High Performance Data Analytics and a Java Grande Run Time
PDF
Where Does Big Data Meet Big Database - QCon 2012
PDF
Debunking "Purpose-Built Data Systems:": Enter the Universal Database
PDF
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
PPT
SciDB : Open Source Data Management System for Data-Intensive Scientific Anal...
PDF
Big Data & the Enterprise
PDF
Bulk ieee projects 2012 2013
PDF
The LDBC Social Network Benchmark Interactive Workload - SIGMOD 2015
PDF
Big Data Fundamentals
PDF
Python's Role in the Future of Data Analysis
PDF
09-03-2024_UnstructuredDataAndAIDiscussion.pdf
PDF
Big Data is changing abruptly, and where it is likely heading
PDF
IRJET- Towards Efficient Framework for Semantic Query Search Engine in Large-...
PDF
STI Summit 2011 - Digital Worlds
PDF
Rise of the scientific database
PPT
Database Research Principles Revealed
PDF
A look under the hood at Apache Spark's API and engine evolutions
PDF
The return of big iron?
PDF
Software Engineering Research: Leading a Double-Agent Life.
Ted Willke, Senior Principal Engineer & GM, Datacenter Group, Intel at MLconf SF
High Performance Data Analytics and a Java Grande Run Time
Where Does Big Data Meet Big Database - QCon 2012
Debunking "Purpose-Built Data Systems:": Enter the Universal Database
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
SciDB : Open Source Data Management System for Data-Intensive Scientific Anal...
Big Data & the Enterprise
Bulk ieee projects 2012 2013
The LDBC Social Network Benchmark Interactive Workload - SIGMOD 2015
Big Data Fundamentals
Python's Role in the Future of Data Analysis
09-03-2024_UnstructuredDataAndAIDiscussion.pdf
Big Data is changing abruptly, and where it is likely heading
IRJET- Towards Efficient Framework for Semantic Query Search Engine in Large-...
STI Summit 2011 - Digital Worlds
Rise of the scientific database
Database Research Principles Revealed
A look under the hood at Apache Spark's API and engine evolutions
The return of big iron?
Software Engineering Research: Leading a Double-Agent Life.
Ad

More from infoblog (13)

PDF
CIDR 2009: James Hamilton Keynote
PDF
CIDR 2009: Jeff Heer Keynote
PPT
Claremont Report on Database Research: Research Directions (Eric A. Brewer)
PPT
Claremont Report on Database Research: Research Directions (Rakesh Agrawal)
PPT
Claremont Report on Database Research: Research Directions (Gerhard Weikum)
PPT
Claremont Report on Database Research: Research Directions (Beng Chin Ooi)
PPT
Claremont Report on Database Research: Research Directions (Yannis E. Ioannidis)
PPT
Claremont Report on Database Research: Research Directions (Donald Kossmann)
PDF
Claremont Report on Database Research: Research Directions (Johannes Gehrke)
PPT
Claremont Report on Database Research: Research Directions (Alon Y. Halevy)
PPT
Claremont Report on Database Research: Research Directions (Anastasia Ailamaki)
PPT
Spot Sigs
PDF
Database Research Principles Revealed (Small Size)
CIDR 2009: James Hamilton Keynote
CIDR 2009: Jeff Heer Keynote
Claremont Report on Database Research: Research Directions (Eric A. Brewer)
Claremont Report on Database Research: Research Directions (Rakesh Agrawal)
Claremont Report on Database Research: Research Directions (Gerhard Weikum)
Claremont Report on Database Research: Research Directions (Beng Chin Ooi)
Claremont Report on Database Research: Research Directions (Yannis E. Ioannidis)
Claremont Report on Database Research: Research Directions (Donald Kossmann)
Claremont Report on Database Research: Research Directions (Johannes Gehrke)
Claremont Report on Database Research: Research Directions (Alon Y. Halevy)
Claremont Report on Database Research: Research Directions (Anastasia Ailamaki)
Spot Sigs
Database Research Principles Revealed (Small Size)
Ad

Recently uploaded (20)

PDF
Enable Enterprise-Ready Security on IBM i Systems.pdf
PDF
REPORT: Heating appliances market in Poland 2024
PDF
Chapter 2 Digital Image Fundamentals.pdf
PDF
HCSP-Presales-Campus Network Planning and Design V1.0 Training Material-Witho...
PDF
Sensors and Actuators in IoT Systems using pdf
PDF
BLW VOCATIONAL TRAINING SUMMER INTERNSHIP REPORT
PPTX
Telecom Fraud Prevention Guide | Hyperlink InfoSystem
PPTX
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
PDF
solutions_manual_-_materials___processing_in_manufacturing__demargo_.pdf
PDF
Transforming Manufacturing operations through Intelligent Integrations
PDF
Top Generative AI Tools for Patent Drafting in 2025.pdf
PDF
Event Presentation Google Cloud Next Extended 2025
PDF
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
PDF
CIFDAQ's Market Insight: SEC Turns Pro Crypto
PPTX
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
PPTX
Comunidade Salesforce São Paulo - Desmistificando o Omnistudio (Vlocity)
PPTX
CroxyProxy Instagram Access id login.pptx
PDF
ai-archetype-understanding-the-personality-of-agentic-ai.pdf
PDF
Modernizing your data center with Dell and AMD
PPTX
Web Security: Login Bypass, SQLi, CSRF & XSS.pptx
Enable Enterprise-Ready Security on IBM i Systems.pdf
REPORT: Heating appliances market in Poland 2024
Chapter 2 Digital Image Fundamentals.pdf
HCSP-Presales-Campus Network Planning and Design V1.0 Training Material-Witho...
Sensors and Actuators in IoT Systems using pdf
BLW VOCATIONAL TRAINING SUMMER INTERNSHIP REPORT
Telecom Fraud Prevention Guide | Hyperlink InfoSystem
Detection-First SIEM: Rule Types, Dashboards, and Threat-Informed Strategy
solutions_manual_-_materials___processing_in_manufacturing__demargo_.pdf
Transforming Manufacturing operations through Intelligent Integrations
Top Generative AI Tools for Patent Drafting in 2025.pdf
Event Presentation Google Cloud Next Extended 2025
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
CIFDAQ's Market Insight: SEC Turns Pro Crypto
Effective Security Operations Center (SOC) A Modern, Strategic, and Threat-In...
Comunidade Salesforce São Paulo - Desmistificando o Omnistudio (Vlocity)
CroxyProxy Instagram Access id login.pptx
ai-archetype-understanding-the-personality-of-agentic-ai.pdf
Modernizing your data center with Dell and AMD
Web Security: Login Bypass, SQLi, CSRF & XSS.pptx

Claremont Report on Database Research: Research Directions (Le Gruenwald)

  • 1. What is to be done? The Future of Database Research Le Gruenwald National Science Foundation Presented to 2008 Database Self-Assessment Submit May 29-30, 2008
  • 2. Topics to Consider Formal Data Semantics Graph Database Human-Centered Database Computing Multi-disciplinary Database Research Mobile Database Database Performance Evaluation
  • 3. Formal Data Semantics For most of the past 40 years DB community has largely ignored most issues concerning data semantics, even such basic matters as measurement units. Nearly all DB systems today lack formal specification of data semantics. This issue is of increasing importance as we attempt to integrate more diverse databases. Need to provide formal data semantics (metadata), e.g., logic – but which logic? DL, FOL, sorted, ... Need ability to integrate and query data semantics. Increasing demands for integrated DB retrieval and inference.
  • 4. Graph Database Big demand: transportation, bio, social networks E.g. perform disjunctive queries over different relationship types E.g. find the shortest path from point A to point B Need a flexible data model and query language
  • 5. Human-Centered Database Computing Need to accommodate different types of users Usability studies Visualization
  • 6. Multi-disciplinary DB Research Need to reach out to other disciplines: What are the innovative uses of existing DB research results that enable transformative research in other disciplines? What transformative DB research would be derived from the needs of other disciplines? Major DB conferences and journals need to embrace multi-disciplinary DB research
  • 7. Mobile Database Increasing demand for mobile applications (including mobile sensor applications) Issues: mobility, disconnection, energy limitation, etc. More activities in this area in Europe and Japan than in the U.S. Major DB conferences need to embrace mobile database research Can energy-aware mobile DB research be extended to achieve GREEN DB for static environments?
  • 8. Database Performance Evaluation Many of current DB research evaluation plans include: Performing simulation experiments using Synthetic datasets Real-life datasets Benchmark datasets (not always available) Making some generalized conclusions without regards to statistical relevance Too ad-hoc, lack of science -> Need a more credible evaluation approach
  • 10. Extra Slides for additional topics
  • 11. Data Models for Vector Fields Vector fields occur in many scientific, engineering applications: Computational fluid dynamics: weather, climate, oceanography, airplane design, wind turbine design and placement, finite element modeling, .... Relational model is largely useless Attempts: Fiber Bundle Data Model (lloyd Treinish, ibm walson david butler, limit point), Vector Bundle Data Model (eddie saek, richard Muntz, ucla ...)‏ /*restricive fiber with map from mesh to vector space from one end to another end */ Need data models, query languages, ... Need interpolation
  • 12. Shape Based Retrieval Applications: Part retrieval, protein docking, protein-ligand binding, drug design, archeology, airplane crash reconstruction, ... Need invariant shape descriptions w.r.t. translation and rotation Need efficient representations and query processing Need methods for “compliant” shape matching (docking)‏ /* mating */
  • 13. Impedance Mismatch Between Programming Languages and DBMSs Longstanding problem of integration of queries into programs Generally poor support by programming languages OODBMSs failed Latest effort: Microsoft Linq Remains, open, difficult problem See related work on XDUCE, CDUCE
  • 14. Very Large Data Integration Data Integration / DB Federation over large numbers of DB (100's or 1000's) remains unsolved problem Increasing important for bioinformatics, intelligence, e-commerce, ... Need better metadata, better tools, new approaches ??