SlideShare a Scribd company logo
Data Science and What It
Means to Library and
Information Science
Jian Qin
School of Information Studies
Syracuse University
iSpeaker Series at Sungkyunkwan University
Seoul, Korea, December 8, 2015
Agenda
• What is data science?
• What is a data scientist?
• What areas of library work can benefit from data
science?
212/8/2015 iSpeaker Series at Sungkyunkwan University, Seoul, Korea
3
•
What is data science?
“An emerging area of work
concerned with the collection,
presentation, analysis,
visualization, management, and
preservation of large collections
of information.”
Stanton, J. (2012). Introduction to Data Science.
http://ischool.syr.edu/media/documents/2012/3/DataScienc
eBook1_1.pdf
The whole lifecycle of data from collection to analysis
to preservation
LCAS DM workshop, Beijing, 201512/8/2015 iSpeaker Series at Sungkyunkwan University, Seoul, Korea
“We’re increasingly
finding data in the wild,
and data scientists are
involved with gathering
data, massaging it into a
tractable form, making it
tell its story, and
presenting that story to
others.”
Loukides, M. (2011). What is data science? Sebastopol, CA:
O’Reilly.
What is data science?
4
Gathering and massaging data to tell its story
12/8/2015 iSpeaker Series at Sungkyunkwan University, Seoul, Korea
5
A systematic enterprise that builds and
organizes knowledge in the form of
testable explanations and predictions.
The study of the generalizable extraction of knowledge
from data, which involves data and statistics or the
systematic study of the organization, properties, and
analysis of data and its role in inference, including our
confidence in the inference.
Dhar, V. (2013). Data science and prediction. Communications of the ACM, 56(12): 64-73.
12/8/2015 iSpeaker Series at Sungkyunkwan University, Seoul, Korea
Why is data science different from
statistics and other existing disciplines?
• Raw material, the “data” part of data science, is
increasingly heterogeneous and unstructured and often
emanating from networks with complex relationships
between the entities.
• Analysis of data requires integration, interpretation, and
sense making that is increasingly derived through tools
from computer science, linguistics, econometrics,
sociology, and other disciplines.
• Data are increasingly generated by computer and for
computer consumption, that is, computers increasingly
do background work for each other and make decisions
automatically
612/8/2015 iSpeaker Series at Sungkyunkwan University, Seoul, Korea
7
Dhar, V. (2013). Data science and prediction. Communications of the ACM,
56(12): 64-73, p. 64.
12/8/2015 iSpeaker Series at Sungkyunkwan University, Seoul, Korea
8
Main fields in data science
12/8/2015 iSpeaker Series at Sungkyunkwan University, Seoul, Korea
What is a data scientist?
• Math skills: Statistics and linear algebra
• Computing skills: programming and infrastructure design
• Able to communicate: ability to create narratives around
their work
• Ask the right questions: involves domain knowledge and
expertise, coupled with a keen ability to see the problem,
see the available data, and match up the two.
912/8/2015 iSpeaker Series at Sungkyunkwan University, Seoul, Korea
Analysis of data problems: Story 1
• Domain: Global migration studies
• What’s involved: migrants, refuges, detention centers, refuge
camps, Asylums, …
• Data types: interview audio recordings, photos, articles, clippings,
written notes, …
• Analysis software: Atlas.ti, SPSS
• Bottleneck problem:
• difficulty in finding the data by person, interview, and related artifacts and in
transforming the data into analysis software
1012/8/2015 iSpeaker Series at Sungkyunkwan University, Seoul, Korea
We’ve got
a problem
Researcher:
How to use
Atlas.ti?
Data scientist:
What data do
you have?
Data scientist:
How do you
collect them?
Data scientist:
What do you do
with the data?
Analysis of data problems: story 2
• Domain: Thermochronology and tectonics
• Data types: Excel data files (lots of them), spectrum and microscopic images,
annotations
• Analysis: modeling by combining data from multiple data files with specialized
software
• Bottleneck problem:
• manually matching/merging/filtering data is extremely cumbersome and the problem is
compounded by the difficulty finding the right data files
1112/8/2015 iSpeaker Series at Sungkyunkwan University, Seoul, Korea
What is involved: workflows in a
research lifecycle
Analysis of data problem: story 3
• Domain: collaboration networks in a data repository
• What’s involved: metadata describing DNA sequences
• Data types: semi-structured data in plain text format
• Analysis: identify entities and relationships, build the
data into a database for querying and extraction
• Bottleneck problems:
• Extremely large data sets with multiple entities, which makes
manual processing impossible
• Disambiguation of author names and correctly linking between
entities
1212/8/2015 iSpeaker Series at Sungkyunkwan University, Seoul, Korea
Analysis of data problems
Analysis of
domain data
Requirement analysis
Workflow analysis
Data modeling
Data transformation
needs analysis
Data provenance
needs analysis
Analysis of data problems is an
analysis of domain data,
requirements, and workflows
that will lead to the
development of solutions.
1312/8/2015 iSpeaker Series at Sungkyunkwan University, Seoul, Korea
Skills required to perform
analysis of domain data problems
Requirement
analysis
Workflow
analysis
Data modeling
Data
transformation
needs analysis
Data
provenance
needs analysis
Interview skills,
analysis and
generalization skills
Ability to capture
components and
sequences in workflows
Ability to translate
domain analysis into
data models
Ability to envision the data
model within the larger
system architecture
1412/8/2015 iSpeaker Series at Sungkyunkwan University, Seoul, Korea
Example 1: modeling research data for
gravitational wave research
15
1. Understand research lifecycle
2. Workflows: steps and relationships
3. Data flows: what goes in and out at
which step
4. Entities and attributes, relationships
5. Researcher’s practice and habits in
documenting and managing data
12/8/2015 iSpeaker Series at Sungkyunkwan University, Seoul, Korea
Example 2: asking the right question in
mining metadata
16
Metadata describing
datasets is big data that can
used to study:
• Collaboration networks
• Scholarly
communication patterns
• Research frontiers and
trends
• Knowledge transfer
• Research impact
assessment
12/8/2015 iSpeaker Series at Sungkyunkwan University, Seoul, Korea
What areas of library work can
benefit from data science?
1712/8/2015 iSpeaker Series at Sungkyunkwan University, Seoul, Korea
Data services and data-driven services
18
Library
Data services that
support research,
learning, and policy
making (external)
Data-driven services
that support library
planning, management,
and evaluation
(internal)
Data literacy
training
Data
discovery
Data
consulting
Data
mining
Data
collection Data
integration
12/8/2015 iSpeaker Series at Sungkyunkwan University, Seoul, Korea
Data-drive organization
• Consumer internet companies
• Google, Amazon, Facebook, LinkedIn
• Brick-mortar companies:
• Walmart, UPS, FedEx, GE
• “A data-driven organization
acquires, processes, and
leverage data in a timely fashion
to create efficiencies, iterate on
and develop new products, and
navigate the competitive
landscape...”
19
Is your library
(company, research
center, etc.) a data-
driven organization?
Patil, D.J. & Mason, H. (2015). Data Driven: Creating a Data
Culture. Sebastopol, CA: O’Reilly Media, p. 6.
12/8/2015 iSpeaker Series at Sungkyunkwan University, Seoul, Korea
Data curation
20
“the active and ongoing management of
data through its life cycle of interest and
usefulness to scholarship, science, and
education. Data curation activities enable
data discovery and retrieval, maintain its
quality, add value, and provide for reuse
over time, and this new field includes
authentication, archiving, management,
preservation, retrieval, and representation.”
–UIUC GSLIS
12/8/2015 iSpeaker Series at Sungkyunkwan University, Seoul, Korea
Data collection
• Build data collections through
• Institutional repositories
• Community repositories
• Developing tools for researchers to submit,
manage, preserve, and discover data
• Develop data collections
• Specialized
• Analysis-ready
• Reusable
• Actionable
21
• For library service planning, decision
making, and evaluation
• To support policy making, research, and
learning
12/8/2015 iSpeaker Series at Sungkyunkwan University, Seoul, Korea
Data discovery
• Complex data landscape:
• International, national, regional
• Disciplinary, community
• Open access vs. closed access
• Data sources for various purposes:
• Utility data sources: open, reusable
• Census data: open, but need additional
processing/meshing to reach the analysis-
ready state
• Government data: open, reusable, but require
additional processing
• Disciplinary research data: access varies,
require special knowledge to access and use
22
Data involving human
subjects are under
strict control by law
and often follow
additional compliance
12/8/2015 iSpeaker Series at Sungkyunkwan University, Seoul, Korea
Data consulting
• Search, locate, and verify data for
particular research purposes
• Plan, design, and implement data
curation and/or data analysis
projects
• Provide training and consulting for
statistical methods and tools
2312/8/2015 iSpeaker Series at Sungkyunkwan University, Seoul, Korea
Data mining
• Using internal data:
• Users, uses, expenses, collections, staff
• Goal: improve efficiencies and service
quality
• Using external data:
• Trends and indicators in scholarly
communication, technology, economy, and
culture
• Goal: adjust current services and plan for
new services
2412/8/2015 iSpeaker Series at Sungkyunkwan University, Seoul, Korea
Data integration
Data integration is the combination of technical
and business processes used to combine data
from disparate sources into meaningful and
valuable information.
--IBM, http://www.ibm.com/analytics/us/en/technology/data-
integration/
25
A process of understanding, cleansing,
monitoring, transforming, and delivering data,
which offers opportunities to develop data
products as an infrastructure for research,
learning, policymaking, and decision making.
12/8/2015 iSpeaker Series at Sungkyunkwan University, Seoul, Korea
A home buyer’s information integration
26
What houses for sale under $250K have at least 2 bathrooms, 2
bedrooms, a nearby school ranking in the upper third, in a
neighborhood with below-average crime rate and diverse population?
Information
integration
Realtor School rankings Crime rate Demographics
12/8/2015 iSpeaker Series at Sungkyunkwan University, Seoul, Korea
Research data
integration
Diabetes data and
trends—Country
level estimates:
http://apps.nccd.cdc.gov/D
DT_STRS2/NationalDiabet
esPrevalenceEstimates.aspx
?mode=PHY ;
Diabetes Data &
Trends home page:
http://apps.nccd.cdc.gov/dd
tstrs/default.aspx
12/8/2015 27iSpeaker Series at Sungkyunkwan University, Seoul, Korea
Summary
• Data science is not a new discipline, but rather, a new way of
utilizing data, methods, and tools to ask the right questions in
solving problems.
• Practicing data science requires strong skills in math,
computing, interpersonal communication, and asking the right
questions
• Libraries are at a strategic position in practicing data science.
How to leverage this position relies on the
• vision
• courage of risk taking
• knowledge of data science and related topics
• careful planning
• collaboration
2812/8/2015 iSpeaker Series at Sungkyunkwan University, Seoul, Korea
12/8/2015 iSpeaker Series at Sungkyunkwan University, Seoul, Korea 29
Thank you!
Questions?
Ad

More Related Content

What's hot (20)

5013 Indexing Presentation
5013 Indexing Presentation5013 Indexing Presentation
5013 Indexing Presentation
lmartin8
 
Introduction to Computational Social Science
Introduction to Computational Social ScienceIntroduction to Computational Social Science
Introduction to Computational Social Science
Premsankar Chakkingal
 
Power bi
Power biPower bi
Power bi
jainema23
 
Dashboard - definition, examples
Dashboard - definition, examplesDashboard - definition, examples
Dashboard - definition, examples
Matthieu Aubry
 
Graph database
Graph database Graph database
Graph database
Shruti Arya
 
What is a survey paper
What is a survey paperWhat is a survey paper
What is a survey paper
Aasheesh Tandon
 
How to Use Bibliometric Study for Writing a Paper: A Starter Guide
How to Use Bibliometric Study for Writing a Paper: A Starter GuideHow to Use Bibliometric Study for Writing a Paper: A Starter Guide
How to Use Bibliometric Study for Writing a Paper: A Starter Guide
Nader Ale Ebrahim
 
Information Management
Information ManagementInformation Management
Information Management
Nadeem Raza
 
Introduction to Data Visualization
Introduction to Data Visualization Introduction to Data Visualization
Introduction to Data Visualization
Ana Jofre
 
Information retrieval introduction
Information retrieval introductionInformation retrieval introduction
Information retrieval introduction
nimmyjans4
 
Data Visualization Techniques in Power BI
Data Visualization Techniques in Power BIData Visualization Techniques in Power BI
Data Visualization Techniques in Power BI
Angel Abundez
 
Data Vault Vs Data Lake
Data Vault Vs Data LakeData Vault Vs Data Lake
Data Vault Vs Data Lake
Calum Miller
 
Information Seeking Behaviour Models
Information Seeking Behaviour ModelsInformation Seeking Behaviour Models
Information Seeking Behaviour Models
2548233
 
What exactly is Business Intelligence?
What exactly is Business Intelligence?What exactly is Business Intelligence?
What exactly is Business Intelligence?
James Serra
 
Hadoop Family and Ecosystem
Hadoop Family and EcosystemHadoop Family and Ecosystem
Hadoop Family and Ecosystem
tcloudcomputing-tw
 
Overview of ICT for Development (ICT4D)
Overview of  ICT for Development (ICT4D)Overview of  ICT for Development (ICT4D)
Overview of ICT for Development (ICT4D)
Deo Shao
 
Introduction to Hadoop and Hadoop component
Introduction to Hadoop and Hadoop component Introduction to Hadoop and Hadoop component
Introduction to Hadoop and Hadoop component
rebeccatho
 
Information policy ppt
Information policy pptInformation policy ppt
Information policy ppt
Kabir Khan
 
Hadoop technology
Hadoop technologyHadoop technology
Hadoop technology
tipanagiriharika
 
Power BI Overview presentation.pptx
Power BI Overview presentation.pptxPower BI Overview presentation.pptx
Power BI Overview presentation.pptx
HungPham381
 
5013 Indexing Presentation
5013 Indexing Presentation5013 Indexing Presentation
5013 Indexing Presentation
lmartin8
 
Introduction to Computational Social Science
Introduction to Computational Social ScienceIntroduction to Computational Social Science
Introduction to Computational Social Science
Premsankar Chakkingal
 
Dashboard - definition, examples
Dashboard - definition, examplesDashboard - definition, examples
Dashboard - definition, examples
Matthieu Aubry
 
How to Use Bibliometric Study for Writing a Paper: A Starter Guide
How to Use Bibliometric Study for Writing a Paper: A Starter GuideHow to Use Bibliometric Study for Writing a Paper: A Starter Guide
How to Use Bibliometric Study for Writing a Paper: A Starter Guide
Nader Ale Ebrahim
 
Information Management
Information ManagementInformation Management
Information Management
Nadeem Raza
 
Introduction to Data Visualization
Introduction to Data Visualization Introduction to Data Visualization
Introduction to Data Visualization
Ana Jofre
 
Information retrieval introduction
Information retrieval introductionInformation retrieval introduction
Information retrieval introduction
nimmyjans4
 
Data Visualization Techniques in Power BI
Data Visualization Techniques in Power BIData Visualization Techniques in Power BI
Data Visualization Techniques in Power BI
Angel Abundez
 
Data Vault Vs Data Lake
Data Vault Vs Data LakeData Vault Vs Data Lake
Data Vault Vs Data Lake
Calum Miller
 
Information Seeking Behaviour Models
Information Seeking Behaviour ModelsInformation Seeking Behaviour Models
Information Seeking Behaviour Models
2548233
 
What exactly is Business Intelligence?
What exactly is Business Intelligence?What exactly is Business Intelligence?
What exactly is Business Intelligence?
James Serra
 
Overview of ICT for Development (ICT4D)
Overview of  ICT for Development (ICT4D)Overview of  ICT for Development (ICT4D)
Overview of ICT for Development (ICT4D)
Deo Shao
 
Introduction to Hadoop and Hadoop component
Introduction to Hadoop and Hadoop component Introduction to Hadoop and Hadoop component
Introduction to Hadoop and Hadoop component
rebeccatho
 
Information policy ppt
Information policy pptInformation policy ppt
Information policy ppt
Kabir Khan
 
Power BI Overview presentation.pptx
Power BI Overview presentation.pptxPower BI Overview presentation.pptx
Power BI Overview presentation.pptx
HungPham381
 

Viewers also liked (20)

J.M. Díaz Nafría: Science of Information: Emergence and evolution of meaning
J.M. Díaz Nafría: Science of Information: Emergence and evolution of meaningJ.M. Díaz Nafría: Science of Information: Emergence and evolution of meaning
J.M. Díaz Nafría: Science of Information: Emergence and evolution of meaning
José Nafría
 
Conceptions of information science
Conceptions of information scienceConceptions of information science
Conceptions of information science
Jorge Prado
 
Share: Science Information Life Cycle
Share: Science Information Life CycleShare: Science Information Life Cycle
Share: Science Information Life Cycle
kauberry
 
Information, Science, and Society
Information, Science, and SocietyInformation, Science, and Society
Information, Science, and Society
Melanie Swan
 
INFORMATION SCIENCE
INFORMATION SCIENCEINFORMATION SCIENCE
INFORMATION SCIENCE
harshaec
 
Towards Neuro–Information Science
Towards Neuro–Information ScienceTowards Neuro–Information Science
Towards Neuro–Information Science
jacekg
 
KNOWLEDGE SCIENCE; NOT INFORMATION SCIENCE OR TECHNOLOGY- SCOPE,THEORIES AND...
KNOWLEDGE SCIENCE; NOT INFORMATION SCIENCE OR TECHNOLOGY-  SCOPE,THEORIES AND...KNOWLEDGE SCIENCE; NOT INFORMATION SCIENCE OR TECHNOLOGY-  SCOPE,THEORIES AND...
KNOWLEDGE SCIENCE; NOT INFORMATION SCIENCE OR TECHNOLOGY- SCOPE,THEORIES AND...
Dr. Raju M. Mathew
 
Big Data and Hadoop - key drivers, ecosystem and use cases
Big Data and Hadoop - key drivers, ecosystem and use casesBig Data and Hadoop - key drivers, ecosystem and use cases
Big Data and Hadoop - key drivers, ecosystem and use cases
Jeff Kelly
 
Big data + data science startup focus points
Big data + data science startup focus pointsBig data + data science startup focus points
Big data + data science startup focus points
Tom Zorde
 
Sharing & Sustaining Ecosystem Data
Sharing & Sustaining Ecosystem DataSharing & Sustaining Ecosystem Data
Sharing & Sustaining Ecosystem Data
TERN Australia
 
Semiotics and Information Science
Semiotics and Information ScienceSemiotics and Information Science
Semiotics and Information Science
Florence Paisey
 
Big Data Open Source Tools and Trends: Enable Real-Time Business Intelligence...
Big Data Open Source Tools and Trends: Enable Real-Time Business Intelligence...Big Data Open Source Tools and Trends: Enable Real-Time Business Intelligence...
Big Data Open Source Tools and Trends: Enable Real-Time Business Intelligence...
Perficient, Inc.
 
Big data ecosystem
Big data ecosystemBig data ecosystem
Big data ecosystem
SlideCentral
 
Real time data services
Real time data servicesReal time data services
Real time data services
Relevate
 
Real Time Big Data
Real Time Big DataReal Time Big Data
Real Time Big Data
InfoFarm
 
Big data ecosystem
Big data ecosystemBig data ecosystem
Big data ecosystem
magda3695
 
Introducing the Big Data Ecosystem with Caserta Concepts & Talend
Introducing the Big Data Ecosystem with Caserta Concepts & TalendIntroducing the Big Data Ecosystem with Caserta Concepts & Talend
Introducing the Big Data Ecosystem with Caserta Concepts & Talend
Caserta
 
Big Data Ecosystem
Big Data EcosystemBig Data Ecosystem
Big Data Ecosystem
Ivo Vachkov
 
Earley Executive Roundtable - Building a Digital Transformation Roadmap
Earley Executive Roundtable - Building a Digital Transformation RoadmapEarley Executive Roundtable - Building a Digital Transformation Roadmap
Earley Executive Roundtable - Building a Digital Transformation Roadmap
Earley Information Science
 
Building New Data Ecosystem for Customer Analytics, Strata + Hadoop World, 2016
Building New Data Ecosystem for Customer Analytics, Strata + Hadoop World, 2016Building New Data Ecosystem for Customer Analytics, Strata + Hadoop World, 2016
Building New Data Ecosystem for Customer Analytics, Strata + Hadoop World, 2016
Caserta
 
J.M. Díaz Nafría: Science of Information: Emergence and evolution of meaning
J.M. Díaz Nafría: Science of Information: Emergence and evolution of meaningJ.M. Díaz Nafría: Science of Information: Emergence and evolution of meaning
J.M. Díaz Nafría: Science of Information: Emergence and evolution of meaning
José Nafría
 
Conceptions of information science
Conceptions of information scienceConceptions of information science
Conceptions of information science
Jorge Prado
 
Share: Science Information Life Cycle
Share: Science Information Life CycleShare: Science Information Life Cycle
Share: Science Information Life Cycle
kauberry
 
Information, Science, and Society
Information, Science, and SocietyInformation, Science, and Society
Information, Science, and Society
Melanie Swan
 
INFORMATION SCIENCE
INFORMATION SCIENCEINFORMATION SCIENCE
INFORMATION SCIENCE
harshaec
 
Towards Neuro–Information Science
Towards Neuro–Information ScienceTowards Neuro–Information Science
Towards Neuro–Information Science
jacekg
 
KNOWLEDGE SCIENCE; NOT INFORMATION SCIENCE OR TECHNOLOGY- SCOPE,THEORIES AND...
KNOWLEDGE SCIENCE; NOT INFORMATION SCIENCE OR TECHNOLOGY-  SCOPE,THEORIES AND...KNOWLEDGE SCIENCE; NOT INFORMATION SCIENCE OR TECHNOLOGY-  SCOPE,THEORIES AND...
KNOWLEDGE SCIENCE; NOT INFORMATION SCIENCE OR TECHNOLOGY- SCOPE,THEORIES AND...
Dr. Raju M. Mathew
 
Big Data and Hadoop - key drivers, ecosystem and use cases
Big Data and Hadoop - key drivers, ecosystem and use casesBig Data and Hadoop - key drivers, ecosystem and use cases
Big Data and Hadoop - key drivers, ecosystem and use cases
Jeff Kelly
 
Big data + data science startup focus points
Big data + data science startup focus pointsBig data + data science startup focus points
Big data + data science startup focus points
Tom Zorde
 
Sharing & Sustaining Ecosystem Data
Sharing & Sustaining Ecosystem DataSharing & Sustaining Ecosystem Data
Sharing & Sustaining Ecosystem Data
TERN Australia
 
Semiotics and Information Science
Semiotics and Information ScienceSemiotics and Information Science
Semiotics and Information Science
Florence Paisey
 
Big Data Open Source Tools and Trends: Enable Real-Time Business Intelligence...
Big Data Open Source Tools and Trends: Enable Real-Time Business Intelligence...Big Data Open Source Tools and Trends: Enable Real-Time Business Intelligence...
Big Data Open Source Tools and Trends: Enable Real-Time Business Intelligence...
Perficient, Inc.
 
Big data ecosystem
Big data ecosystemBig data ecosystem
Big data ecosystem
SlideCentral
 
Real time data services
Real time data servicesReal time data services
Real time data services
Relevate
 
Real Time Big Data
Real Time Big DataReal Time Big Data
Real Time Big Data
InfoFarm
 
Big data ecosystem
Big data ecosystemBig data ecosystem
Big data ecosystem
magda3695
 
Introducing the Big Data Ecosystem with Caserta Concepts & Talend
Introducing the Big Data Ecosystem with Caserta Concepts & TalendIntroducing the Big Data Ecosystem with Caserta Concepts & Talend
Introducing the Big Data Ecosystem with Caserta Concepts & Talend
Caserta
 
Big Data Ecosystem
Big Data EcosystemBig Data Ecosystem
Big Data Ecosystem
Ivo Vachkov
 
Earley Executive Roundtable - Building a Digital Transformation Roadmap
Earley Executive Roundtable - Building a Digital Transformation RoadmapEarley Executive Roundtable - Building a Digital Transformation Roadmap
Earley Executive Roundtable - Building a Digital Transformation Roadmap
Earley Information Science
 
Building New Data Ecosystem for Customer Analytics, Strata + Hadoop World, 2016
Building New Data Ecosystem for Customer Analytics, Strata + Hadoop World, 2016Building New Data Ecosystem for Customer Analytics, Strata + Hadoop World, 2016
Building New Data Ecosystem for Customer Analytics, Strata + Hadoop World, 2016
Caserta
 
Ad

Similar to Data Science and What It Means to Library and Information Science (20)

Slides | Research data literacy and the library
Slides | Research data literacy and the librarySlides | Research data literacy and the library
Slides | Research data literacy and the library
Library_Connect
 
Slides | Research data literacy and the library
Slides | Research data literacy and the librarySlides | Research data literacy and the library
Slides | Research data literacy and the library
Colleen DeLory
 
2-6-14 ESI Supplemental Webinar: The Data Information Literacy Project
2-6-14 ESI Supplemental Webinar: The Data Information  Literacy Project2-6-14 ESI Supplemental Webinar: The Data Information  Literacy Project
2-6-14 ESI Supplemental Webinar: The Data Information Literacy Project
DuraSpace
 
Realizing the Potential of Research Data by Carole L. Palmer
Realizing the Potential of Research Data by Carole L. Palmer Realizing the Potential of Research Data by Carole L. Palmer
Realizing the Potential of Research Data by Carole L. Palmer
carolelynnpalmer
 
Research Data Management in Academic Libraries: Meeting the Challenge
Research Data Management in Academic Libraries: Meeting the ChallengeResearch Data Management in Academic Libraries: Meeting the Challenge
Research Data Management in Academic Libraries: Meeting the Challenge
Spencer Keralis
 
Managing 'Big Data' in the social sciences: the contribution of an analytico-...
Managing 'Big Data' in the social sciences: the contribution of an analytico-...Managing 'Big Data' in the social sciences: the contribution of an analytico-...
Managing 'Big Data' in the social sciences: the contribution of an analytico-...
CILIP MDG
 
Immersive informatics - research data management at Pitt iSchool and Carnegie...
Immersive informatics - research data management at Pitt iSchool and Carnegie...Immersive informatics - research data management at Pitt iSchool and Carnegie...
Immersive informatics - research data management at Pitt iSchool and Carnegie...
Keith Webster
 
Data Services at a Liberal Arts College Library
Data Services at a Liberal Arts College LibraryData Services at a Liberal Arts College Library
Data Services at a Liberal Arts College Library
Julie Judkins
 
Services, policy, guidance and training: Improving research data management a...
Services, policy, guidance and training: Improving research data management a...Services, policy, guidance and training: Improving research data management a...
Services, policy, guidance and training: Improving research data management a...
Robin Rice
 
Organizational Implications of Data Science Environments in Education, Resear...
Organizational Implications of Data Science Environments in Education, Resear...Organizational Implications of Data Science Environments in Education, Resear...
Organizational Implications of Data Science Environments in Education, Resear...
Victoria Steeves
 
2015 03 19 (EDUCON2015) eMadrid UPM Towards a Learning Analytics Approach for...
2015 03 19 (EDUCON2015) eMadrid UPM Towards a Learning Analytics Approach for...2015 03 19 (EDUCON2015) eMadrid UPM Towards a Learning Analytics Approach for...
2015 03 19 (EDUCON2015) eMadrid UPM Towards a Learning Analytics Approach for...
eMadrid network
 
User engagement in research data curation
User engagement in research data curationUser engagement in research data curation
User engagement in research data curation
EDINA, University of Edinburgh
 
Research data support: a growth area for academic libraries?
Research data support: a growth area for academic libraries?Research data support: a growth area for academic libraries?
Research data support: a growth area for academic libraries?
Robin Rice
 
Sept 18 NISO Webinar: Research Data Curation, Part 2: Libraries and Big Data ...
Sept 18 NISO Webinar: Research Data Curation, Part 2: Libraries and Big Data ...Sept 18 NISO Webinar: Research Data Curation, Part 2: Libraries and Big Data ...
Sept 18 NISO Webinar: Research Data Curation, Part 2: Libraries and Big Data ...
National Information Standards Organization (NISO)
 
Sediment Experimentalist Network (SEN): Sharing and reusing methods and data ...
Sediment Experimentalist Network (SEN): Sharing and reusing methods and data ...Sediment Experimentalist Network (SEN): Sharing and reusing methods and data ...
Sediment Experimentalist Network (SEN): Sharing and reusing methods and data ...
hsuleslie
 
IFLA ARL Webinar Series: Research Ethics in an Open Research Environment
IFLA ARL Webinar Series: Research Ethics in an Open Research EnvironmentIFLA ARL Webinar Series: Research Ethics in an Open Research Environment
IFLA ARL Webinar Series: Research Ethics in an Open Research Environment
IFLAAcademicandResea
 
Services, policy, guidance and training: Improving research data management a...
Services, policy, guidance and training: Improving research data management a...Services, policy, guidance and training: Improving research data management a...
Services, policy, guidance and training: Improving research data management a...
EDINA, University of Edinburgh
 
Next generation data services at the Marriott Library
Next generation data services at the Marriott LibraryNext generation data services at the Marriott Library
Next generation data services at the Marriott Library
Rebekah Cummings
 
New Data, Same Skills: Applying Core Principles to New Needs in Data Curation
New Data, Same Skills: Applying Core Principles to New Needs in Data CurationNew Data, Same Skills: Applying Core Principles to New Needs in Data Curation
New Data, Same Skills: Applying Core Principles to New Needs in Data Curation
Lynn Connaway
 
New Data, Same Skills: Applying Core Principles to New Needs in Data Curation
New Data, Same Skills: Applying Core Principles to New Needs in Data CurationNew Data, Same Skills: Applying Core Principles to New Needs in Data Curation
New Data, Same Skills: Applying Core Principles to New Needs in Data Curation
OCLC
 
Slides | Research data literacy and the library
Slides | Research data literacy and the librarySlides | Research data literacy and the library
Slides | Research data literacy and the library
Library_Connect
 
Slides | Research data literacy and the library
Slides | Research data literacy and the librarySlides | Research data literacy and the library
Slides | Research data literacy and the library
Colleen DeLory
 
2-6-14 ESI Supplemental Webinar: The Data Information Literacy Project
2-6-14 ESI Supplemental Webinar: The Data Information  Literacy Project2-6-14 ESI Supplemental Webinar: The Data Information  Literacy Project
2-6-14 ESI Supplemental Webinar: The Data Information Literacy Project
DuraSpace
 
Realizing the Potential of Research Data by Carole L. Palmer
Realizing the Potential of Research Data by Carole L. Palmer Realizing the Potential of Research Data by Carole L. Palmer
Realizing the Potential of Research Data by Carole L. Palmer
carolelynnpalmer
 
Research Data Management in Academic Libraries: Meeting the Challenge
Research Data Management in Academic Libraries: Meeting the ChallengeResearch Data Management in Academic Libraries: Meeting the Challenge
Research Data Management in Academic Libraries: Meeting the Challenge
Spencer Keralis
 
Managing 'Big Data' in the social sciences: the contribution of an analytico-...
Managing 'Big Data' in the social sciences: the contribution of an analytico-...Managing 'Big Data' in the social sciences: the contribution of an analytico-...
Managing 'Big Data' in the social sciences: the contribution of an analytico-...
CILIP MDG
 
Immersive informatics - research data management at Pitt iSchool and Carnegie...
Immersive informatics - research data management at Pitt iSchool and Carnegie...Immersive informatics - research data management at Pitt iSchool and Carnegie...
Immersive informatics - research data management at Pitt iSchool and Carnegie...
Keith Webster
 
Data Services at a Liberal Arts College Library
Data Services at a Liberal Arts College LibraryData Services at a Liberal Arts College Library
Data Services at a Liberal Arts College Library
Julie Judkins
 
Services, policy, guidance and training: Improving research data management a...
Services, policy, guidance and training: Improving research data management a...Services, policy, guidance and training: Improving research data management a...
Services, policy, guidance and training: Improving research data management a...
Robin Rice
 
Organizational Implications of Data Science Environments in Education, Resear...
Organizational Implications of Data Science Environments in Education, Resear...Organizational Implications of Data Science Environments in Education, Resear...
Organizational Implications of Data Science Environments in Education, Resear...
Victoria Steeves
 
2015 03 19 (EDUCON2015) eMadrid UPM Towards a Learning Analytics Approach for...
2015 03 19 (EDUCON2015) eMadrid UPM Towards a Learning Analytics Approach for...2015 03 19 (EDUCON2015) eMadrid UPM Towards a Learning Analytics Approach for...
2015 03 19 (EDUCON2015) eMadrid UPM Towards a Learning Analytics Approach for...
eMadrid network
 
Research data support: a growth area for academic libraries?
Research data support: a growth area for academic libraries?Research data support: a growth area for academic libraries?
Research data support: a growth area for academic libraries?
Robin Rice
 
Sediment Experimentalist Network (SEN): Sharing and reusing methods and data ...
Sediment Experimentalist Network (SEN): Sharing and reusing methods and data ...Sediment Experimentalist Network (SEN): Sharing and reusing methods and data ...
Sediment Experimentalist Network (SEN): Sharing and reusing methods and data ...
hsuleslie
 
IFLA ARL Webinar Series: Research Ethics in an Open Research Environment
IFLA ARL Webinar Series: Research Ethics in an Open Research EnvironmentIFLA ARL Webinar Series: Research Ethics in an Open Research Environment
IFLA ARL Webinar Series: Research Ethics in an Open Research Environment
IFLAAcademicandResea
 
Services, policy, guidance and training: Improving research data management a...
Services, policy, guidance and training: Improving research data management a...Services, policy, guidance and training: Improving research data management a...
Services, policy, guidance and training: Improving research data management a...
EDINA, University of Edinburgh
 
Next generation data services at the Marriott Library
Next generation data services at the Marriott LibraryNext generation data services at the Marriott Library
Next generation data services at the Marriott Library
Rebekah Cummings
 
New Data, Same Skills: Applying Core Principles to New Needs in Data Curation
New Data, Same Skills: Applying Core Principles to New Needs in Data CurationNew Data, Same Skills: Applying Core Principles to New Needs in Data Curation
New Data, Same Skills: Applying Core Principles to New Needs in Data Curation
Lynn Connaway
 
New Data, Same Skills: Applying Core Principles to New Needs in Data Curation
New Data, Same Skills: Applying Core Principles to New Needs in Data CurationNew Data, Same Skills: Applying Core Principles to New Needs in Data Curation
New Data, Same Skills: Applying Core Principles to New Needs in Data Curation
OCLC
 
Ad

More from Jian Qin (12)

How Portable Are the Metadata Standards for Scientific Data?
How Portable Are the Metadata Standards for Scientific Data?How Portable Are the Metadata Standards for Scientific Data?
How Portable Are the Metadata Standards for Scientific Data?
Jian Qin
 
Functional and Architectural Requirements for Metadata: Supporting Discovery...
Functional and Architectural Requirements for Metadata: Supporting Discovery...Functional and Architectural Requirements for Metadata: Supporting Discovery...
Functional and Architectural Requirements for Metadata: Supporting Discovery...
Jian Qin
 
Survey research
Survey research Survey research
Survey research
Jian Qin
 
Data repositories -- Xiamen University 2012 06-08
Data repositories -- Xiamen University 2012 06-08Data repositories -- Xiamen University 2012 06-08
Data repositories -- Xiamen University 2012 06-08
Jian Qin
 
Developing Data Services to Support Scientific Data Management (v3)
Developing Data Services to Support Scientific Data Management (v3)Developing Data Services to Support Scientific Data Management (v3)
Developing Data Services to Support Scientific Data Management (v3)
Jian Qin
 
Preparing eScience librarians -- RDAP 2012
Preparing eScience librarians -- RDAP 2012 Preparing eScience librarians -- RDAP 2012
Preparing eScience librarians -- RDAP 2012
Jian Qin
 
Developing Data Services to Support eScience/eResearch
Developing Data Services to Support eScience/eResearchDeveloping Data Services to Support eScience/eResearch
Developing Data Services to Support eScience/eResearch
Jian Qin
 
Scientific data management (v2)
Scientific data management (v2)Scientific data management (v2)
Scientific data management (v2)
Jian Qin
 
Scientific Data Management
Scientific Data ManagementScientific Data Management
Scientific Data Management
Jian Qin
 
Research literature review
Research literature reviewResearch literature review
Research literature review
Jian Qin
 
Scholarly communication
Scholarly communicationScholarly communication
Scholarly communication
Jian Qin
 
Linking Scientific Metadata (presented at DC2010)
Linking Scientific Metadata (presented at DC2010)Linking Scientific Metadata (presented at DC2010)
Linking Scientific Metadata (presented at DC2010)
Jian Qin
 
How Portable Are the Metadata Standards for Scientific Data?
How Portable Are the Metadata Standards for Scientific Data?How Portable Are the Metadata Standards for Scientific Data?
How Portable Are the Metadata Standards for Scientific Data?
Jian Qin
 
Functional and Architectural Requirements for Metadata: Supporting Discovery...
Functional and Architectural Requirements for Metadata: Supporting Discovery...Functional and Architectural Requirements for Metadata: Supporting Discovery...
Functional and Architectural Requirements for Metadata: Supporting Discovery...
Jian Qin
 
Survey research
Survey research Survey research
Survey research
Jian Qin
 
Data repositories -- Xiamen University 2012 06-08
Data repositories -- Xiamen University 2012 06-08Data repositories -- Xiamen University 2012 06-08
Data repositories -- Xiamen University 2012 06-08
Jian Qin
 
Developing Data Services to Support Scientific Data Management (v3)
Developing Data Services to Support Scientific Data Management (v3)Developing Data Services to Support Scientific Data Management (v3)
Developing Data Services to Support Scientific Data Management (v3)
Jian Qin
 
Preparing eScience librarians -- RDAP 2012
Preparing eScience librarians -- RDAP 2012 Preparing eScience librarians -- RDAP 2012
Preparing eScience librarians -- RDAP 2012
Jian Qin
 
Developing Data Services to Support eScience/eResearch
Developing Data Services to Support eScience/eResearchDeveloping Data Services to Support eScience/eResearch
Developing Data Services to Support eScience/eResearch
Jian Qin
 
Scientific data management (v2)
Scientific data management (v2)Scientific data management (v2)
Scientific data management (v2)
Jian Qin
 
Scientific Data Management
Scientific Data ManagementScientific Data Management
Scientific Data Management
Jian Qin
 
Research literature review
Research literature reviewResearch literature review
Research literature review
Jian Qin
 
Scholarly communication
Scholarly communicationScholarly communication
Scholarly communication
Jian Qin
 
Linking Scientific Metadata (presented at DC2010)
Linking Scientific Metadata (presented at DC2010)Linking Scientific Metadata (presented at DC2010)
Linking Scientific Metadata (presented at DC2010)
Jian Qin
 

Recently uploaded (20)

SCI BIZ TECH QUIZ (OPEN) PRELIMS XTASY 2025.pptx
SCI BIZ TECH QUIZ (OPEN) PRELIMS XTASY 2025.pptxSCI BIZ TECH QUIZ (OPEN) PRELIMS XTASY 2025.pptx
SCI BIZ TECH QUIZ (OPEN) PRELIMS XTASY 2025.pptx
Ronisha Das
 
2541William_McCollough_DigitalDetox.docx
2541William_McCollough_DigitalDetox.docx2541William_McCollough_DigitalDetox.docx
2541William_McCollough_DigitalDetox.docx
contactwilliamm2546
 
Ultimate VMware 2V0-11.25 Exam Dumps for Exam Success
Ultimate VMware 2V0-11.25 Exam Dumps for Exam SuccessUltimate VMware 2V0-11.25 Exam Dumps for Exam Success
Ultimate VMware 2V0-11.25 Exam Dumps for Exam Success
Mark Soia
 
Biophysics Chapter 3 Methods of Studying Macromolecules.pdf
Biophysics Chapter 3 Methods of Studying Macromolecules.pdfBiophysics Chapter 3 Methods of Studying Macromolecules.pdf
Biophysics Chapter 3 Methods of Studying Macromolecules.pdf
PKLI-Institute of Nursing and Allied Health Sciences Lahore , Pakistan.
 
To study Digestive system of insect.pptx
To study Digestive system of insect.pptxTo study Digestive system of insect.pptx
To study Digestive system of insect.pptx
Arshad Shaikh
 
P-glycoprotein pamphlet: iteration 4 of 4 final
P-glycoprotein pamphlet: iteration 4 of 4 finalP-glycoprotein pamphlet: iteration 4 of 4 final
P-glycoprotein pamphlet: iteration 4 of 4 final
bs22n2s
 
K12 Tableau Tuesday - Algebra Equity and Access in Atlanta Public Schools
K12 Tableau Tuesday  - Algebra Equity and Access in Atlanta Public SchoolsK12 Tableau Tuesday  - Algebra Equity and Access in Atlanta Public Schools
K12 Tableau Tuesday - Algebra Equity and Access in Atlanta Public Schools
dogden2
 
Unit 6_Introduction_Phishing_Password Cracking.pdf
Unit 6_Introduction_Phishing_Password Cracking.pdfUnit 6_Introduction_Phishing_Password Cracking.pdf
Unit 6_Introduction_Phishing_Password Cracking.pdf
KanchanPatil34
 
Operations Management (Dr. Abdulfatah Salem).pdf
Operations Management (Dr. Abdulfatah Salem).pdfOperations Management (Dr. Abdulfatah Salem).pdf
Operations Management (Dr. Abdulfatah Salem).pdf
Arab Academy for Science, Technology and Maritime Transport
 
Handling Multiple Choice Responses: Fortune Effiong.pptx
Handling Multiple Choice Responses: Fortune Effiong.pptxHandling Multiple Choice Responses: Fortune Effiong.pptx
Handling Multiple Choice Responses: Fortune Effiong.pptx
AuthorAIDNationalRes
 
apa-style-referencing-visual-guide-2025.pdf
apa-style-referencing-visual-guide-2025.pdfapa-style-referencing-visual-guide-2025.pdf
apa-style-referencing-visual-guide-2025.pdf
Ishika Ghosh
 
How to track Cost and Revenue using Analytic Accounts in odoo Accounting, App...
How to track Cost and Revenue using Analytic Accounts in odoo Accounting, App...How to track Cost and Revenue using Analytic Accounts in odoo Accounting, App...
How to track Cost and Revenue using Analytic Accounts in odoo Accounting, App...
Celine George
 
YSPH VMOC Special Report - Measles Outbreak Southwest US 5-3-2025.pptx
YSPH VMOC Special Report - Measles Outbreak  Southwest US 5-3-2025.pptxYSPH VMOC Special Report - Measles Outbreak  Southwest US 5-3-2025.pptx
YSPH VMOC Special Report - Measles Outbreak Southwest US 5-3-2025.pptx
Yale School of Public Health - The Virtual Medical Operations Center (VMOC)
 
One Hot encoding a revolution in Machine learning
One Hot encoding a revolution in Machine learningOne Hot encoding a revolution in Machine learning
One Hot encoding a revolution in Machine learning
momer9505
 
World war-1(Causes & impacts at a glance) PPT by Simanchala Sarab(BABed,sem-4...
World war-1(Causes & impacts at a glance) PPT by Simanchala Sarab(BABed,sem-4...World war-1(Causes & impacts at a glance) PPT by Simanchala Sarab(BABed,sem-4...
World war-1(Causes & impacts at a glance) PPT by Simanchala Sarab(BABed,sem-4...
larencebapu132
 
Odoo Inventory Rules and Routes v17 - Odoo Slides
Odoo Inventory Rules and Routes v17 - Odoo SlidesOdoo Inventory Rules and Routes v17 - Odoo Slides
Odoo Inventory Rules and Routes v17 - Odoo Slides
Celine George
 
pulse ppt.pptx Types of pulse , characteristics of pulse , Alteration of pulse
pulse  ppt.pptx Types of pulse , characteristics of pulse , Alteration of pulsepulse  ppt.pptx Types of pulse , characteristics of pulse , Alteration of pulse
pulse ppt.pptx Types of pulse , characteristics of pulse , Alteration of pulse
sushreesangita003
 
Presentation on Tourism Product Development By Md Shaifullar Rabbi
Presentation on Tourism Product Development By Md Shaifullar RabbiPresentation on Tourism Product Development By Md Shaifullar Rabbi
Presentation on Tourism Product Development By Md Shaifullar Rabbi
Md Shaifullar Rabbi
 
To study the nervous system of insect.pptx
To study the nervous system of insect.pptxTo study the nervous system of insect.pptx
To study the nervous system of insect.pptx
Arshad Shaikh
 
How to manage Multiple Warehouses for multiple floors in odoo point of sale
How to manage Multiple Warehouses for multiple floors in odoo point of saleHow to manage Multiple Warehouses for multiple floors in odoo point of sale
How to manage Multiple Warehouses for multiple floors in odoo point of sale
Celine George
 
SCI BIZ TECH QUIZ (OPEN) PRELIMS XTASY 2025.pptx
SCI BIZ TECH QUIZ (OPEN) PRELIMS XTASY 2025.pptxSCI BIZ TECH QUIZ (OPEN) PRELIMS XTASY 2025.pptx
SCI BIZ TECH QUIZ (OPEN) PRELIMS XTASY 2025.pptx
Ronisha Das
 
2541William_McCollough_DigitalDetox.docx
2541William_McCollough_DigitalDetox.docx2541William_McCollough_DigitalDetox.docx
2541William_McCollough_DigitalDetox.docx
contactwilliamm2546
 
Ultimate VMware 2V0-11.25 Exam Dumps for Exam Success
Ultimate VMware 2V0-11.25 Exam Dumps for Exam SuccessUltimate VMware 2V0-11.25 Exam Dumps for Exam Success
Ultimate VMware 2V0-11.25 Exam Dumps for Exam Success
Mark Soia
 
To study Digestive system of insect.pptx
To study Digestive system of insect.pptxTo study Digestive system of insect.pptx
To study Digestive system of insect.pptx
Arshad Shaikh
 
P-glycoprotein pamphlet: iteration 4 of 4 final
P-glycoprotein pamphlet: iteration 4 of 4 finalP-glycoprotein pamphlet: iteration 4 of 4 final
P-glycoprotein pamphlet: iteration 4 of 4 final
bs22n2s
 
K12 Tableau Tuesday - Algebra Equity and Access in Atlanta Public Schools
K12 Tableau Tuesday  - Algebra Equity and Access in Atlanta Public SchoolsK12 Tableau Tuesday  - Algebra Equity and Access in Atlanta Public Schools
K12 Tableau Tuesday - Algebra Equity and Access in Atlanta Public Schools
dogden2
 
Unit 6_Introduction_Phishing_Password Cracking.pdf
Unit 6_Introduction_Phishing_Password Cracking.pdfUnit 6_Introduction_Phishing_Password Cracking.pdf
Unit 6_Introduction_Phishing_Password Cracking.pdf
KanchanPatil34
 
Handling Multiple Choice Responses: Fortune Effiong.pptx
Handling Multiple Choice Responses: Fortune Effiong.pptxHandling Multiple Choice Responses: Fortune Effiong.pptx
Handling Multiple Choice Responses: Fortune Effiong.pptx
AuthorAIDNationalRes
 
apa-style-referencing-visual-guide-2025.pdf
apa-style-referencing-visual-guide-2025.pdfapa-style-referencing-visual-guide-2025.pdf
apa-style-referencing-visual-guide-2025.pdf
Ishika Ghosh
 
How to track Cost and Revenue using Analytic Accounts in odoo Accounting, App...
How to track Cost and Revenue using Analytic Accounts in odoo Accounting, App...How to track Cost and Revenue using Analytic Accounts in odoo Accounting, App...
How to track Cost and Revenue using Analytic Accounts in odoo Accounting, App...
Celine George
 
One Hot encoding a revolution in Machine learning
One Hot encoding a revolution in Machine learningOne Hot encoding a revolution in Machine learning
One Hot encoding a revolution in Machine learning
momer9505
 
World war-1(Causes & impacts at a glance) PPT by Simanchala Sarab(BABed,sem-4...
World war-1(Causes & impacts at a glance) PPT by Simanchala Sarab(BABed,sem-4...World war-1(Causes & impacts at a glance) PPT by Simanchala Sarab(BABed,sem-4...
World war-1(Causes & impacts at a glance) PPT by Simanchala Sarab(BABed,sem-4...
larencebapu132
 
Odoo Inventory Rules and Routes v17 - Odoo Slides
Odoo Inventory Rules and Routes v17 - Odoo SlidesOdoo Inventory Rules and Routes v17 - Odoo Slides
Odoo Inventory Rules and Routes v17 - Odoo Slides
Celine George
 
pulse ppt.pptx Types of pulse , characteristics of pulse , Alteration of pulse
pulse  ppt.pptx Types of pulse , characteristics of pulse , Alteration of pulsepulse  ppt.pptx Types of pulse , characteristics of pulse , Alteration of pulse
pulse ppt.pptx Types of pulse , characteristics of pulse , Alteration of pulse
sushreesangita003
 
Presentation on Tourism Product Development By Md Shaifullar Rabbi
Presentation on Tourism Product Development By Md Shaifullar RabbiPresentation on Tourism Product Development By Md Shaifullar Rabbi
Presentation on Tourism Product Development By Md Shaifullar Rabbi
Md Shaifullar Rabbi
 
To study the nervous system of insect.pptx
To study the nervous system of insect.pptxTo study the nervous system of insect.pptx
To study the nervous system of insect.pptx
Arshad Shaikh
 
How to manage Multiple Warehouses for multiple floors in odoo point of sale
How to manage Multiple Warehouses for multiple floors in odoo point of saleHow to manage Multiple Warehouses for multiple floors in odoo point of sale
How to manage Multiple Warehouses for multiple floors in odoo point of sale
Celine George
 

Data Science and What It Means to Library and Information Science

  • 1. Data Science and What It Means to Library and Information Science Jian Qin School of Information Studies Syracuse University iSpeaker Series at Sungkyunkwan University Seoul, Korea, December 8, 2015
  • 2. Agenda • What is data science? • What is a data scientist? • What areas of library work can benefit from data science? 212/8/2015 iSpeaker Series at Sungkyunkwan University, Seoul, Korea
  • 3. 3 • What is data science? “An emerging area of work concerned with the collection, presentation, analysis, visualization, management, and preservation of large collections of information.” Stanton, J. (2012). Introduction to Data Science. http://ischool.syr.edu/media/documents/2012/3/DataScienc eBook1_1.pdf The whole lifecycle of data from collection to analysis to preservation LCAS DM workshop, Beijing, 201512/8/2015 iSpeaker Series at Sungkyunkwan University, Seoul, Korea
  • 4. “We’re increasingly finding data in the wild, and data scientists are involved with gathering data, massaging it into a tractable form, making it tell its story, and presenting that story to others.” Loukides, M. (2011). What is data science? Sebastopol, CA: O’Reilly. What is data science? 4 Gathering and massaging data to tell its story 12/8/2015 iSpeaker Series at Sungkyunkwan University, Seoul, Korea
  • 5. 5 A systematic enterprise that builds and organizes knowledge in the form of testable explanations and predictions. The study of the generalizable extraction of knowledge from data, which involves data and statistics or the systematic study of the organization, properties, and analysis of data and its role in inference, including our confidence in the inference. Dhar, V. (2013). Data science and prediction. Communications of the ACM, 56(12): 64-73. 12/8/2015 iSpeaker Series at Sungkyunkwan University, Seoul, Korea
  • 6. Why is data science different from statistics and other existing disciplines? • Raw material, the “data” part of data science, is increasingly heterogeneous and unstructured and often emanating from networks with complex relationships between the entities. • Analysis of data requires integration, interpretation, and sense making that is increasingly derived through tools from computer science, linguistics, econometrics, sociology, and other disciplines. • Data are increasingly generated by computer and for computer consumption, that is, computers increasingly do background work for each other and make decisions automatically 612/8/2015 iSpeaker Series at Sungkyunkwan University, Seoul, Korea
  • 7. 7 Dhar, V. (2013). Data science and prediction. Communications of the ACM, 56(12): 64-73, p. 64. 12/8/2015 iSpeaker Series at Sungkyunkwan University, Seoul, Korea
  • 8. 8 Main fields in data science 12/8/2015 iSpeaker Series at Sungkyunkwan University, Seoul, Korea
  • 9. What is a data scientist? • Math skills: Statistics and linear algebra • Computing skills: programming and infrastructure design • Able to communicate: ability to create narratives around their work • Ask the right questions: involves domain knowledge and expertise, coupled with a keen ability to see the problem, see the available data, and match up the two. 912/8/2015 iSpeaker Series at Sungkyunkwan University, Seoul, Korea
  • 10. Analysis of data problems: Story 1 • Domain: Global migration studies • What’s involved: migrants, refuges, detention centers, refuge camps, Asylums, … • Data types: interview audio recordings, photos, articles, clippings, written notes, … • Analysis software: Atlas.ti, SPSS • Bottleneck problem: • difficulty in finding the data by person, interview, and related artifacts and in transforming the data into analysis software 1012/8/2015 iSpeaker Series at Sungkyunkwan University, Seoul, Korea We’ve got a problem Researcher: How to use Atlas.ti? Data scientist: What data do you have? Data scientist: How do you collect them? Data scientist: What do you do with the data?
  • 11. Analysis of data problems: story 2 • Domain: Thermochronology and tectonics • Data types: Excel data files (lots of them), spectrum and microscopic images, annotations • Analysis: modeling by combining data from multiple data files with specialized software • Bottleneck problem: • manually matching/merging/filtering data is extremely cumbersome and the problem is compounded by the difficulty finding the right data files 1112/8/2015 iSpeaker Series at Sungkyunkwan University, Seoul, Korea What is involved: workflows in a research lifecycle
  • 12. Analysis of data problem: story 3 • Domain: collaboration networks in a data repository • What’s involved: metadata describing DNA sequences • Data types: semi-structured data in plain text format • Analysis: identify entities and relationships, build the data into a database for querying and extraction • Bottleneck problems: • Extremely large data sets with multiple entities, which makes manual processing impossible • Disambiguation of author names and correctly linking between entities 1212/8/2015 iSpeaker Series at Sungkyunkwan University, Seoul, Korea
  • 13. Analysis of data problems Analysis of domain data Requirement analysis Workflow analysis Data modeling Data transformation needs analysis Data provenance needs analysis Analysis of data problems is an analysis of domain data, requirements, and workflows that will lead to the development of solutions. 1312/8/2015 iSpeaker Series at Sungkyunkwan University, Seoul, Korea
  • 14. Skills required to perform analysis of domain data problems Requirement analysis Workflow analysis Data modeling Data transformation needs analysis Data provenance needs analysis Interview skills, analysis and generalization skills Ability to capture components and sequences in workflows Ability to translate domain analysis into data models Ability to envision the data model within the larger system architecture 1412/8/2015 iSpeaker Series at Sungkyunkwan University, Seoul, Korea
  • 15. Example 1: modeling research data for gravitational wave research 15 1. Understand research lifecycle 2. Workflows: steps and relationships 3. Data flows: what goes in and out at which step 4. Entities and attributes, relationships 5. Researcher’s practice and habits in documenting and managing data 12/8/2015 iSpeaker Series at Sungkyunkwan University, Seoul, Korea
  • 16. Example 2: asking the right question in mining metadata 16 Metadata describing datasets is big data that can used to study: • Collaboration networks • Scholarly communication patterns • Research frontiers and trends • Knowledge transfer • Research impact assessment 12/8/2015 iSpeaker Series at Sungkyunkwan University, Seoul, Korea
  • 17. What areas of library work can benefit from data science? 1712/8/2015 iSpeaker Series at Sungkyunkwan University, Seoul, Korea
  • 18. Data services and data-driven services 18 Library Data services that support research, learning, and policy making (external) Data-driven services that support library planning, management, and evaluation (internal) Data literacy training Data discovery Data consulting Data mining Data collection Data integration 12/8/2015 iSpeaker Series at Sungkyunkwan University, Seoul, Korea
  • 19. Data-drive organization • Consumer internet companies • Google, Amazon, Facebook, LinkedIn • Brick-mortar companies: • Walmart, UPS, FedEx, GE • “A data-driven organization acquires, processes, and leverage data in a timely fashion to create efficiencies, iterate on and develop new products, and navigate the competitive landscape...” 19 Is your library (company, research center, etc.) a data- driven organization? Patil, D.J. & Mason, H. (2015). Data Driven: Creating a Data Culture. Sebastopol, CA: O’Reilly Media, p. 6. 12/8/2015 iSpeaker Series at Sungkyunkwan University, Seoul, Korea
  • 20. Data curation 20 “the active and ongoing management of data through its life cycle of interest and usefulness to scholarship, science, and education. Data curation activities enable data discovery and retrieval, maintain its quality, add value, and provide for reuse over time, and this new field includes authentication, archiving, management, preservation, retrieval, and representation.” –UIUC GSLIS 12/8/2015 iSpeaker Series at Sungkyunkwan University, Seoul, Korea
  • 21. Data collection • Build data collections through • Institutional repositories • Community repositories • Developing tools for researchers to submit, manage, preserve, and discover data • Develop data collections • Specialized • Analysis-ready • Reusable • Actionable 21 • For library service planning, decision making, and evaluation • To support policy making, research, and learning 12/8/2015 iSpeaker Series at Sungkyunkwan University, Seoul, Korea
  • 22. Data discovery • Complex data landscape: • International, national, regional • Disciplinary, community • Open access vs. closed access • Data sources for various purposes: • Utility data sources: open, reusable • Census data: open, but need additional processing/meshing to reach the analysis- ready state • Government data: open, reusable, but require additional processing • Disciplinary research data: access varies, require special knowledge to access and use 22 Data involving human subjects are under strict control by law and often follow additional compliance 12/8/2015 iSpeaker Series at Sungkyunkwan University, Seoul, Korea
  • 23. Data consulting • Search, locate, and verify data for particular research purposes • Plan, design, and implement data curation and/or data analysis projects • Provide training and consulting for statistical methods and tools 2312/8/2015 iSpeaker Series at Sungkyunkwan University, Seoul, Korea
  • 24. Data mining • Using internal data: • Users, uses, expenses, collections, staff • Goal: improve efficiencies and service quality • Using external data: • Trends and indicators in scholarly communication, technology, economy, and culture • Goal: adjust current services and plan for new services 2412/8/2015 iSpeaker Series at Sungkyunkwan University, Seoul, Korea
  • 25. Data integration Data integration is the combination of technical and business processes used to combine data from disparate sources into meaningful and valuable information. --IBM, http://www.ibm.com/analytics/us/en/technology/data- integration/ 25 A process of understanding, cleansing, monitoring, transforming, and delivering data, which offers opportunities to develop data products as an infrastructure for research, learning, policymaking, and decision making. 12/8/2015 iSpeaker Series at Sungkyunkwan University, Seoul, Korea
  • 26. A home buyer’s information integration 26 What houses for sale under $250K have at least 2 bathrooms, 2 bedrooms, a nearby school ranking in the upper third, in a neighborhood with below-average crime rate and diverse population? Information integration Realtor School rankings Crime rate Demographics 12/8/2015 iSpeaker Series at Sungkyunkwan University, Seoul, Korea
  • 27. Research data integration Diabetes data and trends—Country level estimates: http://apps.nccd.cdc.gov/D DT_STRS2/NationalDiabet esPrevalenceEstimates.aspx ?mode=PHY ; Diabetes Data & Trends home page: http://apps.nccd.cdc.gov/dd tstrs/default.aspx 12/8/2015 27iSpeaker Series at Sungkyunkwan University, Seoul, Korea
  • 28. Summary • Data science is not a new discipline, but rather, a new way of utilizing data, methods, and tools to ask the right questions in solving problems. • Practicing data science requires strong skills in math, computing, interpersonal communication, and asking the right questions • Libraries are at a strategic position in practicing data science. How to leverage this position relies on the • vision • courage of risk taking • knowledge of data science and related topics • careful planning • collaboration 2812/8/2015 iSpeaker Series at Sungkyunkwan University, Seoul, Korea
  • 29. 12/8/2015 iSpeaker Series at Sungkyunkwan University, Seoul, Korea 29 Thank you! Questions?