SlideShare a Scribd company logo
Big Data for the Social Sciences
David De Roure, Strategic Adviser for Data Resources @dder
Big Data doesn’t respect
disciplinary boundaries
Digital Social Research
theODI.org
Big Data for the Social Sciences - David De Roure - Jisc Digital Festival 2014
Big Data for the Social Sciences - David De Roure - Jisc Digital Festival 2014
Big Data for the Social Sciences - David De Roure - Jisc Digital Festival 2014
Mandy Chessell
The Big Picture
More people
Moremachines
Big Data
Big Compute
Conventional
Computation
“Big Social”
Social Networks
e-infrastructure
online
R&D
Big Data
Production
& Analytics
deeply
about
society
RCUK and Big Data
▶ ‘Big data is a term for a collection of datasets so large
and complex that it is beyond the ability of typical
database software tools to capture, store, manage, and
analyse them. ‘Big’ is not defined as being larger than a
certain number of ‘bytes’ because as technology advances
over time, the size of datasets that qualify as big data will
also increase’ (RCUK)
▶ But why do we want it?
New forms of data enable us to
1. Answer existing research questions in new ways
2. Ask entirely new research questions
Big Data for the Social Sciences - David De Roure - Jisc Digital Festival 2014
NERC Big Data
...as diverse as our science
• From micro- to macro-scale
• Many sources:
• Monitoring campaigns
• Field sites & sensors
• State-of-the-art laboratories
• Ships & aircraft
• Remote Sensing & EO
• Regulator networks
• Volunteers/citizen science
• Model output
• Long-term and unique!
10µm
100 TB
Big data: time-based media including film, tv, cctv footage -
retail data - geospatial data - email and social media - images
and associated metadata - performance data including raw
data of recordings, choreography, performance structure -
open government data - music - large-scale digital scans -
library, museum & gallery archives and metadata
Research benefits of new data
▶ Undertaking research on pressing policy-related issues
without the need for new data collection
• Food consumption, social background and obesity
• Energy consumption, housing type and climatic conditions
• Rural location, private/public transport alternatives and
incomes
• School attainment, higher education participation, subject
choices, student debt and later incomes
▶ New data such as social media enable us to ask big questions,
about big populations, and in real time – this is
transformative
Big Data Network
Phase 1 and 2
Big Data for the Social Sciences - David De Roure - Jisc Digital Festival 2014
Big Data for the Social Sciences - David De Roure - Jisc Digital Festival 2014
E-infrastructureLeadershipCouncil
Big Data for the Social Sciences - David De Roure - Jisc Digital Festival 2014
Mandy Chessell
Big Data for the Social Sciences - David De Roure - Jisc Digital Festival 2014
F i r s t
Interdisciplinary and “in the wild” *
“in it” versus “on it”
Nigel Shadbolt et al
Real life is and must be full of all kinds of social
constraint – the very processes from which society
arises. Computers can help if we use them to create
abstract social machines on the Web: processes in
which the people do the creative work and the
machine does the administration...The stage is set for
an evolutionary growth of new social engines.The
ability to create new forms of social process would be
given to the world at large, and development would be
rapid.
Berners-Lee, Weaving the Web, 1999 (pp. 172–175)
The Order of Social Machines
Some Social Machines
SOCIAM:TheTheory and Practice of Social Machines is funded by the UK Engineering and Physical Sciences Research Council (EPSRC) under grant
number EPJ017728/1 and comprises the Universities of Southampton, Oxford and Edinburgh. See sociam.org
Edwards, P. N., et al. (2013) Knowledge Infrastructures: Intellectual Frameworks and Research
Challenges.Ann Arbor: Deep Blue. http://hdl.handle.net/2027.42/97552
Web as
lens
Web as
artefact
Web Observatories
http://www.w3.org/community/webobservatory/
Big data elephant versus sense-making network?
The challenge is to foster the co-constituted socio-technical
system on the right i.e. a computationally-enabled sense-making
network of expertise, data, models and narratives.
Iain Buchan
Join the W3C Community Group www.w3.org/community/rosc
Jun Zhao
www.researchobject.org
PipWillcox
Take homes
▶ New forms of data enable us answer old questions in
new ways and to answer entirely new questions
▶ There are multiple shifts occurring:
– Volumes of data
– Realtime analytics
– Computational infrastructure
– Dataflows vs datasets (and curation infrastructure)
– Correlation vs causation
– Increasing automation
– Machine-to-Machine in Internet of Things
david.deroure@oerc.ox.ac.uk
www.oerc.ox.ac.uk/people/dder
@dder
Slide and image credits: Fiona Armstrong, Christine Borgman, Iain
Buchan, Mandy Chessell, Neil Chue Hong, Nigel Shadbolt, Pip
Willcox, Jun Zhao, Guardian newspaper
www.oerc.ox.ac.uk
david.deroure@oerc.ox.ac.uk
@dder

More Related Content

PPTX
Workshop at Oxford on publishing for early career researchers - April 2011
Jisc
 
PPTX
The wider environment of open scholarship – Jisc and CNI conference 10 July ...
Jisc
 
PPTX
Digital Humanities and the First World War
Adrian Stevenson
 
PPTX
Frictionless Sharing - The New Normal?
Martin Hamilton
 
PPTX
Stronger together: community initiatives in journal management
Jisc
 
PPTX
Keynote speech - Carole Goble - Jisc Digital Festival 2015
Jisc
 
PPTX
Meeting the Research Data Management Challenge - Rachel Bruce, Kevin Ashley, ...
Jisc
 
PPTX
Building an international infrastructure for research data - Jisc Digital Fes...
Jisc
 
Workshop at Oxford on publishing for early career researchers - April 2011
Jisc
 
The wider environment of open scholarship – Jisc and CNI conference 10 July ...
Jisc
 
Digital Humanities and the First World War
Adrian Stevenson
 
Frictionless Sharing - The New Normal?
Martin Hamilton
 
Stronger together: community initiatives in journal management
Jisc
 
Keynote speech - Carole Goble - Jisc Digital Festival 2015
Jisc
 
Meeting the Research Data Management Challenge - Rachel Bruce, Kevin Ashley, ...
Jisc
 
Building an international infrastructure for research data - Jisc Digital Fes...
Jisc
 

What's hot (20)

PPTX
Opening up data: a UK perspective – Jisc and CNI conference 10 July 2014
Jisc
 
PPT
Harnessing Collective Intelligence for Sustainable Development
EDINA, University of Edinburgh
 
PPTX
Jisc's new shared data centre
Jisc
 
PPTX
Why science needs open data – Jisc and CNI conference 10 July 2014
Jisc
 
PDF
Digital literacy: key issues
Jisc
 
PPTX
Agile resources on the open web …. a global digital library
Jisc
 
PPTX
SPARC Repositories conference in Baltimore - Nov 2010
Jisc
 
PDF
Research data spring: streamlining deposit
Jisc RDM
 
PPTX
SafeShare - Networkshop44
Jisc
 
PPTX
The user -driven evolution of Janet - Jisc Digifest 2016
Jisc
 
PPTX
Big data and the dark arts - Jisc Digital Media 2015
Jisc
 
PPT
Scott Edmunds at OASP Asia: Open (and Big) Data – the next challenge
GigaScience, BGI Hong Kong
 
PPTX
Student expectations of entering higher education - Jisc Digital Festival 2015
Jisc
 
PPT
Agile Data Access Initiative
EDINA, University of Edinburgh
 
PPTX
Parallel session: international
Jisc
 
PPTX
Application of Assent in the safe - Networkshop44
Jisc
 
PPT
Open Book Publishers, Rupert Gatti
OAbooks
 
PPTX
Implementing Open Access: Effective Management of Your Research Data
Martin Hamilton
 
PPTX
How you can enhance the efficiency and effectiveness of teaching and learning...
Jisc
 
PPTX
Immersive informatics - research data management at Pitt iSchool and Carnegie...
Keith Webster
 
Opening up data: a UK perspective – Jisc and CNI conference 10 July 2014
Jisc
 
Harnessing Collective Intelligence for Sustainable Development
EDINA, University of Edinburgh
 
Jisc's new shared data centre
Jisc
 
Why science needs open data – Jisc and CNI conference 10 July 2014
Jisc
 
Digital literacy: key issues
Jisc
 
Agile resources on the open web …. a global digital library
Jisc
 
SPARC Repositories conference in Baltimore - Nov 2010
Jisc
 
Research data spring: streamlining deposit
Jisc RDM
 
SafeShare - Networkshop44
Jisc
 
The user -driven evolution of Janet - Jisc Digifest 2016
Jisc
 
Big data and the dark arts - Jisc Digital Media 2015
Jisc
 
Scott Edmunds at OASP Asia: Open (and Big) Data – the next challenge
GigaScience, BGI Hong Kong
 
Student expectations of entering higher education - Jisc Digital Festival 2015
Jisc
 
Agile Data Access Initiative
EDINA, University of Edinburgh
 
Parallel session: international
Jisc
 
Application of Assent in the safe - Networkshop44
Jisc
 
Open Book Publishers, Rupert Gatti
OAbooks
 
Implementing Open Access: Effective Management of Your Research Data
Martin Hamilton
 
How you can enhance the efficiency and effectiveness of teaching and learning...
Jisc
 
Immersive informatics - research data management at Pitt iSchool and Carnegie...
Keith Webster
 
Ad

Viewers also liked (7)

PPTX
Accessing and Using Big Data to Advance Social Science Knowledge
Josh Cowls
 
PDF
Big Data and Social Science Theory: Leveraging Large Scale Data to Discover N...
Han Woo PARK
 
PDF
Internet Archives and Social Science Research - Yeungnam University
mwe400
 
PPT
Moving from small science to big science: Social and organizational impedimen...
Eric Meyer
 
PDF
Search Tasks, Proactive Search & Digital Assistants
Rishabh Mehrotra
 
PDF
Designing the search experience by tyler tate : twigkit
Tyler Tate
 
PPTX
Social impacts information technology
Rimple Darra
 
Accessing and Using Big Data to Advance Social Science Knowledge
Josh Cowls
 
Big Data and Social Science Theory: Leveraging Large Scale Data to Discover N...
Han Woo PARK
 
Internet Archives and Social Science Research - Yeungnam University
mwe400
 
Moving from small science to big science: Social and organizational impedimen...
Eric Meyer
 
Search Tasks, Proactive Search & Digital Assistants
Rishabh Mehrotra
 
Designing the search experience by tyler tate : twigkit
Tyler Tate
 
Social impacts information technology
Rimple Darra
 
Ad

Similar to Big Data for the Social Sciences - David De Roure - Jisc Digital Festival 2014 (20)

PPTX
Big Data for the Social Sciences
David De Roure
 
PPTX
Social Science Landscape for Web Observatories
David De Roure
 
PPTX
Big Data and Social Machines
David De Roure
 
PPTX
Web Observatories and e-Research
David De Roure
 
PDF
Big Data Challenges for the Social Sciences
David De Roure
 
PDF
New Data `New Computation
David De Roure
 
PPTX
Social Machines - A Disruptive Technology?
David De Roure
 
PPTX
Social Machines Paradigm
David De Roure
 
PDF
Emerging Forms of Data and Analytics
David De Roure
 
PDF
Big Data and Social Sciences
David De Roure
 
PPTX
Future of Scholarly Communications
David De Roure
 
PPTX
Social Machines IIIT
David De Roure
 
PDF
Taking IT for Granted - David De Roure
IT as a Utility Network+ (ITaaU)
 
PDF
New and Emerging Forms of Data
David De Roure
 
PPTX
Social Machines GSS
David De Roure
 
PPTX
Social Machines in Practice: Solutions, Stakeholders and Scopes
Clare Hooper
 
PDF
Ethics of Automation
David De Roure
 
PDF
Shared data and the future of libraries
Regan Harper
 
PPTX
New Forms of Data and Scientific Research
David De Roure
 
PPTX
Taking IT for Granted
David De Roure
 
Big Data for the Social Sciences
David De Roure
 
Social Science Landscape for Web Observatories
David De Roure
 
Big Data and Social Machines
David De Roure
 
Web Observatories and e-Research
David De Roure
 
Big Data Challenges for the Social Sciences
David De Roure
 
New Data `New Computation
David De Roure
 
Social Machines - A Disruptive Technology?
David De Roure
 
Social Machines Paradigm
David De Roure
 
Emerging Forms of Data and Analytics
David De Roure
 
Big Data and Social Sciences
David De Roure
 
Future of Scholarly Communications
David De Roure
 
Social Machines IIIT
David De Roure
 
Taking IT for Granted - David De Roure
IT as a Utility Network+ (ITaaU)
 
New and Emerging Forms of Data
David De Roure
 
Social Machines GSS
David De Roure
 
Social Machines in Practice: Solutions, Stakeholders and Scopes
Clare Hooper
 
Ethics of Automation
David De Roure
 
Shared data and the future of libraries
Regan Harper
 
New Forms of Data and Scientific Research
David De Roure
 
Taking IT for Granted
David De Roure
 

More from Jisc (20)

PPTX
Strengthening open access through collaboration: building connections with OP...
Jisc
 
PPTX
Andrew-Brown-JUSP-showcase-20240730.pptx
Jisc
 
PPTX
JUSP Showcase - Rebuilding Data presentation
Jisc
 
PPTX
Adobe Express Engagement Webinar (Delegate).pptx
Jisc
 
PPTX
FE Accessibility training matrix partnership - information session
Jisc
 
PPTX
Procuring a research management system: why is it so hard?
Jisc
 
PPTX
Adobe Express Engagement Webinar (Delegate).pptx
Jisc
 
PPTX
How libraries can support authors with open access requirements for UKRI fund...
Jisc
 
PPTX
Supporting (UKRI) OA monographs at Salford.pptx
Jisc
 
PPTX
The approach at University of Liverpool.pptx
Jisc
 
PPTX
Jisc's value to HE: the University of Sheffield
Jisc
 
PPTX
Towards a code of practice for AI in AT.pptx
Jisc
 
PPTX
Jamworks pilot and AI at Jisc (20/03/2024)
Jisc
 
PPTX
Wellbeing inclusion and digital dystopias.pptx
Jisc
 
PPTX
Accessible Digital Futures project (20/03/2024)
Jisc
 
PPTX
Procuring digital preservation CAN be quick and painless with our new dynamic...
Jisc
 
PPTX
International students’ digital experience: understanding and mitigating the ...
Jisc
 
PPTX
Digital Storytelling Community Launch!.pptx
Jisc
 
PPTX
Open Access book publishing understanding your options (1).pptx
Jisc
 
PPTX
Scottish Universities Press supporting authors with requirements for open acc...
Jisc
 
Strengthening open access through collaboration: building connections with OP...
Jisc
 
Andrew-Brown-JUSP-showcase-20240730.pptx
Jisc
 
JUSP Showcase - Rebuilding Data presentation
Jisc
 
Adobe Express Engagement Webinar (Delegate).pptx
Jisc
 
FE Accessibility training matrix partnership - information session
Jisc
 
Procuring a research management system: why is it so hard?
Jisc
 
Adobe Express Engagement Webinar (Delegate).pptx
Jisc
 
How libraries can support authors with open access requirements for UKRI fund...
Jisc
 
Supporting (UKRI) OA monographs at Salford.pptx
Jisc
 
The approach at University of Liverpool.pptx
Jisc
 
Jisc's value to HE: the University of Sheffield
Jisc
 
Towards a code of practice for AI in AT.pptx
Jisc
 
Jamworks pilot and AI at Jisc (20/03/2024)
Jisc
 
Wellbeing inclusion and digital dystopias.pptx
Jisc
 
Accessible Digital Futures project (20/03/2024)
Jisc
 
Procuring digital preservation CAN be quick and painless with our new dynamic...
Jisc
 
International students’ digital experience: understanding and mitigating the ...
Jisc
 
Digital Storytelling Community Launch!.pptx
Jisc
 
Open Access book publishing understanding your options (1).pptx
Jisc
 
Scottish Universities Press supporting authors with requirements for open acc...
Jisc
 

Recently uploaded (20)

PPTX
Understanding operators in c language.pptx
auteharshil95
 
PDF
UTS Health Student Promotional Representative_Position Description.pdf
Faculty of Health, University of Technology Sydney
 
PDF
Mga Unang Hakbang Tungo Sa Tao by Joe Vibar Nero.pdf
MariellaTBesana
 
PPTX
IMMUNIZATION PROGRAMME pptx
AneetaSharma15
 
PPTX
Week 4 Term 3 Study Techniques revisited.pptx
mansk2
 
PPTX
Open Quiz Monsoon Mind Game Final Set.pptx
Sourav Kr Podder
 
PDF
High Ground Student Revision Booklet Preview
jpinnuck
 
PDF
Introducing Procurement and Supply L2M1.pdf
labyankof
 
PPTX
Introduction and Scope of Bichemistry.pptx
shantiyogi
 
PPTX
How to Manage Leads in Odoo 18 CRM - Odoo Slides
Celine George
 
PPTX
HISTORY COLLECTION FOR PSYCHIATRIC PATIENTS.pptx
PoojaSen20
 
PPTX
Presentation on Janskhiya sthirata kosh.
Ms Usha Vadhel
 
PPTX
ACUTE NASOPHARYNGITIS. pptx
AneetaSharma15
 
PDF
Arihant Class 10 All in One Maths full pdf
sajal kumar
 
PPTX
Dakar Framework Education For All- 2000(Act)
santoshmohalik1
 
PPTX
Odoo 18 Sales_ Managing Quotation Validity
Celine George
 
PDF
The Final Stretch: How to Release a Game and Not Die in the Process.
Marta Fijak
 
PDF
Phylum Arthropoda: Characteristics and Classification, Entomology Lecture
Miraj Khan
 
PDF
Origin of periodic table-Mendeleev’s Periodic-Modern Periodic table
Mithil Fal Desai
 
PPTX
Skill Development Program For Physiotherapy Students by SRY.pptx
Prof.Dr.Y.SHANTHOSHRAJA MPT Orthopedic., MSc Microbiology
 
Understanding operators in c language.pptx
auteharshil95
 
UTS Health Student Promotional Representative_Position Description.pdf
Faculty of Health, University of Technology Sydney
 
Mga Unang Hakbang Tungo Sa Tao by Joe Vibar Nero.pdf
MariellaTBesana
 
IMMUNIZATION PROGRAMME pptx
AneetaSharma15
 
Week 4 Term 3 Study Techniques revisited.pptx
mansk2
 
Open Quiz Monsoon Mind Game Final Set.pptx
Sourav Kr Podder
 
High Ground Student Revision Booklet Preview
jpinnuck
 
Introducing Procurement and Supply L2M1.pdf
labyankof
 
Introduction and Scope of Bichemistry.pptx
shantiyogi
 
How to Manage Leads in Odoo 18 CRM - Odoo Slides
Celine George
 
HISTORY COLLECTION FOR PSYCHIATRIC PATIENTS.pptx
PoojaSen20
 
Presentation on Janskhiya sthirata kosh.
Ms Usha Vadhel
 
ACUTE NASOPHARYNGITIS. pptx
AneetaSharma15
 
Arihant Class 10 All in One Maths full pdf
sajal kumar
 
Dakar Framework Education For All- 2000(Act)
santoshmohalik1
 
Odoo 18 Sales_ Managing Quotation Validity
Celine George
 
The Final Stretch: How to Release a Game and Not Die in the Process.
Marta Fijak
 
Phylum Arthropoda: Characteristics and Classification, Entomology Lecture
Miraj Khan
 
Origin of periodic table-Mendeleev’s Periodic-Modern Periodic table
Mithil Fal Desai
 
Skill Development Program For Physiotherapy Students by SRY.pptx
Prof.Dr.Y.SHANTHOSHRAJA MPT Orthopedic., MSc Microbiology
 

Big Data for the Social Sciences - David De Roure - Jisc Digital Festival 2014

  • 1. Big Data for the Social Sciences David De Roure, Strategic Adviser for Data Resources @dder
  • 2. Big Data doesn’t respect disciplinary boundaries Digital Social Research
  • 8. The Big Picture More people Moremachines Big Data Big Compute Conventional Computation “Big Social” Social Networks e-infrastructure online R&D Big Data Production & Analytics deeply about society
  • 9. RCUK and Big Data ▶ ‘Big data is a term for a collection of datasets so large and complex that it is beyond the ability of typical database software tools to capture, store, manage, and analyse them. ‘Big’ is not defined as being larger than a certain number of ‘bytes’ because as technology advances over time, the size of datasets that qualify as big data will also increase’ (RCUK) ▶ But why do we want it? New forms of data enable us to 1. Answer existing research questions in new ways 2. Ask entirely new research questions
  • 11. NERC Big Data ...as diverse as our science • From micro- to macro-scale • Many sources: • Monitoring campaigns • Field sites & sensors • State-of-the-art laboratories • Ships & aircraft • Remote Sensing & EO • Regulator networks • Volunteers/citizen science • Model output • Long-term and unique! 10µm
  • 12. 100 TB Big data: time-based media including film, tv, cctv footage - retail data - geospatial data - email and social media - images and associated metadata - performance data including raw data of recordings, choreography, performance structure - open government data - music - large-scale digital scans - library, museum & gallery archives and metadata
  • 13. Research benefits of new data ▶ Undertaking research on pressing policy-related issues without the need for new data collection • Food consumption, social background and obesity • Energy consumption, housing type and climatic conditions • Rural location, private/public transport alternatives and incomes • School attainment, higher education participation, subject choices, student debt and later incomes ▶ New data such as social media enable us to ask big questions, about big populations, and in real time – this is transformative
  • 22. F i r s t
  • 23. Interdisciplinary and “in the wild” * “in it” versus “on it”
  • 25. Real life is and must be full of all kinds of social constraint – the very processes from which society arises. Computers can help if we use them to create abstract social machines on the Web: processes in which the people do the creative work and the machine does the administration...The stage is set for an evolutionary growth of new social engines.The ability to create new forms of social process would be given to the world at large, and development would be rapid. Berners-Lee, Weaving the Web, 1999 (pp. 172–175) The Order of Social Machines
  • 26. Some Social Machines SOCIAM:TheTheory and Practice of Social Machines is funded by the UK Engineering and Physical Sciences Research Council (EPSRC) under grant number EPJ017728/1 and comprises the Universities of Southampton, Oxford and Edinburgh. See sociam.org
  • 27. Edwards, P. N., et al. (2013) Knowledge Infrastructures: Intellectual Frameworks and Research Challenges.Ann Arbor: Deep Blue. http://hdl.handle.net/2027.42/97552
  • 28. Web as lens Web as artefact Web Observatories http://www.w3.org/community/webobservatory/
  • 29. Big data elephant versus sense-making network? The challenge is to foster the co-constituted socio-technical system on the right i.e. a computationally-enabled sense-making network of expertise, data, models and narratives. Iain Buchan
  • 30. Join the W3C Community Group www.w3.org/community/rosc Jun Zhao www.researchobject.org
  • 32. Take homes ▶ New forms of data enable us answer old questions in new ways and to answer entirely new questions ▶ There are multiple shifts occurring: – Volumes of data – Realtime analytics – Computational infrastructure – Dataflows vs datasets (and curation infrastructure) – Correlation vs causation – Increasing automation – Machine-to-Machine in Internet of Things
  • 33. [email protected] www.oerc.ox.ac.uk/people/dder @dder Slide and image credits: Fiona Armstrong, Christine Borgman, Iain Buchan, Mandy Chessell, Neil Chue Hong, Nigel Shadbolt, Pip Willcox, Jun Zhao, Guardian newspaper