0% found this document useful (0 votes)

8 views

Big_Data_Visualization_Tools

Uploaded by

rohaabbas658

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

8 views

Big_Data_Visualization_Tools

Uploaded by

rohaabbas658

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 12

Big Data Visualization Tools *

Nikos Bikakis
ATHENA Research Center, Athens, Greece

Data visualization and analytics are nowadays one of the corner-stones of Data
Science, turning the abundance of Big Data being produced through modern systems
into actionable knowledge. Indeed, the Big Data era has realized the availability of
voluminous datasets that are dynamic, noisy and heterogeneous in nature.
Transforming a data-curious user into someone who can access and analyze that
data is even more burdensome now for a great number of users with little or no
support and expertise on the data processing part. Thus, the area of data
visualization and analysis has gained great attention recently, calling for joint action
from different research areas and communities such as information visualization,
data management and mining, human-computer interaction, and computer graphics.
This article presents the limitations of traditional visualization systems in the Big
Data era. Additionally, it discusses the major prerequisites and challenges that
should be addressed by modern visualization systems. Finally, the state-of-the-art
methods that have been developed in the context of the Big Data visualization and
analytics are presented, considering methods from the Data Management and
Mining, Information Visualization and Human-Computer Interaction communities.

Synonyms
Exploratory data analysis; Information visualization; Interactive visualization;
Visual analytics; Visual exploration

Definition
Data visualization is the presentation of data in a pictorial or graphical format, and a data
visualization tool is the software that generates this presentation. Data visualization offers
intuitive ways for information perception and manipulation that essentially amplify the
overall cognitive performance of information processing, enabling users to effectively
identify interesting patterns, infer correlations and causalities, and support sense-making
activities.

*
This article appears in: Encyclopedia of Big Data Technologies, 2nd Edition,
Springer, 2022
https://doi.org/10.1007/978-3-319-63962-8_109-2
Overview
Data visualization provides users with intuitive means to interactively explore and analyze
data, enabling them to identify interesting patterns, discover correlations and causalities, and
support sense-making activities (Throughout the article, terms visualization and visual
exploration, as well as terms tool and system, are used interchangeably.). This is of great
importance, especially given the massive volumes of digital information concerning nearly
every aspect of human activity that are currently being produced and collected.
Data visualization and analytics are nowadays one of the cornerstones of Data Science,
turning the abundance of Big Data being produced through modern systems into actionable
knowledge. Indeed, the Big Data era has realized the availability of voluminous datasets that
are dynamic, noisy, and heterogeneous in nature. Transforming a data-curious user into
someone who can access and analyze that data is even more burdensome now for a great
number of users with little or no support and expertise on the data processing part. Thus, the
area of data visualization and analysis has gained great attention recently, calling for joint
action from different research areas and communities such as information visualization, data
management and mining, human-computer interaction, and computer graphics.
Several traditional problems from those communities, such as efficient data storage,
querying and indexing for enabling visual analytics, ways for visual presentation of massive
data, efficient interaction, and personalization techniques that can fit to different user needs,
are revisited with Big Data in mind (Andrienko et al. 2020; Qin et al. 2020; Idreos et al.
2015; Behrisch et al. 2019; Godfrey et al. 2016; Shneiderman 2008).
Given the above, modern visualization systems should effectively and efficiently handle the
following aspects:
• Real-Time Interaction. Efficient and scalable techniques should support the
interaction with billion-objects datasets while maintaining an acceptable system
response in in less than a second.
• On-the-Fly Visualization. Support of on-the-fly visualizations over large and
dynamic sets of volatile raw (i.e., not preprocessed) data is required. In several
cases, a preprocessing phased is not an option.
• Visual Scalability. Provision of effective data abstraction mechanisms is necessary
for addressing problems related to visual information overloading (aka
overplotting).
• User Assistance and Personalization. Encouraging user comprehension and
offering customization capabilities to different user-defined exploration scenarios
and preferences according to the analysis needs are important features.
The literature on visualization is extensive, covering a large range of fields and many
decades (Rees and Laramee 2019; McNabb and Laramee 2017). Data visualization is
discussed in a great number of recent introductory-level textbooks, such as Ward et al.
(2015), Keim et al. (2010). Further, surveys of Big Data visualization systems can be found
at (Qin et al. 2020; Po et al. 2020; Godfrey et al. 2016; Behrisch et al. 2019; Bikakis and
Sellis 2016; Idreos et al. 2015).
Finally, there is a great deal of information regarding visualization tools available in the
Web. We mention dataviz.tools (http://dataviz.tools) and datavizcatalogue
(www.datavizcatalogue. com) which are catalogs containing a large number of visualization
tools, libraries, and resources.

Visualization in Big Data Era

This section discusses the basic concepts related to Big Data visualization. First, the
limitations of traditional visualization systems are outlined. Then, the basic characteristics of
data visualization in the context of Big Data era are presented. Finally, the major
prerequisites and challenges that should be addressed by modern visualization systems are
discussed.

Traditional Visualization Systems. Most traditional visualization systems perform well

for ad hoc visualizations of small data files (e.g., showing a trend line or a bar chart) or over
aggregated data (e.g., summaries of data points, into which user can zoom in), which can fit
in main memory. Hence, they restrict themselves to dealing with small datasets, which can
be easily handled and analyzed with conventional data management and visual explorations
techniques. For larger data files, the conventional systems usually require a preprocessing
phase, such as loading in a data management system. As a result, they are limited to
accessing preprocessed sets of static data.

Big Data Era. On the other hand, nowadays, the Big Data era has made available large
numbers of very big datasets that are often dynamic and characterized by high variety and
volatility. For example, in several cases (e.g., scientific databases), new data constantly
arrive on an hourly basis; in other cases, data sources offer query or API endpoints for online
access and updating. Further, nowadays, an increasingly large number of diverse users (i.e.,
users with different preferences or skills) explore and analyze data in a plethora of different
scenarios and tasks.

Visualization Systems in Big Data Era. Modern systems should be able to efficiently
handle big dynamic datasets, operating on machines with limited computational and memory
resources (e.g., laptops). The dynamic nature of nowadays data (e.g., stream data), hinders
the application of a preprocessing phase, such as traditional database loading and indexing.
Hence, systems should provide on-the-fly processing and visualization over large sets of
data.
Further, in conjunction with performance issues, modern systems have to address challenges
related to visual presentation. Visualizing a large number of data objects is a challenging
task; modern systems have to “squeeze a billion records into a million pixels” (Shneiderman
2008). Even in small datasets, offering a dataset overview may be extremely difficult; in
both cases, information overloading (aka overplotting) is a common issue. Consequently,
visual scalability is a basic requirement of modern systems, which have to effectively
support data reduction/abstraction (e.g., sampling, aggregation) over enormous numbers of
data objects.
Apart from the aforementioned requirements, modern systems must also satisfy the diversity
of preferences and requirements posed by different users and tasks. Modern systems should
provide the user with the ability to customize the exploration experience based on her
preferences and the individual requirements of each examined task. Additionally, systems
should automatically adjust their parameters by taking into account the environment setting
and available resources, e.g., screen resolution/size, available memory.

Key Research Findings

This section presents how state-of-the-art approaches from data management and mining,
information visualization, and human-computer interaction communities attempt to handle
the challenges that arise in the Big Data era.

Data Reduction. In order to handle and visualize large datasets, modern systems have to
deal with information overloading issues. Offering visual scalability is crucial in Big Data
visualization. Systems should provide efficient and effective abstraction and summarization
mechanisms. In this direction, a large number of systems use approximation techniques (aka
data reduction techniques), in which abstract sets of data are computed. Considering the
existing approaches, most of them are based on (1) sampling and filtering (Fisher et al. 2012;
Park et al. 2016; Agarwal et al. 2013; Battle et al. 2013) and/or (2) aggregation (e.g.,
binning, clustering) (Elmqvist and Fekete 2010; Bikakis et al. 2017; Jugel et al. 2015; Liu et
al. 2013).

Hierarchical Data Exploration. Data reduction techniques are often defined hierarchically
(Elmqvist and Fekete 2010), allowing users to explore data in multiple “level of detail” by,
e.g., hierarchical aggregation.
Hierarchical approaches (aka multilevel) allow the visual exploration of very large datasets
in multiple levels (with different “level of detail”), offering both an overview and an
intuitive and effective way for finding specific parts within a dataset.
Particularly, in hierarchical approaches, the user first obtains an overview of the dataset
before proceeding to data exploration operations (e.g., Roll-Up, Drill-Down, Zoom, Filter)
and finally retrieving details about the data. A significant challenge, in large data
visualization, is the problem of overplotting. It can effectively be addressed in hierarchical
approaches, in which, in each level, the number of the presented visual elements is
controlled by data reduction methods.
Hierarchical techniques have been extensively used in large graphs/network visualization, in
order to handle the common problem of overloading, in winch the graph is presented as
“hairball.” In these techniques the graph is recursively decomposed into smaller subgraphs
that form a hierarchy of abstraction layers. In most cases, the hierarchy is constructed by
exploiting clustering and partitioning (Rodrigues Jr. et al. 2013; Tominski et al. 2009),
sampling (Sundara et al. 2010), and edge bundling (Gansner et al. 2011) techniques.

Progressive Data Visualization. Visual data exploration requires real-time system’s

response. However, computing complete results over large (unprocessed) datasets may be
extremely costly and in several cases unnecessary. Modern systems should progressively
return partial and preferably representative results, as soon as possible (Angelini et al. 2018;
Zgraggen et al. 2017).
Progressiveness can significantly improve efficiency in exploration scenarios, where it is
common that users attempt to find something interesting without knowing what exactly they
are searching for beforehand. In this case, users perform a sequence of operations (e.g.,
queries), where the result of each operation determines the formulation of the next operation.
Recently, many systems adopt the progressive paradigm attempting to reduce the response
time (Moritz et al. 2017; Rahman et al. 2017; Angelini et al. 2018; Zgraggen et al. 2017;
Fisher et al. 2012; Agarwal et al. 2013). Progressive approaches, instead of performing all
the computations in one step (that can take a long time to complete), split them in a series of
short chunks of approximate computations that improved with time. Therefore, instead of
waiting for an unbounded amount of time, users can see the results unfolding progressively.
This way the users are able to interrupt the execution and define the next operation, without
waiting the exact result to be computed.

Adaptive Indexing and In-situ Data Management. Several approaches like database
cracking and adaptive indexing have been adopted in data exploration scenarios. The basic
idea of these is to incrementally adapt the indexes and/or refine the physical order of data,
during query processing, following the characteristics of the workload (Pedro et al. 2019;
Vikram et al. 2020; Matheus et al. 2021; Stratos et al. 2007).
In-situ paradigm (Idreos et al. 2011; Alagiannis et al. 2012; Bikakis et al. 2021; Maroulis et
al. 2022; Olma et al. 2017) is a recent trend that aims at enabling the on-the-fly querying
over large sets of raw data, by avoiding the (pre)processing (e.g., loading and indexing)
overhead of traditional DBMS techniques. In-situ query processing aims at avoiding data
loading in a DBMS by accessing and operating directly over raw data files. In these systems,
in situ incremental and adaptive processing and indexing techniques are used, in which
small parts of raw data are processed incrementally “following” users’ interactions.
Furthermore, several well-known DBMS support in situ SQL querying over CSV files.
Particularly, MySQL provides the CSV Storage Engine, Oracle offers the External Tables
and Postgres has the Foreign Data.

Visual-Oriented Data Structures. Numerous data visualization systems have been

developed on-top of data structures and indexes which are designed in the context of visual
exploration, to improve efficiency and scalability. VisTrees (El-Hindi et al. 2016) and
HETree (Bikakis et al. 2017) are tree-based main-memory indexes that address visual
exploration use cases; i.e., they offer exploration-oriented features such as incremental index
construction and adaptation.
Nanocubes (Lins et al. 2013), Hashedcubes (de Lara Pahins et al. 2017), SmartCube (Liu et
al. 2020), Gaussian Cubes (Wang et al. 2017), and TopKubes (Miranda et al. 2017) are
main-memory data structures supporting interactive visualization. They are based on main-
memory variations of a data cube in order to reduce the time needed to generate the
visualization.
Spatial indexes have been used in graphVizdb (Bikakis et al. 2016) is a graph-based
visualization tool, which employs a 2D spatial index (e.g., R-tree) and maps user interactions
into window 2D queries. Spatial 2D indexing is also adopted in Kyrix (Wenbo et al. 2019),
enabling efficient Zoom and Pan operations over arbitrary data types. Finally, tile-based
structures are used in several visualization systems, such as RawVis (Bikakis et al. 2021),
AID (Saheli et al. 2019), ForeCache (Battle et al. 2016).

Caching and Prefetching. Recall that, in exploration scenarios, a sequence of operations is

performed and, in most cases, each operation is driven by the previous one. In this setting,
caching and/or prefetching the sets of data that are likely to be accessed by the user in the
near future can significantly reduce the response time (Battle et al. 2016; Tauheed et al.
2012). Most of these approaches use prediction techniques which exploit several factors
(e.g., user behavior, user profile, use case) in order to determine the upcoming user
interactions.

User Assistance. The huge amount of available information makes it difficult for users to
manually explore and analyze data. Modern systems should provide mechanisms that assist
the user and reduce the effort needed on their part, considering the diversity of preferences
and requirements posed by different users and tasks.
Recently, several approaches have been developed in the context of visualization
recommendation (Vartak et al. 2016). These approaches recommend the most suitable
visualizations in order to assist users throughout the analysis process. Usually, the
recommendations take into account several factors, such as data characteristics, environment
setting and available resources (e.g., screen resolution/size, available memory), examined
task, user preferences and behavior, etc.
Considering data characteristics, there are several systems that recommend the most suitable
visualization technique (and parameters) based on the type, attributes, distribution, or
cardinality of the input data (Key et al. 2012; Ehsan et al. 2016). Other approaches provide
visualization recommendations based on user behavior and preferences (Mutlu et al. 2016),
using machine learning (Hu et al. 2019) or similarity-based techniques (Kim et al. 2017). In
a similar context, some systems assist users by recommending certain visualizations that
reveal surprising, interesting data or outliers (Vartak et al. 2014; Wongsuphasawat et al.
2016).

Examples of Applications
Visualization techniques are of great importance in a wide range of application areas in the
Big Data era. The volume, velocity, heterogeneity, and complexity of available data make it
extremely difficult for humans to explore and analyze data. Data visualization enables users
to perform a series of analysis tasks that are not always possible with common data analysis
techniques (Keim et al. 2010).
Major application domains for data visualization and analytics are Physics and Astronomy.
Satellites and telescopes collect daily massive and dynamic streams of data. Using traditional
analysis techniques, astronomers are able to identify noise, patterns, and similarities. On the
other hand, visual analytics can enable astronomers to identify unexpected phenomena and
perform several complex operations, which are not are feasible by traditional analysis
approaches.
Another application domain is atmospheric sciences like Meteorology and Climatology. In
this domain high volumes of data are collected from sensors and satellites on a daily basis.
Storing these data over the years results in massive amounts of data that have to be analyzed.
Visual analytics can assist scientists to perform core tasks, such as climate factors correlation
analysis, event prediction, etc. Further, in this domain, visualization systems are used in
several scenarios in order capture real-time phenomena, such as hurricanes, fires, floods, and
tsunamis.
In the domain of Bioinformatics, visualization techniques are exploited in numerous tasks.
For example, analyzing the large amounts of biological data produced by DNA sequencers is
extremely challenging. Visual techniques can help biologist to gain insight and identify
interesting “areas” of genes on which to perform their experiments.
In the Big Data era, visualization techniques are extensively used in the business intelligence
domain. Finance markets is one application area, where visual analytics allow to monitor
markets, identify trends, and perform predictions. Besides, market research is also an
application area. Marketing agencies and in-house marketing departments analyze a plethora
of diverse sources (e.g., finance data, customer behavior, social media). Visual techniques
are exploited to realize task such as identifying trends, finding emerging market
opportunities, finding influential users and communities, and optimizing operations (e.g.,
troubleshooting of products and services), business analysis, and development (e.g., churn
rate prediction, marketing optimization).

Future Directions for Research

From a community perspective, the challenges related to data visualization and analysis
involve different communities and research areas such as Data management & Mining,
Information Visualization, Human-Computer Interaction, and Computer Graphics. In what
follows we summarize some of the basic challenges indicated in a recent report (Andrienko
et al. 2020).

Understand needs, personalize, and guide. Modern systems need to handle several major
user-centric challenges. Systems should understand what the users need to solve their
problems and offer guidance (“Show the Data not Seen by Humans”). In this context, the
following basic challenge can be considered: (a) recommend views of the data that the users
might want to analyze; (b) find what parts of data will be useful for each task; (c) provide
insights recommendations; (d) produce data stories and explanations; (e) develop novel
interfaces that assist users to understand data types and properties of the data; (f) integrated
human factors related to human vision and perception to analysis pipeline, so users
supervise, or provide feedback to systems.

Scalability and efficiency. Another great challenge is related to the systems’ scalability and
efficiency. This is to enable visualization systems to efficiently handle billion objects
datasets, while limiting the response to a few milliseconds. In that direction, the challenges
involve how to build tools that can perform interactive operations and complex analytics
over massive sets of data. In that respect, there is the need for novel approaches (e.g.,
progressive data processing) that can handle large streaming, sampled, uncertain, high-
dimensional, and noisy data.
Data-intensive applications. Classical data management problems, such as data storage,
querying, and indexing, are highly related to efficiency and scalability of the modern
visualization systems. However, in the context of visual analysis, solving such problems
reveals several “new” challenges. Such challenges are considered the following: define
visualization-centric algebras, design visualization operators, implement operation
optimization techniques, define effective storage and indexing scheme.

Interactive machine learning. Building interactive tools and enabling visual analysis to
Machine Learning (ML) applications is a great challenge. For example, develop visual
methods for interpreting and techniques for interacting with ML models; implement
visualization systems that enable models’ troubleshooting, debugging, and comparison.

Cross-References
- Visualization
- Visualization Techniques
- Visualizing Semantic Data
- Graph Exploration and Search

References
Agarwal S, Mozafari B, Panda A, Milner H, Madden S, Stoica I (2013) Blinkdb:
Queries with bounded errors and bounded response times on very large data. In:
European Conference on Computer Systems (EuroSys)
Alagiannis I, Borovica R, Branco M, Idreos S, Ailamaki A (2012) Nodb: Efficient
query execution on raw data files. In: ACM conference on management of data
(SIGMOD)
Andrienko GL, Andrienko NV, Drucker SM, Fekete J, Fisher D, Idreos S, Kraska T,
Li G, Ma K, Mackinlay JD, Oulasvirta A, Schreck T, Schumann H, Stonebraker
M, Auber D, Bikakis N, Chrysanthis PK, Papastefanatos G, Sharaf MA (2020) Big
data visualization and analytics: Future research challenges and emerging
applications. In: Proceedings of the international workshop on big data visual
exploration and analytics (BigVis)
Angelini M, Santucci G, Schumann H, Schulz H (2018) A review and
characterization of progressive visual analytics. Informatics 5(3)
Battle L, Stonebraker M, Chang R (2013) Dynamic reduction of query result sets for
interactive visualizaton. In: IEEE Conf. on dig data (BigData)
Battle L, Chang R, Stonebraker M (2016) Dynamic prefetching of data tiles for
interactive visualization. In: ACM conference on management of data (SIGMOD)
Behrisch M, Streeb D, Stoffel F, Seebacher D, Matejek B, Weber SH, Mittelstaedt S,
Pfister H, Keim D (2019) Commercial visual analytics systems-advances in the
big data analytics field. IEEE Trans Vis Comput Graph (TVCG) 25(10)
Bikakis N, Sellis T (2016) Exploration and visualization in the web of big linked
data: A survey of the state of the art. In: 6th intl. workshop on linked web data
management (LWDM)
Bikakis N, Liagouris J, Krommyda M, Papastefanatos G, Sellis T (2016) graphVizdb:
A scalable platform for interactive large graph visualization. In: IEEE intl. conf.
on data engineering (ICDE)
Bikakis N, Papastefanatos G, Skourla M, Sellis T (2017) A hierarchical aggregation
framework for efficient multilevel visual exploration and analysis. Semantic Web
J 8(1)
Bikakis N, Maroulis S, Papastefanatos G, Vassiliadis P (2021) In-situ visual
exploration over big raw data. Information Systems, Elsevier 95
de Lara Pahins CA, Stephens SA, Scheidegger C, Comba JLD (2017) Hashedcubes:
Simple, low memory, real-time visual exploration of big data. IEEE Trans Vis
Comput Graph (TVCG) 23(1)
Ehsan H, Sharaf MA, Chrysanthis PK (2016) Muve: Efficient multi-objective view
recommendation for visual data exploration. In: IEEE intl. conf. on data
engineering (ICDE)
El-Hindi M, Zhao Z, Binnig C, Kraska T (2016) Vistrees: Fast indexes for interactive
data exploration. In: HILDA
Elmqvist N, Fekete J (2010) Hierarchical aggregation for information visualization:
overview, techniques, and design guidelines. IEEE Trans Vis Comput Graph
(TVCG) 16(3)
Fisher D, Popov IO, Drucker SM, Schraefel MC (2012) Trust me, I’m partially right:
Incremental visualization lets analysts explore large datasets faster. In: Conference
on human factors in computing systems (CHI)
Gansner ER, Hu Y, North SC, Scheidegger CE (2011) Multilevel agglomerative edge
bundling for visualizing large graphs. In: IEEE pacific visualization symposium
(PacificVis)
Godfrey P, Gryz J, Lasek P (2016) Interactive visualization of large data sets. IEEE
Trans Knowl Data Eng (TKDE) 28(8)
Hu KZ, Bakker MA, Li S, Kraska T, Hidalgo CA (2019) VizML: A machine learning
approach to visualization recommendation. In: Conference on human factors in
computing systems (CHI), p 128
Idreos S, Alagiannis I, Johnson R, Ailamaki A (2011) Here are my data files. Here
are my queries. Where are my results? In: Conf. on innovative data systems
research (CIDR)
Idreos S, Papaemmanouil O, Chaudhuri S (2015) Overview of data exploration
techniques. In: ACM conference on management of data (SIGMOD)
Jugel U, Jerzak Z, Hackenbroich G, Markl V (2015) VDDa: Automatic visualization-
driven data aggregation in relational databases. J Very Large Data Bases (VLDBJ)
Keim DA, Kohlhammer J, Ellis GP, Mansmann F (2010) Mastering the information
age - solving problems with visual analytics. Eurographics Association
Key A, Howe B, Perry D, Aragon CR (2012) Vizdeck: Self-organizing dashboards
for visual analytics. In: ACM conference on management of data (SIGMOD)
Kim Y, Wongsuphasawat K, Hullman J, Heer J (2017) Graphscape: A model for
automated reasoning about visualization similarity and sequencing. In: Conference
on human factors in computing systems (CHI)
Lins LD, Klosowski JT, Scheidegger CE (2013) Nanocubes for real-time exploration
of spatiotemporal datasets. IEEE Trans Vis Comput Graph (TVCG) 19:2456–2465
Liu Z, Jiang B, Heer J (2013) imMens: Real-time visual querying of big data. Comput
Graph Forum (CGF) 32(3):421–430
Liu C, Wu C, Shao H, Yuan X (2020) Smartcube: An adaptive data management
architecture for the real-time visualization of spatiotemporal datasets. IEEE Trans
Vis Comput Graph (TVCG) 26(1)
Maroulis S, Bikakis N, Papastefanatos G et al (2022) Resource-aware adaptive
indexing for in situ visual exploration and analytics. VLDB J.
https://doi.org/10.1007/s00778-022-00739-z
Matheus AN, Pedro H, Eduardo C de Almeida, Stefan M (2021) Multidimensional
adaptive & progressive indexes. In IEEE Conference on Data Engineering (ICDE),
pp 624–635
McNabb L, Laramee RS (2017) Survey of surveys (sos) - mapping the landscape of
survey papers in information visualization. Comput Graph Forum 36(3)
Miranda F, Lins L, Klosowski JT, Silva CT (2017) Topkube: A rank-aware data cube
for real-time exploration of spatiotemporal data. IEEE TVCG 24
Moritz D, Fisher D, Ding B, Wang C (2017) Trust, but verify: Optimistic
visualizations of approximate queries for exploring big data. In: Conference on
human factors in computing systems (CHI)
Mutlu B, Veas EE, Trattner C (2016) Vizrec: Recommending personalized
visualizations. ACM Trans Interact Intell Syst (TIIS) 6(4)
Olma M, Karpathiotakis M, Alagiannis I, Athanassoulis M, Ailamaki A (2017)
Slalom: Coasting through raw data via adaptive partitioning and indexing. VLDB
Endow 10(10)
Park Y, Cafarella MJ, Mozafari B (2016) Visualization-aware sampling for very large
databases. In: IEEE Intl. Conf. on Data Engineering (ICDE)
Pedro H, Stefan M, Hannes M, Mark R (2019) Progressive indexes: Indexing for
interactive data analysis. In Proc VLDB Endow 12(13):2366–2378
Po L, Bikakis N, Desimoni F, Papastefanatos G (2020) Linked data visualization:
Techniques, tools, and big data. Synthesis lectures on the data, semantics, and
knowledge, morgan and claypool
Qin X, Luo Y, Tang N, Li G (2020) Making data visualization more efficient and
effective: A survey. J Very Large Data Bases (VLDBJ) 29(1)
Rahman S, Aliakbarpour M, Kong H, Blais E, Karahalios K, Parameswaran AG,
Rubinfeld R (2017) I’ve Seen “enough”: Incrementally improving visualizations to
support rapid decision making. VLDB Endowment (PVLDB) 10(11)
Rees D, Laramee RS (2019) A survey of information visualization books. Comput
Graph Forum 38(1)
Rodrigues Jr. JFR, Tong H, Pan J, Traina AJM, Traina Jr. C, Faloutsos C (2013)
Large graph analysis in the GMine system. IEEE Trans Knowl Data Eng (TKDE)
25(1)
Saheli G, Ahmed E, Shipra J (2019) AID: An adaptive image data index for
interactive multilevel visualization. In IEEE International Conference on Data
Engineering (ICDE), 42:1594–1597. https://doi.org/10.1109/icde.2019.00150
Shneiderman B (2008) Extreme visualization: Squeezing a billion records into a
million pixels. In: ACM conference on management of data (SIGMOD)
Stratos I, Martin LK, Stefan M (2007) Database cracking. In Conference on
Innovative Data Systems Research (CIDR), pp 68–78
Sundara S, Atre M, Kolovski V, Das S, Wu Z, Chong EI, Srinivasan J (2010)
Visualizing large-scale RDF data using subsets, summaries, and sampling in
Oracle. In: IEEE intl. conf. on data engineering (ICDE), pp 1048–1059
Tauheed F, Heinis T, Schürmann F, Markram H, Ailamaki A (2012) SCOUT:
Prefetching for latent feature following queries. VLDB Endowment (PVLDB)
5(11)
Tominski C, Abello J, Schumann H (2009) Cgv - An interactive graph visualization
system. Comput Graph 33(6)
Vartak M, Madden S, Parameswaran AG, Polyzotis N (2014) SEEDB: Automatically
generating query visualizations. VLDB Endowment (PVLDB) 7(13)
Vartak M, Huang S, Siddiqui T, Madden S, Parameswaran AG (2016) Towards
visualization recommendation systems. SIGMOD Record 45(4)
Vikram N, Jialin D, Mohammad A, Tim K (2020) Learning multi-dimensional
indexes. SIGMOD Conference, pp 985–100
Wang Z, Ferreira N, Wei Y, Bhaskar AS, Scheidegger C (2017) Gaussian cubes:
Real-time modeling for visual exploration of large multidimensional datasets.
IEEE Trans Vis Comput Graph 23(1)
Ward MO, Grinstein G, Keim D (2015) Interactive data visualization: Foundations,
techniques, and applications, 2nd edn. A. K. Peters, Ltd.
Wenbo T, Xiaoyu L, Yedi W, Leilani B, Çagatay D, Remco C, Michael S (2019)
Kyrix: Interactive pan/zoom visualizations at scale. Comput Graph Forum 38(3):
529–540
Wongsuphasawat K, Moritz D, Anand A, Mackinlay JD, Howe B, Heer J (2016)
Voyager: Exploratory analysis via faceted browsing of visualization
recommendations. IEEE Trans Vis Comput Graph (TVCG) 22(1)
Zgraggen E, Galakatos A, Crotty A, Fekete J, Kraska T (2017) How progressive
visualizations affect exploratory analysis. IEEE Trans Vis Comput Graph
(TVCG) 23(8)

Scrubber 2
No ratings yet
Scrubber 2
18 pages
Big_Data_Visualization_Tools-Encyclopedia_of_Big_Data_Technologies-2nd_Edition-2021
No ratings yet
Big_Data_Visualization_Tools-Encyclopedia_of_Big_Data_Technologies-2nd_Edition-2021
13 pages
Ls 5 Big Data Visualization
No ratings yet
Ls 5 Big Data Visualization
7 pages
Ls 5 - IMP
No ratings yet
Ls 5 - IMP
23 pages
Data Visualization1
No ratings yet
Data Visualization1
5 pages
NewPaper_DataVisualization
No ratings yet
NewPaper_DataVisualization
6 pages
Unit V-Data Visualization
No ratings yet
Unit V-Data Visualization
5 pages
Big Data Research
No ratings yet
Big Data Research
2 pages
Unit-6: Data Visualization and Hadoop
No ratings yet
Unit-6: Data Visualization and Hadoop
96 pages
Data Viz Tools
No ratings yet
Data Viz Tools
6 pages
IMTC634_Data Science_Chapter 8
No ratings yet
IMTC634_Data Science_Chapter 8
24 pages
DATA VISUALIZATION
No ratings yet
DATA VISUALIZATION
17 pages
EIT Project
No ratings yet
EIT Project
16 pages
Vol11Iss1_P4
No ratings yet
Vol11Iss1_P4
7 pages
The Power of Big Data: Transforming Industries and Shaping the Future
From Everand
The Power of Big Data: Transforming Industries and Shaping the Future
Tom Henricksen
No ratings yet
Unit v Da Online.pptx
No ratings yet
Unit v Da Online.pptx
66 pages
Data Visualization Techniques Traditional Data To Big Data
No ratings yet
Data Visualization Techniques Traditional Data To Big Data
23 pages
Data Visualization in The Age of Big Data
No ratings yet
Data Visualization in The Age of Big Data
7 pages
Analysis
No ratings yet
Analysis
53 pages
BDA - UNIT 5
No ratings yet
BDA - UNIT 5
24 pages
Data Visualization
No ratings yet
Data Visualization
5 pages
UNIT 5 BDT.pptx
No ratings yet
UNIT 5 BDT.pptx
132 pages
Does design matter when visualizing Big Data?
No ratings yet
Does design matter when visualizing Big Data?
41 pages
Data Visualization-1
No ratings yet
Data Visualization-1
29 pages
Big Data Visualization
No ratings yet
Big Data Visualization
7 pages
Building and Operating Data Hubs: Using a practical Framework as Toolset
From Everand
Building and Operating Data Hubs: Using a practical Framework as Toolset
Georg Graner
No ratings yet
Cda-u2-Visualization
No ratings yet
Cda-u2-Visualization
39 pages
Data Analysis and Visualization of Sales Data
No ratings yet
Data Analysis and Visualization of Sales Data
6 pages
Business Analytics Notes
No ratings yet
Business Analytics Notes
6 pages
UNIT 5 Data Analytics
No ratings yet
UNIT 5 Data Analytics
20 pages
Visualization
No ratings yet
Visualization
15 pages
Advanced Visualizations
No ratings yet
Advanced Visualizations
32 pages
August 2001/vol. 44, No. 8 COMMUNICATIONS OF THE ACM
No ratings yet
August 2001/vol. 44, No. 8 COMMUNICATIONS OF THE ACM
7 pages
Data Visualization and Hadoop
No ratings yet
Data Visualization and Hadoop
34 pages
Data Visualization
No ratings yet
Data Visualization
23 pages
Degusami
No ratings yet
Degusami
5 pages
Data Visualization Notes
No ratings yet
Data Visualization Notes
4 pages
Big Data: the Revolution That Is Transforming Our Work, Market and World
From Everand
Big Data: the Revolution That Is Transforming Our Work, Market and World
PAT NAKAMOTO
No ratings yet
Big Data Ethics in Research
From Everand
Big Data Ethics in Research
Nicolae Sfetcu
No ratings yet
Big Data Visualization and Analytics: Future Research Challenges and Emerging Applications
No ratings yet
Big Data Visualization and Analytics: Future Research Challenges and Emerging Applications
9 pages
Data Visualization and Statistical Graphics in Big Data Analysis
No ratings yet
Data Visualization and Statistical Graphics in Big Data Analysis
30 pages
Qin (2020). Making Data Visualization More Efficient and Effective
No ratings yet
Qin (2020). Making Data Visualization More Efficient and Effective
25 pages
Data Visualization: Methods, Types, Benefits, and Checklist: March 2019
No ratings yet
Data Visualization: Methods, Types, Benefits, and Checklist: March 2019
10 pages
Cda U2 Visualization
No ratings yet
Cda U2 Visualization
38 pages
Data Visualization Techniques: Dr. D. Koteswara Rao
No ratings yet
Data Visualization Techniques: Dr. D. Koteswara Rao
41 pages
Unit-5 BDA - Data Visualization
No ratings yet
Unit-5 BDA - Data Visualization
19 pages
BDT UNIT - 4 Text Note
No ratings yet
BDT UNIT - 4 Text Note
63 pages
Data Visualization
No ratings yet
Data Visualization
24 pages
20200321152821_DSS_Chapter_SEVEN
No ratings yet
20200321152821_DSS_Chapter_SEVEN
12 pages
Visualization Pipeline
No ratings yet
Visualization Pipeline
41 pages
The Future of Database Management Technologies: Harnessing the Power of Data: Insights and Strategies in Database Management
From Everand
The Future of Database Management Technologies: Harnessing the Power of Data: Insights and Strategies in Database Management
Robert Lewis
No ratings yet
Exploration and Visualization in The Web of Big Linked
No ratings yet
Exploration and Visualization in The Web of Big Linked
8 pages
Data Visualization
No ratings yet
Data Visualization
8 pages
Eds Unit 3
No ratings yet
Eds Unit 3
22 pages
NewPaper DataVisualization
No ratings yet
NewPaper DataVisualization
7 pages
Bda - Rahul Parida
No ratings yet
Bda - Rahul Parida
15 pages
DV Notes Diskha
No ratings yet
DV Notes Diskha
15 pages
Application of Data Visualization in Enterprise Da
No ratings yet
Application of Data Visualization in Enterprise Da
5 pages
Reading and Writing Set 2 Assgn
No ratings yet
Reading and Writing Set 2 Assgn
16 pages
Enterprise Data Science: Smarter Decisions with Big Data
From Everand
Enterprise Data Science: Smarter Decisions with Big Data
Vidhur Gupta
No ratings yet
Data Visualization - Chapter1
No ratings yet
Data Visualization - Chapter1
66 pages
TUSK Interface Documentation-Lothar4.10_HTTP-20240618
No ratings yet
TUSK Interface Documentation-Lothar4.10_HTTP-20240618
31 pages
0300057EN
No ratings yet
0300057EN
64 pages
Amazon API
No ratings yet
Amazon API
9 pages
H13 711 Simulacro
No ratings yet
H13 711 Simulacro
41 pages
Functional Dependency
No ratings yet
Functional Dependency
33 pages
TPS51123RGER Datasheet PDF
No ratings yet
TPS51123RGER Datasheet PDF
33 pages
HW1 Solution
No ratings yet
HW1 Solution
3 pages
Upwork - Resume - Dwi Febrianto
No ratings yet
Upwork - Resume - Dwi Febrianto
2 pages
Chapter 1 Introduction to Styles Class 10 IT 402-WITH NCERT
No ratings yet
Chapter 1 Introduction to Styles Class 10 IT 402-WITH NCERT
3 pages
VPN - NAT (Medjili Mouhamed Naime)
No ratings yet
VPN - NAT (Medjili Mouhamed Naime)
15 pages
Salesforce Marketing Cloud & Email Automation
No ratings yet
Salesforce Marketing Cloud & Email Automation
12 pages
Asus-Product-Guide-2013-08 09
No ratings yet
Asus-Product-Guide-2013-08 09
23 pages
Freshers Booklet 21 - 22
No ratings yet
Freshers Booklet 21 - 22
109 pages
Rpu-2100R Instructions For Use: Version 1.3 MRN-184-EN
No ratings yet
Rpu-2100R Instructions For Use: Version 1.3 MRN-184-EN
60 pages
Practical Implementation of A Blockchain-Enabled SDN For Large-Scale Infrastructure Networks
No ratings yet
Practical Implementation of A Blockchain-Enabled SDN For Large-Scale Infrastructure Networks
18 pages
Dell 27 Gaming Monitor - S2721Dgf: For More Information, Visit
No ratings yet
Dell 27 Gaming Monitor - S2721Dgf: For More Information, Visit
2 pages
ITAB Lab - MS DOS
No ratings yet
ITAB Lab - MS DOS
69 pages
Redemption Form - Latest PDF
No ratings yet
Redemption Form - Latest PDF
1 page
Computer Applications - Em - Key Answer
No ratings yet
Computer Applications - Em - Key Answer
4 pages
IT-PR-010 Change Management Process F
No ratings yet
IT-PR-010 Change Management Process F
13 pages
Help - EFIIQ - Nov22 1 PDF (Single Chapter) En-Us
No ratings yet
Help - EFIIQ - Nov22 1 PDF (Single Chapter) En-Us
62 pages
BGP-4 Case Studies: Nenad Krajnovic
No ratings yet
BGP-4 Case Studies: Nenad Krajnovic
60 pages
New Thesis Ref 3
No ratings yet
New Thesis Ref 3
6 pages
Especificaciones Tecnicas Transponder 260SCX2
No ratings yet
Especificaciones Tecnicas Transponder 260SCX2
7 pages
Uipath - Uipath-Ardv1.V2021-01-22.Q52: Leave A Reply
No ratings yet
Uipath - Uipath-Ardv1.V2021-01-22.Q52: Leave A Reply
15 pages
Kaspersky Virus Removal Tool 2010
No ratings yet
Kaspersky Virus Removal Tool 2010
9 pages
Creating Modern-Looking Userform Controls in VBA
No ratings yet
Creating Modern-Looking Userform Controls in VBA
10 pages
EE421/Lab 03 Visualization and Programming: - O + X S D V P N
No ratings yet
EE421/Lab 03 Visualization and Programming: - O + X S D V P N
4 pages
Hazem Mohamed Abdallah CV
No ratings yet
Hazem Mohamed Abdallah CV
5 pages

Big_Data_Visualization_Tools

Uploaded by

Big_Data_Visualization_Tools

Uploaded by

Big Data Visualization Tools *

Visualization in Big Data Era

Traditional Visualization Systems. Most traditional visualization systems perform well

Key Research Findings

Progressive Data Visualization. Visual data exploration requires real-time system’s

Visual-Oriented Data Structures. Numerous data visualization systems have been

Caching and Prefetching. Recall that, in exploration scenarios, a sequence of operations is

Future Directions for Research

You might also like