A Review Report on Big Data Analytics
A Review Report on Big Data Analytics
Abstract—The emergence of modern information systems, The first step towards developing a big data software
social media platforms, Internet of Things (IoT) devices, web system is to understand the big data, and the technologies and
and mobile applications, and other new technologies have terminologies associated with it. Hence, this paper first
resulted in the generation of huge amounts of data. Big data presents an overview of different published literature and
refers to extensive and diverse datasets that are generated from surveys related to big data technologies and terminologies [2],
a wide range of sources and pose challenges in processing and and [6]. Once a foundational understanding of big data is
analyzing using traditional data tools. Software systems that are established, further review is done on published literature
designed to handle large and complex datasets are known as big
related to big data software engineering and architecture for
data software systems. These are a collection of tools,
developing big data systems [1], [3], [5], [8], [9], and [12].
technologies, and platforms designed to handle, process,
analyze, and extract insights from big data. This research paper Big data processing, analysis, and utilization are a few of
presents a comprehensive review of various aspects of big data the most important aspects of big data research. Published
software engineering, development tools, literature reviews, literature related to this area of big data research has also been
techniques, and terminologies. It offers valuable insights into the reviewed in this research paper [4], [7], [10], and [11].
significant areas of big data that have undergone extensive
research in recent years. By providing a thorough This paper is assembled as follows: Section II presents the
understanding of big data software systems, the paper serves as research work that was collected and analyzed for the
a valuable resource for researchers venturing into the realm of literature review. Section III provides the findings from
big data and seeking potential avenues for future research. numerous research papers and discusses them in the context
of big data software engineering. Finally, section IV presents
Keywords—Big data, Big data software engineering, Big data the conclusions, the limitations of the current research work,
analytics, Big data software development. and ideas for future research.
I. INTRODUCTION II. RELATED WORK
The term big data refers to large, complex, diverse, and To conduct a comprehensive literature review, a diverse
difficult-to-process datasets that are generated from a wide collection of research publications covering various aspects of
range of sources including social media platforms, Internet of big data, including analytics, software engineering,
Things (IoT) devices, web and mobile applications, machine development tools, techniques, terminologies, and literature
and sensor data, transactional data, government and public reviews, was gathered. These publications were sourced from
data, scientific and research data, multimedia data and other reputable online research databases, with a specific focus on
heterogeneous data sources [2]. Because of its complexity, selecting journals indexed in either Science Citation Index
this type of data is difficult to process with traditional data Expanded (SCIE) or Emerging Sources Citation Index (ESCI)
management tools or data processing applications [4]. for their rigorous peer-review process and academic authority.
Today’s digital world is also called the era of big data [5]. The
emergence of new technologies and advancements in data Table I shows the list of 12 papers that were collected and
collection methods have generated a lot of data from a wide presents them in ascending order by the year of publication.
range of data sources. All of this data needs to be stored, For each paper, the area of research work is presented along
processed, and analyzed for various reasons. with a brief description of the research area and the proposed
approach provided in the paper.
Big data stores a lot of valuable information. From this
data, valuable insights and patterns can be generated. This TABLE I. RELATED WORK IN BIG DATA
information is helpful in business and its careful examination
is a crucial factor for staying ahead of the competition. Paper Year Research Area
Proposed
However, data management and analysis of big data present Approach
serious challenges and require significant resources, new
Gorton and Klein Big data software
methods, and powerful technologies [2]. Likewise, [8]
2015
development
Case study
developing big data software systems requires scalable Big data
architecture, and hence software requirement engineering for Acharjya and challenges, open Narrative literature
2016
developing big data systems must consider pervasive Ahmed [4] research issues, review
distribution, variable request loads, computation-intensive and tools
analytics, high availability, and sustainability [9]. Big data software Narrative literature
Gorton et al. [9] 2016
engineering review
In this paper, several publications are reviewed to provide Big data Fuzzy set
Wang et al. [7] 2017
an overview of the research work done in the field of big data processing techniques
Big data software
analysis and software engineering. The surveyed publications Osvaldo et al. [12] 2017
development
MapReduce model
include published literature from 2015 to 2023. Big data Systematic
Heidari et al. [11] 2018
processing literature review
Proposed
Paper Year Research Area
Approach
Structured
Oussous et al. [2] 2018
Big data Narrative literature Variety
technologies review
Latent Dirichlet
Unstructured
Gurcan and Big data software Allocation (LDA)