Poster Garciacarrasco

Uploaded by

zakari

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

2 views1 page

Poster Garciacarrasco

Uploaded by

zakari

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 1

A Requirements-driven methodology aligned with the Model-Driven Architecture (MDA) for Data Analytics

over Big Data sources using AI

Jorge García Carrasco, Alejandro Maté, Juan Carlos Trujillo
University of Alicante
Department of Software and Computing Systems
Lucentia Research

Background On the other hand, an additional study regarding image generation via DL models
was also performed. Specifically, the study focused on the use of two types of
With the emergence of Big Data [2], the amount of data available for companies techniques, namely Transfer Learning (TL)[6] and Data Augmentation (DA)[3], and
and individuals has been dramatically increased. Extracting value from such large its effect on the task of image generation via Generative Adversarial Networks
and complex amount of data is not an easy task. This situation has led to an in- (GAN)[1] when training such GAN with an extremely small dataset. Several
creasing interest on the use of Machine Learning (ML) and Deep Learning (DL) examples of images synthesized by the GAN are shown in Fig. 1. The study led to
techniques [5]. However, despite the interest of industry and academia for the use the following conclusions:
of such techniques, there is a lack of methods that ease the capture of require- The use of DA enabled us to train GANs with an extremely low number of
ments and allow to efficiently address the development of an AI-based project. samples (∼ 103 samples) compared to the typical required samples (∼ 105 − 106
samples).
Objectives The use of TL allowed the network to converge much faster, as well to slightly
improve the quality of the results.
The main objective of this thesis is to propose a methodology for developing
AI-based solutions over Big Data sources, that helps on the capture of require- This study shows the potential of GANs when combined with DA and TL for
ments and, parting from these, derives a semi-automatic implementation, thus image generation with very small training datasets. Therefore, the use of these
reducing the cost and error rate of AI-based projects. techniques can be extremely useful in areas of application where the availability
of data is limited, such as in the medical field. The paper has been sent to a
conference, and the complete results will be soon available.
Preliminary Results

When developing an AI-based project, one of the most crucial stages, and where
most of the time and effort is spent, is the preprocessing stage. Depending on the
preprocessing of the data, the performance of an AI model can drastically change
[4]. Therefore, the first part of the thesis will be focused on the preprocessing
part of the methodology, specifically, on the use of Feature Engineering (FE) tech-
niques for the diagnosis and prognosis of mental diseases via the recording of EEG
brain signals. The use of ML and DL models has become really popular in this field, Figure 1. Four samples synthesized by a GAN trained to generate images of glass façades.
however, it is essential to apply previous preprocessing FE techniques to the data,
as EEG are noisy and non-stationary signals. In other words, a proper choice of
FE techniques could greatly improve the performance depending on the algorithm
and the mental disorder. Conclusions and future work
This motivated us to perform a Systematic Mapping Study (SMS), where more than
900 articles were covered, with the objective of showing a clear overview of which The work done up to now acted as a preliminary step which allowed us to gain
FE and AI techniques have been applied to each mental disorder. This paper is in knowledge related to different techniques which are essential when developing
the revision stage and will be available in the future, but partial results are shown in AI-based projects. Therefore, the next step of the thesis will be to apply the gained
Fig. 2 via a bubble plot. This type of plots provide a clear overview of the amount knowledge into implementing the actual methodology, specifically, the part related
of work that has been performed regarding a combination of mental disorder, and with the processing of data before feeding it to the AI model.
feature transformation techniques, for example.

Figure 2. Bubble plot which shows the number of works related to each combination of feature transformation and mental disease. Note that the plot is cropped in order to fit in the poster.

References
[1] Ian Goodfellow, Jean Pouget-Abadie, Mehdi Mirza, Bing Xu, David Warde-Farley, Sherjil Ozair, Aaron Courville, and Yoshua Bengio. Generative adversarial nets. Advances in neural information processing systems, 27, 2014.
[2] J Hurwitz, Alan Nugent, Fern Halper, and Marcia Kaufman. Big data. New York, 2013.
[3] Tero Karras, Miika Aittala, Janne Hellsten, Samuli Laine, Jaakko Lehtinen, and Timo Aila. Training generative adversarial networks with limited data. Advances in Neural Information Processing Systems, 33:12104–12114, 2020.
[4] Andreas Vogelsang and Markus Borg. Requirements engineering for machine learning: Perspectives from data scientists. In 2019 IEEE 27th International Requirements Engineering Conference Workshops (REW), pages 245–251. IEEE, 2019.
[5] Qingchen Zhang, Laurence T Yang, Zhikui Chen, and Peng Li. A survey on deep learning for big data. Information Fusion, 42:146–157, 2018.
[6] F Zhuang, Z Qi, K Duan, D Xi, Y Zhu, H Zhu, H Xiong, and Q He. A comprehensive survey on transfer learning. arxiv. arXiv preprint arXiv:1911.02685, 2020.

Acknowledgements

This work has been co-funded by the AETHER-UA project (PID2020-112540RB-C43), a smart data holistic approach for context-aware data analytics: smarter machine
learning for business modelling and analytics, funded by Spanish Ministry of Science and Innovation. And the BALLADEER (PROMETEO/2021/088) project, a Big Data
analytical platform for the diagnosis and treatment of Attention Deficit Hyperactivity Disorder (ADHD) featuring extended reality, funded by the Conselleria de Innovación,
Universidades, Ciencia y Sociedad Digital (Generalitat Valenciana).

Jornada de Doctorado en Informática (JDI) 2022 [email protected]

18 Image Generation Using Gan's
No ratings yet
18 Image Generation Using Gan's
5 pages
Lean101 Train The Trainer Slides tcm36-68577
100% (1)
Lean101 Train The Trainer Slides tcm36-68577
66 pages
Exploring The Role of Generative Adversarial Networks (Gans) and Generative Ai For Synthetic Data Generation and Augmentation in Machine Learning
No ratings yet
Exploring The Role of Generative Adversarial Networks (Gans) and Generative Ai For Synthetic Data Generation and Augmentation in Machine Learning
8 pages
Predictive Maintenance of Electromechanical Systems Based On Enhanced Generative Adversarial Neural Network With Convolutional Neural Network
No ratings yet
Predictive Maintenance of Electromechanical Systems Based On Enhanced Generative Adversarial Neural Network With Convolutional Neural Network
9 pages
Neural Differential Equations: A Comprehensive Review and Applications
No ratings yet
Neural Differential Equations: A Comprehensive Review and Applications
14 pages
A Simplified Generative Model Based On Gradient Descent and Mean Square Error
No ratings yet
A Simplified Generative Model Based On Gradient Descent and Mean Square Error
8 pages
A Review On Deep Learning Approaches To Image Classification and Object Segmentation 1
No ratings yet
A Review On Deep Learning Approaches To Image Classification and Object Segmentation 1
23 pages
Generative Adversarial Networks (Gans) : An Overview of Theoretical Model, Evaluation Metrics, and Recent Developments
No ratings yet
Generative Adversarial Networks (Gans) : An Overview of Theoretical Model, Evaluation Metrics, and Recent Developments
17 pages
Deep Learning Architectures Enabling Sophisticated Feature Extraction and Representation For Complex Data Analysis
No ratings yet
Deep Learning Architectures Enabling Sophisticated Feature Extraction and Representation For Complex Data Analysis
11 pages
Deep Learning Applications and Image Processing
No ratings yet
Deep Learning Applications and Image Processing
5 pages
Examen
100% (1)
Examen
4 pages
CH 8
No ratings yet
CH 8
42 pages
Self-Supervised Pretext Tasks - AD
No ratings yet
Self-Supervised Pretext Tasks - AD
9 pages
Ai in Ee
No ratings yet
Ai in Ee
9 pages
Background and Literature Review
No ratings yet
Background and Literature Review
7 pages
Liu Hu Report
No ratings yet
Liu Hu Report
6 pages
Ai 05 00035
No ratings yet
Ai 05 00035
19 pages
Deep L Earning
No ratings yet
Deep L Earning
7 pages
Machine Learning Papers Report
No ratings yet
Machine Learning Papers Report
5 pages
Assignment 2 - Santosh Soni
No ratings yet
Assignment 2 - Santosh Soni
3 pages
Background and Literature Review
No ratings yet
Background and Literature Review
17 pages
Three Reasons That You Should NOT Use Deep Learning - by George Seif - Towards Data Science
No ratings yet
Three Reasons That You Should NOT Use Deep Learning - by George Seif - Towards Data Science
1 page
Mathematics: Survey On Synthetic Data Generation, Evaluation Methods and Gans
No ratings yet
Mathematics: Survey On Synthetic Data Generation, Evaluation Methods and Gans
41 pages
Application of Data Augmentation On Deep Learning
No ratings yet
Application of Data Augmentation On Deep Learning
13 pages
Berrahal 2020
No ratings yet
Berrahal 2020
8 pages
3 A Review On Machine Learning and It's Application
No ratings yet
3 A Review On Machine Learning and It's Application
5 pages
2022 - A Survey On Deep Learning For Software Engineering
No ratings yet
2022 - A Survey On Deep Learning For Software Engineering
73 pages
Data Augmentation Techniques in Time Series Domain: A Survey and Taxonomy
No ratings yet
Data Augmentation Techniques in Time Series Domain: A Survey and Taxonomy
25 pages
A Comprehensive Survey of Image Generation Models Based On Deep Learning
No ratings yet
A Comprehensive Survey of Image Generation Models Based On Deep Learning
30 pages
Controllable Data Generation by Deep Learning: A Review
No ratings yet
Controllable Data Generation by Deep Learning: A Review
55 pages
A Review of Generative Adversarial Networks For Computer Vision TasksElectronics Switzerland
No ratings yet
A Review of Generative Adversarial Networks For Computer Vision TasksElectronics Switzerland
17 pages
Neural Networks
No ratings yet
Neural Networks
23 pages
New Smart Face Generation
No ratings yet
New Smart Face Generation
9 pages
Controllable Data Generation by Deep Learning: A Review: Shiyu Wang Yuanqi Du Xiaojie Guo Bo Pan Zhaohui Qin Liang Zhao
No ratings yet
Controllable Data Generation by Deep Learning: A Review: Shiyu Wang Yuanqi Du Xiaojie Guo Bo Pan Zhaohui Qin Liang Zhao
38 pages
Wang 18 Domain Adaptation
No ratings yet
Wang 18 Domain Adaptation
20 pages
Deep Learning
No ratings yet
Deep Learning
7 pages
Literature Review Draft 1
No ratings yet
Literature Review Draft 1
22 pages
IEEE Xplore Reference Download 2025.2.9.22.39.11
No ratings yet
IEEE Xplore Reference Download 2025.2.9.22.39.11
3 pages
Exploring The Various Machine Learning Models For Image Generation - A Comprehensive Survey Unlocking The Future of Digital Creativity
No ratings yet
Exploring The Various Machine Learning Models For Image Generation - A Comprehensive Survey Unlocking The Future of Digital Creativity
15 pages
Fault Diagnosis of Reducers Based On Digital Twins and Deep Learning
No ratings yet
Fault Diagnosis of Reducers Based On Digital Twins and Deep Learning
15 pages
Deep Generative Models For Synthetic Dat
No ratings yet
Deep Generative Models For Synthetic Dat
27 pages
"Transfer Learning" For Bridging The Gap Between Data Sciences and The Deep Learning
No ratings yet
"Transfer Learning" For Bridging The Gap Between Data Sciences and The Deep Learning
9 pages
2019 - Data Augmentation Using GANs - Fabio Henrique
No ratings yet
2019 - Data Augmentation Using GANs - Fabio Henrique
16 pages
A Study On Effects of Data Augmentation in Detection
No ratings yet
A Study On Effects of Data Augmentation in Detection
13 pages
Data Augmentation For Performance Prediction in VLSI Circuits
No ratings yet
Data Augmentation For Performance Prediction in VLSI Circuits
14 pages
Whisper PDF
No ratings yet
Whisper PDF
28 pages
Module 5
No ratings yet
Module 5
72 pages
Literature Review Draft 7
No ratings yet
Literature Review Draft 7
35 pages
Applsci 14 05975
No ratings yet
Applsci 14 05975
13 pages
Activation Functions Book
No ratings yet
Activation Functions Book
20 pages
IJRPR23960
No ratings yet
IJRPR23960
6 pages
Ijimai 9 1 16
No ratings yet
Ijimai 9 1 16
36 pages
Digital Twin - Old Wine in A New Bottle
No ratings yet
Digital Twin - Old Wine in A New Bottle
20 pages
‎⁨فصل ثاني اسراء⁩
No ratings yet
‎⁨فصل ثاني اسراء⁩
13 pages
Project Report Final
No ratings yet
Project Report Final
20 pages
Image Recognition and Processing Using Artificial Neural Network
No ratings yet
Image Recognition and Processing Using Artificial Neural Network
10 pages
Group 27 - Creating Art From Existing Images Using Deep Neural Network Models
No ratings yet
Group 27 - Creating Art From Existing Images Using Deep Neural Network Models
93 pages
Data Augmentation For Improving Deep Learning in Image Classification Problem
No ratings yet
Data Augmentation For Improving Deep Learning in Image Classification Problem
7 pages
Img 3
No ratings yet
Img 3
4 pages
EasyChair Preprint 15723
No ratings yet
EasyChair Preprint 15723
10 pages
Book On The Supply Chain Analytics Working
No ratings yet
Book On The Supply Chain Analytics Working
120 pages
Diabetes Data Analysis Using Python Report
No ratings yet
Diabetes Data Analysis Using Python Report
15 pages
65eeb59cc7566 Solutions Manual Digi 240808 034325
No ratings yet
65eeb59cc7566 Solutions Manual Digi 240808 034325
142 pages
Resume Parser Progress
No ratings yet
Resume Parser Progress
11 pages
Normalization and Tokenization in NLP
No ratings yet
Normalization and Tokenization in NLP
10 pages
Lean Manufacturing: Continuous Improvement Program
No ratings yet
Lean Manufacturing: Continuous Improvement Program
16 pages
Brain Tumor Detection
No ratings yet
Brain Tumor Detection
25 pages
Ai Phase 3 Project
No ratings yet
Ai Phase 3 Project
18 pages
Machine Learning With Unstructured Data
No ratings yet
Machine Learning With Unstructured Data
25 pages
Image Forgeryin
No ratings yet
Image Forgeryin
25 pages
Speech Bandwidth Expansion Via High Fidelity Generative Adversarial Networks
No ratings yet
Speech Bandwidth Expansion Via High Fidelity Generative Adversarial Networks
10 pages
Denoising Convolutional Autoencoders For Noisy Speech Recognition
No ratings yet
Denoising Convolutional Autoencoders For Noisy Speech Recognition
6 pages
IEEE Usa
No ratings yet
IEEE Usa
7 pages
Data Mining Lifecycle
No ratings yet
Data Mining Lifecycle
2 pages
24msp3077 1st Rev
No ratings yet
24msp3077 1st Rev
20 pages
Contrastive and Consistency Learning For Neural Noisy-Channel Model in Spoken Language Understanding
No ratings yet
Contrastive and Consistency Learning For Neural Noisy-Channel Model in Spoken Language Understanding
14 pages
Report On Coral Leaf Stage - 1
No ratings yet
Report On Coral Leaf Stage - 1
25 pages
Big Data Analysis
No ratings yet
Big Data Analysis
33 pages
Article 7
No ratings yet
Article 7
5 pages
Mini Project Final
No ratings yet
Mini Project Final
12 pages
Subjective Answer Evaluation Using NLP
No ratings yet
Subjective Answer Evaluation Using NLP
12 pages
CIT - Sri Aravind - AI&DS
No ratings yet
CIT - Sri Aravind - AI&DS
1 page
Terminology
No ratings yet
Terminology
4 pages
Tailored Data Annotation Resume
No ratings yet
Tailored Data Annotation Resume
3 pages
Lesson 04 Fine-Tuning ChatGPT
No ratings yet
Lesson 04 Fine-Tuning ChatGPT
41 pages
CS322 Lec5 S25
No ratings yet
CS322 Lec5 S25
45 pages
SYNOPSIS
No ratings yet
SYNOPSIS
28 pages
Data Mining
No ratings yet
Data Mining
35 pages
Case Study DFA NFA Text Classification
No ratings yet
Case Study DFA NFA Text Classification
6 pages
Introduction To Brain Tumor Detection
No ratings yet
Introduction To Brain Tumor Detection
10 pages
Rahil Merged
No ratings yet
Rahil Merged
27 pages
Kec Ai Gryffindor Dravidianlangtech Naacl 2025
No ratings yet
Kec Ai Gryffindor Dravidianlangtech Naacl 2025
7 pages
Data Mining Unit-5
No ratings yet
Data Mining Unit-5
6 pages
A Sign Language Convention Using YOLOv5 Updatd
No ratings yet
A Sign Language Convention Using YOLOv5 Updatd
5 pages
Sarvagha K DS
No ratings yet
Sarvagha K DS
1 page
AI for Everyone: An Intermediate Guide to Artificial Intelligence
From Everand
AI for Everyone: An Intermediate Guide to Artificial Intelligence
Nova Clarke
No ratings yet
50 Breakthrough AI Concepts in 500 Words Each: In 500 words, #17
From Everand
50 Breakthrough AI Concepts in 500 Words Each: In 500 words, #17
Nietsnie Trebla
No ratings yet
Data Mining: Concepts, Fundamentals And Applications
From Everand
Data Mining: Concepts, Fundamentals And Applications
Enrico Guardelli
No ratings yet