0% found this document useful (0 votes)
9 views8 pages

Literaturesurvey

ghfdsjah

Uploaded by

deshmukhneha833
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
9 views8 pages

Literaturesurvey

ghfdsjah

Uploaded by

deshmukhneha833
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 8

SL.

PROJECT TITLE AUTHOR PUBLISHED DATASETS


NO YEAR
1. Yoga Pose  Shruti Kothari May 2020 It consists of videos of 6 yoga poses
Classification Using performed by 15 different
Deep Learning individuals (5 females and 10
males). The 6 yoga poses namely are
– Bhujangasana (Cobra pose),
Padmasana (Lotus pose), Shavasana
(Corpse pose), Tadasana (Mountain
pose), Trikonasana (Triangle pose)
and Vrikshasana (Tree pose). The
total number of videos is 88 with a
duration of 1 hour 6 minutes and 5
seconds.
https://archive.org/details/YogaVidC
ollected.
2. YoNet: A Neural  Faisal Bin 8 February 2023 The Yoga-82 dataset was collected
Network for Yoga Ashraf from the web using the bing search
Pose Classification  Muhammad engine which contain 5 yoga poses
Usama Islam for classification– AdhoMukha,
 Md Rayhan Sukhasana, Tadasana,
Kabir Virabhadrasana i and
 Jasim Uddin. Virabhadrasana ii
3. Novel deep learning  Amira Samy 17 November 2023 The dataset used in this research
models for yoga Talaat paper is called “Yoga Pose
pose estimator Classification,“ sourced from
Kaggle. The dataset comprises of
839 images that have been classified
into the categories (Downdog,
Goddess, Plank, Tree, Warrior2)
with classes 0, 1, 2, 3, 4
correspondingly. The proposed
framework has 70% of the images as
training data, 20% as validation data,
and 10% as testing data.
4. Yoga pose  Shubham Garg 3 June 2022 The dataset contain images of 5
classification: a  Aman Saxena different yoga asanas performed by
CNN and MediaPipe  Richa Gupta distinct individuals. The five yoga
inspired deep postures are the down-dog pose (320
learning approach images), the goddess pose (260
for real‑world images), the plank pose (381
application. images), the warrior pose (361
images), and the tree pose (229
images).
5. Yoga Pose Detection  Deepak Kumar 22 November 2020 The dataset comprises of recordings
and Classification  Anurag Sinha of 6 yoga presents performed by 15
Using Deep distinct people (5 females and 10
Learning guys). The 6 yoga presents
specifically are – Bhujangasana
(Cobra present), Padmasana (Lotus
present), Shavasana (Corpse
present), Tadasana (Mountain
present), Trikonasana (Triangle
posture) and Vrikshasana (Tree
present). The all out number of
recordings is 88 with a term of 1
hour 6 minutes and 5 seconds.
6. Exploration of deep  Sumeet Saurav 08 March 2024 The dataset contain six yoga poses,
learning  Prashant Gidde namely Bhujangasana, Padmasana,
architectures for  Sanjay Singh Shavasana, Tadasana, Trikonasana,
real-time yoga and Vrikshasana
pose recognition
7. Real‑time yoga pose  Ratnesh Prasad 25 September The data is collected from a RGB
classification with Srivastava 2023 camera with each frame having a
3‑D pose estimation  Lokendra Singh minimum height of 360 px and
model with LSTM Umrao minimum width of 640 px for an
 Ramjeet Singh aspect ratio of 16:9. The dataset
Yadav consists of 10 different yoga poses
namely Mountain Pose (Tadasana),
Garland Pose (Malasana), Happy
baby pose (Ananda Balasana), Head-
to-knee forward bend (Janu
Sirsasana), Low lunge
(Anjaneyasana), Seated Forward
Fold (Paschimottanasana), Plank
pose (Kumbhakasana), Rasied arm
pose(Hasta Uttanasana), Staff pose
(Dandasana) and Standing forward
bend (Uttanasana) with 30 videos
each and having 30 sequences for
every video.
8. Three-dimensional  Shrajal Jain 9 October 2020 A Yoga pose dataset was created
CNN-inspired deep  Aditya Rustagi with the participation of 27
learning architecture  Sumeet Saurav individual (8 males and 19 females),
for Yoga pose  Ravi Saini which consists of ten Yoga poses,
recognition in the  Sanjay Singh namely Malasana, Ananda Balasana,
real-world Janu Sirsasana, Anjaneyasana,
environment Tadasana, Kumbhakasana, Hasta
Uttanasana, Paschimottanasana,
Uttanasana, and Dandasana.
9. Real-time Yoga  Santosh Kumar 20 May 2019 A dataset of six Yoga asanas (i.e.
recognition using Yadav Bhujangasana, Padmasana,
deep learning  Amitojdeep Shavasana, Tadasana, Trikonasana,
Singh and Vrikshasana) has been created
 Abhishek Gupta using 15 individuals (ten males and
 Jagdish Lal five females)
Raheja
10. YAP_LSTM: yoga  J. Palanimeera 28 August 2023 A dataset is collected from 10 people
asana prediction  K. Ponmozhi1 (5 men, 5 women) performing every
using pose one of these ten asanas.
estimation and long
short-term memory
11. Segmentation  Nagalakshmi 27 July 2023 Dataset-1: Yoga-82 dataset: This
quality assessment Vallabhaneni dataset (Yoga-82 2022; Yoga-82 rar
network-based  Panneer 2022) is formulated for recognizing
object detection Prabhavathy the yoga poses that consist of 82
and optimized CNN classes. This dataset consists of 82
with transfer categories and 9949 images.
learning for yoga Dataset-2: 107 Yoga asanas dataset:
pose classification This dataset is a yoga pose (Yoga
for health care Asanas 2022; Yogapose 2022)
image classification dataset, and it is
used for recognizing the yoga poses,
which consists of 107 classes with
5996 total images.
Dataset-3: Combined dataset: The
third dataset used here is the
combined one, which comprises the
above specified two datasets. In the
combined dataset, there are 189
classes, a total of 15,945 images.
12. Yoga pose  Utkarsh 12 December 2021 Open-source data containing 6
classification and Bahukhandi different yoga poses videos
detection using  Dr. Shikha performed by 15 different
machine learning Gupta volunteers.
technique.
Sl. No PREPROCESSING TECHNIQUE CLASSIFICATION ACCURACY
ALGORITHM
1. Extracting key points of poses in video CNN and LSTM model  Train accuracy:
frames using the OpenPose library. on OpenPose data 0.9992
 Validation accuracy:
0.9987
 Test accuracy: 0.9938
2. YoNet Architecture 94.91%
3. LGDeep Model 100%
4. the preprocessing of the dataset begins YogaConvo2d  Train accuracy:
with extracting the landmarks on the architecture 99.35%
human body in the frame using the  Validation accuracy:
MediaPipe library. 99.62%
 Test accuracy:
97.09%
5. key points of poses in video frames are CNN and LSTM  Train exactness:
extracted by using the OpenPose library. 0.9878
 Validation precision:
0.9921
 Test precision: 0.9858
6. CNN and LSTM model 99.65%
3DCNN
7. skeleton key points extraction LSTM classifier 92.34%
8. 3D CNN model  The in-house Yoga
pose dataset,-91.15%
 the publicly available
six-pose Yoga
dataset-99.39%

9. The positions of 18 key points tracked by CNN and LSTM Yoga poses in a video -
the Open- Pose, i.e. ears, eyes, nose, 99.04%
neck, shoulders, hips, knees, ankles, Real time-98.92%
elbows, and wrists
10. LSTM 99.2%
11. RLA-based CNN+TL Dataset 1- 0.993
Dataset 2- 0.944
Dataset 3- 0.928
12. Normalization Logistic Regression 94%
classifier
Sl. CONCLUSION FUTURE WORK
NO
1. Deep learning methods are promising because of the The dataset can be expanded my adding more
vast research being done in this field. The use of yoga poses performed by individuals not only
hybrid CNN and LSTM model on OpenPose data is in indoor setting but also outdoor. The
seen to be highly effective and classifies all the 6 yoga performance of the models depends upon the
poses perfectly. A basic CNN and SVM also perform quality of OpenPose pose estimation which
well beyond our expectations. Performance of SVM may not perform well in cases of overlap
proves that ML algorithms can also be used for pose between people or overlap between body parts.
estimation or activity recognition problems. Also, A portable device for self-training and real-
SVM is much lighter and less complex when time predictions can be implemented for this
compared to a neural network and requires less system. This work demonstrates activity
training time. recognition for practical applications. An
approach comparable to this can be utilized for
pose recognition in tasks such as sports,
surveillance, healthcare etc. Multi-person pose
estimation is a whole new problem in itself and
has a lot of scope for research. There are a lot
of scenarios where single person pose
estimation would not suffice, for example pose
estimation in crowded scenarios would have
multiple persons which will involve tracking
and identifying pose of each individual.
2. In this work, we have proposed a novel neural More poses can be considered even with our
network architecture, YoNet, to recognize five proposed architecture due to its strategy of
common yoga poses after having a thorough extracting features. Future research work also
discussion on current related works. The intuition of includes better performance through hyper-
our architecture is to extract the spatial and depth parameter tuning.
features from the image separately and use both types
of features for recognition. It gives our architecture an
advantage to differentiate better among the poses as
hypothesized in our methodology and proven through
result analysis and comparison carried out in our
research work.

3. The paper presented and compared four innovative Future research and development include may
yoga pose recognition models. LGDeep is the best include dataset expansion by extending the
yoga posture classification model, using deep transfer dataset to incorporate more postures, variants,
learning and ensemble techniques. LGDeep’s method and body types which can improve the model’s
achieved 100% classification accuracy, exceeding generalizability and robustness. To increase
previous similar studies and models. LGDeep’s accessibility, the Yoga posture recognition
specificity and sensitivity exceed those of other system might include a user-friendly interface.
techniques, proving its usefulness. The LGDeep Users may simply interact with the system,
model’s dependability and accuracy make it highly visualize their positions, and track their
suitable candidate for a yoga position recognition progress over time.
system. The recommended technique might improve
yoga practitioners’ health and safety due to its strong
classification capabilities.
4. The yoga posture evaluation system could help re- The proposed model can be modified to work
popularize asanas while also performing each asana for a video dataset or real-time feed. Three-
correctly. To accomplish this task, deep learning and dimensional convolutional neural networks can
AI techniques are promising and have a lot of also be explored for yoga asana detection and
potential. The employment of a convolutional neural help achieve even better results. Body posture
network on key points determined with MediaPipe tracking and classification can be used in
was found to be quite effective for this purpose, training robots, health care, surveillance,
accurately classifying all five yoga asanas. This work sports, motion capture, motion tracking,
also attempts to solve the many obstacles and consoles, augmented reality, etc. There is still a
restrictions in current state-of-the-art procedures. lot of untapped potential and research that can
be done in human posture detection.
5. Human posture assessment has been concentrated The proposed models right now characterize
widely over the previous years. When contrasted with just 6 yoga asanas. There are various yoga
other PC vision issues, human posture assessment is asanas, and subsequently making a posture
distinctive as it needs to limit and amass human body assessment model that can be effective for all
parts based on an effectively characterized structure of the asanas is a testing issue. The dataset can be
the human body. Use of posture assessment in extended my adding more yoga presents
wellness and sports can help forestall wounds and performed by people in indoor setting as well
improve the execution of individuals’ exercise. Yoga as open air.
self-guidance frameworks convey the potential to
make yoga famous alongside ensuring it is acted in the
correct way. Profound learning techniques are
promising a result of the huge exploration being done
in this field. The utilization of mixture CNN and
LSTM model on OpenPose information apparently is
profoundly successful and arranges all the 6 yoga
presents impeccably.
6 To provide a portable embedded solution for the real Our future research direction will be to develop
world deployment of the developed yoga pose deep learning-based models to identify the
recognition system, we optimized the designed abnormalities in the yoga poses and
lightweight 3DCNN Model3 using the TensorRT SDK incorporate feedback techniques for posture
and deployed it on the Nvidia Xavier embedded board correction. We also plan to explore the use of
for real-time inference. One can deploy such a system dual-stream neural networks combining body
in real-world conditions or integrate it with a self- pose (extracted from body keypoints) and
training yoga system to recognize different poses. spatial information (obtained from the RGB
frames) for the task of yoga pose recognition.
Finally, we intend to study skeleton-based
yoga pose recognition, combining skeleton
features and advanced neural network
techniques like graph convolutional networks
(GCNs), transformers, etc.
7. The model proposed was able to successfully classify In future, the skeleton points data and a CNN
the yoga poses with an average accuracy of 92.34%. can be combined with the image data to
This study aimed at making a model which can be overcome the problems like strong
used as a Yoga coach and is easily accessible to the articulations, small and barely visible joints,
common people for living a healthy and stress-free occlusions, and clothing with an ensemble
lifestyle. Real-time computing of features, as well as technique to make a more robust system. The
categorization, are key components of this proposed limitation of the proposed model can be
approach. computationally expensive to train and require
more computational resources compared to
other types of neural networks.
8. In this paper, we presented a vision-based system for In future work, more poses and video clips can
the recognition of Yoga poses in real time, which be added to the database to enhance its
intends to overcome the limitations of the existing usability. Additionally, the fusion of geometric
state-of-the-art technique. To this end, we first build a and spatial–temporal features from the Yoga
large-scale dataset of ten Yoga poses captured in pose sequences can be explored to enhance the
complex real-world environments so that our designed recognition accuracy of the designed system.
system could deliver better performance when The designed system can also be optimized and
deployed in real-world conditions. Secondly, we realized on portable embedded devices for the
proposed a lightweight 3D CNN architecture that recognition of the Yoga poses in real time.
exploits the inherent spatial–temporal relationship
among the Yoga poses for their recognition. On the
test set of the in-house Yoga pose dataset, the
proposed 3D CNN model achieved recognition
accuracy of 91.15% along with average precision,
average recall, and average F1-score of 0.91. Besides,
on the publicly available six-pose Yoga dataset, our
proposed model achieved competitive recognition
accuracy of 99.39% along with average precision,
average recall, and average F1-score of 0.99.
9. In this paper, we proposed a Yoga identification In future work, more asanas and a larger
system using a traditional RGB camera. The dataset is dataset comprising of both image and videos
collected using HD 1080p Logitech webcam for 15 can be included. Also, the system can be
individuals (ten males and five females) and made implemented on a portable device for real-time
publicly available. OpenPose is used to capture the predictions and self-training. This work serves
user and detect keypoints. The end-to-end deep as a demonstration of activity recognition
learning-based framework eliminates the need for systems for realistic applications. A similar
making handcrafted features allowing for the addition approach can be used for posture recognition in
of new asanas by just retraining the model with new various tasks like surveillance, sports,
data. We applied the time-distributed CNN layer to healthcare, image classification, etc.
detect patterns between keypoints in a single frame
and the LSTM to memorize the patterns found in the
recent frames. Using LSTM for the memory of
previous frames and polling for denoising, the results
make the system even more robust by minimizing the
error due to false keypoint detection.
10. This paper presents a yoga recognition method Extra yoga pose and large data with images
utilizing a conventional RGB camera in this study. and films may be integrated into destiny
The data were obtained for ten individuals (five men improvement and examine more models with
and five females) using an HD 1080p RGB camera yoga asana data.
and made publicly available. To find important
locations, pose estimation is employed. The end-to-
end deep-learning-based architecture stops the
requirement for hand-crafted functions, allowing the
model to be retrained with new data to include new
asana. The fundamental elements of the LSTM were
used in this study to help participants remember the
patterns seen in recent frames. The outcomes make the
device even more resilient with the aid of lowering the
mistake because the office key—factor identifies with
the aid of using LSTM for earlier body memory and
polling for denoising.
11. This article aims to establish a systematic model for The future evolvement would be the inclusion
yoga pose categorization by exploiting designed RLA- of some other hybrid optimization algorithms
based CNN with TL. Human object detection is to attain superior accuracy in terms of pose
implemented using SQA, where the network is trained classification.
to utilize RLA. The segmented result is applied to the
feature extraction module, where features including
SLBT, LTP, HLoG, and hierarchical skeleton features
are refined. Finally, yoga pose classification is
performed using CNN with TL.
12. In this study, a yoga pose classifier was successfully
developed which works perfectly on images, static
video, and live video of any user. The study starts
from environment creation and proceeds with data
collections from open data sources. Mediapipe pose
estimation library is used for human pose estimation
which returns body key points, these data points form
the basis of a new dataset. Then data preprocessing
takes place in which target variables are changed.
After this normalization of data occurs for better
performance of machine learning algorithms and
finally feature engineering of features starts where
various joint angles of the body are calculated using
the formula shown in figure 6. As the data is
completely preprocessed data is finally fed to machine
learning models. Evaluation of these models is done
on test data and is compared based on accuracy score.
Logistic regression classifier achieves a maximum
score of 94% among all classifiers. For classification a
threshold value is used which is set at 97% below
which no pose detected is given as output to the user.

You might also like