0% found this document useful (0 votes)

10 views20 pages

arjun1123 (3)

Uploaded by

Tushar Tyagi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

10 views20 pages

arjun1123 (3)

Uploaded by

Tushar Tyagi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 20

CHAPTER 1

INTRODUCTION

This project focused on implementing an AI-based object detection system using the
TensorFlow Object Detection API. Object detection, a key area in computer vision, involves
identifying and localizing objects within images by drawing bounding boxes around them. This
capability has wide-ranging applications in industries like autonomous driving, surveillance,
and robotics.

To build this system, we used TensorFlow, an open-source deep learning framework, and its
Object Detection API, which provides pre-trained models and tools for training, evaluation,
and deployment. The process began with setting up the required environment, including
installing TensorFlow and the Object Detection API. For training the model, we used a custom
dataset containing images annotated with bounding boxes and class labels, which were
converted into the TensorFlow-compatible TFRecord format. We also created a label map to
associate class labels with integer IDs.

We selected a pre-trained model from TensorFlow's Model Zoo, specifically the SSD
MobileNet v2, which offers a good balance of speed and accuracy. The model was fine-tuned
on our custom dataset to adapt it to the specific objects we wanted to detect. The configuration
file was updated to reflect the dataset’s paths, class labels, and hyper parameters such as batch
size, learning rate, and number of steps. We used a GPU to speed up the training process and
monitored the model’s performance with TensorBoard.

During training, we used a validation dataset to evaluate the model and prevent overfitting.
Techniques like data augmentation (rotation, flipping, and scaling) helped improve
generalization. After several thousand steps of training, the model achieved a Mean Average
Precision (mAP) of around 70% on the validation set, indicating good accuracy in detecting
and localizing objects.

Once trained, the model was used to perform inference on new images. The inference pipeline
involved feeding an image into the trained model, which then returned the predicted bounding
boxes, class labels, and confidence scores. These results were visualized by drawing bounding
boxes around detected objects on the images.

1
The model performed well with an inference time of approximately 0.5 seconds per image,
making it suitable for real-time applications. While challenges like overfitting and dataset
annotation were addressed, the project successfully demonstrated how TensorFlow’s Object
Detection API can be used to create a high-performance object detection system. Future work
could involve optimizing the model further, deploying it on mobile or edge devices using
TensorFlow Lite, and expanding it for video stream analysis or more complex datasets.

1. Problem Statement

The problem at hand is the need for an efficient and accurate object detection system that can
identify and localize multiple objects within images or video streams. Object detection is
critical in applications such as surveillance, autonomous driving, industrial automation, and
robotics. Traditional computer vision methods rely on manually crafted features and complex
algorithms, which are often not scalable or effective for detecting diverse objects in varying
conditions.

The challenge is to develop a machine learning model that can accurately identify multiple
objects in a single image or video frame, localize them by drawing bounding boxes, and classify
them into predefined categories. The model must be robust across different environments,
lighting conditions, and object sizes, while maintaining high inference speed for real-time
applications. Additionally, the solution should be adaptable to custom datasets, as many
practical use cases require detecting objects that may not be covered by generic models.

This project aims to address these challenges by utilizing the TensorFlow Object Detection
API, fine-tuning pre-trained models on a custom dataset to create an efficient, scalable, and
accurate object detection system.

1.2 Objective

The objective of this project is to develop an AI-powered object detection system using the
TensorFlow Object Detection API, capable of accurately identifying and localizing multiple
objects in images or video frames. The project aims to implement an efficient object detection
pipeline using deep learning models, leveraging TensorFlow’s pre-trained models and the

Object Detection API. The focus is on fine-tuning a pre-trained model on a custom dataset to
detect specific objects, ensuring the model can adapt to unique scenarios and object classes.

2
The project also seeks to evaluate the model's performance using metrics like Mean Average
Precision (mAP), Precision, and Recall, ensuring high detection accuracy and localization
precision. Another key objective is to optimize inference speed for real-time object detection,
making the system suitable for applications such as surveillance, autonomous driving, and
robotics. Additionally, the project will involve visualizing the detection results, including
bounding boxes and class labels, on input images to demonstrate the model's effectiveness.
Finally, the project will explore methods to improve model accuracy, such as data augmentation
and hyperparameter tuning, to handle challenges like varying object sizes, occlusion, and
complex backgrounds. The ultimate goal is to create a scalable and robust object detection
system that can be applied to real-world scenarios and provide a foundation for further
development.

1.2.1 Scope

The scope of this project is focused on developing an object detection system using the
TensorFlow Object Detection API to identify and localize objects in images or video streams.
The project will involve preparing a custom dataset, which includes gathering and annotating
images with bounding boxes and object labels, and converting them into the TFRecord format
for training. A pre-trained model from TensorFlow's Model Zoo, such as SSD MobileNet v2,
will be selected and fine-tuned on the custom dataset to adapt it to the specific objects being
detected.

The model will be trained using TensorFlow, optimized for accuracy, and techniques like data
augmentation will be used to improve generalization and avoid overfitting. The performance
of the model will be evaluated using metrics like Mean Average Precision (mAP), Precision,
and Recall to assess both object detection accuracy and bounding box localization precision.

Real-time inference will be a key focus, optimizing the model for low-latency performance so
that it can detect and localize objects in new images or video streams quickly. Additionally, the
project will include visualizations that display bounding boxes and class labels on the input
images to demonstrate the model's practical capabilities.

While the project does not involve developing a new object detection architecture from scratch,
it will explore fine-tuning existing models and optimizing them for specific use cases. Future
work could extend the system for mobile or edge device deployment with TensorFlow Lite,
support real-time video stream analysis, and work towards improving detection accuracy and

3
speed. The system is designed to be scalable and adaptable for various real-world applications,
such as surveillance or autonomous systems.

1.3 Existing Software

Existing software solutions for object detection include several popular frameworks and
libraries designed for various use cases. The **TensorFlow Object Detection API** is widely
used for building custom models and offers pre-trained architectures like SSD and Faster R-
CNN. YOLO (You Only Look Once)is known for its speed and efficiency, making it ideal for
real-time applications. Detectron2, built on PyTorch, provides state-of-the-art detection models
like Faster R-CNN and Mask R-CNN and is favored for research.

Darknet, the framework behind YOLO, is lightweight and simple to use, while OpenCV
provides traditional computer vision tools and can be integrated with deep learning models for
hybrid solutions. Faster R-CNN offers high accuracy but is slower and more computationally
intensive than YOLO. MMDetection, also built on PyTorch, supports a variety of models and
is popular in research for its scalability and flexibility. Additionally, PyTorch’s Torchvision
and Keras are also used for object detection, with Keras offering high-level abstractions for
model development.

These tools offer different trade-offs in terms of speed, accuracy, and ease of use, and the choice
of framework depends on the specific needs of the project, such as real-time performance,
scalability, or research flexibility.

1.4. Motivation

The motivation for this project stems from the growing demand for intelligent systems capable
of understanding and interpreting visual data in real-time. Object detection is a critical
technology for various industries, including autonomous driving, surveillance, healthcare,
retail, and robotics. As the ability to detect and localize objects in images becomes more
refined, it opens up new possibilities for automation, security, and improved decision-making.

Traditional object detection methods often struggle with scalability, accuracy, and adaptability
to diverse environments. Deep learning-based methods, especially using frameworks like
TensorFlow and pre-trained models, offer significant improvements in terms of speed and
precision, making them suitable for a wide range of real-world applications. By leveraging the

4
TensorFlow Object Detection API, this project aims to harness the power of state-of-the-art
deep learning models, fine-tune them for custom use cases, and create an efficient system that
can be deployed in real-time applications.

The motivation is to explore how modern object detection techniques can be applied to real-
world problems, such as monitoring public spaces, automating vehicle navigation, or
improving product recognition in retail. The ability to automatically identify and track objects
with high accuracy not only reduces human labor but also enhances safety, security, and
efficiency. Moreover, the project provides an opportunity to understand the practical challenges
involved in training, optimizing, and deploying AI-based object detection systems, with
potential for future expansion into more advanced AI applications.

5
CHAPTER 2

BACKGROUND AND RELATED WORK

TABLE 2.1 Comparison of various methodology

Paper Title Author(s) Year Research Gap Limitation

"Faster R-
High computational cost
CNN: Towards Lack of efficiency for
Shaoqing and limited applicability
Real-Time 2021 real-time applications on
Ren et al. to resource-constrained
Object low-resource devices.
environments.
Detection"

Compromised accuracy
"SSD: Single Difficulty in detecting
Wei Liu et for small object detection
Shot Multibox 2023 small objects with high
al. due to reliance on lower
Detector" accuracy.
resolution feature maps.

Joseph Struggles with high- Trade-off between speed

"YOLOv3: An
Redmon, density object scenes and and accuracy, particularly
Incremental 2021
Ali fine-grained for smaller and
Improvement"
Farhadi classifications. overlapping objects.

"EfficientDet: Requires careful

Limited exploration of
Scalable and Mingxing hyperparameter tuning to
2022 scalability in highly
Efficient Object Tan et al. achieve optimal results
dynamic environments.
Detection" across diverse datasets.

Limited generalization to Computational overhead

Kaiming non-standard datasets for instance segmentation
"Mask R-CNN" 2022
He et al. with diverse object shapes reduces efficiency for
and textures. large-scale applications.

Struggles with handling Suboptimal performance

"RetinaNet:
extreme foreground- for objects with
Focal Loss for Tsung-Yi
2023 background class significant occlusion or
Dense Object Lin et al.
imbalances in complex low contrast relative to
Detection"
datasets. the background.

"CenterNet:
Limited evaluation on Reduced performance for
Keypoint
Kaiwen real-time scenarios with very small objects or
Triplets for 2021
Duan et al. highly dynamic object those at the edge of the
Object
movements. image frame.
Detection"

6
Faster R-CNN: Towards Real-Time Object Detection (Shaoqing Ren et al., 2021) Faster
R-CNN introduced a novel approach to object detection by integrating a region proposal
network (RPN) with a Fast R-CNN detector. This innovation significantly improved the speed
and accuracy of detection by eliminating the need for external region proposal methods. Faster
R-CNN achieved state-of-the-art performance on benchmark datasets at the time, marking a
milestone in the field of object detection. The system’s ability to propose regions and classify
objects in an end-to-end manner paved the way for more streamlined and efficient pipelines.
Despite its advancements, Faster R-CNN faces challenges when applied to real-time scenarios,
particularly on resource-constrained devices. Its high computational cost and reliance on
powerful hardware make it unsuitable for applications requiring low latency, such as mobile
devices or embedded systems. Additionally, the approach struggles with detecting small objects
in crowded scenes, limiting its applicability to tasks requiring fine-grained localization.

SSD: Single Shot Multibox Detector (Wei Liu et al., 2023) The SSD model revolutionized
object detection by proposing a single-shot framework that directly predicts object bounding
boxes and class labels without the need for a region proposal step. By using feature maps at
multiple resolutions, SSD is particularly fast and efficient, making it a strong candidate for real-
time applications. It performs well on large-scale datasets and is less computationally intensive
compared to two-stage detectors like Faster R-CNN.
However, SSD exhibits limitations in detecting small objects. The use of lower-resolution
feature maps for prediction reduces the model’s ability to localize and classify smaller objects
accurately. This trade-off between speed and accuracy is a significant drawback, particularly
for applications such as surveillance, where small object detection is crucial.

YOLOv3: An Incremental Improvement (Joseph Redmon, Ali Farhadi, 2021 YOLOv3 is

an incremental improvement to the original YOLO (You Only Look Once) architecture. It
introduced multi-scale predictions and utilized a feature pyramid network to improve the
detection of objects at various scales. YOLOv3 is highly optimized for speed, making it one of
the fastest object detection models available. Its simplicity and ease of deployment in real-
world scenarios have contributed to its popularity in various applications.
Despite its advantages, YOLOv3 struggles in scenes with high object density or overlapping
objects. It also faces challenges in detecting small objects and fine-grained details. While it

7
offers a balance between speed and accuracy, its detection capabilities for small, closely spaced
objects remain inferior to more complex models like Faster R-CNN.

EfficientDet: Scalable and Efficient Object Detection (Mingxing Tan et al., 2022)
EfficientDet employs a compound scaling method that balances resolution, depth, and width,
making it one of the most efficient models for object detection. It uses a bi-directional feature
pyramid network (BiFPN) to enhance feature fusion and improve accuracy across scales. The
model is designed for scalability, allowing it to adapt to a wide range of hardware capabilities
and datasets.
However, EfficientDet requires precise hyperparameter tuning to achieve optimal performance,
which can be challenging and time-consuming. The model's ability to handle dynamic
environments, where objects change rapidly, is not extensively explored. This limitation
restricts its use in highly dynamic scenarios such as autonomous driving.

Mask R-CNN (Kaiming He et al., 2022) Mask R-CNN extended Faster R-CNN by adding a
branch for predicting segmentation masks, enabling instance segmentation along with object
detection. This innovation allows Mask R-CNN to classify and localize objects while also
delineating their shapes, making it particularly useful for applications like medical imaging and
video analysis.
Despite its strengths, Mask R-CNN suffers from computational overhead, which makes it
unsuitable for large-scale or real-time applications. The complexity of the model increases
significantly with the inclusion of the segmentation mask prediction, leading to slower
inference times compared to other object detection models.

RetinaNet: Focal Loss for Dense Object Detection (Tsung-Yi Lin et al., 2021) RetinaNet
introduced the concept of focal loss to address the class imbalance problem in dense object
detection. By focusing on hard-to-classify examples, RetinaNet improved detection accuracy
for objects in challenging settings. It combines the advantages of single-stage detectors (speed)
with the accuracy of two-stage models, making it a versatile solution.
However, RetinaNet’s performance diminishes in scenarios where objects are heavily occluded
or have low contrast against their background. Additionally, its computational requirements are
higher than other single-stage detectors like YOLO, limiting its efficiency for real-time tasks.

8
CenterNet: Keypoint Triplets for Object Detection (Kaiwen Duan et al., 2021) Center Net
uses keypoint detection to predict object centers and bounding box dimensions. By treating
object detection as a keypoint estimation task, it simplifies the pipeline and reduces
computational complexity. CenterNet achieves competitive accuracy while maintaining a
relatively fast inference time, making it suitable for various applications.
However, the model struggles with very small objects or those located at the image's edges.
Additionally, its evaluation on real-time scenarios with highly dynamic object movements is
limited, which restricts its applicability to use cases requiring rapid detection in changing
environments.

9
CHAPTER 3

HARDWARE AND SOFTWARE REQUIREMENTS

3.1 Hardware Requirements

• Multi-core processor (Intel i5/i7 or AMD Ryzen) for data processing.
• NVIDIA GPU (GTX 1060 or higher) with at least 4GB VRAM for accelerated
training and inference.
• 8GB or more, preferably 16GB for handling larger datasets.
• Minimum 50GB of free space, SSD preferred for faster data access.

3.2 Software Requirements

• Operating System: Windows 10/11, macOS, or Linux (Ubuntu recommended).

• Python: Version 3.6 or higher.
• TensorFlow: Version 2.x with the Object Detection API.
• CUDA and cuDNN: For GPU acceleration (if using NVIDIA GPU).
• Libraries: OpenCV (for image processing), NumPy (for numerical operations),
Matplotlib (for visualization).
• Development Environment: Jupyter Notebook, PyCharm, or Visual Studio Code.

3.3 Testing Technology

• Accuracy Metrics: Including Mean Average Precision (mAP), Precision, Recall,

and Intersection over Union (IoU) to measure detection quality.
• Inference Speed: Testing the model's real-time performance by measuring inference
time per image or video frame.
• Cross-Validation: Assessing model generalization by testing on different subsets of
the dataset.
• Visual Testing: Verifying detection results visually by reviewing bounding boxes
and class labels.

10
CHAPTER 4

SDLC METHODOLOGIES

The agile methodology was used. This is because the agile methodology is more adaptable and
can accommodate changes more easily. It is also more user-centric, which is important in this
case because the system is being developed for the users. Agile is an iterative approach to
project management and software development that enables teams to deliver value to customers
faster and with fewer headaches. An agile team delivers work in small but consumable
increments rather than betting everything on a "big bang" launch. Continuous evaluation of
requirements, plans, and results provides teams with a natural mechanism for responding to
change quickly. The following SDLC models are proposed:

1. AVAILABALE SDLC Models

4.1.1 Waterfall Model

The waterfall is a widely used SDLC model. The waterfall model is a continuous software
development model in which development is seen as flowing steadily downwards (like a
waterfall) through the steps of requirements analysis, design, implementation, testing
(validation), integration, and maintenance. To begin, some certification techniques must be
used at the end of each step to identify the end of one phase and the start of the next. Some
verification and validation usually do this by ensuring that the stage's output is consistent with
its input (which is the output of the previous step) and that the stage's output is consistent with
the overall requirements of the system.

Figure 4.1. Waterfall Model

11
4.1.2. RAD Model

The Rapid Application Development (RAD) process is an adaptation of the waterfall model
that aims to develop software in a short period of time. The RAD model is based on the idea
that by using focus groups to gather system requirements, a better system can be developed in
less time.

o Business Modeling

o Data Modeling

o Process Modeling

o Application Generation

o Testing and Turnover

Figure 4.2. RAD Model

4.1.3 Spiral Model

The spiral model is a process model that is risk-driven. This SDLC model assists the group in
implementing elements of one or more process models such as waterfall, incremental, waterfall,
and so on. The spiral technique is a hybrid of rapid prototyping and concurrent design and
development. Each spiral cycle begins with the identification of the cycle's objectives, the
various alternatives for achieving the goals, and the constraints that exist. This is the cycle's
first quadrant (upper-left quadrant).

12
The cycle then proceeds to evaluate these various alternatives in light of the objectives and
constraints. The focus of evaluation in this step is on the project's risk perception.

This step may involve activities such as benchmarking, simulation, and prototyping.

Figure 4.3. Spiral Model

4.1.4 Incremental Model

The incremental model does not stand alone. It must be a series of waterfall cycles. At the start
of the project, the requirements are divided into groups. The SDLC model is used to develop
software for each group. The SDLC process is repeated, with each release introducing new
features until all requirements are met. Each cycle in this method serves as the maintenance
phase for the previous software release. The incremental model has been modified to allow
development cycles to overlap. The following cycle may begin before the previous cycle is
completed.

13
Figure 4.4. Incremental Model

2. Model used in project: Agile Model

Agile methodology is a practice that encourages continuous interaction between development

and testing throughout any project's SDLC process. The Agile method divides the entire project
into small incremental builds. All of these builds are delivered in iterations, with each iteration
lasting between one and three weeks. Any agile software phase is defined in such a

way that it addresses several key assumptions about the majority of software projects:

●It's difficult to predict which software requirements will remain constant and which will
change. It is also difficult to predict how user priorities will shift as the project
progresses.

●Many types of software require both design and development. That is, both activities
should be carried out concurrently so that design models can be validated as they
are developed. It's difficult to imagine how much design work is required before
construction is used to test the configuration.

●Analysis, design, development, and testing are not as predictable as we would like (from
a planning standpoint).

The agile software development emphasises on four core values:

14
1.Individual and team interactions over processes and tools

2.Working software over comprehensive documentation

3.Customer collaboration over contract negotiation

4.Responding to change over following a plan

Figure 4.5. Agile Model

15
CHAPTER 5

APPLICATION ARCHITECTURE
The application architecture of the AI-powered legal documentation assistant is designed to be
scalable, flexible, and secure, enabling efficient processing and management of legal
documents while providing intelligent features such as contract analysis, risk assessment,
compliance checking, and real-time collaboration. The architecture consists of several key
components that interact seamlessly to deliver a comprehensive solution for legal professionals
and organizations.

The user interface serves as the front-end of the system, allowing users to interact with the
assistant. It is designed to be intuitive and user-friendly, catering to both legal professionals
and non-technical users. This layer includes features such as document uploading, contract
editing, risk analysis feedback, and compliance checking. The UI can be accessed through a
web-based platform, ensuring accessibility across devices and locations.

Fig 5.1. Application Architecture

16
1. MVVM
Following are the components of MVVM −

Figure 5.2. MVVM Model

Model classes are non-visual classes that contain the data for the app. As a result, the model
can be thought of as representing the domain model of the app, which typically includes a data
model as well as business and validation logic. Data transfer objects (DTOs), Plain Old CLR
Objects (POCOs), and generated entity and proxy objects are examples of model objects.

Model classes are typically used in conjunction with data access and caching services or
repositories.

View
The view defines the structure, layout, and appearance of what the user sees on screen. Each
view should ideally be defined in XAML, with minimal code-behind that does not contain
business logic. However, the code-behind may contain UI logic that implements visual
behavior that is difficult to express in XAML, such as animations, in some cases.

View Model
The view model implements properties and commands to which the view can data bind, as well
as change notification events that notify the view of any state changes. The view model's
properties and commands define the functionality that the UI will provide, but the view
determines how that functionality will be displayed.

17
2. Phases of Project
The development of an AI-powered legal documentation assistant can be broken down into
several key phases to ensure systematic progress, effective management, and high-quality
results. Below are the phases typically involved in the project:

Phase 1: Gather input from stakeholders, including legal professionals, IT experts, and
potential users. Identify the key features of the system such as document uploading, AI-driven
analysis, clause extraction, compliance checking, and document management.

Phase 2: Create detailed system architecture diagrams that include front-end (UI), back-end
(server-side logic), AI components (NLP, ML models), and data storage mechanisms. Define
data models for storing legal documents, user data, analysis results, etc.

Phase 3: Gather datasets such as legal documents, contracts, clauses, and case laws. Clean and
preprocess the data, including removing noise, normalizing text, and tagging data appropriately
(e.g., annotating clauses or identifying key legal terms).

Phase 4: Choose and implement appropriate NLP and machine learning techniques such as
named entity recognition (NER), document classification, sentiment analysis, and entity
extraction. Train AI models on the prepared datasets to perform tasks like clause identification,
compliance validation, and document summarization.

3. Use case diagram

Figure 5.3. Use Case Diagram

18
4. Class diagram

Figure 5.4. Class Diagram

19
Reference

1. Pachipala, Y., Harika, M., Aakanksha, B., & Kavitha, M. (2022, March). Object detection
using tensorflow.

2. Sharma, A., Mishra, T., Kukade, J., Golwalkar, A., & Tomar, H. (2023, August). Object
Detection Using TensorFlow.

3. Sanchez, S. A., Romero, H. J., & Morales, A. D. (2020, May). A review: Comparison of
performance metrics of pretrained models for object detection using the TensorFlow
framework.

4. Guo, L. (2022). Application of TensorFlow AI Object Detection to Mobile App

Development.

5. Visalatchi, A. R., Navasri, T., Ranjanipriya, P., & Yogamathi, R. (2020, March). Intelligent
Vision with TensorFlow using Neural Network Algorithms.

6. Atienza, R. (2021). Advanced Deep Learning with TensorFlow 2 and Keras: Apply DL,
GANs, VAEs, deep RL, unsupervised learning, object detection and segmentation, and more.

7. Arun, V., Shashikala, B. M., Vani, H. Y., Tapkire, M., Anusuya, M. A., & Lavanya, M. S.
(2024).

8. Dharmadhikari, S. C., Rao, M. B. G., & Chhajed, R. R. (2021). Object detection and data
classification with deep learning.

9. Sudharshan, D. P., & Raj, S. (2022, January). Object recognition in images using
convolutional neural network.

10. Knez, S., & Šajn, L. (2021). Food object recognition using a mobile device: evaluation of
currently implemented systems.

MCQs of Communication skills..KV
88% (16)
MCQs of Communication skills..KV
10 pages
Hiligaynon MFAT PDF
100% (5)
Hiligaynon MFAT PDF
37 pages
Merleau Ponty - Basic Writings
No ratings yet
Merleau Ponty - Basic Writings
5 pages
Arjun Present
No ratings yet
Arjun Present
20 pages
pppr 2 final
No ratings yet
pppr 2 final
37 pages
Object Detection Using Tensorflow....
No ratings yet
Object Detection Using Tensorflow....
9 pages
PART B ETI-1
No ratings yet
PART B ETI-1
7 pages
major project Synopsis
No ratings yet
major project Synopsis
5 pages
index page arjun
No ratings yet
index page arjun
8 pages
ankit synopsis
No ratings yet
ankit synopsis
13 pages
Object Detection Withtensorflow: D. Hari Vamshi V. Raju U. Laxman
No ratings yet
Object Detection Withtensorflow: D. Hari Vamshi V. Raju U. Laxman
25 pages
Object Detection Using TensorFlow
No ratings yet
Object Detection Using TensorFlow
21 pages
Design_of_A_Real-Time_Object_Detection_Prototype_S
No ratings yet
Design_of_A_Real-Time_Object_Detection_Prototype_S
6 pages
Realtime Visual Recognition in Deep Convolutional Neural Networks
No ratings yet
Realtime Visual Recognition in Deep Convolutional Neural Networks
13 pages
Nivetha Me P2 PPT
No ratings yet
Nivetha Me P2 PPT
18 pages
2802 8020 1 PB
No ratings yet
2802 8020 1 PB
3 pages
Object Detection - Deep Learning: Jamia Hamdard
No ratings yet
Object Detection - Deep Learning: Jamia Hamdard
26 pages
Real Time Object Detection in Surveillance Cameras With 2xjeq74wam
No ratings yet
Real Time Object Detection in Surveillance Cameras With 2xjeq74wam
8 pages
Machine Learning - Advanced Concepts
From Everand
Machine Learning - Advanced Concepts
Derrick Mwiti
No ratings yet
Presentation1 FINAL 1
No ratings yet
Presentation1 FINAL 1
11 pages
Synopsisss
No ratings yet
Synopsisss
3 pages
Object Detection Using Deep Learning
No ratings yet
Object Detection Using Deep Learning
45 pages
Minor Project
No ratings yet
Minor Project
21 pages
Final Project Report
No ratings yet
Final Project Report
60 pages
SL. No Chapter Name Page No
No ratings yet
SL. No Chapter Name Page No
8 pages
ankit report
No ratings yet
ankit report
73 pages
Full
No ratings yet
Full
9 pages
Deep Learning for Object Detection - 131124
No ratings yet
Deep Learning for Object Detection - 131124
35 pages
Object Detection
No ratings yet
Object Detection
3 pages
Real-Time Object Detection Using SSD MobileNet Mod
No ratings yet
Real-Time Object Detection Using SSD MobileNet Mod
6 pages
Final Year PPT 1
No ratings yet
Final Year PPT 1
18 pages
Project Report
No ratings yet
Project Report
13 pages
Social Distancing Detection Using Tensorflow
No ratings yet
Social Distancing Detection Using Tensorflow
4 pages
real object detection system using yolov3 images
No ratings yet
real object detection system using yolov3 images
6 pages
final_report
No ratings yet
final_report
22 pages
21BCS1133 - Exp 2.3
No ratings yet
21BCS1133 - Exp 2.3
4 pages
Solar Battery Charger
No ratings yet
Solar Battery Charger
25 pages
Real Time Object Recognition and Classification
No ratings yet
Real Time Object Recognition and Classification
6 pages
AI for Everyone: An Intermediate Guide to Artificial Intelligence
From Everand
AI for Everyone: An Intermediate Guide to Artificial Intelligence
Nova Clarke
No ratings yet
19bce0014 VL2021220702099 Pe003
No ratings yet
19bce0014 VL2021220702099 Pe003
17 pages
A novel model to detect and categorize objects from images by using a hybrid machine learning model
No ratings yet
A novel model to detect and categorize objects from images by using a hybrid machine learning model
13 pages
Computer_Vision_Based_Object_Detection_and_Recognition_System_for_Image_Searching
No ratings yet
Computer_Vision_Based_Object_Detection_and_Recognition_System_for_Image_Searching
4 pages
8 ObectDectection
No ratings yet
8 ObectDectection
60 pages
M10 - Introduction To TensorFlow, Deep Learning and Application
No ratings yet
M10 - Introduction To TensorFlow, Deep Learning and Application
25 pages
Synopsis
No ratings yet
Synopsis
4 pages
conference paper
No ratings yet
conference paper
3 pages
Incremental Training For Image Classification of Unseen Objects
No ratings yet
Incremental Training For Image Classification of Unseen Objects
19 pages
Object Detection and Segmentation On Tensor Flow Using
No ratings yet
Object Detection and Segmentation On Tensor Flow Using
10 pages
Project Detecto!: A Real-Time Object Detection Model
No ratings yet
Project Detecto!: A Real-Time Object Detection Model
3 pages
Computer Vision Report
No ratings yet
Computer Vision Report
21 pages
Real Time Object Detection System
No ratings yet
Real Time Object Detection System
31 pages
Nivetha Me P2 Report
No ratings yet
Nivetha Me P2 Report
86 pages
Comparative analysis of feature descriptors and classifiers for real-time object detection
No ratings yet
Comparative analysis of feature descriptors and classifiers for real-time object detection
11 pages
The Code
No ratings yet
The Code
4 pages
MINI PROJECT SYNOPSIS
No ratings yet
MINI PROJECT SYNOPSIS
6 pages
Object Detection and Recognition Using TensorFlow for Blind People
No ratings yet
Object Detection and Recognition Using TensorFlow for Blind People
6 pages
Second Progress Report UID - 17BCS2127
No ratings yet
Second Progress Report UID - 17BCS2127
13 pages
Mini Project Report
No ratings yet
Mini Project Report
15 pages
MYPPTT
No ratings yet
MYPPTT
19 pages
[2025-AEJ]Object detection in real-time video surveillance using attention based transformer-YOLOv8 model
No ratings yet
[2025-AEJ]Object detection in real-time video surveillance using attention based transformer-YOLOv8 model
14 pages
CAPSTONE_PROJECT
No ratings yet
CAPSTONE_PROJECT
47 pages
Detectron2 in Practice: Definitive Reference for Developers and Engineers
From Everand
Detectron2 in Practice: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
revised
No ratings yet
revised
8 pages
The Big Awesome Book of PDF
No ratings yet
The Big Awesome Book of PDF
1 page
WINSEM2019-20 STS3006 SS VL2019205007330 Reference Material I 11-Jan-2020 Averages
No ratings yet
WINSEM2019-20 STS3006 SS VL2019205007330 Reference Material I 11-Jan-2020 Averages
28 pages
LESSON-PLAN-GRADE-8-DIGESTIVE-SYSTEM with comments from Maam Dejilla-Feb 7 2025
No ratings yet
LESSON-PLAN-GRADE-8-DIGESTIVE-SYSTEM with comments from Maam Dejilla-Feb 7 2025
8 pages
Plasma Cell Thesis
100% (3)
Plasma Cell Thesis
8 pages
Terry_Sejnowski
No ratings yet
Terry_Sejnowski
6 pages
Basic Research - Types, Methods and Examples
No ratings yet
Basic Research - Types, Methods and Examples
20 pages
"Liderando Procesos de Crecimiento Humano": Institución Educativa San Cristóbal
No ratings yet
"Liderando Procesos de Crecimiento Humano": Institución Educativa San Cristóbal
3 pages
Blue Brain Technology: An Overview
No ratings yet
Blue Brain Technology: An Overview
5 pages
Second English Evaluation Uac 2021
No ratings yet
Second English Evaluation Uac 2021
2 pages
Toaz - Info English Code 1 Phonics Book PR
No ratings yet
Toaz - Info English Code 1 Phonics Book PR
51 pages
60503672-MIT
No ratings yet
60503672-MIT
373 pages
Types of Family
No ratings yet
Types of Family
2 pages
Barista # Curriculum
No ratings yet
Barista # Curriculum
75 pages
PPT Global English VF1
No ratings yet
PPT Global English VF1
30 pages
Mayuri Bandsode - Helicopter Parenting
No ratings yet
Mayuri Bandsode - Helicopter Parenting
21 pages
Pooja's Resume
No ratings yet
Pooja's Resume
1 page
Sunshine Grace Madriaga Assessment 2 - Compress
No ratings yet
Sunshine Grace Madriaga Assessment 2 - Compress
26 pages
Gym Dataset
No ratings yet
Gym Dataset
63 pages
Thesis Statement Greek Myth
100% (2)
Thesis Statement Greek Myth
5 pages
Enclosure No. 7 Sample End-of-Day Evaluation Tool
No ratings yet
Enclosure No. 7 Sample End-of-Day Evaluation Tool
3 pages
Identifying Gifted Children and Dyslexia PDF
No ratings yet
Identifying Gifted Children and Dyslexia PDF
7 pages
WBL Bi-Weekly Reflection
No ratings yet
WBL Bi-Weekly Reflection
2 pages
Trocr: Transformer-Based Optical Character Recognition With Pre-Trained Models
No ratings yet
Trocr: Transformer-Based Optical Character Recognition With Pre-Trained Models
10 pages
Alisha Andrew Resume 2023
No ratings yet
Alisha Andrew Resume 2023
2 pages
Diversified Experience in Architecture DEA Logbook Filing
No ratings yet
Diversified Experience in Architecture DEA Logbook Filing
9 pages
2005 Us Army Heavy Construction Equipment Operator Course Phase 2 Scoop Loader Pmcs A01 16p
No ratings yet
2005 Us Army Heavy Construction Equipment Operator Course Phase 2 Scoop Loader Pmcs A01 16p
16 pages

arjun1123 (3)

Uploaded by

arjun1123 (3)

Uploaded by

CHAPTER 1

1.3 Existing Software

BACKGROUND AND RELATED WORK

TABLE 2.1 Comparison of various methodology

Paper Title Author(s) Year Research Gap Limitation

Joseph Struggles with high- Trade-off between speed

"EfficientDet: Requires careful

Limited generalization to Computational overhead

Struggles with handling Suboptimal performance

YOLOv3: An Incremental Improvement (Joseph Redmon, Ali Farhadi, 2021 YOLOv3 is

HARDWARE AND SOFTWARE REQUIREMENTS

3.1 Hardware Requirements

3.2 Software Requirements

• Operating System: Windows 10/11, macOS, or Linux (Ubuntu recommended).

3.3 Testing Technology

• Accuracy Metrics: Including Mean Average Precision (mAP), Precision, Recall,

1. AVAILABALE SDLC Models

4.1.1 Waterfall Model

Figure 4.1. Waterfall Model

o Testing and Turnover

Figure 4.2. RAD Model

4.1.3 Spiral Model

Figure 4.3. Spiral Model

4.1.4 Incremental Model

2. Model used in project: Agile Model

Agile methodology is a practice that encourages continuous interaction between development

The agile software development emphasises on four core values:

2.Working software over comprehensive documentation

3.Customer collaboration over contract negotiation

4.Responding to change over following a plan

Figure 4.5. Agile Model

Fig 5.1. Application Architecture

Figure 5.2. MVVM Model

3. Use case diagram

Figure 5.3. Use Case Diagram

Figure 5.4. Class Diagram

4. Guo, L. (2022). Application of TensorFlow AI Object Detection to Mobile App

You might also like