Report Intership Chapters
Report Intership Chapters
Shree Drashti Infotech LLP is an Embedded and Software solutions providing "StartUp Gujarat
Awarded" Company established in 2019.
The main purpose of Drashti Infotech is to provide an extensive range of services and solutions
in the ground of Embedded System Design and Programing with python, .net, android etc.
Shree Drashti Infotech LLP also offers application-oriented training to students, and thereby
bridging the gap between industry's requirement and students' skill sets.
Python has emerged as a versatile programming language for Embedded Systems, Web
Application Development, Data Science and Artificial Intelligence. Shree Drashti infotech LLP
offers one of the best online free python internship programs for engineering students which
will enable you to launch your career in any of the above-mentioned domains. Rather than
focusing on teaching syntax of Python, this free python internship will make you a hands-on
Python programmer. You will be equipping yourself with strong foundational skills like
Algorithms, problem solving, OO programming by getting hands-on with Python.
GOALS:
Get hands-on programming exposure in Python programming the way Industry works.
Build good project based on Python programming by exploring all Python
programming constructs.
Obtain Government of India approved from AICT Internship
Lay foundations for a long-term career in Data Science / Machine Learning
Get exposure in Standard Industry Practices of Data Analyst
1
Chapter: 2 Major Component of Internship
2
<tab>output
Not equal
If (a!=10):
<tab>output
2.1.2 Loops
Python has two primitive loop commands:
while loops
for loops
while
while (condition):
<tab>output
while with else
while (condition):
<tab>output
else:
<tab>output
while with break
while (condition):
<tab> if (condition):
<tab> <tab> break
<tab>output
for with range
for i in range(end):
<tab>output
for i in range(start,end):
<tab>output
for i in range(start,end,spacing):
<tab>output
for with custom list
list=[“item1”, “item2”,..]
for i in list:
<tab>output
for with continue
list = ["item1", "item2", "item3"]
for x in list:
<tab>if x == "item2":
<tab><tab>continue
3
<tab>print(x)
Operator Meaning Example
+ Add two operands or unary plus x + y+ 2
- Subtract right operand from the left or unary minus x - y- 2
* Multiply two operands x*y
/ Divide left operand by the right one (always results into x/y
float)
% Modulus - remainder of the division of left operand by x % y (remainder
the right of x/y)
// Floor division - division that results into whole number x // y
adjusted to the left in the number line
** Exponent - left operand raised to the power of right x**y (x to the
power y)
4
• In NumPy dimensions are called axes. The number of axes is rank.
• NumPy’s array class is called ndarray. It is also known by the alias array.
5
import pandas as pd
The "pd" is an alias or abbreviation which will be used as a shortcut to access or call pandas
functions. To access the functions from pandas library, you just need to
type pd.function instead of pandas. Function every time you need to apply it.
6
Convert in array.
Model selection
1). X_train - This includes your all independent variables,these will be used to train the model,
also as we have specified the test_size = 0.20, this means 80% of observations from our
complete data will be used to train/fit the model and rest 20% will be used to test the model.
2). X_test - This is remaining 20% portion of the independent variables from the data which
will not be used in the training phase and will be used to make predictions to test the accuracy
of the model.
3). y_train - This is our dependent variable which needs to be predicted by this model, this
includes category labels against our independent variables, we need to specify our dependent
variable while training/fitting the model.
4). y_test - This data has category labels for our test data, these labels will be used to test the
accuracy between actual and predicted categories.
7
We will use Scikit-Lean’s support vector classifier(svc) to train an SVM model on this data
Support vector machines (SVMs) are powerful yet flexible supervised machine learning
algorithms which are used both for classification and regression. But generally, they are used in
classification problems. SVMs have their unique way of implementation as compared to other
machine learning algorithms. Lately, they are extremely popular because of their ability to
handle multiple continuous and categorical variables.
Other Models
8
9
Chapter: 3 Methodology Adopted to carry out an internship
3.2 Regression
10
Chapter: 4 Tools and Technology Used
System 64 bit
Type:
4.1.3. Tools
4.2 SOFTWARE STUDY:
4.2.1 Python 3.7
Python is a high-level programming language that is interpreted. Programming language
Python was designed by Guido van Rossum in 1991 and is based on a design philosophy that
emphasizes code readability via the use of considerable whitespace. To aid programmers in
writing concise, logical code for small and big projects, its language features and object-
oriented approach use object-oriented design principles.
Python is a dynamically typed, garbage-collected programming language. It is compatible with
a variety of programming paradigms, including procedural, object- oriented, and functional.
Because of its vast standard library, Python is sometimes referred to as a "batteries included"
language.
Python is a versatile programming language that may be used in a variety of ways. Met
programming and met objects (magic methods) allow for functional and aspect- oriented
programming, as well as object-oriented and structured programming.
Using dynamic typing, reference counting, and a garbage collector that identifies cycles,
Python manages memory effectively. There's also late binding, which binds method and
variable names while the programme is being performed, allowing for dynamic name
11
resolution.
Lisp-like functional programming in the manner of Python can be done because of its design.
Also included are list comprehensions (dictionaries), sets (and generator expressions), and
filtering and mapping methods. [54] In the form of a formal paraphrase Both intercools and
functions implement functional tools taken from Haskell or Standard ML in the standard
library.
Beautiful is better than ugly, says the Zen of Python (PEP 20), a statement summarizing the
philosophy of Python.
Explicit is superior to implicit in this case.
It's better to keep things simple than complicated.
Complex rather than convoluted is preferred.
Additionally, readability is critical.
Python was designed to be highly extensible, rather than having all of its capabilities built into
its core. Existing applications often benefit from programmable interfaces, because to their
small modularity. Because of his dissatisfaction with ABC's approach, Van Rossum came up
with the idea of a compact core language with a huge standard library and an easily expandable
interpreter.
4.3 SCIKIT-LEARN
Python machine learning library Scikit-learn is free. NumPy and SciPy's numerical and
scientific libraries NumPy and SciPy are supported, and the software is designed to interact
with a variety of classification, regression, and clustering techniques.
4.4. PANDAS
Pandas is a Python library that is available as a free open-source download and that enables you
to modify data. Pandas is a Python library that was developed on top of NumPy; as a result, it
needs NumPy to function properly. Pandas make it easy to produce, modify, and organize data
sets in a variety of formats. Pandas is also a beautiful option for dealing with time series data.
Pandas is popular among data scientists because of the following advantages:
In addition, it provides a sophisticated time series tool for working with data; it makes use of
Sequence for only one data structures and Data file for multi- dimensional data structures; it
provides an efficient way to slice the data; and it provides various way to combine, combines,
or reshape the data.
Pandas is, in a nutshell, a convenient data analysis library. It has the capability of manipulating
and analyzing information. As well as the ability to execute operations on data structures that
are both powerful and easy to use, Pandas also provides the ability to do operations on data
12
structures in a short amount of time.
13
Chapter: 5 Snapshots
Reading data:
Spilt data:
14
5.2 heart disease:
Reading data:
Spilt data:
15
5.3 Chronical Kidney Disease:
Reading data:
Spilt data:
16
17
5.4 Bank Additional:
Reading data:
Spilt data:
18
5.5 Bike Bayer’s:
Reading data:
Spilt data:
19
20
5.6 Crop Recommendation:
Reading data:
Spilt data:
21
Testing Classification Report SVM:
Reading data:
Spilt data:
22
Testing Classification Report SVM:
Reading data:
Spilt data:
parameters LR:
23
5.9 Covid-19 Cases:
Reading data:
Spilt data:
parameters LR:
24
25
Chapter: 6 Observation
KNN:
DT:
RF:
26
NB:
KNN:
DT:
27
RF:
NB:
KNN:
28
DT:
RF:
NB:
KNN:
29
DT:
RF:
NB:
KNN:
30
DT:
RF:
NB:
31
6.6 Crop Recommendation:
KNN:
DT:
32
RF:
NB:
33
6.7 Instagram Fake User:
KNN:
DT:
34
RF:
NB:
35
Model Accuracy Precision Recall F1-Score
SVM 0.74 0.76 0.67 0.71
KNN 0.70 0.68 0.66 0.70
36
SVM 0.98 0.99 0.98 0.98
KNN 0.98 0.98 0.98 0.98
DT 0.98 0.98 0.98 0.98
RF 0.84 0.84 0.86 0.81
NB 0.99 0.99 0.99 0.99
37
Chapter: 8 Conclusion and Future Scope
In the internship I have learn about machine learning and regression models and their working
on different datasets. As we can show that for Iris dataset, Heart disease, kidney disease etc.
dataset the Python ML is best, Similarly in regression ADANIPORTS, COVID-19 dataset the
RF classification model error is less as compared to others.
In future we can build any model with big dataset and can make AutoML approaches which
can select best ML models among all from single test.
38
References
1. https://pypi.org/
2. https://data-flair.training/blogs/python-introduction/
3. https://scikit-learn.org/
4. https://www.w3schools.com/python/pandas/default.asp
5. https://www.geeksforgeeks.org/introduction-to-pandas-in-python/
39