0% found this document useful (0 votes)
21 views31 pages

R_Lab_rec_final

This document outlines the curriculum for the CMCA22ET3 - Data Analytics and R-Programming Laboratory for the academic year 2024-2025. It includes a bonafide certificate, a table of contents detailing various exercises related to matrix and vector operations, data frame creation, statistical calculations, and graphical representations using R programming. Each exercise provides an aim, algorithm, program code, and results indicating successful execution.

Uploaded by

hitlerkomanam
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
21 views31 pages

R_Lab_rec_final

This document outlines the curriculum for the CMCA22ET3 - Data Analytics and R-Programming Laboratory for the academic year 2024-2025. It includes a bonafide certificate, a table of contents detailing various exercises related to matrix and vector operations, data frame creation, statistical calculations, and graphical representations using R programming. Each exercise provides an aim, algorithm, program code, and results indicating successful execution.

Uploaded by

hitlerkomanam
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 31

DEPARTMENT OF COMPUTER APPLICATIONS

CMCA22ET3- DATA ANALYTICS


&
R-PROGRAMMING LABORATORY
2024 –2025
(ODD SEMESTER)

NAME :
REGISTER NO. :
COURSE : M.C.A
YEAR / SEM / SEC : II/ III /
DEPARTMENT OF COMPUTER APPLICATIONS

BONAFIDE CERTIFICATE

Certified that this is a Bonafide Record of work done by of

(Reg.NO: ) MCA II year in CMCA22ET3- DATA ANALYTICS


AND R-PROGRAMMING LABORATORY during the III semester in the year
2024-2025.

Submitted for the Practical Examination held on

Signature of Lab-in charge Signature of Dean

Internal Examiner External Examiner


TABLE OF CONTENT

PAGE
S.NO DATE CONTENTS NO. SIGNATURE

Operations On Matrices 4
1. (Addition, Subtraction, Multiplication)
Operations On Vectors 7
2.

Creation of data frames, Bar chart and 10


3. Histogram.
Random Number Generator In R. 13
4.

Calculating Mean, Median, Standard 15


5. Deviation, Variance For A Given Set Of
Data
Factorial Using Recursive Function 18
6.

Evaluating Linear model 20


7.

Plot the Graph for Normal Distribution 24


8.

9. Time Series Graph For A Given Set Of 26


Data

10. Hypothesis Testing Using R Programming 29


Exercise – 1 Operations On Matrices
(Addition, Subtraction, Multiplication)
DATE:

AIM:

To write a program to perform operation on matrices

ALGORITHM:

Step 1: Start the program.

Step 2: Create vectors of data that will populate the matrices.

Step 3: Use the matrix function to create matrices from the vectors.

Step 4: Ensure both matrices have the same dimensions, then use the + operator to add the
matrices.

Step 5: Ensure both matrices have the same dimensions, then use the - operator to subtract the
matrices.

Step 6: Ensure the columns of first matrix is same as rows of second matrix to multiply the
matrices.

Step 7: Print the results of the operations.

Step 8: Stop the program.


Concept : Basic operations on Matrices:

PROGRAM:

mat1.data <- c(1,2,3,4,5,6,7,8,9)


mat1.data <- c(1,2,3,4,5,6,7,8,9)
mat1 <- matrix(mat1.data,nrow=3,ncol=3,byrow=TRUE)
mat1
mat2.data <- c(10,11,12,13,14,15,16,17,18)
mat2 <- matrix(mat2.data,nrow=3)
mat2
mat2[1,2]
mat2[c(1,3),c(2,3)]
mat2[2, , drop=FALSE]
mat2[c(-1),c(-2,-3),drop=FALSE]
mat2[c(TRUE,FALSE),c(FALSE,TRUE,TRUE)]
mat1*5
mat1+2
mat1/2
mat_add <- mat1+mat2
mat_add
mat_sub <- mat1-mat2
mat_sub
mat_mul <- mat1*mat2
mat_mul
OUTPUT:

RESULT:

Thus the above code got executed successfully.


Exercise – 2 Operations On Vector
DATE:

AIM:

To write a program to perform operations on vector using R programming

ALGORITHM:

Step 1: Start the program.

Step 2: Define the vector using the c() function.

Step 3: Perform arithmetic operations (addition, subtraction, multiplication, division) on


vector elements.

Step 4: Perform logical operations, such as:


a) Comparison: Compare vector elements using operators >, <, ==.
b) Logical AND/OR: Use operators & and | for element-wise logical operations.

Step 5: Perform statistical operations like mean, median, sum, and product.

Step 6: Perform vector functions using length(), sort(), and rev().

Step 7: Display the analyzed results.

Step 8: Stop the program.


Concept:

• Multiplying a vector with a constant


• Adding two vectors
• Multiplying two vectors
• Subtraction of 2 vectors
• Division operations on vectors
• Creating Vectors using sequence function for a given range
• Repetition of vectors

PROGRAM:

vec<-c(1,2,3,4,5)
multivec <- vec*2
multivec

vec2<-c(6,7,8,9,10)
vector_add <- vec+vec2
vector_mul <- vec*vec2
vector_mul

vector_sub <- vec2-vec


vector_sub

vector_div <- multivec/vec


vec_seq <- seq(from=1,to=20,length=30)
vec_seq

vec_rep <- rep(c(2,3,4), times=3)


vec_rep

sum(vec_rep)
OUTPUT:

> vec<-c(1,2,3,4,5)
> multivec <- vec*2
> multivec
[1] 2 4 6 8 10
> vec2<-c(6,7,8,9,10)
> vector_add <- vec+vec2
> vector_mul <- vec*vec2
> vector_mul
[1] 6 14 24 36 50
> vector_sub <- vec2-vec
> vector_sub
[1] 5 5 5 5 5
> vector_div <- multivec/vec
> vec_seq <- seq(from=1,to=20,length=30)
> vec_seq

[1] 1.000000 1.655172 2.310345 2.965517 3.620690 4.275862 4.931034 5.586207


[9] 6.241379 6.896552 7.551724 8.206897 8.862069 9.517241 10.172414 10.827586
[17] 11.482759 12.137931 12.793103 13.448276 14.103448 14.758621 15.413793 16.068966
[25] 16.724138 17.379310 18.034483 18.689655 19.344828 20.000000

> vec_rep <- rep(c(2,3,4), times=3)


> vec_rep
[1] 2 3 4 2 3 4 2 3 4
> sum(vec_rep)
[1] 27

RESULT:

Thus the above code got executed successfully.


Exercise – 3 Creation of data frames, Bar chart and Histogram.
DATE:

AIM:

To write a program to create data frames, Bar- chart & histogram in R programming

ALGORITHM:

Step 1: Start the program.

Step 2: Use the data.frame() function to combine the vectors into a data frame.

Step 3: Identify the categorical data and corresponding frequencies that you want to visualize in
a bar chart.

Step 4: Use the barplot() function for basic bar charts or ggplot2 for more customization.

Step 5: Identify the numerical data that you want to visualize in a histogram.

Step 6: Use the hist() function for a basic histogram.

Step 7: Display the analyzed results.

Step 8: Stop the program.


PROGRAM:

In the drive E create a directory “Sample”


In the note pad type the data shown in k1.csv

setwd("E:\\sample")
getwd()
sample = read.csv("k1.csv")
data <- read.csv("k1.csv")
head(sample)

sal <- max(data$salary)


print(sal)
salaries <-(data$salary)
salaries
names<-(data$name)
names
summary(data$salary)

salarybar<-c(names,salaries, byrow=FALSE)
barplot(data$salary,main="SALARY", ylab='salary',
xlab ="name", col="blue",
horiz=FALSE,beside=TRUE)
hist(data$salary)
colnames<-c(data$name)
colnames
rowname<-c(data$sa)
OUTPUT

RESULT:

Thus the above code got executed successfully.


Exercise – 4 Random Number Generator In R.
DATE:

AIM:

To write program to create a Random Number Generator in R programming.

ALGORITHM:

Step 1: Start the program.

Step 2: Decide the type of random numbers you want to generate (uniform, normal, binomial).

Step 3: Define the parameters for the chosen distribution.

Step 4: Use the appropriate function to generate random numbers based on the chosen
distribution and specified parameters.

Step 5: Display the generated random numbers.

Step 6: Stop the program.


PROGRAM:

df <- data.frame("Serial_number" = 1:5, "Age" = c(20, 21, 17, 23, 19), "Name" = c("raja","ramu",
"ravi", "raghu", "rajesh"))
# Sort by age ascending order
newdataAsc <- df[order(df$Age),]
newdataAsc
# sorting is descending order
newdataDsc <- df[order(-df$Age),]
newdataDsc

OUTPUT:

RESULT:

Thus the above code got executed successfully.


Exercise – 5 Calculating Mean, Median, Standard Deviation, Variance
DATE: For A Given Set Of Data

AIM:

To write a program for calculating mean, median, standard deviation, and variance for a given set
of data.

ALGORITHM:

Step 1: Start the program.

Step 2: Start with a vector of numerical data.

Step 3: The mean is the average of all numbers. Use the mean function in R to generate mean
value.

Step 4: The median is the middle value of the sorted data. Use the median function in R to
calculate median value.

Step 5: The standard deviation measures the amount of variation or dispersion in a set of values.
Use the SD function in R to compute Standard deviation.

Step 6: The variance is the square of the standard deviation. Use the var function in R to
calculate Variance.

Step 7: Display the calculated mean, median, standard deviation, and variance.

Step 8: Stop the program.


PROGRAM:

setwd("E:\\Sample")
getwd()

sample2 = read.csv("k2.csv")
data <- read.csv("k2.csv")
data

mean(data$x)
median(data$x)
sd(data$x)
var(data$x)

#mediam absolute variance


mad(data$x)
max(data$x)
min(data$x)
sum(data$x)
length(data$x)
OUTPUT:

RESULT:

Thus the above code got executed successfully.


Exercise – 6 Factorial Using Recursive Function
DATE:

AIM:

To Calculate factorial of a number using recursive function.

ALGORITHM:

Step 1: Start the program.

Step 2: Create a function that takes a single argument ‘n’, which represents the number for
which we want to calculate the factorial.

Step 3: Inside the function, check if n is equal to 0 or 1.

Step 4: If n is either 0 or 1 then the function returns 1, as the result of 0! and 1! is 1.

Step 5: If n is greater than 1 then the function multiplies n and recursion of the function n-1
until n becomes 0 or 1.

Step 6: The function will eventually reach the base case after several calls, returning the final
computed factorial.

Step 8: Stop the program


PROGRAM:
rec_fac <- function(x)
{
if(x==0 || x==1)
{
return(1)
}
else
{
return(x*rec_fac(x-1))
}
}
var = 8;
var = as.integer(var)
print(var)
x<-var
f<-rec_fac(x)
t1<-c("factorial",x,"is",f )
t1

OUTPUT:

RESULT:

Thus the above code got executed successfully.


Exercise – 7 Evaluating Linear model
DATE:

AIM:

To plot a scatter graph and evaluating linear model to predict car sales using advertisement cost.

ALGORITHM:

Step 1: Start the program.

Step 2: Create a directory "sample", then set the working directory using setwd() in R.

Step 3: Ensure that you have k1.csv file in the E:/sample directory with the required data, then
read the CSV file into a data frame.

Step 4: Print the first few rows of the data frame.

Step 5: Extract columns from the original data frame and create a new data frame.

Step 6: Create a scatter plot to visualize the relationship between car sales and advertisement
cost.

Step 7: Create a linear model to predict car sales based on advertisement cost.

Step 8: Get a summary of the linear model to see the coefficients and statistical metrics.

Step 9: Stop the program.


PROGRAM:
In the drive E create a directory “Sample”
In the note pad type the data shown in k3.csv
(I am attaching the k3.csv file for your kind perusal)

setwd("E:\\Sample")
getwd()
sample1 = read.csv("k3.csv")
data <- read.csv("k3.csv")
head(sample1)
# Create a data frame.

# Get the max salary from data frame.


store_name<-data$store
store_name
car_sales <-data$sales
car_sales
store_advt<-data$advt_cost
store_advt
data<-data.frame(store_name,car_sales,store_advt)
data
scatter.smooth(x=data$car_sales,y=data$store_advt, xlab = "car sales",ylab = "advertisement cost",
main="car sales - advertisementcost")
Linear_model <- lm(car_sales~store_advt,data = sample1)
print(Linear_model)
summary(Linear_model)
OUTPUT:
RESULT:

Thus the above code got executed successfully.


Exercise – 8 Plot the Graph for Normal Distribution
DATE:

AIM:

To plot the graph for a normal distribution by using necessary functions.

ALGORITHM:

Step 1: Start the program.

Step 2: Load necessary libraries.

Step 3: Use dnorm to calculate the density function values over a range of x-values if we want
to plot the PDF (Probability Density Function).

Step 4: Use base R plotting functions like plot() and lines(), or advanced functions from
packages.

Step 5: Add titles, axis labels, and other enhancements for better visualization.

Step 6: Stop the program.


PROGRAM:

# Create a sequence of numbers between -5 and 5 incrementing it by 0.2.


x <- seq(-5, 5, by = .1)

# The mean here is 1.0 and standard deviation as 0.


y <- dnorm(x, mean =0, sd = 1)

#Plot the Graph


plot(x,y)

OUTPUT:

RESULT:

Thus the above code got executed successfully.


Exercise – 9 Time Series Graph For A Given Set Of Data
DATE:

AIM:

To generate Time Series Graph for A Given Set Of Data.

ALGORITHM:

Step 1: Start the program.

Step 2: Import the data into R from a file (such as Excel or other sources).

Step 3: Check the data structure for any missing values and ensure the data is in the correct
format for time series analysis.

Step 4: If the data is not already in time series format, convert it using the ts() function.

Step 5: Use the plot() function or other specialized plotting functions from relevant packages to
create a time series graph.

Step 6: Add a title, labels, and other annotations to make the plot more informative.

Step 7: Stop the program


PROGRAM:

sales_ice <- c(450, 613, 466, 205.7,571.0, 622.0, 851.4, 621.4,875.3,979.7, 927.5,14.45)
sales_cooldrink <- c(550, 713, 566, 687.2,110, 120, 72.4,814.4,423.5, 98.7, 741.4,345.3)

combined.sales <- matrix(c(sales_ice,sales_cooldrink),nrow = 12)


sales.timeseries <- ts(combined.sales,start = c(2021,1),frequency = 12)
plot(sales.timeseries, main = "Showing ice cream – cooldrink sales")

print(sales.timeseries)
OUTPUT:

RESULT:

Thus the above code got executed successfully.


Exercise – 10 Hypothesis Testing Using R Programming
DATE:

AIM:

To prove Hypothesis Testing Using R Programming.

ALGORITHM:

Step1: Start the Program

Step2: Define the Hypotheses


• Null Hypothesis (H₀)
• Alternative Hypothesis (H₁ or Ha)

Step3: Choose the Test


• Based on data type, sample size, and assumptions.

Step4: Set the Significance Level


• Common values: 0.05, 0.01, or 0.10.

Step5: Load Data and Pre process

Step6: Perform the Test


• Use appropriate R functions (e.g., t.test(), chisq. test()).

Step7: Calculate Test Statistic and P-value

Step8: Make a Decision and Conclusion


• Reject or fail to reject H₀ based on p-value.

Step9: Stop the Program


PROGRAM:

x <- sample(c(1:100),size=20,replace=TRUE)

y <- sample(c(1:100),size=20,replace=TRUE)

t1<- c("value of variable x")


t1
x
t2<- c("value of variable y") y
t3<-c("t test ") t3
t.test(x,y)
t4<-c("Correlation between x and y") t4
cor(x,y)
t5<-c("covarience") t5
cov(x,y)
OUTPUT:

RESULT:

Thus the above code got executed successfully.

You might also like