0% found this document useful (0 votes)

53 views

UNIT 1-Capstone Project Practice Questions

Capstone project

Uploaded by

Rashmi Kaith

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

53 views

UNIT 1-Capstone Project Practice Questions

Capstone project

Uploaded by

Rashmi Kaith

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 14

UNIT 1: CAPSTONE PROJECT

1. What is the main purpose of a Capstone Project?

o A) To demonstrate theoretical knowledge.
o B) To complete a thesis paper.
o C) To integrate all knowledge gained through a comprehensive project.
o D) To learn about different industries.

2. Which of the following is not an objective of a Capstone Project?

o A) Solving real-world problems.
o B) Expressing solutions in technical terms.
o C) Selecting appropriate algorithms for a problem.
o D) Learning teamwork.

3. Which AI project involves predicting stock prices?

o A) Movie Ticket Price Predictor
o B) Stock Prices Predictor
o C) Sentiment Analyzer
o D) Student Results Predictor

4. Which AI model is typically used for classification?

o A) Regression
o B) Clustering
o C) Classification
o D) Anomaly Detection

5. What is the first step in the AI project cycle?

o A) Model construction
o B) Data gathering
o C) Problem definition
o D) Evaluation & refinements

6. Which step is critical in determining whether AI techniques are applicable to a problem?

o A) Gathering data
o B) Identifying a pattern in data
o C) Deploying the model
o D) Selecting the right algorithm
7. Which of the following is not a stage of Design Thinking?
o A) Empathize
o B) Define
o C) Deploy
o D) Prototype

8. Which is an example of problem decomposition?

o A) Gathering data from sensors
o B) Breaking down app development into multiple tasks
o C) Running machine learning models
o D) Collecting user feedback

9. Which concept involves the breakdown of time series data into trend, seasonality, and noise
components?
• A) Time Series Forecasting
• B) Design Thinking
• C) Problem Decomposition
• D) Time Series Decomposition

10. Which is the first step in any AI or machine learning project?

• A) Data modeling
• B) Data collection
• C) Business understanding
• D) Cross-validation

11. Which is the foundational methodology for data science?

• A) Data mining
• B) CRISP-DM
• C) Agile
• D) SDLC

12. Which of these approaches would you use for showing relationships between variables?
• A) Predictive approach
• B) Descriptive approach
• C) Classification approach
• D) Regression

13. When might a predictive model be used?

• A) To explain historical data
• B) To show relationships between data
• C) To predict future outcomes
• D) To cluster similar data points
14. What question should be asked first in a data project?
• A) What is the business outcome?
• B) How will the data be collected?
• C) What data is needed?
• D) What algorithm will be used?

15. Which dataset is commonly used for predicting house prices?

• A) Airline Passenger Dataset
• B) Forestfires Dataset
• C) Housing Dataset
• D) MNIST Dataset

16. What is a descriptive model used for?

• A) Prediction of new data
• B) Describing relationships in historical data
• C) Anomaly detection
• D) Identifying missing data

17. Which concept refers to adjusting models using new data to improve their accuracy?
• A) Refinement
• B) Validation
• C) Cross-validation
• D) Feature selection

18. What does the train-test split method achieve?

• A) Collecting the data
• B) Evaluating model performance
• C) Data pre-processing
• D) Model deployment

19. What percentage is commonly used for training data in a train-test split?
• A) 80%
• B) 20%
• C) 67%
• D) 50%

20. In a cross-validation process, how many subsets are generally created in a 5-fold cross-
validation?
• A) 2
• B) 5
• C) 10
• D) 3
21. When is cross-validation more beneficial than train-test split?
• A) For large datasets
• B) For datasets with limited rows
• C) When doing unsupervised learning
• D) For high computational costs

22. Which of the following is a commonly used metric for regression models?
• A) Accuracy
• B) Precision
• C) Recall
• D) Root Mean Squared Error (RMSE)

23. Which metric is most suitable for classification tasks?

• A) MSE
• B) Accuracy
• C) RMSE
• D) Noise ratio

24. Which of the following is used to calculate RMSE?

• A) Mean of residuals
• B) Sum of absolute errors
• C) Square root of the mean of squared errors
• D) Mean of absolute differences

25. What does a low RMSE indicate?

• A) Poor model performance
• B) High variance in predictions
• C) Accurate predictions
• D) Overfitting

26. Which algorithm is used in the example of the Airline Passenger Dataset?
• A) Decision Tree
• B) Random Forest
• C) Seasonal Decomposition
• D) Support Vector Machine

27. Which value represents the best prediction in MSE?

• A) The highest value
• B) The lowest value
• C) The mean of predictions
• D) The median of predictions
28. What type of learning involves algorithms like regression or classification?
• A) Supervised learning
• B) Unsupervised learning
• C) Reinforcement learning
• D) Semi-supervised learning

29. Which algorithm is most suitable for a regression problem?

• A) Decision tree
• B) Linear regression
• C) K-nearest neighbors
• D) Naive Bayes

30. In a recommendation system, which method is typically used to suggest new items?
• A) Clustering
• B) Regression
• C) Collaborative filtering
• D) Anomaly detection

31. Which of the following would be considered a feature in a dataset?

• A) The target label
• B) An algorithm
• C) A variable used for prediction
• D) The test set

32. Which is the most reliable method to evaluate model performance on smaller datasets?
• A) Simple train-test split
• B) Leave-one-out cross-validation
• C) Randomized testing
• D) Bootstrap aggregation

33. Cross-validation is typically used to:

• A) Build the model
• B) Split data into train and test sets
• C) Test the model with multiple subsets
• D) Apply unsupervised learning

34. What is one major drawback of cross-validation compared to train-test split?

• A) It uses less data.
• B) It takes more time and computational resources.
• C) It produces less accurate results.
• D) It can only be applied to classification problems.
35. Which validation method involves using every data point for testing at least once?
• A) K-fold cross-validation
• B) Simple validation
• C) Random split
• D) Hold-out validation

36. What is the primary advantage of using cross-validation?

• A) Requires fewer computational resources
• B) More accurate representation of model performance
• C) Faster training of the model
• D) Higher accuracy for large datasets

37. What is the goal of hyperparameter tuning?

• A) Choosing the right model
• B) Optimizing algorithm performance
• C) Collecting more data
• D) Scaling the data

38. What does MAPE stand for?

• A) Mean Absolute Prediction Error
• B) Mean Absolute Percentage Error
• C) Mean Adjusted Prediction Error
• D) Minimum Absolute Prediction Estimate

39. Which error metric penalizes large errors more than small errors?
• A) RMSE
• B) MSE
• C) Accuracy
• D) Precision

40. Which error metric would you use to compare different regression models?
• A) Classification accuracy
• B) RMSE
• C) ROC-AUC score
• D) F1-Score

41. Which of the following is most important when evaluating a model’s accuracy on unseen
data?
• A) Precision
• B) Validation data
• C) Recall
• D) Feature engineering
42. Which metric is used to evaluate classification tasks in binary classification?
• A) Precision and recall
• B) RMSE
• C) MSE
• D) MAE

43. Which evaluation metric balances precision and recall in a classification problem?
• A) F1-Score
• B) Accuracy
• C) RMSE
• D) Cross-validation

44. What does MAE stand for in machine learning?

• A) Model Accuracy Estimate
• B) Mean Absolute Error
• C) Maximum Accuracy Estimate
• D) Minimum Adjustment Error

45. Which error metric is less sensitive to outliers in regression problems?

• A) RMSE
• B) MAE
• C) MSE
• D) Cross-entropy

46. Which evaluation metric is best for highly imbalanced classification datasets?
• A) Accuracy
• B) F1-Score
• C) RMSE
• D) MAE

47. Which AI project involves recognizing human activities using smartphone data?
• A) Stock Prices Predictor
• B) Human Activity Recognition
• C) Student Results Predictor
• D) Sentiment Analysis

48. Which of the following best describes anomaly detection?

• A) Grouping similar data points
• B) Identifying unusual patterns in data
• C) Predicting continuous outcomes
• D) Labeling data based on features
49. In AI, what is a common use of clustering algorithms?
• A) Predicting future outcomes
• B) Grouping similar data points without labels
• C) Detecting anomalies
• D) Improving model accuracy

1. Assertion (A): The Capstone Project integrates all learning from an academic program.
Reason (R): It focuses solely on individual work rather than collaboration.