(241H0060131) Assignment 2
(241H0060131) Assignment 2
roll no :- 241H0060131
id no :- WBBBA24148
COURSE :- BBA 1st year (batch - b)
SUBMITTED BY :- MOUNISHA JAIN
SUBMITTED TO :-POOJA GUPTA MAM
Data science
1. Python Libraries:
o Pandas: Essential for data manipulation and analysis
o Scikit-learn: Widely used for machine learning tasks
o Seaborn: Great for data visualization.
o
2. Machine Learning Platforms:
o TensorFlow: Popular for building and deploying machine learning
models.
o PyTorch: Known for its flexibility and ease of use in research and
production
o
3. Data Processing and Analysis:
o Apache Spark: Used for big data processing
o Databricks: Provides a unified analytics platform
o
4. Data Visualization:
o Tableau: Renowned for its powerful data visualization capabilities
o Power BI: Integrates well with Microsoft products and offers robust
visualization tools
o
5. Cloud-Based Tools:
o Google BigQuery: Excellent for large-scale data analysis
o Azure Synapse: Integrates analytics and data warehousing
Big Data: The proliferation of big data continues to shape the data
science landscape, with organizations leveraging large volumes of data
from diverse sources to gain insights and make informed decisions.
Continuous Learning and Upskilling: Given the rapid evolution of data science
technologies and methodologies, professionals in the field are
increasingly focused on continuous learning and upskilling to stay
abreast of the latest developments
Skills needed in data science :-