5.2 Data Visualization II
5.2 Data Visualization II
a. A map of all 34
walking tracks
b. A color legend
for track length
Summary
• How to make a treemap
• How to create a map in Tableau
• How to make dashboard and story
• Flow maps & spider maps
• Customized maps
Summary
Text, CSV, JSON
Regular Expression
Managing NumPy
Data
Pandas
StatsModels
Tableau
Data
Visualization Matplotlib
More Powerful Libraries
• SciKit-Learn: a Python library for machine learning based on
NumPy, SciPy, and matplotlib.
http://scikit-learn.org
• Seaborn: a Python library for making statistical graphics, it is
based on matplotlib, works great with pandas’ data
structures.
https://seaborn.pydata.org/
• Natural Language Toolkit (NLTK): a Python library to process
natural language. Good for sentiment analysis.
https://www.nltk.org/
Field of application Library Name (ranked by Github) Type
Mathematics & Engineering 1. NumPy Data wrangling
Mathematics & Engineering 2. SciPy Data wrangling
Data Manipulation & Analysis 3. pandas Data wrangling
Mathematics & Engineering 4. StatsModels Statistics
Visualization 5. matplotlib Visualization
Visualization 6. seaborn Visualization
Visualization 7. plotly Visualization
Visualization 8. bokeh Visualization
Visualization 9. pydot Visualization
Machine Learning 10. scikit-learn Machine learning
Machine Learning 11. XGBoost / LightGBM / CatBoost Machine learning
Machine Learning 12. eli5 Machine learning
Machine Learning 13. TensorFlow Deep learning
Machine Learning 14. PyTorch Deep learning
Machine Learning 15. Keras Deep learning
Machine Learning 16. dist-keras / elephas / spark-deep-learning Distributed deep learning
Mathematics & Engineering 17. NLTK NLP
Mathematics & Engineering 18. gensim NLP
Mathematics & Engineering 19. spaCy NLP
Data Manipulation & Analysis 20. scrapy Data scraping
The end.