Machine Learning QnA Session @ DDU Jan 2025
Machine Learning QnA Session @ DDU Jan 2025
Vyom
Muktan
● Software intern @ Atliq
● Research work with BSB sir in Speech recognition
● Data science internship @ ISRO, worked on precipitation data analysis
● Kaggle Silver medal
● Open source contributions in Pandas, Pytorch Ignite and Hugging face
● MS in CS @ UTD
● ML internship @ Apple, worked on speech recognition
● Applied Scientist internship @ Amazon, worked on Reinforcement Learning based
recommender systems, published a paper at internal conference (AMLC '23)
● ML internship @ Finesse (fashion-tech startup), worked on scraping and ranking of
trending fashion outfit posts
● MLE @ Apple, working on Apple Speech Recognition team
Key Learning
Reachout:
(with questions in the linkedin requests so we can connect faster)
Vyom - https://www.linkedin.com/in/01-vyom/
Muktan - https://www.linkedin.com/in/muktan-patel/
Open discussion
Mods can add questions and answers here so we can have these qna session results for
posterity.
Answer:
1. start with ML course by Andrew NG and get basics of ML clear (take your time to complete it)
2. get hands-on experience by working on kaggle competitions, start with easier one
3. Select one modality (text, image, tabular data) and work on live kaggle competition to get
more experience
IMP: Try to work on you DSA basics in 1st and half of 2nd year of your university then
from 4th semester start with above steps
Q) How can I get started with computer vision after completing foundational courses in
Machine Learning and Deep Learning? Please share resources also if you know for it.
Answer)
● Exploring problems to solve in Computer Vision
○ https://www.kaggle.com/competitions/global-wheat-detection this can be a good
starting point to learn basics
○ Picking - Kaggle, Reading papers, talking to people working in CV research space
(reaching out to people)
○ https://www.coursera.org/specializations/deep-learning?utm_medium=sem&utm
_source=gg&utm_campaign=B2C_NAMER_deep-learning_deeplearning-ai_FTCOF_
specializations_pmax-nonNRL-within-14d&campaignid=20131140422&adgroupid
=6467332841&device=c&keyword=&matchtype=&network=x&devicemodel=&ad
position=&creativeid=6467332841&hide_mobile_promo&gad_source=1&gclid=Cj
0KCQiAst67BhCEARIsAKKdWOmLAnGsvlXuZd_qupqgBWGAhBXybxphibNxzOhP6
LY7iXqDOGuz1A0aAvZJEALw_wcB
○ Augmentation
○ Image types
○ Understanding the “Attention is all you need” but for Vision
■ Previous works from the below paper
■ [1512.03385] Deep Residual Learning for Image Recognition
○ Good ideas now -
■ Merging with Language Models - CLIP: Connecting text and images |
OpenAI
■ Image Generation
■ Image Understanding
■ Video Gen/Under - Veo 2 - Google DeepMind
● AI as a security breach for the data privacy issues
○ Opt Outs for all softwares
○ Opt Outs for your internet data (website, etc…) - robots.txt
○ https://www.lesswrong.com/posts/SGDjWC9NWxXWmkL86/keeping-content-out-
of-llm-training-datasets