Resume Sonaika Pati D (010824) - 4-1
Resume Sonaika Pati D (010824) - 4-1
PROJECTS
Customer Service Automation (Natural Language Processing)
• Used NLTK library with the core natural language processing to develop and implement novel techniques for sentiment analysis
on product reviews, dialogue state tracking in chatbots, customer service automation, etc.
• Categorized comments into positive & negative clusters from different social networking sites using Sentiment & Text Analytics
• Ensured model has low False Positive Rate & Text classification, sentiment analysis for unstructured and semi- structured data.
• Created and designed reports by using gathered metrics to infer and draw logical conclusions from past and future behavior.
Autonomous Tagging of Stack Overflow Questions
• ML SVM Classifier for predicting tags using Scikit-Learn classifier to correctly predict tags of Stack Overflow questions.
• Used token-based feature engineering techniques along with TF-IDF vectorizer with multi-label classification as each question
can have 1-5 tags.
Stock Price Prediction
• Leveraged ARIMA model to forecast the stock price trend with MAPE of around 2.5% in predicting the next 15 observations.
• Utilized cross-validation to avoid the look-ahead bias. Trained multiple machine learning models and then combined them
using ensemble learning to produce higher prediction accuracy.
COURSES/CERTIFICATIONS
• Machine Learning Fundamentals by Andrew Ng • Microsoft Certified: Analyzing Data with MS Power BI
• Data Science & Business Analytics • Probability & Statistics
• Deep Learning for computer vision with TensorFlow • PyTorch for Deep Learning
• Data Analyst Nanodegree by Udacity • Python for Data Science
• AWS Certified Machine Learning Specialty • Complete Neural Network Bootcamp
SKILLSETS
• Programming Languages: Python, SQL
• Tools: Tableau, Power BI, Jupyter Notebook, Visual Studio Code, AWS, GitHub, MS Office, Confluence, Jira, MS Excel
• Databases: SQL Server, MySQL, MongoDB, GCP Big Query
• Libraries: NumPy, Pandas, Matplotlib, Seaborn, Scikit-Learn, NLTK, Keras, ARIMA, SpaCy, TensorFlow, SciPy, OpenCV, PyTorch
• Skills: Big Data, EDA, Data Analysis, Data Visualization, Data Mining, Sklearn, Natural Language Processing, BERT, Transformer,
Time series, Deep Learning, MLOps, Business Analytics, Supervised or Unsupervised Machine Learning Algorithms, Regression,
Classification, One hot encoding, Clustering, Generative AI, LLM, LangChain, LlamaIndex, ChromaDB, GPT-3, vector database
Gemini Flash
• Deployment: Docker, AWS Sage maker, MLOps, AWS S3, EC2, Streamlit, Flask, Pickle
EDUCATION
Program Institution CGPA/% Completion Year
Bachelors (BBA) in Information Technology Utkal University 9.3 2022
Class XII [HSC Examination] KMBB Junior Science College 7.4 2018
Class X [SSC Examination] SSVM Keonjhar 8.6 2016