Data Science, CV and NLP ai
Data Science, CV and NLP ai
A multidisciplinary field combining statistics, mathematics, and computer science to extract insights from data.
Computer Vision
A subfield of AI focused on enabling computers to interpret and process visual data like images and videos.
- Key Techniques:
- Image Classification
- Object Detection
- Image Segmentation
2. Classification + Localization:
- Not only classifies the image but also provides the spatial location of the classified object using bounding boxes.
3. Object Detection:
- Provides bounding boxes for each detected object (e.g., cars, people).
4. Instance Segmentation:
- Goes beyond object detection by outlining the exact shape of each detected object.
- Learning Complex Patterns: ML models, especially deep learning, can detect intricate visual features.
- Continuous Improvement: With more data, models adapt to handle edge cases like overlapping objects or low-
light images.
NLP enables computers to understand, interpret, and respond to human language in text or speech.
- Key Processes: Sentence segmentation, tokenization, parsing, sentiment analysis, and machine translation.
Applications:
3. Stemming: Reducing words to their root forms by removing affixes (e.g., "running" → "run").
4. Lemmatization: Transforming words to their dictionary base form considering context (e.g., "better" → "good").
Challenges in NLP:
3. Language Variability: Handling diverse languages, dialects, and informal text formats.
- Specific Challenge:
- Contextual Understanding: Sarcasm or figurative speech, such as "That’s just great!" in a negative scenario, is
hard for NLP to process accurately, leading to incorrect sentiment analysis.
- Purpose: Converts images of printed or handwritten text into editable and searchable formats.
- Applications: Digitizing books, automating data entry, and improving accessibility for visually impaired users.
- Semantic Understanding: Word embeddings map words into a continuous vector space, capturing meanings and
relationships (e.g., "king" - "man" + "woman" ≈ "queen").
- Context Representation: Helps models understand similarity and nuance between words.
3. Bias: Datasets may reflect societal biases, leading to skewed model predictions.