Data and Its Types
Data and Its Types
• Definition: Data that is organized in a predefined format, such as tables with rows and
columns.
• Examples:
o Sales records in a database.
o Customer information in a CRM system.
• Characteristics:
o Easy to store, process, and analyze.
o Typically stored in relational databases (e.g., SQL).
2. Unstructured Data
3. Semi-Structured Data
• Definition: Data that has some structure but does not fit into a rigid schema.
• Examples:
o JSON or XML files.
o Emails with metadata (e.g., sender, subject, date).
• Characteristics:
o Combines elements of structured and unstructured data.
o Requires specialized tools for processing.
4. Quantitative Data
5. Qualitative Data
6. Time-Series Data
7. Cross-Sectional Data
2. External Sources
3. Primary Data
4. Secondary Data
2. Web Scraping
3. APIs
• Use Case: Accessing data from third-party platforms (e.g., Twitter, Google Maps).
• Tools: Postman, Python (requests library).
5. Transactional Data
Example 2: Healthcare
• Data Types: Structured (patient records), time-series (vital signs over time).
• Sources: Internal (hospital databases), external (government health reports).
Example 3: Finance
8. Key Takeaways
1. Data Types: Structured, unstructured, semi-structured, quantitative, qualitative, time-series,
and cross-sectional.
2. Data Sources: Internal (e.g., sales data), external (e.g., social media), primary (e.g., surveys),
and secondary (e.g., industry reports).
3. Data Collection Methods: Surveys, web scraping, APIs, sensors, and transactional systems.
4. Challenges: Data quality, privacy, integration, and scalability.
5. Best Practices: Define objectives, ensure data quality, use the right tools, protect privacy, and
document the process.
By understanding data types and sources, BBA students can effectively collect, manage, and analyze
data to drive business decisions. Let me know if you’d like further details or examples! 😊