0A057 Advanced Data Preparation Using IBM SPSS Modeler CourseDesc
0A057 Advanced Data Preparation Using IBM SPSS Modeler CourseDesc
Topics Covered
• Using functions to cleanse and enrich data
• Using additional field transformations
• Working with Sequence Data
• Sampling, partitioning and balancing data
• Improving Efficiency
Intended Audience
• This advanced course is intended for anyone who wishes to become familiar with the full range of techniques
available in IBM SPSS Modeler for data manipulation.
Prerequisites
• General computer literacy
• Experience using IBM SPSS Modeler including familiarity with the Modeler environment, creating streams,
reading data files, exploring data, setting the unit of analysis, combining datasets, deriving and reclassifying
fields, and basic knowledge of modeling.
• Prior completion of Introduction to IBM SPSS Modeler and Data Mining (v18) is recommended.
IBM Analytics
Advanced Data Preparation Using IMB SPSS Modeler (v18)
1: Using functions to cleanse and enrich data 4: Sampling, partitioning and balancing data
• Use date functions • Draw simple and complex samples with the Sample
• Use conversion functions node
• Use string functions • Create a training set and testing set with the
• Use statistical functions Partition node
• Use missing value functions • Reduce or boost the number of records with the
Balance node
2: Using additional field transformations
• Replace values with the Filler node 5: Improving efficiency
• Recode continuous fields with the Binning node • Use database scalability by SQL pushback
• Change a field’s distribution with the Transform • Process outliers and missing values with the Data