This document discusses representing term frequency (TF) and TF-IDF transformations in the Predictive Model Markup Language (PMML). It provides details on encoding TF, TF-IDF, and text indexing in PMML, including defining a centralized TF-IDF function and invoking it for multiple documents. It also covers techniques for string normalization, tokenization, and counting terms during text transformations. Finally, it discusses ensuring interoperability between PMML and scikit-learn for text feature extraction.