Testing in Data Science
Testing in Data Science
You can still do the property checks on the output data. Feature Forge is
specifically used in ML.
Then there are the Metrics we talked about in class yesterday that are used to
check the quality of the model.
In our specific problem, we could use the Hypothesis library to get a random
Dataframe to pass through our function and check if any rows still have
correlation more than a certain number. Since the data is random but
parameters can be defined, we can get exactly the kind of test we want.
I'll write a test for this later. I'll share the code once it works.
Hope this helps.