Your model seems to be sensitive to small perturbations in the input data. These perturbations can include adding typos, changing word order, or turning text into uppercase or lowercase. This happens when:
To learn more about causes and solutions, check our guide on robustness issues.
Feature `text` | Add typos | Fail rate = 0.130 | 104/800 tested samples (13.0%) changed prediction after perturbation |
800 samples affected (91.7% of dataset) | Show details Hide details |
test_suite = results.generate_test_suite("My first test suite")
test_suite.run()