Optimize parameters¶
The optimization process calculates the best confidence thresholds per class based on a model, a dataset with annotations, and a target unknown rate.
Predictions with a confidence below this threshold will get the "unknown" class.
Purpose¶
- Labeling: Use these thresholds on unlabeled data to identify samples that are good candidates for labeling and training because the model is uncertain about them. You can use predictive labeling in the label center or tag samples in the test center to generate these unknown predictions.
- Unknown rate monitoring: Track how the unknown rate changes over time with inference monitoring. If the rate diverges from the initial target, your training data may no longer represent the inference data, indicating the need for model retraining.
Tip
Confidence does not equal correctness. "Unknown" does not mean "probably wrong." Models are often overconfident.
To generate class thresholds¶
- Open an AI-ADC EfficientNet project.
- Go to the Optimization center, and then click Set up optimization.
- In Datasets, select one or more datasets as ground truth.
- In Models, select the model for generating predictions.
-
Configure optimization parameters:
- Target unknown ratio: Desired percentage of "unknown" classifications.
- Tolerance: Allowable deviation from the target unknown ratio.
- Trials: Number of optimization attempts.
-
Click Run optimization.
Output¶
- The optimization generates confidence threshold parameters for each class.
- These thresholds can be used in predictive labeling, tests, and inference.