A history is a sequence of pairs. A policy is a map from histories to . Given history , let be the set of that are consistent with . Define to be the number of tests run under policy when the state is .
Extensions
- Feedback -- for example, could be itself.
- Decision space -- for example, could choose weights to apply.
- Error tolerance -- cost of incorrect label.
- Noise -- could incorporate false positives or negatives.