Swap regret is a concept from online learning and game theory. It is a generalization of regret in a repeated, n-decision game.
Definition
editIn each round , the learner chooses decision with probability and the utility for decision is . A learner's swap-regret is defined to be the following:
Intuitively, it is how much a player could improve by switching each occurrence of decision i to the best decision j possible in hindsight. The swap regret is always nonnegative. Swap regret is useful for computing correlated equilibria.
References
edit- Blum, Avrim; Mansour, Yishay (2007), "From external to internal regret" (PDF), Journal of Machine Learning Research, 8: 1307–1324, MR 2332433.