Swap regret is a concept from online learning and game theory. It is a generalization of regret in a repeated, n-decision game.

Definition

edit

In each round  , the learner chooses decision   with probability   and the utility for decision   is  . A learner's swap-regret is defined to be the following:

 

Intuitively, it is how much a player could improve by switching each occurrence of decision i to the best decision j possible in hindsight. The swap regret is always nonnegative. Swap regret is useful for computing correlated equilibria.

References

edit
  • Blum, Avrim; Mansour, Yishay (2007), "From external to internal regret" (PDF), Journal of Machine Learning Research, 8: 1307–1324, MR 2332433.