Minimax Regret
   HOME

TheInfoList



OR:

In
decision theory Decision theory or the theory of rational choice is a branch of probability theory, probability, economics, and analytic philosophy that uses expected utility and probabilities, probability to model how individuals would behave Rationality, ratio ...
, regret aversion (or anticipated regret) describes how the human emotional response of
regret Regret is the emotion of wishing one had made a different decision in the past, because the consequences of the decision one did make were unfavorable. Regret is related to perceived opportunity. Its intensity varies over time after the decisi ...
can influence decision-making under
uncertainty Uncertainty or incertitude refers to situations involving imperfect or unknown information. It applies to predictions of future events, to physical measurements that are already made, or to the unknown, and is particularly relevant for decision ...
. When individuals make choices without complete information, they often experience regret if they later discover that a different choice would have produced a better outcome. This regret can be quantified as the difference in value between the actual decision made and what would have been the optimal decision in hindsight. Unlike traditional models that consider regret as merely a post-decision emotional response, the theory of regret aversion proposes that decision-makers actively anticipate potential future regret and incorporate this anticipation into their current decision-making process. This anticipation can lead individuals to make choices specifically designed to minimize the possibility of experiencing regret later, even if those choices are not optimal from a purely probabilistic expected-value perspective. Regret is a powerful
negative emotion In psychology, negative affectivity (NA), or negative affect, is a personality variable that involves the experience of negative emotions and poor self-concept. Negative affectivity subsumes a variety of negative emotions, including anger, contem ...
with significant social and reputational implications, playing a central role in how humans learn from experience and in the psychology of
risk aversion In economics and finance, risk aversion is the tendency of people to prefer outcomes with low uncertainty to those outcomes with high uncertainty, even if the average outcome of the latter is equal to or higher in monetary value than the more c ...
. The conscious anticipation of regret creates a
feedback loop Feedback occurs when outputs of a system are routed back as inputs as part of a chain of cause and effect that forms a circuit or loop. The system can then be said to ''feed back'' into itself. The notion of cause-and-effect has to be handle ...
that elevates regret from being simply an emotional reaction—often modeled as mere
human behavior Human behavior is the potential and expressed capacity (Energy (psychological), mentally, Physical activity, physically, and Social action, socially) of human individuals or groups to respond to internal and external Stimulation, stimuli throu ...
—into a key factor in
rational Rationality is the quality of being guided by or based on reason. In this regard, a person acts rationally if they have a good reason for what they do, or a belief is rational if it is based on strong evidence. This quality can apply to an ...
choice behavior that can be formally modeled in decision theory. This anticipatory mechanism helps explain various observed decision patterns that deviate from standard
expected utility theory The expected utility hypothesis is a foundational assumption in mathematical economics concerning decision making under uncertainty. It postulates that rational agents maximize utility, meaning the subjective desirability of their actions. Rational ...
, including
status quo bias A status quo bias or default bias is a cognitive bias which results from a preference for the maintenance of one's existing state of affairs. The current baseline (or status quo) is taken as a reference point, and any change from that baseline is p ...
, inaction inertia, and the tendency to avoid decisions that might lead to easily imagined counterfactual scenarios where a better outcome would have occurred.


Description

Regret theory is a model in
theoretical economics Economics () is a behavioral science that studies the Production (economics), production, distribution (economics), distribution, and Consumption (economics), consumption of goods and services. Economics focuses on the behaviour and interac ...
simultaneously developed in 1982 by
Graham Loomes Graham Loomes (born 5 August 1950) is a British economist and academic, specialising in behavioural economics. Since 2009, he has been Professor of Economics and Behavioural Science at the University of Warwick. He previously worked at the Univ ...
and
Robert Sugden Robert Sugden is a fictional character from the British ITV soap opera ''Emmerdale''. The character originally appeared on the show regularly between 22 April 1986 and 3 October 2005. During that time he was first played as a baby by Richard ...
, David E. Bell, and Peter C. Fishburn. Regret theory models choice under uncertainty taking into account the effect of anticipated regret. Subsequently, several other authors improved upon it. It incorporates a regret term in the
utility function In economics, utility is a measure of a certain person's satisfaction from a certain state of the world. Over time, the term has been used with at least two meanings. * In a Normative economics, normative context, utility refers to a goal or ob ...
which depends negatively on the realized outcome and positively on the best alternative outcome given the uncertainty resolution. This regret term is usually an increasing, continuous and non-negative function subtracted to the traditional utility index. These types of preferences always violate transitivity in the traditional sense, although most satisfy a weaker version. For independent lotteries and when regret is evaluated over the difference between utilities and then averaged over the all combinations of outcomes, the regret can still be transitive but for only specific form of regret functional. It is shown that only
hyperbolic sine In mathematics, hyperbolic functions are analogues of the ordinary trigonometric functions, but defined using the hyperbola rather than the circle. Just as the points form a unit circle, circle with a unit radius, the points form the right ha ...
function will maintain this property. This form of regret inherits most of desired features, such as holding right preferences in face of first order
stochastic dominance Stochastic dominance is a Partially ordered set, partial order between random variables. It is a form of stochastic ordering. The concept arises in decision theory and decision analysis in situations where one gamble (a probability distribution ov ...
, risk averseness for logarithmic utilities and the ability to explain
Allais paradox The Allais paradox is a choice problem designed by to show an inconsistency of actual observed choices with the predictions of expected utility theory. The Allais paradox demonstrates that individuals rarely make rational decisions consistently ...
. Regret aversion is not only a theoretical economics model, but a cognitive bias occurring as a decision has been made to abstain from regretting an alternative decision. To better preface, regret aversion can be seen through fear by either commission or omission; the prospect of committing to a failure or omitting an opportunity that we seek to avoid. Regret, feeling sadness or disappointment over something that has happened, can be rationalized for a certain decision, but can guide preferences and can lead people astray. This contributes to the spread of disinformation because things are not seen as one's personal responsibility.


Evidence

Several experiments over both incentivized and hypothetical choices attest to the magnitude of this effect. Experiments in
first price auction A first-price sealed-bid auction (FPSBA) is a common type of auction. It is also known as blind auction. In this type of auction, all bidders simultaneously submit sealed bids so that no bidder knows the bid of any other participant. The highest b ...
s show that by manipulating the feedback the participants expect to receive, significant differences in the average bids are observed. In particular, "Loser's regret" can be induced by revealing the winning bid to all participants in the auction, and thus revealing to the losers whether they would have been able to make a profit and how much could it have been (a participant that has a valuation of $50, bids $30 and finds out the winning bid was $35 will also learn that he or she could have earned as much as $15 by bidding anything over $35.) This in turn allows for the possibility of regret and if bidders correctly anticipate this, they would tend to bid higher than in the case where no feedback on the winning bid is provided in order to decrease the possibility of regret. In decisions over lotteries, experiments also provide supporting evidence of anticipated regret. As in the case of first price auctions, differences in feedback over the resolution of the uncertainty can cause the possibility of regret and if this is anticipated, it may induce different preferences. For example, when faced with a choice between $40 with certainty and a coin toss that pays $100 if the outcome is guessed correctly and $0 otherwise, not only does the certain payment alternative minimizes the risk but also the possibility of regret, since typically the coin will not be tossed (and thus the uncertainty not resolved) while if the coin toss is chosen, the outcome that pays $0 will induce regret. If the coin is tossed regardless of the chosen alternative, then the alternative payoff will always be known and then there is no choice that will eliminate the possibility of regret.


Anticipated regret versus experienced regret

Anticipated regret tends to be overestimated for both choices and actions over which people perceive themselves to be responsible. People are particularly likely to overestimate the regret they will feel when missing a desired outcome by a narrow margin. In one study, commuters predicted they would experience greater regret if they missed a train by 1 minute more than missing a train by 5 minutes, for example, but commuters who actually missed their train by 1 or 5 minutes experienced (equal and) lower amounts of regret. Commuters appeared to overestimate the regret they would feel when missing the train by a narrow margin, because they tended to underestimate the extent to which they would attribute missing the train to external causes (e.g., missing their wallet or spending less time in the shower).


Applications

Besides the traditional setting of choices over lotteries, regret aversion has been proposed as an explanation for the typically observed overbidding in first price auctions, and the
disposition effect The disposition effect is an anomaly discovered in behavioral finance. It relates to the tendency of investors to sell assets that have increased in value, while keeping assets that have dropped in value. Hersh Shefrin and Meir Statman identifie ...
, among others.


Minimax regret

The
minimax Minimax (sometimes Minmax, MM or saddle point) is a decision rule used in artificial intelligence, decision theory, combinatorial game theory, statistics, and philosophy for ''minimizing'' the possible loss function, loss for a Worst-case scenari ...
regret approach is to minimize the worst-case regret, originally presented by
Leonard Savage Leonard Jimmie Savage (born Leonard Ogashevitz; 1917 – 1971) was an American mathematician and statistician. Economist Milton Friedman said Savage was "one of the few people I have met whom I would unhesitatingly call a genius." Education and ...
in 1951. The aim of this is to perform as closely as possible to the optimal course. Since the minimax criterion applied here is to the regret (difference or ratio of the payoffs) rather than to the payoff itself, it is not as pessimistic as the ordinary minimax approach. Similar approaches have been used in a variety of areas such as: *
Hypothesis testing A statistical hypothesis test is a method of statistical inference used to decide whether the data provide sufficient evidence to reject a particular hypothesis. A statistical hypothesis test typically involves a calculation of a test statistic. T ...
*
Prediction A prediction (Latin ''præ-'', "before," and ''dictum'', "something said") or forecast is a statement about a future event or about future data. Predictions are often, but not always, based upon experience or knowledge of forecasters. There ...
*
Economics Economics () is a behavioral science that studies the Production (economics), production, distribution (economics), distribution, and Consumption (economics), consumption of goods and services. Economics focuses on the behaviour and interac ...
One benefit of minimax (as opposed to expected regret) is that it is independent of the probabilities of the various outcomes: thus if regret can be accurately computed, one can reliably use minimax regret. However, probabilities of outcomes are hard to estimate. This differs from the standard minimax approach in that it uses ''differences'' or ''ratios'' between outcomes, and thus requires interval or ratio measurements, as well as
ordinal measurement Level of measurement or scale of measure is a classification that describes the nature of information within the values assigned to variables. Psychologist Stanley Smith Stevens developed the best-known classification with four levels, or scale ...
s (ranking), as in standard minimax.


Example

Suppose an investor has to choose between investing in stocks, bonds or the money market, and the total return depends on what happens to interest rates. The following table shows some possible returns: The crude maximin choice based on returns would be to invest in the money market, ensuring a return of at least 1. However, if interest rates fell then the regret associated with this choice would be large. This would be 11, which is the difference between the 12 which could have been received if the outcome had been known in advance and the 1 received. A mixed portfolio of about 11.1% in stocks and 88.9% in the money market would have ensured a return of at least 2.22; but, if interest rates fell, there would be a regret of about 9.78. The regret table for this example, constructed by subtracting actual returns from best returns, is as follows: Therefore, using a minimax choice based on regret, the best course would be to invest in bonds, ensuring a regret of no worse than 5. A mixed investment portfolio would do even better: 61.1% invested in stocks, and 38.9% in the money market would produce a regret no worse than about 4.28.


Example: Linear estimation setting

What follows is an illustration of how the concept of regret can be used to design a linear
estimator In statistics, an estimator is a rule for calculating an estimate of a given quantity based on Sample (statistics), observed data: thus the rule (the estimator), the quantity of interest (the estimand) and its result (the estimate) are distinguish ...
. In this example, the problem is to construct a linear estimator of a finite-dimensional parameter vector x from its noisy linear measurement with known noise covariance structure. The loss of reconstruction of x is measured using the
mean-squared error In statistics, the mean squared error (MSE) or mean squared deviation (MSD) of an estimator (of a procedure for estimating an unobserved quantity) measures the average of the squares of the errors—that is, the average squared difference betwee ...
(MSE). The unknown parameter vector is known to lie in an
ellipsoid An ellipsoid is a surface that can be obtained from a sphere by deforming it by means of directional Scaling (geometry), scalings, or more generally, of an affine transformation. An ellipsoid is a quadric surface;  that is, a Surface (mathemat ...
E centered at zero. The regret is defined to be the difference between the MSE of the linear estimator that doesn't know the parameter x, and the MSE of the linear estimator that knows x. Also, since the estimator is restricted to be linear, the zero MSE cannot be achieved in the latter case. In this case, the solution of a convex optimization problem gives the optimal, minimax regret-minimizing linear estimator, which can be seen by the following argument. According to the assumptions, the observed vector y and the unknown deterministic parameter vector x are tied by the linear model :y=Hx+w where H is a known n \times m matrix with full column rank m, and w is a zero mean random vector with a known covariance matrix C_w. Let :\hat=Gy be a linear estimate of x from y, where G is some m \times n matrix. The MSE of this estimator is given by :MSE = E\left(, , \hat-x, , ^2\right) = Tr(GC_wG^*) + x^*(I-GH)^*(I-GH)x. Since the MSE depends explicitly on x it cannot be minimized directly. Instead, the concept of regret can be used in order to define a linear estimator with good MSE performance. To define the regret here, consider a linear estimator that knows the value of the parameter x, i.e., the matrix G can explicitly depend on x: :\hat^o=G(x)y. The MSE of \hat^o is :MSE^o=E\left(, , \hat^o-x, , ^2\right) = Tr(G(x)C_wG(x)^*) + x^*(I-G(x)H)^*(I-G(x)H)x. To find the optimal G(x), MSE^o is differentiated with respect to G and the derivative is equated to 0 getting :G(x)=xx^*H^*(C_w+Hxx^*H^*)^. Then, using the Matrix Inversion Lemma :G(x)=\fracxx^*H^*C_w^. Substituting this G(x) back into MSE^o, one gets :MSE^o=\frac. This is the smallest MSE achievable with a linear estimate that knows x. In practice this MSE cannot be achieved, but it serves as a bound on the optimal MSE. The regret of using the linear estimator specified by G is equal to :R(x,G)=MSE-MSE^o=Tr(GC_wG^*) + x^*(I-GH)^*(I-GH)x-\frac. The minimax regret approach here is to minimize the worst-case regret, i.e., \sup_ R(x,G). This will allow a performance as close as possible to the best achievable performance in the worst case of the parameter x. Although this problem appears difficult, it is an instance of
convex optimization Convex optimization is a subfield of mathematical optimization that studies the problem of minimizing convex functions over convex sets (or, equivalently, maximizing concave functions over convex sets). Many classes of convex optimization problems ...
and in particular a numerical solution can be efficiently calculated. Similar ideas can be used when x is random with uncertainty in the
covariance matrix In probability theory and statistics, a covariance matrix (also known as auto-covariance matrix, dispersion matrix, variance matrix, or variance–covariance matrix) is a square matrix giving the covariance between each pair of elements of ...
.


Regret in principal-agent problems

Camara, Hartline and Johnsen study principal-agent problems. These are incomplete-information games between two players called ''Principal'' and ''Agent'', whose payoffs depend on a state of nature known only by the Agent. The Principal commits to a policy, then the agent responds, and then the state of nature is revealed. They assume that the principal and agent interact repeatedly, and may learn over time from the state history, using
reinforcement learning Reinforcement learning (RL) is an interdisciplinary area of machine learning and optimal control concerned with how an intelligent agent should take actions in a dynamic environment in order to maximize a reward signal. Reinforcement learnin ...
. They assume that the agent is driven by regret-aversion. In particular, the agent minimizes his ''counterfactual internal regret''. Based on this assumption, they develop mechanisms that minimize the principal's regret. Collina, Roth and Shao improve their mechanism both in running-time and in the bounds for regret (as a function of the number of distinct states of nature).


See also

*
Regret-free mechanism In mechanism design, a regret-free truth-telling mechanism (RFTT, or regret-free mechanism for short) is a mechanism in which each player who reveals his true private information does not feel regret after seeing the mechanism outcome. A regret-free ...
* Competitive regret *
Decision theory Decision theory or the theory of rational choice is a branch of probability theory, probability, economics, and analytic philosophy that uses expected utility and probabilities, probability to model how individuals would behave Rationality, ratio ...
*
Info-gap decision theory Info-gap decision theory seeks to optimize robustness to failure under severe uncertainty,Yakov Ben-Haim, ''Information-Gap Theory: Decisions Under Severe Uncertainty,'' Academic Press, London, 2001.Yakov Ben-Haim, ''Info-Gap Theory: Decisions Und ...
*
Loss function In mathematical optimization and decision theory, a loss function or cost function (sometimes also called an error function) is a function that maps an event or values of one or more variables onto a real number intuitively representing some "cost ...
*
Minimax Minimax (sometimes Minmax, MM or saddle point) is a decision rule used in artificial intelligence, decision theory, combinatorial game theory, statistics, and philosophy for ''minimizing'' the possible loss function, loss for a Worst-case scenari ...
*
Swap regret Swap regret is a concept from online learning and game theory. It is a generalization of regret in a repeated, ''n''-decision game. Definition In each round t, the learner chooses decision i with probability x^t_i and the utility for decision i i ...
* Wald's maximin model


References


External links

* {{cite web, url=http://philosophy.hku.hk/think/strategy/decision.php, title=TUTORIAL G05: Decision theory, archive-url=https://web.archive.org/web/20150703104008/http://philosophy.hku.hk/think/strategy/decision.php, archive-date=3 July 2015 Choice modelling Optimal decisions Decision theory