Time Discounting and Time Preference: A Critical Review PDF

Journal of Economic Literature Vol. XL (June 2002), pp. 351–401 Frederick, Loewenstein, and O’Donoghue: Time Journal of Economic Discounting...

Journal Vol. XL (June 2002), pp. 351–401 Time Discounting and Time Preference: A Critical S HANE FREDERICK and 1. Introduction I NTERTEMPORAL CHOICES involving tradeoffs among costs and benefits occurring at different times— are important and ubiquitous. Such deci- sions not only affect one’s health, wealth, and happiness, but, may also, as Adam Smith first recognized, determine the economic prosperity of nations. In this paper, we review empirical research on intertemporal choice, and present an overview of recent theoretical formula- tions that incorporate insights gained from this research. Economists’ attention to intertempo- ral choice began early in the history of the discipline. Not long after Adam Smith called attention to the impor- tance of intertemporal choice for the 1 Frederick: Sloan School of Management, Mas- sachusetts Institute of Technology. Loewenstein: Department of Social and Decision Carnegie Mellon University. O’Donoghue: partment of Economics, Cornell University. We thank Colin Camerer, David Laibson, John McMillan, Drazen Prelec, Daniel Sicherman, Duncan Simester, and mous referees for useful comments. We thank Cara Barber, Rosa Blackwood, Mandar Oak, and Rosa Stipanovic for research cial support, Frederick and Loewenstein Integrated Study of the Human Dimensions of Global Change at Carnegie Mellon University (NSF Grant SBR-9521914), and O’Donoghue thanks the National Science Foundation SES-0078796). 352 Journal of Economic Samuelson’s reservations about the descriptive validity of the DU model were justified. Section 4 reviews the growing list of “DU anomalies”— patterns of choice that are inconsistent with the model’s theoretical predic- tions. Virtually every assumption under- lying the DU model has been tested and found to be descriptively invalid in at least some situations. Moreover, as we discuss at the end of the section, these anomalies are not anomalies in the sense that they are regarded as errors by the people who commit them. Unlike many of the better-known expected- utility anomalies, the DU anomalies do not necessarily violate any standard or principle that people believe they should uphold. The insights about intertemporal choice gleaned from this empirical re- search have led to the proposal of nu- merous alternative theoretical models, which we review in section 5. Some of these modify the discount function, per- mitting, for example, declining discount rates or “hyperbolic discounting.” Oth- ers introduce additional arguments into the utility function, such as of anticipation. Still others depart from the DU model more radically, by in- cluding, for instance, systematic mis- predictions of future utility. Many of these new theories revive psychological considerations discussed by Rae and other early economists that were extin- guished with the adoption of the DU model and its expression of intertem- poral preferences in terms of a single parameter. In section 6, we review attempts to estimate discount rates. While the DU model assumes that people are charac- terized by a single discount rate, this literature reveals spectacular variation across (and even within) studies. The failure of this research to converge to- ward any agreed-upon average discount Frederick, Loewenstein, and not as a result of empirical research demonstrating its validity. Intertemporal choice became firmly established as a distinct topic with John Rae’s publication of The So- ciological Theory of Capital. Like Adam Smith, Rae sought to determine why wealth differed among nations. Smith had argued that national wealth was de- termined by the amount of labor allo- cated to the production of capital, but Rae recognized that this account was in- complete because it failed to explain the determinants of this allocation. In Rae’s view, the missing element was “the effective desire of accumulatio psychological factor that differed across countries and determined a society’s level of saving and investment. Along with inventing the topic of in- tertemporal choice, Rae also produced the first in-depth discussion of the psy- chological motives underlying inter- temporal choice. Rae believed that intertemporal-choice behavior was the joint product of factors that either pro- moted or limited the effective desire of accumulation. The two main factors that promoted the effective desire of accumulation were the bequest motive (“the prevalence throughout the society of the social and benevolent affections,” p. 58) and the propensity to exercise self-restraint (“the extent of the intel- lectual powers, and the consequent prevalence of habits of reflection, and prudence, in the minds of the mem- bers of society,” p. 58). One limiting factor was the uncertainty of human life: When engaged in safe occupations, in healthy countries, men are much more apt to be frugal, than in unhealthy, or hazardous occupations, and in climates pernicious to hu- man life. Sailors and soldiers are prodigals. In the West Indies, New Orleans, Indies, the expenditure of the profuse. The same people, coming to reside in the healthy parts of Europe, and not get- 354 Journal The anticipatory-utility and absti- nence perspectives share the idea that intertemporal tradeoffs depend on im- mediate feelings—in one case, the im- mediate pleasure of anticipation, and in the other, the immediate discomfort of self-denial. The two perspectives, how- ever, explain variability in intertemporal- choice behavior in different ways. The anticipatory-utility perspective attrib- utes variations in intertemporal-choice behavior to differences in people’s abilities to imagine the future and to differences in situations that promote or inhibit such mental images. The ab- stinence perspective, on the other hand, explains variations in intertemporal- choice behavior on the basis of individ- ual and situational differences in the psychological discomfort associated with self-denial. In this view, one should observe high rates of time discounting by people who find it painful to delay gratification, and in situations in which deferral is generally painful—e.g., when one is, as Rae worded it, in the “actual presence of the immediate object of desire.” Eugen von Böhm-Bawerk, the next major figure in the development of the economic perspective on intertemporal choice, added a new motive to the list proposed by Rae, Jevons, and Senior, arguing that humans suffer from a systematic tendency to underestimate future wants: It may be that we possess inadequate to imagine and to abstract, willing to put forth the necessary effort, but in any event we limn a more or less incom- plete picture of our future wants and espe- cially of the remotely distant ones. And then there are all those wants come to mind at all. (Böhm-Bawerk 268–69) 2 2 In a frequently cited passage nomics of Welfare, Arthur Pigou (1920) proposed a similar account of time preference, suggesting that it results from a type of Frederick, Loewenstein, rate of substitution on the diagonal, where consumption is equal in both periods. Fisher’s writings, like those of his predecessors, included extensive discus- sions of the psychological determinants of time preference. Like Böhm-Bawerk, he differentiated “objective factors,” such as projected future wealth and risk, from “personal factors.” Fisher’s list of personal factors included the four described by Rae, “foresight” (the abil- ity to imagine future wants—the inverse of the deficit that Böhm-Bawerk postu- lated), and “fashion,” which Fisher be- lieved to be “of vast importance... in its influence both on the rate of interest and on the distribution of wealth itself.” (Fisher 1930, p. 88): The most fitful of the causes at work is prob- ably fashion. This at the present time acts, on the one hand, to stimulate men to save and become millionaires, and, hand, to stimulate millionaires to live in an ostentatious manner. (Fisher 1930, p. 87) Hence, in the early part of the twen- tieth century, “time preference” was viewed as an amalgamation of various intertemporal motives. While the DU model condenses these motives into the discount rate, we will argue that resur- recting these distinct motives is crucial for understanding intertemporal choices. 3. The Discounted Utility Model In 1937, Paul Samuelson introduced the DU model in a five-page article titled “A Note on Measurement of Util- ity.” Samuelson’s paper was intended to offer a generalized model of intertem- poral choice that was applicable tiple time periods (Fisher’s graphical indifference-curve analysis was to extend to more than two time peri- ods) and to make the point that repre- senting intertemporal tradeoffs re- quired a cardinal measure of utility. But 356 Journal of reservations, the simplicity and ele- gance of this formulation was irresist- ible, and the DU model was rapidly adopted as the framework of choice for analyzing intertemporal decisions. The DU model received a scarcely needed further boost to its dominance as the standard model of intertemporal choice when Tjalling C. Koopmans (1960) showed that the model could be derived from a superficially plausible set of axioms. Koopmans, like Samuel- son, did not argue that the DU model was psychologically or normatively plausible; his goal was only to show that under some well-specified (though ar- guably unrealistic) circumstances, in- dividuals were logically compelled to possess positive time preference. Pro- ducers of a product, however, cannot dictate how the product will be used, and Koopmans’ central technical mes- sage was largely lost while his axiom- atization of the DU model helped to cement its popularity and bolster its perceived legitimacy. In the remainder of this section, we describe some important features of the DU model as it is commonly used by economists, and briefly comment on the normative and positive validity of these assumptions. These features do not rep- resent an axiom system—they are nei- ther necessary nor sufficient conditions for the DU model—but are intended to highlight the implicit psychological assumptions underlying the model. 4 3.1 Integration of New Alternatives with Existing Plans A central assumption in most models of intertemporal choice—including the DU model—is that a person evaluates 4 There are several different axiom systems for the DU model—in addition to Koopmans, see Peter Fishburn (1970), K. J. Lancaster (1963), Richard F. Meyer (1976), and Rubinstein (1982). Frederick, Loewenstein, regarding consumption in future time periods. 3.2 Utility Independence The DU model explicitly assumes that the overall value—or “global utility”— of a sequence of outcomes is equal to the (discounted) sum of the utilities in each period. Hence, the distribution of utility across time makes no difference beyond that dictated by discounting, which (assuming positive time prefer- ence) penalizes utility that is experi- enced later. The assumption of utility independence has rarely been discussed or challenged, but its implications are far from innocuous. It rules out any kind of preference for patterns of utility over time—e.g., a preference for a flat utility profile over a roller-coaster util- ity profile with the same discounted utility. 5 3.3 Consumption Independence The DU model explicitly assumes that a person’s well-being in period independent of her consumption in any other period—i.e., that the marginal rate of substitution between consump- tion in periods τ and τ′ is independent of consumption in period τ″. Consumption independence is analo- gous to, but fundamentally different the independence axiom of expected- utility theory. In expected-utility ory, the independence axiom specifies that preferences over uncertain pros- 5 “Utility independence” one literally interprets u(ct rienced in period t + k. We believe fact, the common interpretation. For a model that relaxes the assumption of utility independence, see Benjamin Hermalin and Alice Isen (2000), who consider a model in which period t depends on well-being i.e., they assume u t = u(c t, Kahneman, Peter Wakker, and Rakesh Sarin (1997) who propose a set of axioms justify an assumption of additive separability in instantaneous utility. 358 Journal and unpredictable ways. Though this unrealistic assumption is often retained for analytical convenience, it becomes less defensible as economists gain insight into how tastes change over time (see Loewenstein and Angner, forthcoming, for a discussion of different sources of preference change). 6 3.5 Independence of Discounting from Consumption The DU model assumes that the dis- count function is invariant across all forms of consumption. This feature is crucial to the notion of time If people discount utility from sources at different rates, then tion of a unitary time preference is meaningless. Instead we would need to label time preference according to the object being delayed—”banana time preference,” “vacation time prefer- ence,” and so on. In section 7, we dis- cuss in more detail the validity of the assumption that the same rate of time preference applies to all forms consumption. 3.6 Constant Discounting and Time Consistency Any discount function 1can the form D(k) = Π kn −= 10  n  resents the per-period discount rate for period n—that is, the discount rate applied between periods n and n + 1. Hence, by assuming that the discount function takes the form D(k) the DU model assumes a constant per- 6 As we discuss in section ence changes, due to things such as habit forma- tion or reference dependence, are best understood in terms of consumption interdependence and not nonstationary utility. In some situations, nonsta- tionarities clearly play an important ior—e.g., Steven Suranovic, Robert Goldfarb, and Thomas Leonard (1999), and O’Donoghue and Mathew Rabin (1999a; 2000) discuss the impor- tance of nonstationarities in behavior. Frederick, Loewenstein, marginal utility (that the instantaneous utility function u(ct) is concave) and posi- tive time preference (that the discount rate ρ is positive). 9 These two assumptions create opposing forces in intertemporal choice: diminishing marginal utility mo- tivates a person to spread consumption over time, while positive time prefer- ence motivates a person to concentrate consumption in the present. Since people do, in fact, sumption over time, the assumption of diminishing marginal utility (or some other property that has the same effect) seems strongly justified. The assump- tion of positive time preference, on the other hand, is more questionable. Sev- eral researchers have argued for posi- tive time preference on logical grounds (Jack Hirshleifer 1970; Koopmans 1960; Koopmans, Peter A. Diamond, and Richard E. Williamson 1964; Olson and Bailey 1981). The gist of their argu- ments is that a zero or negative time preference, combined with a positive real rate of return on saving, would command the infinite deferral of all consumption. 10 But this conclusion as- sumes, unrealistically, that have infinite life-spans and linear (or weakly concave) utility functions. theless, in econometric analyses of sav- ings and intertemporal substitution, posi- tive time preference is sometimes treated as an identifying restriction whose vio- lation is interpreted as evidence of misspecification. The most compelling argument sup- porting the logic of positive time pref- 9 Discounting is not inherent to the DU model, because the model could be applied with ρ ≤ 0. However, the inclusion of ρ in the model strongly implies that it may take a value other than zero, and the name discount rate certainly suggests that it is greater than zero. 10 In the context of intergenerational choice, Koopmans (1967) called this result the paradox of the indefinitely postponed splurge. See also Ken- neth J. Arrow (1983), S. Chakravarty (1962), and Robert M. Solow (1974). 360 Journal of Economic and found no relation between mone- tary discount rates (as imputed from procedures such as “I would be indiffer- ent between $100 tomorrow and $____ in five years”) and self-perceived stabil- ity of identity (as defined by the follow- ing similarity ratings: “Compared to now, how similar were you five years ago [will you be five years from now]?”), nor did he find any relation between such monetary discount rates and the presumed correlates of identity stability (e.g., the extent to which peo- ple agree with the statement “I am still embarrassed by stupid things I did a long time ago”). 4. DU Anomalies Over the last two decades, empirical research on intertemporal choice has documented various inadequacies of the DU model as a descriptive model of be- havior. First, empirically observed dis- count rates are not constant over time, but appear to decline—a pattern often referred to as hyperbolic discounting. Furthermore, even for a given delay, discount rates vary across different types of intertemporal choices: gains are discounted more than losses, small amounts more than large amounts, and explicit sequences of multiple outcomes are discounted differently than considered singly. 4.1 Hyperbolic Discounting The best documented DU anomaly is hyperbolic discounting. The term “hyperbolic discounting” is often used to mean, in our terminology, that a per- son has a declining rate of time prefer- ence (in our notation, ρ n is declining in n), and we adopt this meaning here. Several results are usually interpreted as evidence for hyperbolic discounting. First, when subjects are asked to com- pare a smaller-sooner reward to a Frederick, Loewenstein, preferences between two delayed re- wards can reverse in favor of the more proximate reward as the time to both rewards diminishes—e.g., someone may prefer $110 in 31 days over $100 in 30 days, but also prefer $100 now over $110 tomorrow. Such “preference re- versals” have been observed both in humans (Green, Nathaniel Fristoe, and Myerson 1994; Kirby and Herrnstein 1995; Andrew Millar and Douglas Navarick 1984; Jay Solnick et al. 1980) and in pigeons (Ainslie and Herrnstein 1981; Green et al. 1981). 14 Fourth, the pattern of declining dis- count rates suggested by the studies above is also evident across studies. In section 6, we summarize studies that es- timate discount rates. Figure 1a plots the average estimated discount factor (= 1/(1 + discount rate)) from each of these studies against the average time horizon for that study. 15 As the regres- sion line reflects, the estimated dis- count factor increases with the time ho- rizon, which means that the discount rate declines. We note, however, that after excluding studies with very short time horizons (one year or less) from the analysis (see figure 1b), there is no 14 These studies all demonstrate preference re- versals in the synchronic sense—subjects neously prefer $100 now over $110 tomorrow and prefer $110 in 31 days over $100 is consistent with hyperbolic discounting. But there seems to be an implicit erence reversals would also hold in the diachronic sense—that if subjects who in 31 days over $100 in 30 days were brought back to the lab thirty days later, at that time over $110 one day later. Under the assumption of stationary discounting in footnote 8), synchronic preference ply diachronic preference reversals. To the extent that subjects anticipate diachronic reversals and want to avoid them, evidence of a preference for commitment could also be interpreted as evidence for hyperbolic discounting (we discuss this issue more in section 5.1.1). 15 In some cases, the discount rates were com- puted from the median respondent. In other cases, the mean discount rate was used. 362 1.0 imputed discount factor 0.8 0.6 0.4 0.2 0.0 0 Figure 1a. Discount Factor as a Function of Time Horizon (all studies) although they did not interpret their results the same way. If Read is correct about subadditive discounting, its main implication for economic applications may be to provide an alternative psychological underpin- ning for using a hyperbolic discount function, because most intertemporal decisions are based primarily on dis- counting from the present. 17 17 A few studies have actually discount rates. Frederick (1999) dents to imagine that they worked at a job that consisted of both pleasant work unpleasant work (“bad days”) attractiveness of having additional year or in a future year. On average, respondents were indifferent between 20 extra good days this year, 21 the following year, implying a one-year discount a five-year discount rate of explanation is that a desire for improvement is evoked more strongly for two (this year and next) than for (this year and five years hence). asked students in a political between the following two payment March 1 A: $997 April 1 B: $1000 Then, two weeks later, he asked them to choose between $997 on November 1 and $1000 on December 1. Fifty-four percent of respondents preferred $997 in November to ber, but only 34 percent preferred sequence B. These two results suggest increasing discount rates. To explain them Rubinstein specu- lated that the three more proximate Frederick, Loewenstein, asked subjects to imagine they had re- ceived a traffic ticket that could be paid either now or later and to state how much they would be willing to pay if payment could be delayed (by three months, one year, or three years). The discount rates imputed from these an- swers were much lower than the discount rates imputed from comparable questions about monetary gains. This pattern is prevalent in the literature. Indeed, in many studies, a substantial proportion of sub- jects prefer to incur a loss immediately rather than delay it (Benzion, Rapoport, and Yagil 1989; Loewenstein 1987; L. D. MacKeigan et al. 1993; Walter Mischel, Joan Grusec, and John C. Masters 1969; Redelmeier and Heller 1993; J. Frank Yates and Royce A. Watts 1975). 4.2.2 The “Magnitude Effect” (small outcomes are discounted more than large ones) Most studies that vary outcome size have found that large outcomes are discounted at a lower rate than small ones (Ainslie and Varda Haendel 1983; Benzion, Rapoport, and Yagil 1989; Green, Fristoe, and Myerson 1994; Green, Astrid Fry, and Myerson 1994; Hol- comb and Nelson 1992; Kirby 1997; Kirby and Marakovic 1995; Kirby, Nancy Petry and Warren Bickel 1999; Loewenstein 1987; Raineri and Rachlin 1993; Marjorie K. Shelley 1993; Thaler 1981). In Thaler’s (1981) study, for ex- ample, respondents were, on average, indifferent between $15 immediately and $60 in a year, $250 immediately and $350 in a year, and $3000 immedi- ately and $4000 in a year, implying dis- count rates of 139 percent, 34 percent, and 29 percent, respectively. 4.2.3 The “Delay-Speedup” Asymmetry Loewenstein (1988) demonstrated that imputed discount rates can be dramatically affected by whether the 364 Journal vacation trips) on consecutive weekends or consecutive months generally pre- ferred to save the better thing for last. Chapman (2000) presented respondents with hypothetical sequences of head- ache pain that were matched in terms of total pain that either gradually less- ened or gradually increased with time. Sequence durations included one hour, one day, one month, one year, five years, and twenty years. For all se- quence durations, the vast majority (from 82 percent to 92 percent) of sub- jects preferred the sequence of pain that lessened over time. (See also W. T. Ross, Jr. and I. Simonson 1991). 4.2.5 Violations of Independence and Preference for Spread The research on preferences over se- quences also reveals strong violations of independence. Consider the following pair of questions from Loewenstein and Prelec (1993): Imagine that over the next decide how to spend your Saturday pair of sequences of dinners would prefer. “Fancy French” fancy French restaurant. “Fancy exquisite lobster dinner at a scheduling considerations (e.g., first second third fourth fifth weekend weekend weekend weekend weekend Option A Fancy Eat at Eat at Eat at French home home home Option B Eat at Eat at Fancy home home French home Option C Fancy Eat at Eat at Eat at French home home home Option D Eat at Eat at Fancy home home French As discussed in section 3.3, consump- tion independence implies that prefer- ences between two consumption pro- files should not be affected by the nature of the consumption in periods in Frederick, Loewenstein, that have been documented are re- garded as errors by the people who commit them. For example, in the “con- junction fallacy” discovered by Tversky and Kahneman (1983), many people will— with some reflection—recognize that a conjunction cannot be more likely than one of its constituents (e.g., that it can’t be more likely for Linda to be a femi- nist bank teller than for her to be “just” a bank teller). In contrast, the patterns of preferences that are re- garded as “anomalies” in the context of the DU model do not necessarily vio- late any standard or principle ple believe they should uphold. Even when the choice pattern is pointed out to people, they do not regard them- selves as having made a mistake (and probably have not made one!). For example, there is no compelling logic that dictates that one who prefers to delay a French dinner should also pre- fer to do so when that French dinner will be closely followed by a lobster dinner. Indeed, it is unclear whether any of the DU “anomalies” should be regarded as mistakes. Frederick and Read (2002) found evidence that the magnitude ef- fect is more pronounced when subjects evaluate both “small” and “large” amounts than when they evaluate either one. Specifically, the difference in the discount rates between a small amount ($10) and a large amount ($1000) was larger when the two judgments were made in close succession than when they were made separately. Analogous results were obtained for the sign ef- fect, as the differences in discount rates between gains and losses were slightly larger in a within-subjects design, where respondents evaluated delayed gains and delayed losses, than in a between-subjects design where they evaluate only gains or only losses. Since respondents did not attempt to 366 Journal 5.1 Models of Hyperbolic Discounting In the economics literature, R. H. Strotz (1955–56) was the first to con- sider alternatives to exponential dis- counting, seeing “no reason why an individual should have such a special discount function” (p. 172). Moreover, Strotz recognized that for any discount function other than exponential, a person would have time-inconsistent preferences. 18 He proposed two strate- gies that might be employed by a per- son who foresees how her preferences will change over time: the “strategy of precommitment” (wherein she commits to some plan of action) and the “strat- egy of consistent planning” (wherein she chooses her behavior ignoring plans that she knows her future selves will not carry out). 19 While Strotz did not posit any specific alternative forms, he did suggest that “special attention” be given to the case of declining discount rates. Motivated by the evidence discussed in section 4.1, there has been a recent surge of interest among economists in the implications of declining discount rates (beginning with David Laibson 1994, 1997). This literature has used a particularly simple functional captures the essence of hyperbolic discounting:  1 if h = 0 D(k) =  k βδ if k > 0. This functional form was first introduced by E. S. Phelps and Pollak (1968) to study intergenerational altruism, and was first applied to individual decision mak- 18 Strotz implicitly assumes ing. 19 Building on Strotz’s strategy planning, some researchers have addressed the question of whether there exists a consistent path for general non-exponential discount functions. See in particular Robert Pollak Peleg and Menahem Yaari (1973), and Steven Goldman (1980). Frederick, Loewenstein, Angeletos et al. (2001) describe how hyperbolic discounting can explain the coexistence of high preretirement wealth, low liquid asset holdings (rela- tive to income levels and illiquid asset holdings), and high credit-card debt. Carolyn Fischer (1999) and O’Donoghue and Rabin (1999c, 2001) have applied (β,δ) preferences to pro- crastination, where hyperbolic discount- ing leads a person to put off an onerous activity more than she would like from a prior perspective. 20 O’Donoghue and Rabin (1999c) examine the implications of hyperbolic discounting for contract- ing when a principal is concerned with combating procrastination by an agent. They show how incentive schemes with “deadlines” may be a useful screening device to distinguish efficient delay from inefficient procrastination. O’Donoghue and Rabin (2001) explore procrastina- tion when a person must not only choose when to complete a task, but also which task to complete. They show that a person might never carry out a very easy and very good option because they continually plan to carry out an even better but more onerous option. For instance, a person might never take half an hour to straighten the shelves in her garage because she persistently plans to take an entire day to do a major cleanup of the entire garage. Extending this logic, they show that providing peo- ple with new options might make pro- crastination more likely. If the person’s only option were to straighten the shelves, she might do it in a timely manner; but if the person can either straighten the shelves or do the major cleanup, she now may do nothing. O’Donoghue and Rabin (1999d) apply this logic to retirement planning. 20 While not framed in terms counting, George Akerlof’s crastination is formally equivalent model. 368 Journal of Economic people lie somewhere in between these two extremes, behavioral evidence re- garding the degree of awareness is quite limited. One way to identify sophistication is to look for evidence of commitment. Someone who suspects that her prefer- ences will change over time might take steps to eliminate an option that seems inferior now but might tempt her later. For example, someone who currently prefers $110 in 31 days to $100 in 30 days but who suspects that in a month she will prefer $100 immediately to $110 tomorrow, might attempt to elimi- nate the $100 reward from the later choice set, and thereby bind herself now to receive the $110 reward in 31 days. Real-world examples of commit- ment include “Christmas clubs” or “fat farms.” Perhaps the best empirical demon- stration of a preference for commit- ment was conducted by Dan Ariely and Klaus Wertenbroch (2002). In that study, MIT executive-education stud- ents had to write three short papers for a class and were assigned to one of two experimental conditions. In one condition, deadlines for the three pa- pers were imposed by the instructor and were evenly spaced across the se- mester. In the other condition, each student was allowed to set her own deadlines for each of the three papers. In both conditions, the penalty for delay was 1 percent per day late, re- gardless of whether the deadline was externally or self-imposed. Although students in the free-choice condition could have made all three papers due at the end of the semester, many did, in fact, choose to impose deadlines on themselves, suggesting that they ap- preciated the value of commitment. Few students chose evenly spaced deadlines, however, and those who did not performed worse in the course Frederick, Loewenstein, 5.2 Models That Enrich the Instantaneous Utility Function Many discounting anomalies, espe- cially those in section 4.2, can be un- derstood as a misspecification of the instantaneous utility function. Similarly, many of the confounds we discuss in section 6 are caused by researchers at- tributing to the discount rate aspects of preference that are more appropriately considered as arguments in the instan- taneous utility function. As a result, alternative models of intertemporal choice have been advanced that add ad- ditional arguments, such as utility from anticipation, to the instantaneous utility function. 5.2.1 Habit-Formation Models James Duesenberry (1952) was the first economist to propose the idea of “habit formation”—that the utility from current consumption (“tastes”) can be affected by the level of past consump- tion. This idea was more formally devel- oped by Pollak (1970) and Harl Ryder and Geoffrey Heal (1973). In habit for- mation models, the period-τ instantane- ous utility function takes the form u(cτ;cτ − 1,cτ − 2,...) for τ′ < τ. For simplicity, most such models assume that all effects of past consumption for current utility enter through a state variable. That is, they assume that period-τ instantaneous util- ity function takes the form u(cτ;zτ) where z τ is a state variable that is in- creasing in past consumption and ∂2 ⁄ ∂cτ∂zτ > 0. Ryder and Heal (1973) assume that zτ is the exponentially weighted sum of past ∞ consumption, or zτ = ∑ i = 1γ icτ − i. Although habit formation is often said to induce a preference for an in- creasing consumption profile, it can, under some circumstances, lead a per- son to prefer a decreasing or even non- 370 Journal various shocks. The key feature of habit formation that drives many of these re- sults is that, after a shock, consumption adjustment is sluggish in the short term but not in the long term. 5.2.2 Reference-Point Models Closely related to, but conceptually distinct from, habit-formation models are models of reference-dependent util- ity, which incorporate ideas from pros- pect theory (Kahneman and Tversky 1979; Tversky and Kahneman 1991). According to prospect theory, outcomes are evaluated using a value function de- fined over departures from a reference point—in our notation, the period-τ in- stantaneous utility function takes the form u(c τ, r τ) = v(c τ – point, r τ, might depend on past con- sumption, expectations, social compari- son, status quo, and such. A second feature of prospect theory is that the value function exhibits loss aversion— negative departures from one’s refer- ence consumption level decrease utility by a greater amount than positive de- partures increase it. A third feature of prospect theory is that the value func- tion exhibits—diminishing sensitivity for both gains and losses, which means that the value function is concave over gains and convex over losses. 23 Loewenstein and Prelec (1992) ap- plied a specialized version of such a value function to intertemporal choice to explain the magnitude effect, the sign effect, and the delay-speedup 23 Reference-point models sometimes there is a direct effect of the reference level, so that u(cτ,rτ) u(cτ,rτ) = v(cτ − rτ) + models could be interpreted as models, where the state variable ence point. Indeed, many habit-formation models, such as Pollak (1970) and Constantinides (1990), assume instantaneous utility u(cτ − zτ), although they loss aversion nor diminishing sensitivity. Frederick, Loewenstein, affect the rate of consumption growth. For example, if a person finds out that her permanent income will be lower than she formerly thought, she would reduce her consumption by, say, 10 per- cent in every period, leaving her con- sumption growth unchanged. If, how- ever, this person were loss averse in current consumption, she would be un- willing to reduce this year’s consump- tion by 10 percent—forcing her to re- duce future consumption by more than 10 percent, and thereby reducing the growth rate of her consumption. Two studies by John Shea (1995a,b) support of Economic Literature Frederick, Loewenstein, and O’Donoghue: Time Journal of Economic Discounting Literature, Vol. XL (June 2002) Review , GEORGE LOEWENSTEIN , T ED O’DONOGHUE 1 wealth of nations, the Scottish economist John Rae was examining the sociologi- —decisions cal and psychological determinants of these choices. In section 2, we briefly review the perspectives on intertempo- ral choice of Rae and nineteenth- and early twentieth-century economists, and describe how these early perspectives interpreted intertemporal choice as the joint product of many conflicting psychological motives. All of this changed when Paul Sam- uelson proposed the discounted-utility (DU) model in 1937. Despite Samuel- son’s manifest reservations about the normative and descriptive validity of the formulation he had proposed, the DU model was accepted almost in- stantly, not only as a valid normative standard for public policies (e.g., in cost- benefit analyses), but as a descriptively accurate representation of actual behav- ior. A central assumption of the DU Sciences, De- model is that all of the disparate mo- tives underlying intertemporal choice can be condensed into a single parameter— Read, Nachum three anony- the discount rate. In section 3 we exam- ine this and many other assumptions underlying the DU model. We do not assistance. For finan- thank the present an axiomatic derivation of the model, but instead focus on those features that highlight the implicit (Award psychological assumptions underlying the model. 351 Literature, Vol. XL (June 2002) rate stems partly from differences in elicitation procedures. But it also stems from the faulty assumption that the var- ied considerations that are relevant in intertemporal choices apply equally to different choices and thus that they can all be sensibly represented by a single discount rate. Throughout the paper, we stress the importance of distinguishing among the varied considerations that underlie in- tertemporal choices. We distinguish time discounting from time preference. We use the term time discounting broadly to encompass any reason for caring less about a future consequence, including factors that diminish the ex- pected utility generated by a future consequence, such as uncertainty or changing tastes. We use the term time preference to refer, more specifically, to the preference for immediate utility over delayed utility. In section 7, we push this theme further, by examining whether time preference itself might consist of distinct psychological traits that can be separately analyzed. Section 8 concludes. the utility 2. Historical Origins of the Discounted Utility Model The historical developments that cul- minated in the formulation of the DU model help to explain the model’s limi- tations. Each of the major figures in the development of the DU model—John Rae, Eugen von Böhm-Bawerk, Irving Fisher, and Paul Samuelson—built upon the theoretical framework of his predecessors, drawing on little more than introspection and personal obser- vation. When the DU model eventually became entrenched as the dominant theoretical framework for modeling in- tertemporal choice, it was due largely to its simplicity and its resemblance to the familiar compound interest formula, and O’Donoghue: Time Discounting 353 ting into the vortex of extravagant fashion, live economically. War and pestilence have always waste and luxury, among the other evils that follow in their train. (Rae 1834, p. 57) in 1834, A second factor that limited the ef- fective desire of accumulation was the excitement produced by the prospect of immediate consumption, and the con- comitant discomfort of deferring such available gratifications: Such pleasures as may now be enjoyed gener- ally awaken a passion strongly prompting to the partaking of them. The actual presence of the immediate object of desire in the mind by exciting the attention, seems to rouse all the faculties, as it were to fix their view on it, and n”—a leads them to a very lively conception of the enjoyments which it offers to their instant possession. (Rae 1834, p. 120) Among the four factors that Rae iden- tified as the joint determinants of time preference, one can glimpse two funda- mentally different views. One, which was later championed by William S. Jevons (1888) and his son, Herbert S. Jevons (1905), assumes that people care only about their immediate utility, and ex- plains farsighted behavior by postulat- ing utility from the anticipation of future consumption. On this view, de- ferral of gratification will occur only if it produces an increase in “anticipal” utility that more than compensates for the decrease in immediate consumption utility. The second perspective assumes equal treatment of present and future (zero discounting) as the natural base- line for behavior, and attributes the overweighting of the present to the miseries produced by the self-denial and living required to delay gratification. N. W. Senior, the best-known advocate of this “abstinence” perspective, wrote, “To abstain from the enjoyment which is in our power, or to seek distant rather the East inhabitants is than immediate results, are among the most painful exertions of the human will” (Senior 1836, p. 60). of Economic Literature, Vol. XL (June 2002) Böhm-Bawerk’s analysis of time pref- erence, like those of his predecessors, was heavily psychological, and much of his voluminous treatise, Capital and Interest, was devoted to discussions of the psychological constituents of time preference. However, whereas the early views of Rae, Senior, and Jevons ex- plained intertemporal choices in terms of motives that are uniquely associated with time, Böhm-Bawerk began model- ing intertemporal choice in the same terms as other economic tradeoffs—as a “technical” decision about allocating re- sources (to oneself) over different points in time, much as one would allocate resources between any two competing interests, such as housing and food. Böhm-Bawerk’s treatment of inter- temporal choice as an allocation of con- sumption among time periods was for- malized a decade later by the American economist Irving Fisher (1930). Fisher plotted the intertemporal consumption decision on a two-good indifference diagram, with consumption in the cur- rent year on the abscissa, and consump- tion in the following year on the ordi- nate. This representation made clear that a person’s observed (marginal) rate of time preference—the marginal rate of substitution at her chosen con- sumption bundle—depends on two considerations: time preference and di- minishing marginal utility. Many econo- mists have subsequently expressed dis- comfort with using the term “time power or that we are not preference” to include the effects of dif- ferential marginal utility arising from unequal consumption levels between time periods (see in particular Mancur Olson and Martin Bailey 1981). In that never 1889, pp. Fisher’s formulation, pure time prefer- ence can be interpreted as the marginal from The Eco- telescopic faculty is defective, and we, therefore, see future pleasures, as it were, on a diminished cognitive illusion: “our scale.” and O’Donoghue: Time Discounting 355 in Samuelson’s simplified model, all the psychological concerns discussed over the previous century were compressed into a single parameter, the discount rate. The DU model specifies a decision maker’s intertemporal preferences over consumption profiles (c t,...,c T). Under the usual assumptions (completeness, transitivity, and continuity), such pref- erences can be represented by an in- tertemporal utility function U t(c t,...,c T). The DU model goes further, by as- suming that a person’s intertemporal utility function can be described by the following special functional form: T−t U t(ct,...,cT) = ∑ D(k)u(ct + k) k=0 k  1  where D(k) =  . 1 + ρ In this formulation, u(ct + k) is often inter- preted as the person’s cardinal instanta- on the other neous utility function—her well-being in period t + k—and D(k) is often inter- preted as the person’s discount func- tion—the relative weight she attaches, in period t, to her well-being in period t + k. ρ represents the individual’s pure rate of time preference (her discount rate), which is meant to reflect the collective effects of the “psychological” motives discussed in section 2.3 Samuelson did not endorse the DU model as a normative model of in- tertemporal choice, noting that “any connection between utility as discussed here and any welfare concept is dis- avowed” (p. 161). He also made no claims on behalf of its descriptive valid- ity, stressing, “It is completely arbitrary to assume that the individual behaves so to mul- as to maximize an integral of the form envisaged in [the DU model]” (p. 159). difficult However, despite Samuelson’s manifest 3 The continuous-time analogue is U t({c } τ τ ∈[t,T]) = ∫τT= t e − ρ(τ − t)u(cτ). For expositional ease, we shall restrict attention to discrete-time throughout. Economic Literature, Vol. XL (June 2002) new alternatives by integrating them with her existing plans. To illustrate, consider a person with an existing con- sumption plan (c t,...,c T) who is offered an intertemporal-choice prospect X, which might be something like an op- tion to give up $5000 today to receive $10,000 in five years. Integration means that prospect X is not evaluated in isola- tion, but in light of how it changes the person’s aggregate consumption in all future periods. Thus, to evaluate the prospect X, the person must choose what her new consumption path (c′t,…,c′T) would be if she were to accept prospect X, and should accept the prospect if U t(c′t,…,c′T) > U t(ct,…,cT). An alternative way to understand in- tegration is to recognize that intertem- poral prospects alter a person’s budget set. If the person’s initial endowment is E 0, then accepting prospect X would change her endowment to E 0 ∪ X. Let- ting B(E) denote the person’s budget set given endowment E—i.e., the set of consumption streams that are feasible given endowment E—the DU model says that the person should accept prospect X if: T τ−t  1  max ∑ 1 + ρ u(cτ) (ct,...,cT ) ∈B(E 0 ∪ X) τ= t  T τ−t  1  > max ∑ 1 + ρ u(cτ). (ct,...,cT ) ∈B(E 0) τ= t  While integration seems normatively compelling, it may be too difficult to actually do. A person may not have well-formed plans about future con- sumption streams, or be unable (or un- willing) to recompute the new optimal plan every time she makes an intertem- poral choice. Some of the evidence we review below supports the plausible presumption that people evaluate the results of intertemporal choices inde- Fishburn and Ariel pendently of any expectations they have and O’Donoghue: Time Discounting 357 pects are not affected by the conse- quences that the prospects share—i.e., that the utility of an experienced out- come is unaffected by other outcomes that one might have experienced (but did not). In intertemporal choice, con- sumption independence says that pref- erences over consumption profiles are not affected by the nature of consump- tion in periods in which consumption is identical in the two profiles—i.e., that an outcome’s utility is unaffected by outcomes experienced in prior or future periods. For example, consumption in- dependence says that a person’s prefer- ence between an Italian and Thai res- taurant tonight should not depend on whether she had Italian last night, nor whether she expects to have it tomor- row. As the example suggests, and as Samuelson and Koopmans both recog- nized, there is no compelling rationale for such an assumption. Samuelson (1952, p. 674) noted that, “the amount of wine I drank yesterday and will drink tomorrow can be expected to have ef- t + k is fects upon my today’s indifference slope between wine and milk.” Simi- larly, Koopmans (1960, p. 292) acknowl- edged that, “One cannot claim a high degree of realism for [the indepen- dence assumption], because there is no clear reason why complementarity of from, goods could not extend over more than one time period.” the- 3.4 Stationary Instantaneous Utility When applying the DU model to spe- has meaning only if cific problems, it is often assumed that + k) as well-being expe- the cardinal instantaneous utility func- that this is, in tion u(cτ) is constant across time, so that the well-being generated by any activity is the same in different periods. Most well-being in in period t – 1— economists would acknowledge that sta- u t – 1). See also Daniel tionarity of the instantaneous utility function is not sensible in many situ- that would ations, because people’s preferences do, in fact, change over time in predictable of Economic Literature, Vol. XL (June 2002) period discount rate (ρ n = ρ for all n). 7 Constant discounting entails an even- handedness in the way a person evalu- ates time. It means that delaying or accelerating two dated outcomes by a common amount should not change preferences between the outcomes—if in period t a person prefers X at τ to Y at τ + d for some τ, then in period t she must prefer X at τ to Y at τ + d for all τ. The assumption of constant discounting permits a person’s time preference to be summarized as a single discount rate. If constant discounting does not preference. hold, then characterizing one’s time different preference requires the specification of the no- an entire discount function. Constant discounting implies that a person’s intertemporal preferences are time-consistent, which means that later preferences “confirm” earlier prefer- ences. Formally, a person’s preferences are time-consistent if, for any two con- sumption profiles (ct,...,cT) and (c′t,...,c′T), with ct = c′t, U t(ct,ct + 1,...,cT) ≥ U t(c′t,c′t + 1, of...,c′T) if and only if U t + 1(ct + 1,...,cT) ≥ U t + 1(c′t + 1,...,c′T). 8 For an interesting dis- cussion that questions the normative va- lidity of constant discounting, see Martin Albrecht and Martin Weber (1995). be written in  1 + ρ  , where ρ n rep- 3.7 Diminishing Marginal Utility and Positive Time Preference While not core features of the DU model, virtually all analyses of intertem-  1  k poral choice assume both diminishing =  1 + ρ  , 7 An alternative but equivalent definition of con- stant discounting is that D(k)/D(k + 1) is indepen- 5, endogenous prefer- dent of k. 8 Constant discounting implies time-consistent preferences only under the ancillary assumption of stationary discounting, for which the dis- count function D(k) is the same in all periods. As a role in behav- counterexample,  1  k if the period-t discount function is D t(k) =  1 + ρ  while  the 1  k period-t + 1 discount function is D t + 1(k) =  1 + ρ′  for some ρ′ ≠ ρ, then the person exhibits constant discounting at both the realm of addictive dates t and t + 1, but nonetheless has time- inconsistent preferences. and O’Donoghue: Time Discounting 359 erence was made by Derek Parfit (1971; 1976; 1982), who contends that there is no enduring self or “I” over time to which all future utility can be ascribed, and that a diminution in psychological connections gives our descendent fu- ture selves the status of other people— making that utility less than fully “ours” and giving us a reason to count it less: 11 spread con- We care less about our further future... because we know that less of what we are now—less, say, of our present hopes or plans, loves or ideals—will survive into the further future... [if] what matters holds to a lesser degree, it cannot be irrational to care less. (Parfit 1971, p. 99) Parfit’s claims are normative, not de- scriptive. He is not attempting to ex- plain or predict people’s intertemporal choices, but is arguing that conclusions about the rationality of time preference must be grounded in a correct view of personal identity. However, if this is the only compelling normative rationale for time discounting, it would be instruc- tive to test for a positive relation be- tween observed time discounting and individuals changing identity. Frederick (2002) conducted the only study of this type, Never- 11 As noted by Frederick (2002), there is much disagreement about the nature of Parfit’s claim. In her review of the philosophical literature, Jennifer Whiting (1986, p. 549) identifies four different in- terpretations: (1) the strong absolute claim: that it is irrational for someone to care about their future welfare, (2) the weak absolute claim: that there is no rational requirement to care about one’s future welfare, (3) the strong comparative claim: that it is irrational to care more about one’s own future welfare than about the welfare of any other per- son, and (4) the weak comparative claim: that one is not rationally required to care more about their future welfare than about the welfare of any other person. We believe that all of these interpretations are too strong, and that Parfit endorses only a weaker version of the weak absolute claim. That is, he claims only that one is not rationally required to care about one’s future welfare to a degree that exceeds the degree of psychological connectedness that obtains between one’s current self and one’s future self. Literature, Vol. XL (June 2002) larger-later reward (see section 6 for a description of these procedures), the implicit discount rate over longer time horizons is lower than the implicit dis- count rate over shorter time horizons. For example, Richard Thaler (1981) asked subjects to specify the amount of money they would require in [one month/one year/ten years] to make them indifferent to receiving $15 now. The median responses [$20/$50/$100] imply an average (annual) discount rate of 345 percent over a one-month horizon, 120 percent over a one-year horizon, and 19 percent over a ten-year hori- zon. 12 Other researchers have found a similar pattern (Uri Benzion, Amnon Rapoport, and Joseph Yagil 1989; Gretchen B. Chapman 1996; Chapman and Arthur S. Elstein 1995; John L. Pender 1996; Daniel A. Redelmeier and Daniel N. Heller 1993). Second, when mathematical functions are explicitly fit to such data, a hyper- bolic functional form, which imposes declining discount rates, fits the data better than the exponential functional form, which imposes constant discount rates (Kris N. Kirby 1997; Kirby and Nino Marakovic 1995; Joel Myerson and Leon- ard Green 1995; Howard Rachlin, Andres Raineri, and David Cross 1991). 13 Third, researchers have shown that outcomes 12 That is, $15 = $20∗(e–(3.45)(1/12)) = $50∗(e–(1.20)(1)) = $100∗(e–(0.19)(10)). While most empirical studies re- port average discount rates over a given horizon, it is sometimes more useful to discuss average “per- period” discount rates. Framed in these terms, Thaler’s results imply an average (annual) discount rate of 345 percent between now and one month from now, 100 percent between one month from now and one year from now, and 7.7 percent between one year from now and ten years from now. That is, $15 = $20∗(e–(3.45)(1/12)) = $50∗(e–(3.45)(1/12) e–(1.00)(11/12)) = $100 ∗ (e –(3.45)(1/12) e –(1.00)(11/12)e –(0.077)(9)). 13 Several hyperbolic functional forms have been proposed: George Ainslie (1975) suggested the function D(t) = 1/t, Richard Herrnstein (1981) and James Mazur (1987) suggested D(t) = 1/(1 + αt), and George Loewenstein and Drazen Prelec (1992) suggested D(t) = 1/(1 + αt) β/α. and O’Donoghue: Time Discounting 361 evidence that discount rates continue to decline. In fact, after excluding the stud- ies with short time horizons, the corre- lation between time horizon and discount factor is almost exactly zero (–0.0026). Although the collective evidence out- lined above seems overwhelmingly to support hyperbolic discounting, a re- cent study by Daniel Read (2001) points out that the most common type of evidence—the finding that implicit discount rates decrease with the time horizon—could also be explained by “subadditive discounting,” which means the total amount of discounting over a temporal interval increases as the inter- val is more finely partitioned. 16 To dem- onstrate subadditive discounting and distinguish it from hyperbolic discount- ing, Read elicited discount rates for a two- year (24-month) interval and for its three constituent intervals, an eight-month interval beginning at the same time, an eight-month interval beginning eight months later, and an eight-month inter- val beginning sixteen months later. He found that the average discount rate for the 24-month interval was lower than the compounded average discount rate over the three eight-month subintervals— a result predicted by subadditive dis- counting but not predicted by hyper- simulta- bolic discounting (or any type of discount function, for that matter). Moreover, in 30 days, which there was no evidence that discount rates belief that such pref- declined with time, as the discount rates for the three eight-month inter- currently prefer $110 vals were approximately equal. Similar they would prefer $100 empirical results were found earlier by J. H. Holcomb and P. S. Nelson (1992), (as discussed reversals im- 16 Read’s proposal that discounting is subaddi- tive is compatible with analogous results in other domains. For example, Amos Tversky and Derek Koehler (1994) found that the total probability as- signed to an event increases the more finely the event is partitioned—e.g., the probability of “death by accident” is judged to be more likely if one separately elicits the probability of “death by fire,” “death by drowning,” “death by falling,” etc. Journal of Economic Literature, Vol. XL (June 2002) 1.0 imputed discount factor 0.8 0.6 0.4 0.2 0.0 5 10 15 0 5 10 15 time horizon (years) time horizon (years) Figure 1b. Discount Factor as a Function of Time Horizon (studies with avg. horizons > 1 year) 4.2 Other DU Anomalies The DU model not only dictates that the discount rate should be constant for all time periods; it also assumes that the discount rate should be the same for all types of goods and all categories of intertemporal decisions. There are sev- eral empirical regularities that appear to contradict this assumption, namely: (1) gains are discounted more than losses; (2) small amounts are discounted found increasing asked 228 respon- more than large amounts; (3) greater discounting is shown to avoid delay (“good days”) and of a good than to expedite its receipt; and to equate the good days this (4) in choices over sequences of outcomes, improving sequences are often preferred to declining sequences or 40 in five years, rate of 5 percent and though positive time preference dic- 15 percent. A possible tates the opposite; and (5) in choices over sequences, violations of indepen- successive years two separated years dence are pervasive, and people seem Rubinstein (2000) to prefer spreading consumption over science class to choose time in a way that diminishing marginal sequences: utility alone cannot explain. June 1 Sept 1 Nov 1 $997 $997 $997 4.2.1 The “Sign Effect” (gains are July1 Oct 1 Dec 1 discounted more than losses) $1000 $1000 $1000 Many studies have concluded that gains are discounted at a higher rate than losses. For instance, Thaler (1981) $1000 in Decem- sequence A to ments may have masked the differences in the timing of the sequence of dated amounts, while additional ele- making the differences in amounts more salient. and O’Donoghue: Time Discounting 363 change in delivery time of an outcome is framed as an acceleration or a delay from some temporal reference point. For example, respondents who didn’t expect to receive a VCR for another year would pay an average of $54 to re- ceive it immediately, but those who thought they would receive it immedi- ately demanded an average of $126 to delay its receipt by a year. Benzion, Rapoport, and Yagil (1989) and Shelley (1993) replicated Loewenstein’s findings for losses as well as gains (respondents demanded more to expedite payment than they would pay to delay it). 4.2.4 Preference for Improving Sequences In studies of discounting that involve choices between two outcomes—e.g., X at τ vs. Y at τ′—positive discounting is the norm. Research examining prefer- ences over sequences of outcomes, how- ever, has generally found that people prefer improving sequences to declin- ing sequences (for an overview, see Ariely and Carmon, in press; Frederick and Loewenstein 2002; Loewenstein and Prelec 1993). For example, Loewen- stein and Nachum Sicherman (1991) found that, for an otherwise identical job, most subjects prefer an increasing wage profile to a declining or flat one (see also Robert Frank 1993). Christo- pher Hsee, Robert P. Abelson, and Peter Salovey (1991) found that an in- creasing salary sequence was rated as highly as a decreasing sequence that conferred much more money. Carol Varey and Kahneman (1992) found that subjects strongly preferred streams of decreasing discomfort to streams of in- creasing discomfort, even when the over- all sum of discomfort over the interval was otherwise identical. Loewenstein and Prelec (1993) found that respon- dents who chose between sequences of two or more events (e.g., dinners or of Economic Literature, Vol. XL (June 2002) which consumption is identical in the two profiles. Thus, anyone preferring profile B to profile A (which share the fifth period “Eat at Home”) should also prefer profile D to profile C (which share the fifth period “Fancy Lobster”). As the data reveal, however, many respondents violated this prediction, preferring the fancy French dinner on the third weekend, if that was the only fancy dinner in the profile, but prefer- ring the fancy French dinner on the first weekend if the profile contained another fancy dinner. This result could be explained by the simple desire to spread consumption over time—which, in this context, violates the dubious as- sumption of independence that the DU model entails. Loewenstein and Prelec (1993) pro- vide further evidence of such a prefer- ence for spread. Subjects were asked to imagine that they were given two cou- pons for fancy ($100) restaurant din- five weekends you must ners, and were asked to indicate when nights. From each they would use them, ignoring consid- below, circle the one you erations such as holidays, birthdays, and refers to a dinner at a Lobster” refers to an such. Subjects either were told that four-star restaurant. Ignore “you can use the coupons at any time your current plans). between today and two years from to- day” or were told nothing about any constraints. Subjects in the two-year constraint condition actually scheduled Eat at [11%] both dinners at a later time than those home who faced no explicit constraint—they Eat at Eat at [89%] delayed the first dinner for eight weeks home (rather than three) and the second din- ner for 31 weeks (rather than thirteen). Fancy [49%] This counterintuitive result can be ex- Lobster plained in terms of a preference for spread if the explicit two-year interval Eat at Fancy [51%] home Lobster was greater than the implicit time hori- zon of subjects in the unconstrained group. 4.3 Are These “Anomalies” Mistakes? In other domains of judgment and choice, many of the famous “effects” and O’Donoghue: Time Discounting 365 coordinate their responses to conform to DU’s postulates when they evaluated rewards of different sizes, it suggests that they consider the different dis- count rates to be normatively appropri- ate. Similarly, even after Loewenstein and Sicherman (1991) informed respon- dents that a decreasing wage profile ($27,000, $26,000,... $23,000) would (via appropriate saving and investing) permit strictly more consumption in every period than the corresponding increasing wage profile with an equiv- alent nominal total ($23,000, $24,000, that peo-... $27,000), respondents still pre- ferred the increasing sequence. Perhaps they suspected that they could not exercise the required self control to maintain their desired consumption sequence, or felt a general leeriness about the significance of a declining wage, either of which could justify that choice. As these examples illus- trate, many DU “anomalies” exist as “anomalies” only by reference to a model that was constructed without regard to its descriptive validity, and which has no compelling normative basis. 5. Alternative Models In response to the anomalies just enumerated, and other intertemporal- choice phenomena that are inconsistent with the DU model, a variety of alter- nate theoretical models have been developed. Some models attempt to achieve greater descriptive realism by relaxing the assumption of constant discounting. Other models incorporate additional considerations into the in- stantaneous utility function, such as the utility from anticipation. Still others depart from the DU model more radically, by including, for instance, systematic mispredictions of future utility. of Economic Literature, Vol. XL (June 2002) ing by Jon Elster (1979). It assumes that the per-period discount rate between now and the next period is 1 −βδβδ whereas the per-period discount rate between any two future periods is 1 −δ δ < 1 −βδβδ. Hence, this (β,δ) formulation assumes a declining discount rate between this pe- riod and next, but a constant discount rate thereafter. The (β,δ) formulation is highly tractable, and captures many of the qualitative implications of hyperbolic discounting. Laibson and his collaborators have used the (β,δ) formulation to explore the implications of hyperbolic discount- ing for consumption-saving behavior. Hyperbolic discounting leads a person to consume more than she would like from a prior perspective (or, equiva- lently, to under-save). Laibson (1997) explores the role of illiquid assets, such functional as housing, as an imperfect commit- ment technology, emphasizing how a person could limit overconsumption by tying up her wealth in illiquid assets. Laibson (1998) explores consumption- saving decisions in a world without illiq- uid assets (or any other commitment technology). These papers describe how hyperbolic discounting might explain some stylized empirical facts, such as form which the excess comovement of income and consumption, the existence of asset-spe- cific marginal propensities to consume, low levels of precautionary savings, and the correlation of measured levels of patience with age, income, and wealth. Laibson, Andrea Repetto, and Jeremy Tobacman (1998), and George-Marios Angeletos et al. (2001) calibrate models of consumption-saving decisions, using stationary discount- both exponential discounting and (β,δ) hyperbolic discounting. By comparing of consistent simulated data to real-world data, they demonstrate how hyperbolic discount- ing can better explain a variety of (1968), Bezalel empirical observations in the consump- tion-saving literature. In particular, and O’Donoghue: Time Discounting 367 O’Donoghue and Rabin (1999a, 2000), Jonathan Gruber and Botond Koszegi (2000), and Juan D. Carrillo (1999) have applied (β,δ) preferences to addiction. These researchers de- scribe how hyperbolic discounting can lead people to overconsume harmful addictive products, and examine the degree of harm caused by such over- consumption. Carrillo and Thomas Mariotti (2000) and Roland Benabou and Jean Tirole (2000) have examined how (β,δ) preferences might influence a person’s decision to acquire informa- tion. If, for example, a person is decid- ing whether to embark on a specific research agenda, she may have the op- tion to get feedback from colleagues about its likely fruitfulness. The stan- dard economic model implies that peo- ple should always choose to acquire this information if it is free. However, Car- rillo and Mariotti show that hyperbolic discounting can lead to “strategic igno- rance”—a person with hyperbolic dis- counting who is worried about with- drawing from an advantageous course of action when the costs become imminent might choose not to acquire free infor- mation if doing so increases the risk of bailing out. 5.1.1 Self Awareness A person with time-inconsistent pref- erences may or may not be aware that her preferences will change over time. Strotz (1955–56) and Pollak (1968) discussed two extreme alternatives. At one extreme, a person could be com- pletely “naïve” and believe that her future preferences will be identical to her current preferences. At the other extreme, a person could be com- pletely “sophisticated” and correctly of hyperbolic dis- predict how her preferences will (1991) model of pro- to a hyperbolic change over time. While casual observa- tion and introspection suggest that Literature, Vol. XL (June 2002) than those with evenly spaced dead- lines (whether externally imposed or self-imposed). 21 O’Donoghue and Rabin (1999b) ex- amine how people’s behaviors depend on their sophistication about their own time inconsistency. Some behaviors, such as using illiquid assets for commit- ment, require some degree of sophisti- cation. Other behaviors, such as over- consumption or procrastination, are more robust to the degree of aware- ness, though the degree of misbehavior may depend on the degree of sophisti- cation. To understand such effects, O’Donoghue and Rabin (2001) intro- duce a formal model of partial naïveté, in which a person is aware that she will have future self-control problems but underestimates their magnitude. They show that severe procrastination cannot occur under complete sophistication, but can arise even if the person is only a little naïve. For more discussion on self-awareness, see O’Donoghue and Rabin (in press). The degree of sophistication versus naiveté has important implications for public policy. If people are sufficiently sophisticated about their own self- control problems, providing commit- ment devices may be beneficial. How- ever, if people are naïve, policies might be better aimed at either edu- cating people about loss of control (making them more sophisticated), or providing incentives for people to use commitment devices, even if they don’t recognize the need for them. 21 A similar “natural” experiment was recently conducted by the Economic and Social Research Council of Great Britain. They recently eliminated submission deadlines and now accept grant pro- posals on a “rolling” basis (though they are still reviewed only periodically). In response to this policy change, submissions have actually declined by about 15–20 percent (direct correspondence with Chris Caswill at ESRC). and O’Donoghue: Time Discounting 369 monotonic consumption profile. The di- rection of the effect depends on things such as how much one has already con- sumed (as reflected in the initial habit stock), and, perhaps most importantly, whether current consumption increases or decreases future utility. In recent years, habit-formation mod- els have been used to analyze a variety of phenomena. Gary Becker and Kevin Murphy (1988) use a habit-formation model to study addictive activities, and in particular to examine the effects of past and future prices on the current consumption of addictive products. 22 Habit formation can help explain asset- pricing anomalies such as the equity- premium puzzle (Andrew Abel 1990; John Campbell and John Cochrane 1999; George M. Constantinides 1990). Incor- porating habit formation into business- cycle models can improve their ability to explain movements in asset prices (Urban Jermann 1998; Michele Boldrin, Lawrence Christiano, and Jonas Fisher 2001). Some recent papers have shown that habit formation may help explain other empirical puzzles in macro- economics as well. Whereas standard growth models assume that high saving where ∂2u ⁄ ∂cτ ∂cτ ′ > 0 rates cause high growth, recent evi- dence suggests that the causality can run in the opposite direction. Christo- pher Carroll, Jody Overland, and David Weil (2000) show that, under conditions of habit formation, high growth rates can cause people to save more. Jeffrey Fuhrer (2000) shows how habit forma- tion might explain the recent finding Both Pollak (1970) and that aggregate spending tends to have a gradual “hump-shaped” response to 22 For rational-choice models building on Becker and Murphy’s framework, see Athanasios Orphanides and David Zervos (1995), Ruqu Wang (1997), and Suranovic, Goldfarb, and Leonard (1999). For addiction models that incorporate hyperbolic discounting, see O’Donoghue and Rabin (1999a, 2000), Gruber and Koszegi (2000), and Carrillo (1999). of Economic Literature, Vol. XL (June 2002) asymmetry. They show that if the elas- ticity of the value function is increasing in the magnitude of outcomes, people will discount smaller magnitudes more than larger magnitudes. Intuitively, the elasticity condition captures the insight that people are responsive to both dif- ferences and ratios of reward amounts. It implies that someone who is indiffer- ent between, say, $10 now and $20 in a year should prefer $200 in a year over $100 now because the larger rewards have a greater difference (and the same ratio). Consequently, even if a person’s time preference is actually constant across outcomes, she will be more will- ing to wait for a fixed proportional in- crement when rewards are larger, and, thus, her imputed discount rate will be r τ). The reference smaller for larger outcomes. Similarly, if the value function for losses is more elastic than the value function for gains, then people will discount gains more than losses. Finally, such a model helps explain the delay-speedup asymmetry (Loewenstein 1988). Shifting consump- tion in any direction is made less desir- able by loss aversion, since one loses consumption in one period and gains it in another. When delaying consump- tion, loss aversion reinforces time dis- counting, creating a powerful aversion to delay. When expediting consumption, loss aversion opposes time discounting, reducing the desirability of speedup (and, occasionally, even causing an aversion to it). Using a reference-dependent model that assumes loss aversion in consump- assume tion, David Bowman, Deborah Mine- consumption level or hart, and Rabin (1999) predict that = v(cτ − rτ) + w(cτ) or “news” about one’s (stochastic) future w(rτ). Some habit-formation income affects one’s consumption reference-point z τ is the refer- growth differently than the standard Permanent Income Hypothesis predicts. According to (the log-linear version of) functions of the form typically assume neither the Permanent Income Hypothesis, changes in future income should not and O’Donoghue: Time Discounting 371 Loewenstein describes how utility from anticipation may play a role in many DU anomalies. Because near-term consumption delivers only consumption utility whereas future consumption de- livers both consumption utility and an- ticipatory utility, anticipatory utility provides a reason to prefer improve- ment and for getting unpleasant out- comes over with quickly instead of delaying them as discounting would predict. It provides a possible explana- tion for why people discount different goods at different rates, because utility from anticipation creates a downwa

Time Discounting and Time Preference: A Critical Review PDF

Document Details

Tags

Related

Summary

Full Transcript

Upgrade to continue