N Problem Set 6 - Solution

N.1 Optimal Taxation: The “Supply Side” (Neoclassical) View

The firm’s problem is to maximize: \[\max_l \quad p\cdot f(l)-w\cdot l.\] Given that $f(l)=A\cdot l$, the firm equivalently maximizes: \[\max_l \quad p \cdot A\cdot l-w\cdot l.\] One way to get the result is, as usual, to take the first-order condition to this problem: \[\frac{w}{p}=A.\] However, I should warn you here that we should actually not be taking a first-order condition. A mathematician would be right in saying that this is wrong: the objective function is linear, and so the above function does not have a maximum ! At the same time, doing so does lead to the “right solution”. The real wage equals labor productivity in the end. What is going on here? One intuitive way to see this is to say that for any $\alpha$, however small, then with $f(l)=A l^{1-\alpha}$ we would get $w/p=A \cdot l^{-\alpha}$ as a first order condition. Taking then the limit of $\alpha$ to $0$ then leads to the above result since then $l^{-\alpha} \approx 1$. However, unfortunately, this is not a valid mathematical proof.⁵² So how do we think of what happens if the technology is completely linear? How do we think of an equilibrium in that case? The proper economic reasoning is as follows:

if we had $p \cdot A>w$, then the marginal gain of hiring an additional worker would be higher than the marginal cost of doing so, whatever $l$. Therefore, firms would like to hire an infinity of workers, and the solution would be $l=+\infty$. This is not a competitive equilibrium, because this level of demand can never be equal to supply.
if we had $p \cdot A<w$, then the marginal gain of hiring an additional worker would be lower than the marginal cost of doing so, whatever $l$. Therefore, the optimal thing for the firm to maximize its profit would be to hire no worker $l=0$. This is not a competitive equilibrum either, because this level of demand is not equal to supply either. As a consequence: \[p \cdot A=w \quad \Rightarrow \quad \boxed{\frac{w}{p}=A}.\] The labor demand curve is said to be infinitely elastic. Note. You should note that with $w/p=A$, the firms’ profit is $0$ regardless of what $l$ it chooses. Thus, the firm produces, potentially at a scale consistent with market clearing, but in terms of profit it is indifferent between producing or not.

Writing that $c=c_0+(1-\tau)\cdot (w/p) \cdot l$ and plugging the value of $c$ into the worker’s optimization problem: \[\max_l \quad c_0 +(1-\tau)\frac{w}{p}l -B\frac{l^{1+\epsilon}}{1+\epsilon}\] The first-order condition (which we can take here as long as $\epsilon>0$) implies: \[(1-\tau) \cdot \frac{w}{p} = B\cdot l^{\epsilon} \quad \Rightarrow \quad \boxed{l = \frac{(1-\tau)^{1/\epsilon}}{B^{1/\epsilon}}\left(\frac{w}{p}\right)^{1/\epsilon}}\]
The number of hours worked is given by replacing out the real wage $w/p$ from the labor demand equation to the labor supply equation, so that: \[\boxed{l = (1-\tau)^{1/\epsilon}\frac{A^{1/\epsilon}}{B^{1/\epsilon}}}.\] Real pre-tax income is when simply given by the real wage times the number of hours. Since the real wage is simply $A$, we get: \[y = \frac{w}{p} \cdot l = A\cdot(1-\tau)^{1/\epsilon}\frac{A^{1/\epsilon}}{B^{1/\epsilon}}.\] Finally: \[\boxed{y = (1-\tau)^{1/\epsilon}\frac{A^{1/\epsilon+1}}{B^{1/\epsilon}}}.\]
A numerical application shown on the Google Spreadsheet implies that: \[\underline{l}=(1-\underline{\tau})^{1/\epsilon}\frac{\underline{A}^{1/\epsilon}}{\underline{B}^{1/\epsilon}}.\] Therefore: \[ \begin{aligned} \underline{l}&=\left(1-\frac{1}{4}\right)^{1/2}\left(\frac{1000000}{28188}\right)^{1/2}\left(\frac{491569855488}{3000000}\right)^{1/2}\\ \underline{l}&=2088. \end{aligned} \] Note that under the assumption of 281 business days in a year, this implies 8 hours a day of work.
Note: Of course, this did not happen by random chance. This explains the weird-looking numbers for $A$ and $B$ in the examples. I reversed engineered $A$ and $B$ so that people in the model would happen to work a realistic number of hours.⁵³ According to the Google Spreadsheet, income is then given by: \[ \begin{aligned} \underline{y}&=\underline{A} \underline{l}\\ &=\frac{1000000}{28188}\cdot 2088\\ \underline{y}&\approx 74074.07 \end{aligned} \] Again, I reversed engineered the examples so that you would get the income of low-income people in lecture 9.

We note that this group has a lower disutility of working $B$ (which could again, provy for many things: education, religion, work ethic, etc.) - and also a higher productivity per hour $A$. Therefore they work more - both because they are more productive and because they have a lower disutility of working: \[ \begin{aligned} \bar{l}&=\left(1-\frac{1}{2}\right)^{1/2}\left(\frac{1000000}{6264}\right)^{1/2}\left(\frac{218475491328}{1000000}\right)^{1/2}\\ \bar{l}&=4176. \end{aligned} \] Note that under the assumption of 281 business days in a year, this implies 16 hours a day of work on average (more likely on weekends, or during Thanksgiving!). Income is then given by: \[ \begin{aligned} \bar{y}&=\bar{A} \cdot \bar{l}\\ &=\frac{1000000}{6264}\cdot 4176\\ \bar{y}&\approx666666.66 \end{aligned} \]
Total output is 20 trillion, 10 trillion coming from the bottom 90% and 10 trillion coming from the top 10%. In million dollars, output of the bottom 90% is: \[ \begin{aligned} \underline{Y} &= \lambda N \cdot \underline{y}\\ &\approx (135 \cdot 10^6) \cdot 74074.07\\ &= 10 \cdot 10^{12}\\ \underline{Y} &= 10 \text{ trillion} \end{aligned} \] In million dollars, output of the top 10% is: \[ \begin{aligned} \bar{Y} &= (1-\lambda) N \cdot \bar{y}\\ &\approx (15 \cdot 10^6) \cdot 666666.66\\ &= 10 \cdot 10^{12}\\ \bar{Y} &= 10 \text{ trillion} \end{aligned} \]
In this question, we assume a tax reform that lowers the marginal tax rate on the richest by 5 points. According to the Google Spreadsheet, an individual earning $\bar{y}=666,666$ pays taxes given by: \[ \begin{aligned} \bar{t}&=-5000+0.25 \cdot (200000-25000) + 0.5 \cdot (666666-200000)\\ &\approx -5000 + 43750 + 233333 \\ \bar{t}&\approx 272083 \end{aligned} \] which means that people in the top 10% pay $272,083 on average in taxes per year. The total paid by this group is: \[ \begin{aligned} \bar{T} &= (1-\lambda) N \cdot \bar{t}\\ &\approx \left(15 \cdot 10^{6}\right) \cdot 272083\\ &\approx 4081 \cdot 10^9\\ \bar{T} &\approx 4081 \text{ billion} \end{aligned} \] The Google Spreadsheet shows that lowering the top marginal tax rate leads to an increase in hours worked by the top 10%, which now is: \[ \begin{aligned} \bar{l}&=\left(1-\frac{45}{100}\right)^{1/2}\left(\frac{1000000}{6264}\right)^{1/2}\left(\frac{218475491328}{1000000}\right)^{1/2}\\ \bar{l}&\approx 4379.8 \end{aligned} \] The change in hours is therefore: \[ \begin{aligned} \Delta \bar{h} &\approx 4379.8 - 4176\\ \Delta \bar{h} &\approx 203.8 \end{aligned} \] Therefore, given the lowering of taxes, high income earners decide to work 203.8 hours more per year - which, accounting for 281 business days in a year, means approximately 45 minutes more per day (note that this is more than twice the increase, so it is higher in percentage terms). Given the productivity of high income people, which is $159.64/hour on average, the increase in annual income given these 45 more minutes of work per day is $32,534.6 annual since: \[ \begin{aligned} \Delta \bar{y} &= \bar{A} \cdot \Delta \bar{h}\\ & \approx 159.64 \cdot 203.8\\ \Delta \bar{y} & \approx 32534.6 \end{aligned} \] Since there are 15 million people who work this much more, then the total impact on GDP is 15 million times that additional income, which is 488 billion: \[ \begin{aligned} \Delta \bar{Y} &= (1-\lambda) N \cdot \Delta \bar{y}\\ & \approx (15 \cdot 10^6) \cdot 32534.6\\ & \approx 488 \cdot 10^9 \\ \Delta \bar{Y} & \approx 488 \text{ billion}. \end{aligned} \] Because the tax cut leads high income earners to work more, it in turns leads to higher tax receipts - there is higher GDP. Again, there are two concepts of multipliers: there is the ex-ante multiplier, calculated with respect to the change in tax receipts if hours did not change, and there is the ex-post multiplier, calculated with respect to the actual change in tax receipts.
Ex-ante tax cut. The ex-ante tax cut is the tax cut which corresponds to the change in taxes that results from the change in marginal tax rates, assuming that people do not adjust their number of hours. In other words, we assume that their income still is $\bar{y}=666666,$ and that the marginal tax rate is reduced from 50% to 45%, which applies on all income above $200000 (we have a bracket system). Thus, the change in individual taxes is: \[ \begin{aligned} \Delta \bar{t} &\approx -(666666-200000) \cdot 0.05 \\ \Delta \bar{t} &\approx -23333.3. \end{aligned} \] Therefore, the change in aggregate taxes is given by 15 million times that, or 350 billion: \[ \begin{aligned} \Delta \bar{T} &= (1-\lambda) N \cdot \Delta \bar{t}\\ &\approx - (1-\lambda) N \cdot 23333.3\\ & \approx - (15 \cdot 10^6) \cdot 23333.3\\ & \approx -350 \cdot 10^9 \\ \Delta \bar{T} & \approx -350 \text{ billion}. \end{aligned} \] Thus, the ex-ante cost of the policy is a fall in tax receipts of $\Delta \bar{T} \approx$ 350 billion. The ex-ante supply-side multiplier is the ratio of the change in GDP: \[ \begin{aligned} \text{Ex-ante Multiplier} &= -\frac{\Delta \bar{Y}}{\Delta \bar{T}}\\ & \approx \frac{488}{350}\\ \text{Ex-ante Multiplier} & \approx 1.39. \end{aligned} \] Ex-post tax cut. The ex-post tax cut is calculated taking into account that the tax reform is cheaper in reality because high income earners increase their labor hours, and they are taxed on this additional income. In other words, there are two effects from the tax cut. First, the marginal tax rate is reduced from 50% to 45%, which applies on all income above $200,000, until the old income $\bar{y}=666666.66.$ Second, all additional income is taxed at rate which is now 45%. Thus, the fall in individual taxes is: \[ \begin{aligned} \Delta \bar{t} & \approx -(666666.66-200000) \cdot 0.05 + \Delta \bar{y} \cdot 0.45\\ & \approx -23333.33 + 32534.6 \cdot 0.45 \\ & \approx -23333.33 + 14640.57 \\ \Delta \bar{t} & \approx -8692.76. \end{aligned} \] Therefore, the ex-post change in aggregate taxes is given by: \[ \begin{aligned} \Delta \bar{T} &= (1-\lambda) N \cdot \Delta \bar{t}\\ & \approx - (1-\lambda) N \cdot 8692.76\\ & \approx - (15 \cdot 10^6) \cdot 8692.76\\ & \approx -130 \cdot 10^9 \\ \Delta \bar{T} & \approx -130 \text{ billion}. \end{aligned} \] Thus, the ex-post cost of the policy is a fall in tax receipts of $\Delta \bar{T} \approx$ 130 billion. The ex-post supply side multiplier is: \[ \begin{aligned} \text{Ex-post Multiplier} &= -\frac{\Delta \bar{Y}}{\Delta \bar{T}}\\ & \approx \frac{488}{130}\\ \text{Ex-post Multiplier} & \approx 3.74. \end{aligned} \] In both cases, in the neoclassical view, it is better to cut taxes on high income people than on low income people (the opposite as from an aggregate demand perspective). I will explain why in the solution to the next exercise (as Robert Barro says, it is because marginal tax rates are initially higher).
According to the Google Spreadsheet, an individual earning $\underline{y}$ pays taxes given by: \[ \begin{aligned} \underline{t} &\approx -5000+0.25 \cdot (74074.07-25000)\\ &\approx -5000 + 12268 \\ \underline{t}&\approx 7268 \end{aligned} \] which means that this individual pays $7,268 on average per year in taxes. The total taxes paid by this group are: \[ \begin{aligned} \underline{T} &= \lambda N \cdot \underline{t}\\ &\approx (135 \cdot 10^6) \cdot 7268\\ &\approx 981 \cdot 10^9\\ \underline{T} &\approx 981 \text{ billion} \end{aligned} \] The Google Spreadsheet shows that lowering the bottom marginal tax rate leads to an increase in hours worked by the bottom 90%, which now is: \[ \begin{aligned} \underline{l}&=\left(1-\frac{20}{100}\right)^{1/2}\left(\frac{1000000}{28188}\right)^{1/2}\left(\frac{491569855488}{3000000}\right)^{1/2}\\ \underline{l}&\approx 2156.5 \end{aligned} \] The change in hours is therefore: \[ \begin{aligned} \Delta \underline{h} &\approx 2156.5 - 2088\\ \Delta \underline{h} &\approx 68.5 \end{aligned} \] Therefore, given the lowering of taxes, people decide to work 68.5 hours more per year - which, accounting for 281 business days in a year, means approximately 15 minutes more per day. Given the productivity of low income people, which is $35.5 an hour on average (low income here is the bottom 90 %, so their income is not that low!), the increase in annual income given these 15 more minutes of work per day is $2431.75 annual since: \[ \begin{aligned} \Delta \underline{y} &= \underline{A} \cdot \Delta \underline{h}\\ &\approx 35.5 \cdot 68.5\\ \Delta \underline{y} &\approx 2431.75. \end{aligned} \] Since there are 135 million people who work this much more, then the total impact on GDP is: \[ \begin{aligned} \Delta \underline{Y} &= \lambda N \cdot \Delta \underline{y}\\ & \approx (135 \cdot 10^6) \cdot 2431.75\\ & \approx 328 \cdot 10^9 \\ \Delta \underline{Y} & \approx 328 \text{ billion}. \end{aligned} \] Because the tax cut leads people to work more, it in turns leads to higher tax receipts - there is higher GDP. Thus, there are two concepts of multipliers: there is the ex-ante multiplier, calculated with respect to the change in tax receipts if hours did not change, and there is the ex-post multiplier, calculated with respect to the actual change in tax receipts.
Ex-ante tax cut. The ex-ante tax cut is the tax cut which corresponds to the change in taxes that results from the change in marginal tax rates, assuming that people do not adjust their number of hours. In other words, we assume that their income still is $\underline{y}=74074.07$, and that the marginal tax rate is reduced from 25% to 20%, which applies on all income above $25,000 (we have a bracket system). Thus, the change in individual taxes is: \[ \begin{aligned} \Delta \underline{t} &\approx -(74074.07-25000) \cdot 0.05\\ \Delta \underline{t} &\approx-2453.7. \end{aligned} \] Everyone’s taxes in the bottom 90% is thus reduced by $2,453.7. Therefore, the change in aggregate taxes is given by: \[ \begin{aligned} \Delta \underline{T} &= \lambda N \cdot \Delta \underline{t}\\ & \approx - (135 \cdot 10^6) \cdot 2453.7\\ & \approx -331 \cdot 10^9 \\ \Delta \underline{T} & \approx -331 \text{ billion}. \end{aligned} \] Thus, the ex-ante cost of the policy is a fall in tax receipts of $\Delta \underline{T} \approx$ 331 billion. The ex-ante multiplier is: \[ \begin{aligned} \text{Ex-ante Multiplier} &= -\frac{\Delta \underline{Y}}{\Delta \underline{T}}\\ & \approx \frac{328}{331}\\ \text{Ex-ante Multiplier} & \approx 0.99. \end{aligned} \] Ex-post tax cut. The ex-post tax cut is calculated taking into account that the tax reform is cheaper in reality because people increase their labor hours, and they are taxed on this additional income. In other words, there are two effects from the tax cut. First, the marginal tax rate is reduced from 25% to 20%, which applies on all income above $25,000, until the old income $\underline{y}=74074.07$, so on the portion $\underline{y}-25000$. Second, all additional income $\Delta \underline{y}$ is taxed at the marginal tax rate which is now 20%. Thus, the change in individual taxes is: \[ \begin{aligned} \Delta \underline{t} & \approx -(\underline{y}-25000) \cdot 0.05 + \Delta \underline{y} \cdot 0.20 \\ & \approx -(74074.07-25000) \cdot 0.05 + \Delta \underline{y} \cdot 0.20 \\ & \approx -2453.7 + 2431.75 \cdot 0.20\\ & \approx -2453.7 + 486.35 \\ \Delta \underline{t} & \approx -1967.35. \end{aligned} \] Therefore, the change in aggregate taxes is given by: \[ \begin{aligned} \Delta \underline{T} &=\lambda N \cdot \Delta \underline{t} \\ & \approx - (135 \cdot 10^6) \cdot 1967.35\\ & \approx -266 \cdot 10^9 \\ \Delta \underline{T} & \approx -266 \text{ billion}. \end{aligned} \] Thus, the ex-post cost of the policy is a fall in tax receipts of $\Delta \underline{T} \approx$ 266 billion. The ex-post multiplier is: \[ \begin{aligned} \text{Ex-post Multiplier} &= -\frac{\Delta \underline{Y}}{\Delta \underline{T}}\\ & \approx \frac{328}{266}\\ \text{Ex-post Multiplier} & \approx 1.23. \end{aligned} \]

N.2 Readings - Paradox of Thrift

Q: Paul Krugman, When Consumers Capitulate, New York Times, October 31, 2008; Paul Krugman, Paradox of Thrift, New York Times Blog, February 3, 2009; Paul Krugman, We’re Still In A Paradox Of Thrift World, New York Times Blog, Aug 26, 2010. In what sense have “American consumers long been living beyond their means?” How does Paul Krugman qualify the “paradox of thrift” idea? A: In the mid-1980s Americans saved about 10% of their income. Lately, however, the savings rate has generally been below 2% and consumer debt has risen to 98% of GDP, twice itrs level of quarter-century ago. Paul Krugman, in 2008, emphasizes very much the idea of monetary policy offsetting the negative effects of saving, as the central bank decreases interest rates, and thereby stimulates investment. As a consequence, in 2008, Paul Krugman is taking a neoclassical take on investment demand.
Q: Paul Krugman, Crowding In and the Paradox of Thrift, New York Times Blog, April 26, 2015 How did the IMF show empirically the existence of the paradox of thrift? A: It is hard to show empirically the existence of the paradox of thrift, because of the issue of reverse causation: if weak growth and weak investment are correlated, it could very well be that weak investment causes weak growth, instead of the other way around. (after all, investment is a component of GDP !) So when we say that the Keynesian investment function implies $I=b_0+b_1 Y$, it is in fact really hard to test this rigorously. We would want to get changes in GDP which are independant from changes in investment, to see whether these changes in GDP indeed lead to a change in investment. The International Monetary Fund (IMF) adopts an instrumental variables approach to deal with the problem of reverse causation. It uses fiscal consolidation as the instrument and finds the cases where spending cuts and/or tax hikes cause the decline of aggregate demand and weak growth, and it shows that this indeed results in a falling investment. This suggests that indeed, independant changes in $Y$ do lead to changes in investment $I$. Using this approach, the IMF finds that the introduction of deficit-reduction measures causes a decline of investment, which means the deficits were crowding in the investment.
Q: Neil Irwin, Interest Rates Just Keep Falling. Economic Orthodoxy Is Falling With Them, New York Times, July 4, 2019. Which economic orthodoxes are falling with interest rates, according to Neil Irwin? A: The economic orthodoxy that is falling with interest rate, is the idea that government deficits crowd out investment spending, through a rise in interest rates. He states that “generations of college economics students have been taught that this is simply how things work”. You have not !

N.3 Redistribution between the top 1% and the bottom 99%

For a household, the threshold to be in the top 1% is around $421,926 according to the Economic Policy Institute. Let me Google that for you: http://bfy.tw/KhF4.
Using the given notations (and those of lecture 9), total income is the sum of the top 1% income and that of the bottom 99%: \[Y=\lambda N \underline{y}+(1-\lambda) N \bar{y}.\] Since $\bar{y}=\gamma \underline{y}$ we have: \[Y=\lambda N \underline{y}+(1-\lambda) N \gamma \underline{y}.\] The total income for the bottom 99% $\underline{Y}$ is given by: \[\underline{Y}=\lambda N \underline{y}.\] Therefore, the share of total income captured by the bottom 99% is: \[ \begin{aligned} \frac{\underline{Y}}{Y}&=\frac{\lambda N \underline{y}}{\lambda N \underline{y}+(1-\lambda) N \gamma \underline{y}}\\ \frac{\underline{Y}}{Y}&=\frac{\lambda}{\lambda +(1-\lambda) \gamma} \end{aligned} \] Denoting by $\nu$ the share of income going to the low income: \[ \begin{aligned} \nu &\equiv \frac{\underline{Y}}{Y}\\ \nu &=\frac{\lambda}{\lambda +(1-\lambda) \gamma} \end{aligned} \] Solving for $\gamma$: \[ \begin{aligned} & \frac{\lambda}{\lambda +(1-\lambda) \gamma} = \nu \quad \Rightarrow \quad \lambda = \lambda \cdot\nu + (1-\lambda)\gamma \cdot \nu \\ & \quad \Rightarrow \quad \lambda \cdot (1-\nu)=\gamma \cdot (1-\lambda) \cdot \nu \quad \Rightarrow \quad \boxed{\gamma = \frac{\lambda}{1-\lambda}\frac{1-\nu}{\nu}} \end{aligned} \] A numerical application is $\nu=0.8$ and $\lambda = 0.99$ so that: \[ \begin{aligned} \gamma &= \frac{0.99}{1-0.99}\frac{1-0.8}{0.8}\\ &= \frac{99}{4} \\ \gamma &= 24.75 \end{aligned} \] This implies that on average, high income earners in the top 1% are approximately 25 times richer (exactly 24.75 times richer) than low income earners in the bottom 99% (note that you can use the above formula to recover the $\gamma = 9$ from the class, using $\lambda = 0.9$ and $\nu = 0.5$ since $\gamma = 0.9/0.1 \cdot0.5/0.5 = 9$).
Total consumption by the low income earners $\underline{C}$ is such that: \[ \begin{aligned} \underline{C}&=\lambda N \underline{c}\\ &=\lambda N \left(\underline{c}_{0}+\underline{c}_{1}(\underline{y}-\underline{t})\right)\\ &=\lambda N \underline{c}_{0} + \lambda N (1-t_1) \underline{c}_{1}\underline{y}-\lambda N \underline{c}_{1} \underline{t}_0\\ \underline{C}&=\left[\lambda N \underline{c}_{0}-\lambda N \underline{c}_{1} \underline{t}_0 \right]+ \frac{\lambda \underline{c}_{1}}{\lambda+(1-\lambda)\gamma}(1-t_1)Y \end{aligned} \] Symmetrically, consumption by the high income earners $\bar{C}$ is such that: \[ \begin{aligned} \bar{C}&=(1-\lambda) N \bar{c}\\ &=(1-\lambda) N \left(\bar{c}_{0}+\bar{c}_{1}(\bar{y}-\bar{t})\right)\\ &=(1-\lambda) N \bar{c}_{0} + (1-\lambda) N (1-t_1) \bar{c}_{1}\bar{y}-(1-\lambda) N \bar{c}_{1} \bar{t}_0\\ \bar{C}&=\left[(1-\lambda) N \bar{c}_{0}-(1-\lambda) N \bar{c}_{1} \bar{t}_0\right] + \frac{(1-\lambda) \gamma\bar{c}_{1}}{\lambda+(1-\lambda)\gamma}(1-t_1)Y \end{aligned} \] Therefore, aggregate consumption $C=\underline{C} + \bar{C}$ is given by: \[ \begin{aligned} C&=\underline{C} + \bar{C}\\ &=\left[\lambda N \underline{c}_{0}-\lambda N \underline{c}_{1} \underline{t}_0 \right]+ \frac{\lambda \underline{c}_{1}}{\lambda+(1-\lambda)\gamma}(1-t_1)Y + \left[(1-\lambda) N \bar{c}_{0}-(1-\lambda) N \bar{c}_{1} \bar{t}_0\right] + \frac{(1-\lambda) \gamma\bar{c}_{1}}{\lambda+(1-\lambda)\gamma}(1-t_1)Y\\ &=\left(\lambda N \underline{c}_{0}+(1-\lambda) N \bar{c}_{0}\right)-\left(\lambda N \underline{c}_{1} \underline{t}_0 +(1-\lambda) N \bar{c}_{1} \bar{t}_0\right) +\frac{\lambda\underline{c}_{1}+\left(1-\lambda\right)\gamma\bar{c}_{1}}{\lambda+(1-\lambda)\gamma}(1-t_1)Y\\ &=\left[\lambda N \underline{c}_{0}+(1-\lambda) N \bar{c}_{0}\right]-\left[\underline{c}_{1} (\lambda N \underline{t}_0) + \bar{c}_{1} ((1-\lambda) N \bar{t}_0)\right] +\frac{\lambda\underline{c}_{1}+\left(1-\lambda\right)\gamma\bar{c}_{1}}{\lambda+(1-\lambda)\gamma}(1-t_1)Y\\ C&=C_0 -\left(\underline{c}_{1}\underline{T}_0+\bar{c}_{1}\bar{T}_0\right)+c_1 (1-t_1) Y. \end{aligned} \] where we have used the suggested notations: \[ \begin{aligned} C_{0}& \equiv \lambda N \underline{c}_0 + (1-\lambda) N \bar{c}_0\\ \underline{T}_{0}& \equiv \lambda N \underline{t}_0\\ \bar{T}_0 & \equiv (1-\lambda) N \bar{t}_0\\ c_{1}&\equiv\frac{\lambda\underline{c}_{1}+\left(1-\lambda\right)\gamma\bar{c}_{1}}{\lambda+(1-\lambda)\gamma}. \end{aligned} \] Therefore, aggregate consumption is given by: \[\boxed{C=C_0 -\left(\underline{c}_{1}\underline{T}_0+\bar{c}_{1}\bar{T}_0\right)+c_1 (1-t_1) Y}.\]
$c_1$ is the average marginal propensity to consume, where the marginal propensity to consume of each group $\underline{c}_1$ and $\bar{c}_1$ is weighted by their share of income in the population $\underline{Y}/Y$ and $\bar{Y}/Y$: \[ \begin{aligned} c_1&=\frac{\lambda\underline{c}_{1}+\left(1-\lambda\right)\gamma \bar{c}_{1}}{\lambda+(1-\lambda)\gamma}\\ &=\frac{\lambda}{\lambda + (1-\lambda)\gamma}\underline{c}_{1} +\frac{(1-\lambda)\gamma}{\lambda+(1-\lambda)\gamma}\bar{c}_{1}\\ c_1&=\frac{\underline{Y}}{Y}\underline{c}_{1} + \frac{\bar{Y}}{Y}\bar{c}_{1} \end{aligned} \] This has a straightforward economic interpretation: for each additional dollar of output, a fraction $\underline{Y}/Y$ goes to low income earners who consume a fraction $\underline{c}_1$, and a fraction $\bar{Y}/Y$ goes to high income earners who consume a fraction $\bar{c}_1$. The average propensity to consume is the sum of these two fractions. We can then simply compute the average marginal propensity to consume when $\underline{c}_1=1$ and $\bar{c}_1=1/4$: \[ \begin{aligned} c_1 &= \frac{\underline{Y}}{Y}\underline{c}_{1} + \frac{\bar{Y}}{Y}\bar{c}_{1}\\ &=\frac{4}{5}\cdot 1 + \frac{1}{5}\cdot \frac{1}{4}\\ c_1 &= \frac{17}{20} \end{aligned} \] Therefore the average propensity to consume is: \[\boxed{c_1 = 0.85}.\]
Using the expression for aggregate consumption $C$ in question 3., and that $I=b_0+b_1 Y$, and plugging it into total aggregate demand $Z$ yields: \[ \begin{aligned} Z &=C+I+G\\ &=C_0 -\left(\underline{c}_{1}\underline{T}_0+\bar{c}_{1}\bar{T}_0\right)+c_1 (1-t_1) Y + b_{0}+b_{1}Y+G\\ Z &=\left[C_0 -\left(\underline{c}_{1}\underline{T}_0+\bar{c}_{1}\bar{T}_0\right)+ b_{0} + G \right]+ \left(c_1(1-t_1) + b_1\right) Y \end{aligned} \] Equating aggregate demand to aggregate income $Z = Y$ gives the value for output (see the lecture notes for details): \[\boxed{Y=\frac{1}{1-\left(1-t_{1}\right)c_{1}-b_{1}}\left[C_0-\underline{c}_{1}\underline{T}_{0}-\bar{c}_{1}\bar{T}_{0}+b_{0}+G\right]}\]
As in lecture 9, a 100 billion dollars tax cut on the top 1% $\Delta \bar{T}_0 = -100$ leads to an increase in GDP given by: \[ \begin{aligned} \Delta Y &=\frac{-\bar{c}_1 \Delta \bar{T}_0}{1-c_1(1-t_1)-b_1}\\ &=\frac{-1/4 \cdot (-100 \text{ billion})}{1-0.85 \cdot 0.75-1/6}\\ \Delta Y&\approx 127.6 \text{ billion}. \end{aligned} \] Thus, according to these numbers, we get a 127.6 billion dollars increase in GDP. The impact on the government surplus is given by: \[ \begin{aligned} \Delta\left(T-G\right)&=\Delta T\\ &=\Delta\bar{T}_{0}+\Delta\underline{T}_{0}+t_1\Delta Y\\ &=\Delta\bar{T}_{0}+t_1\Delta Y \\ & \approx -100 + \frac{1}{4} \cdot 127.6\\ \Delta\left(T-G\right) & \approx-68.1 \text{ billion} \end{aligned} \] Thus, we get a 68.1 billion dollars increase in the government deficit.
A 100 billion dollars tax cut on the bottom 99% $\Delta \underline{T}_0 = -100$ leads to an increase in GDP given by: \[ \begin{aligned} \Delta Y &=\frac{-\underline{c}_1 \Delta \underline{T}_0}{1-c_1(1-t_1)-b_1}\\ &=\frac{-1 \cdot (-100 \text{ billion})}{1-0.85 \cdot 0.75-1/6}\\ \Delta Y&\approx 510.6 \text{ billion}. \end{aligned} \] Thus, according to these numbers, we get a 510.6 billion dollars increase in GDP. The impact on the government surplus is given by: \[ \begin{aligned} \Delta\left(T-G\right)&=\Delta T\\ &=\Delta\bar{T}_{0}+\Delta\underline{T}_{0}+t_1\Delta Y\\ &=\Delta\underline{T}_{0}+t_1\Delta Y \\ & \approx -100 + \frac{1}{4} \cdot 510.6\\ \Delta\left(T-G\right) & \approx 27.6 \text{ billion} \end{aligned} \] Thus, despite the 100 billion dollars tax cut, we get a 27.6 billion dollars increase in the government surplus, or a reduction in the government deficit. In this situation, tax cuts more than pay for themselves. This seems like a much better policy than the tax reduction on the rich. However, we will see in the next lectures that things are not so straightforward.
Finally, a 100 billion dollars tax cut on the bottom 99% $\Delta \underline{T}_0 = -100$ financed by a 100 billion tax increase on the top 1% $\Delta \bar{T}_0 = 100$ leads to an increase in GDP given by: \[ \begin{aligned} \Delta Y &=\frac{(\underline{c}_1-\bar{c}_1) \Delta \bar{T}_0}{1-c_1(1-t_1)-b_1}\\ &=\frac{(1-1/4) \cdot (100 \text{ billion})}{1-0.85 \cdot 0.75-1/6}\\ \Delta Y&\approx 383.0 \text{ billion} \end{aligned} \] Thus, according to these numbers, we get a 383.0 billion dollars increase in GDP. The impact on the government surplus is given by: \[ \begin{aligned} \Delta\left(T-G\right)&=\Delta T\\ &=\Delta\bar{T}_{0}+\Delta\underline{T}_{0}+t_1\Delta Y\\ &=t_1\Delta Y \\ &\approx \frac{1}{4} \cdot 383.0\\ \Delta\left(T-G\right) & \approx 95.7 \text{ billion} \end{aligned} \] We get a 95.7 billion dollars increase in the government surplus, or a reduction in the government deficit.
If there are no automatic stabilizers ($t_1 = 0$), then the multiplier apparently becomes infinite since $1-c_1-b_1 = 1-0.85 - 1/6<0$. However, this is impossible, as there are only finite resources in the economy. Therefore, this implies that output is determined by supply constraints (amount of labor, capital, and technology), as in the Solow growth model, and that the Keynesian type of analysis no longer applies.
In fact, as I said during the lecture, this result is intuitive. If $1-c_1<b_1$, then this implies that the marginal propensity to invest $b_1$ is higher than the marginal propensity to save $1-c_1$. As a result, we never get to a Keynesian situation of “too much saving”.

N.4 Readings - Redistributive Policies

Q: Laura D’Andrea Tyson, Owen Zidar. “Tax Cuts for Job Creators.” New York Times Blog Post, October 19, 2012. How can we test “trickle-down economics” using state-level data? What additional arguments are given in favor of the main thesis in this article? A: According to this article, there are two types of tests that one can think of running in order to show that tax cuts are more effective at helping growth when they fall on relatively higher incomes, than when they are given to low income people (“trickle-down economics”). The first test consists in looking at state-level data: in order to test that hypothesis, we can use the fact that the number of rich households varies across states. If trickle-down economics was true, then states with a larger share of households in the top 5% of the income distribution should see faster economic growth, after a tax cut on the rich, than those with a smaller share. If they grow faster relative to other states after a tax cut to the rich, then we know that tax cuts for the rich are an effective way of stimulating the economy. The authors find that this is not true in the data. The other evidence given in the article that tax cuts for the rich are not an effective way of stimulating the economy is that job growth was slower after the Bush tax cuts than it was after the Clinton tax rises. When Bush cut taxes the job growth rate didn’t increase and was in fact slower than during the Clinton era, when taxes were increased. Moroever, the authors argue that tax cuts for low income households lead to a larger dollar-for-dollar increase in consumption and thus were a more effective way of stimulating the economy. Note however that this argument is not bullet proof: it could very well be that Bill Clinton experienced higher growth, just because his presidency happened to occur right during the Internet boom (for example), hence the importance of looking at state-level data rather than at US-level data only.
Q: “Inequality Will Eventually Hurt the Rich, Too”, Michael Pettis, New York Times, April 18, 2019. What is the link between debt accumulation and consumer demand? When is inequality potentially beneficial? A: Accumulating consumer debt allows poorer households to increase their demand for consumption goods even when their incomes are not growing. The author mentions how consumer demand continued to rise prior to 2008 even though consumer incomes were stagnant. In this sense consumer debt accumulation increased consumption demand. Inequality is potentially beneficial when capital is scarce, for the same reason as saving is good in the Solow model when we are below the Golden Rule of capital accumulation. When capital is scarce the lower MPCs of the rich means that higher levels of income inequality will lead to more savings and hence a greater amount of capital. Since higher levels of capital accumulation translate into higher incomes eventually (higher GDP), this can lead to higher incomes for everyone (through higher wages for excample), and therefore higher consumption. The author mentions how this was potentially true in the US during the 19th century. At that time, higher levels of income inequality probably implied both more saving and more investment. This led to higher incomes for everyone in the long run. However, it is not clear that this is still the case, and therefore it could very well be that higher desired saving is holding back growth, instead of helping it, as in the Keynesian model.

N.5 Paul Krugman VS Robert Barro on the Bush tax cuts

A transcript of the interview is available on Mark Thoma’s Economists’s View blog. Robert Barro’s view on taxes is in line with the previous exercise. He believes that what matters is incentives, and he has a purely “supply side” view of taxes:

And the basic way you do that is you cut the taxes where they start out having the highest rates. Now, a lot of that at the moment is among the rich people. It used to be it was more at the poor people. Though we’ve actually done a lot on that with respect to the earned income tax credits, so it is not so true now. But the way you really enhance the economy and make it work better is by cutting tax rates where they start out being the highest, because that is where the government is initially distorting the situation the most.

This is what we have found in the previous exercise where output was completely determined by the supply side. Cuts on the high income were more efficient than cuts on the low income in the calculations above. Robert Barro says that the reason why tax cuts work so well on the high income, is because taxes on the rich are initially higher so “that is where the government is initially distorting the situation the most.” How can we understand the logic of his argument? In fact, there are some mathematics behind Robert Barro’s comment (he was a physics major at Caltech). Recall that we found that income (which adjusts based on labor supply) is given as a function of taxes as: \[y = (1-\tau)^{1/\epsilon}\frac{A^{1/\epsilon+1}}{B^{1/\epsilon}}\] Taking logs of this expression, as we often do when we are dealing with power functions (as in the problem set on the labor market): \[\log(y) = \frac{1}{\epsilon}\log\left(1-\tau\right)+\left(\frac{1}{\epsilon}+1\right)\log A -\frac{1}{\epsilon} \log B.\] Therefore, a change in marginal taxes $\tau$ leads to a change in $y$ given by: \[\frac{d\log(y)}{d\tau}=\frac{1}{\epsilon}\frac{d \log(1-\tau)}{d \tau}.\] The derivative of $\log(1-\tau)$ with respect to $\tau$ is: \[\frac{d \log(1-\tau)}{d \tau}=-\frac{1}{1-\tau}.\] Therefore, a change in $\tau$ leads to a change in $y$ given by: \[\boxed{\frac{d\log(y)}{d\tau}=-\frac{1}{\epsilon (1-\tau)}}.\] Therefore, the percentage increase in income following a marginal tax cut for a person is an increasing function of $\tau$, or what is marginal tax rate initially is. So Robert Barro is validated in his statement that the government should cut taxes where marginal tax rates are initially the highest. For example, for $\epsilon=2$ as above, and $\tau=0.5$, we get an increase in income of \[ \begin{aligned} \Delta \log (y)&=-\frac{\Delta \tau}{\epsilon (1-\tau)}\\ &=-\frac{-0.01}{2 \cdot (1-0.5)}\\ \Delta \log (y)&=0.01 \end{aligned} \] or 1% for a fall in the marginal tax rate of $\Delta \tau = -1\% = -0.01$. For $\tau=0.25$, we get an increase in income of \[ \begin{aligned} \Delta \log (y)&=-\frac{\Delta \tau}{\epsilon (1-\tau)}\\ &=-\frac{-0.01}{2 \cdot (1-0.25)}\\ \Delta \log (y)&=0.0066 \end{aligned} \] or 0.66% for a fall in the marginal tax rate of $\Delta \tau = -1\%= -0.01$.

On the other hand, Robert Barro does not “like” Keynesian stimulus at all, which he views as just “heaping money” to people. To him, tax cuts must be used in order to increase the incentives to work (or rather, lower the desincentives to work):

And I should distinguish a lot between the 2003 and 2001 tax cut plans. They are really quite different. The big thing about the 2003 plan is that it didn’t just heap money to people. It didn’t just particularly give money to people at increased incentives to do things. It did that particularly by accelerating the marginal income tax rate cuts.

On the contrary, Paul Krugman, when asked by Charlie Rose “If, in fact, Senator Kerry is elected, and if, in fact, he is able to go forward with his program to roll back tax cuts for those making more than $200,000 a year, what impact do you think it will have on the economy and reducing the deficit?” responds:

I don’t think it would have a negative effect. Because I think that we have a problem of demand, not supply. And we have a problem of spending, not incentives. And since, in fact, Kerry is proposing to use the bulk of any additional revenue for new government programs, doesn’t do much for the deficit, actually. But it also doesn’t do anything to depress spending. And at the point about the scale of it, the scale of what Kerry is proposing is, in fact, almost identical to the Clinton ’93 tax increase, which did not exactly sink the U.S. economy. So, you know, you can’t make the case that this is a major thing one way or the other.

Paul Krugman does not think that higher taxes on the rich are very detrimental to the economy. For him, the economy suffers more from a lack of demand, of spending, than from the types of incentive effects in the previous exercise.

Perhaps paradoxically, Paul Krugman worries a lot more about rising public debt than Robert Barro. In particular, he worries that the reason why tax cuts on the rich are made is to later “starve the beast”:

Robert, you have written about and you are supporting starve-the- beast as a doctrine, cut revenues and then we can use that to squeeze down government spending. You know, I’m not in favor of that. I think that the government does important stuff. And I don’t think there is a lot of – … basically, we have this $400 billion deficit. It’s not going to go away simply through economic growth, it has to be some combination of either spending cuts or increased revenue. And I don’t like these tax cuts, which … will eventually force a cut in programs that I think are very important to people’s lives.

But he does not want to see expenditures phased out:

I want to maintain the social insurance institutions. I want to maintain Social Security, Medicare and Medicaid. And I want some from further expansion of health insurance. So I say look, you know, we don’t have enough revenue as is. I don’t want more tax cuts that will further undermine the revenue base that makes it possible to have these programs that sand off some of the rough edges of capitalism. (…) I just have to say, what kind of conservative, or better what kind of Republican is Bush? And the answer, of course, is he is a banana Republican. Just look at the fiscal irresponsibility.

On the contrary, Robert Barro is not that worried about public debt. He likes that tax cuts help provide incentives to work, as in the previous exercise. He says:

You can’t go that crazy about the deficit. It’s a little over 3 percent of the GDP now.

As we will see during the following lectures, Robert Barro believes in “Ricardian” equivalence: he thinks that rational optimizers anticipate future tax increases to come, and that they will offset by an increase in private saving the tax increases done by the government.⁵⁴

Robert Barro suggests at the end of the interview some reforms on the Social Security side “in terms of introducing some kind of private accounts, not changing the pensions for the existing retired people or people close to it, but down the road, people who are younger.” This is how he justifies his position:

Well, that is something that has to be argued out. But you know, you can imagine going up to somebody in early 40, but that’s something you would have to go through. But the idea would be to have a transition where a lot more in the next generation is going to be in terms of personal accounts, rather than the kind of plan that we’ve had since the 1930s.

Paul Krugman, on the other hand, has the following view:

Well, I think it is a terrible idea, but even aside from that, it is very expensive. Because the real problem in any of these things, is what about somebody – well, my age, your age, we have been paying into the system all our lives. We don’t have a private account. Where is the money going to come from to pay for our retirement? You and I don’t need it, but a lot of people do. And then you start to say well, we’re going to finance that. Anyway you cut it, you need several trillion dollars of additional money injected into the system to make any of these things work, to cover the transition. I don’t think it is a good idea to do it in any case, but to cover this transition for all the people who’ve been paying into what has been a pay-as-you-go program, where each generation supports the previous generation in its retirement.

Finally, Robert Barro compares the pay-as-you-go system the U.S. currently has to implicit public debt:

What you have now is a situation where there is a big public debt out there in the form of future Social Security payments. So it’s not an official part of the public debt, but it’s just like it. And, of course, the transition is where you increase the literal debt in order to fund it, you have to borrow. But you’re replacing implicit debt, which is the pension payments you are owing, with explicit debt.

As we have seen in problem set 7, this discussion on the pay-as-you-go makes perfect sense in terms of the overlapping-generations model. Robert Barro want to see life-cycle contribute to capital accumulation, and phase-out progressively the current system of pay-as-you-go. Of course, as Paul Krugman points out, promises have been made to the current generation of retirees, which means that public debt would need to be taken on by the government in order to pay these pension benefits (if the new young generation now invests their saving in private accounts, then they cannot at the same time contribute to the pay-as-you-go system). Therefore, privatizing social security would merely make existing government liabilities more visible, by transforming them from implicit liabilities into explicit liabilities.

There is no theorem in mathematics which says that the limit of the maxima of many functions converges to the maximum of the limit of these functions. I said “unfortunately”, but this is fortunate actually, as we just found a counterexample in this exercise: the limit of the functions does not, in fact, have a maximum!↩
This is also how economists work in practice. Economists rarely observe preferences directly (outside of a lab where they would run experiments), but only the choices that people make. (which comes from a variety of sources: surveys, administrative data, etc.) Therefore, they rationalize people’s choices picking values for the parameters in their models, so that their models are able to match people’s behavior.↩
The fact that high income earners do not increase their consumption much following tax cuts is not necessarily due to the fact that they expect future taxes to come. Another candidate explanation is simply that they have a lower marginal propensity to consume - which, in turn, might come from the various reasons outlined in lecture 4. They may have utility for wealth, in the form of prestige, or just the joy of owning. Or, in the case of the extremely rich, they simply would not know what to do with this money.↩