13.6 Tweedie Distribution

In this section, you learn how to:
  • Define the Tweedie distribution as a Poisson sum of gamma random variables
  • Express the distribution as a mixture of discrete and continuous components
  • Parameterize the Tweedie distribution to that it can be used in a GLM context

Video Overview of the Section (Alternative .mp4 Version -5:30 min)


We have seen that the natural exponential family includes continuous distributions, such as the normal and gamma, as well as discrete distributions, such as the binomial and Poisson. It also includes distributions that are mixtures of discrete and continuous components. In insurance claims modeling, the most widely used mixture is the Tweedie (1984) distribution. It has a positive mass at zero representing no claims and a continuous component for positive values representing the amount of a claim.

The Tweedie distribution is defined as a Poisson sum of gamma random variables, known as an aggregate loss in actuarial science. Specifically, suppose that (N) has a Poisson distribution with mean (lambda), representing the number of claims. Let {(y_j)} be an i.i.d. sequence, independent of (N), with each (y_j) having a gamma distribution with parameters (alpha) and (gamma), representing the amount of a claim. Then, (S_N = y_1 + ldots + y_N) is Poisson sum of gammas. Section 16.5 will discuss aggregate loss models in further detail. This section focuses on the important special case of the Tweedie distribution.

To understand the mixture aspect of the Tweedie distribution, first note that it is straightforward to compute the probability of zero claims as
begin{equation*}
Pr ( S_N=0) = Pr (N=0) = e^{-lambda}.
end{equation*} The distribution function can be computed using conditional expectations,
begin{equation*}
Pr ( S_N leq y) = e^{-lambda}+sum_{n=1}^{infty} Pr(N=n) Pr(S_n leq y), ~~~~~y geq 0.
end{equation*} Because the sum of i.i.d. gammas is a gamma, (S_n) (not (S_N)) has a gamma distribution with parameters (nalpha) and (gamma). Thus, for (y>0), the density of the Tweedie distribution is
begin{equation}
mathrm{f}_{S}(y) = sum_{n=1}^{infty} e^{-lambda} frac{lambda ^n}{n!} frac{gamma^{n alpha}}{Gamma(n alpha)} y^{n alpha -1} e^{-y gamma}.
end{equation}

At first glance, this density does not appear to be a member of the linear exponential family. To see the relationship, we first calculate the moments using iterated expectations as
begin{equation}
mathrm{E~}S_N = lambda frac{alpha}{gamma}~~~~~~mathrm{and}~~~~~ mathrm{Var~}S_N = frac{lambda alpha}{gamma^2} (1+alpha).
end{equation} Now, define three parameters (mu, phi, p) through the relations
begin{equation*}
lambda = frac{mu^{2-p}}{phi (2-p)},~~~~~~alpha = frac{2-p}{p-1}~~~~~~mathrm{and}~~~~~ frac{1}{gamma} = phi(p-1)mu^{p-1}.
end{equation*} Inserting these new parameters yields
begin{equation}
mathrm{f}_{S}(y) = exp left[ frac{-1}{phi} left( frac{mu^{2-p}}{2-p} + frac{y}{(p-1)mu^{p-1}} right) +S(y,phi) right].
end{equation} We leave the calculation of (S(y,phi)) as an exercise.

Thus, the Tweedie distribution is a member of the linear exponential family. Easy calculations show that begin{equation}
mathrm{E~}S_N = mu~~~~~~mathrm{and}~~~~~ mathrm{Var~}S_N = phi mu^p,
end{equation} where 1 < p < 2. Examining our variance function Table 13.1, the Tweedie distribution can also be viewed as a choice that is intermediate between the Poisson and the gamma distributions.

[raw] [/raw]