This post introduces the class of discrete distributions called the (a,b,0) class.
A counting distribution is a discrete random variable that takes on values of non-negative integers 0,1,2, … Examples include the Poisson distribution, the binomial distribution and the negative binomial distribution (see here for a discussion). These distributions are potential models for the number of occurrences for some random events of interest, e.g. the number of losses in actuarial applications. The discussion below shows that the notion of (a,b,0) class is another way to describe the big three counting distributions of Poisson, binomial and negative binomial. The notion of (a,b,1) class is a generalization of the (a,b,0) class and is defined in a subsequent post.
The (a,b,0) Class
The (a,b,0) class is at heart a recursive algorithm to generate probabilities. Let’s fix some notations. Let be a counting random variable. For each , let . The counting random variable is said to be a member of the (a,b,0) class of distributions if for some constants and the following recursive relation holds
Note that the recursive relation (1) generates all the probabilities for all integers starting at 1. The relation (1) does not account for . Does that mean that the initial probability can be any arbitrary probability value? Note that the recursive relation (1) means that each is ultimately expressed in terms of .
When and are fixed, the value of is also fixed since the probabilities must sum to 1. In fact is the following value.
Thus a member of the (a,b,0) class has two parameters, namely and , which completely determine the distribution.
As an example, let and where is a fixed positive constant. Using (1), we see that
According to (2),
With , the probabilities are from a Poisson distribution. Thus, when the parameter is 0, and the parameter is a positive constant, the corresponding distribution from the (a,b,0) class is a Poisson distribution.
Only Three Members in the (a,b,0) Class
In essence, the (a,b,0) class has only three members, namely the big 3 discrete distributions – the Poisson distribution, the binomial distribution and the negative binomial distribution, with each distribution represented by a different sign of the parameter . Using the recursive relation (1), it can be shown that each of the big three distributions belongs to the (a,b,0) class. The following table shows the parameters and in the three cases.
|Distribution||Usual Parameters||Parameter a||Parameter b|
Table 1 shows how to parametrize the three distributions. For example, for the binomial distribution with parameters (the number of trials) and (the probability of success), the (a,b,0) parameters are and . The two rows for negative binomial reflect two different parametrizations. Of course, the geometric distribution is simply a negative binomial distribution when the parameter . Essentially Table 1 consists of three different distributions.
Table 1 works in the opposite direction as well. Any set of (a,b,0) parameters and must fit into one of the distributions listed in Table 1. In other words, the recursive relation (1) produces no new counting distribution. Any counting distribution satisfying (1) must be one of the big 3 counting distributions listed in Table 1.
Note that under the recursive relation (1), not all combinations of and will make a probability distribution. For example, when both and are negative constants, the resulting probabilities are negative for odd . When , the resulting probabilities cannot be reliably positive in all instances. When , , i.e. the distribution is a point mass at 0. So we would like to restrict the attention on the case where .
To echo the point made previously, it is the case that when and when the recursive relation (1) produces a viable probability distribution, the resulting distribution must be one of the three distributions listed in Table 1. This point is not entirely obvious. Any interested reader can see chapter 6 of .
Table 1 indicates that the sign of the parameter determines the form of the (a,b,0) distribution. If , it is a Poisson distribution. If is negative, it is a binomial distribution. If is positive, it is a negative binomial distribution.
We now present a few more examples illustrating the working of the (a,b,0) recursive relation.
This example illustrates that knowing three consecutive probabilities of a member of the (a,b,0) class determines the entire distribution. For example, suppose we know that
These three consecutive probabilities produce the following two linear equations of and .
Solving these two linear equations produces and . Since is positive, this is a negative binomial distribution. The corresponding negative binomial parameters are and . With this information, the (a,b,0) distribution in question is completely determined. The following are the several distributional quantities.
Actually any three given probabilities determine the entire (a,b,0) distribution. They do not have to be consecutive. Suppose we are given the following probabilities.
Applying the recursive relation (1) produces the following equations.
The above 3 equations lead to the following two equations.
Of the above two equations, one is a linear equation and one is a quadratic equation. Solving these two equations produces and . Since is negative, this is a binomial distribution. Using the translation in Table 1 gives the following equations.
Solving these equations gives and . The (a,b,0) distribution in question is then completely determined.
Another distributional quantity that can give insight into the (a,b,0) class is the factorial moment. For any random variable , its th factorial moment is
For example, the first three factorial moments are:
For any member of the (a,b,0) class with parameters and , the first factorial moment is:
The higher (a,b,0) factorial moments can be obtained recursively as follows:
The recursive formula (5) is a good way to determine the raw moments of the member of the (a,b,0) class. For example, the following calculate the second raw moment and the variance of the random variable , assumed to be a member of the (a,b,0) class with parameters and .
One interesting characteristic of the (a,b,0) class is that knowing limited distributional information determines the distribution. Example 2 and Example 3 show that knowing three point masses completely determines the (a,b,0) distribution. The above derivation shows that knowing the mean and the variance also completely determines the (a,b,0) distribution.
Fitting (a,b,0) Distributions
If the (a,b,0) recursive formula in (1) generates no new distributions, why study (a,b,0) class and why not just focus on Poisson, binomial and negative binomial distribution individually? One reason for studying the recursive (a,b,0) formula is that it gives a graphical way to choose an appropriate member of the (a,b,0) class. To see this, rewrite (1) as follows:
Note that the quantity on the right side of (6) is a linear function of the integers . If we plot the left hand side quantity of (6) with on the x-axis, the plot should be a linear one with the slope being the parameter and the y-intercept being the parameter (of course assuming it is an (a,b,0) distribution).
The relation (6) is a way to quickly determine whether a given sample is taken from a member of the (a,b,0) class. To do this, calculate the ratio of two consecutive data categories times . In other words, compute ratio such as the following for values of :
where is the observed frequency for the category . The ratio of to multiplied by is a stand-in for the left hand side of (6). Then plot these values against . A linear trend that is observed in the graph is evidence that the data in the sample is taken from an (a,b,0) distribution.
The slope of the plotted line gives an indication of which (a,b,0) member to use. If the plot is approximately horizontal, then the Poisson model is appropriate. If the plot is a line with negative slope, then the binomial model is more appropriate. If the plot is approximately a line with positive slope, use the negative binomial model. For this approach to work properly, large observed data set is preferred.
The (a,b,1) Class
It is possible that the (a,b,0) distributions do not adequately describe a random counting phenomenon being observed. For example, the sample data may indicate that the probability at zero may be larger than is indicated by the distributions in the (a,b,0) class. One alternative is to assign a larger value for and recursively generate the subsequent probabilities for . The class of the distributions defined by this recursive scheme is called the (a,b,1) class, which is discussed in the next post.
- Panjer H. H., Wilmot G. E., Insurance Risk Models, Society of Actuaries, Chicago, 1992.
Dan Ma actuarial topics
Dan Ma actuarial
Dan Ma math
Daniel Ma actuarial
Daniel Ma mathematics
Daniel Ma actuarial topics