A New Method for Generalizing Burr and Related Distributions

Tanujit Chakraborty; Suchismita Das; Swarup Chattopadhyay

doi:10.1515/ms-2022-0016

Article Open Access

A New Method for Generalizing Burr and Related Distributions

Tanujit Chakraborty , Suchismita Das and Swarup Chattopadhyay

Published/Copyright: February 16, 2022

Published by

Become an author with De Gruyter Brill

Author Information Explore this Subject

From the journal Mathematica Slovaca Volume 72 Issue 1

Abstract

A new method has been proposed to generalize Burr-XII distribution, also called Burr distribution, by adding an extra parameter to an existing Burr distribution for more flexibility. In this method, the exponent of the Burr distribution is modeled using a nonlinear function of the data and one additional parameter. The models of this newly introduced generalized Burr family can significantly increase the flexibility of the former Burr distribution with respect to the density and hazard rate shapes. Families expanded using the method proposed here is heavy-tailed and belongs to the maximum domain of attractions of the Frechet distribution. The method is further applied to yield three-parameter classical Pareto and generalized exponentiated distributions which shows the broader application of the proposed idea of generalization. A relevant model of the new generalized Burr family has been considered in detail, with particular emphasis on the hazard functions, stochastic orders, estimation procedures, and testing methods are derived. Finally, as empirical evidence, the new distribution is applied to the analysis of large-scale heavy-tailed network data and compared with other commonly used distributions available for fitting degree distributions of networks. Experimental results suggest that the proposed Burr distribution with nonlinear exponent better fits the large-scale heavy-tailed networks better than the popularly used Marhsall-Olkin generalization of Burr and exponentiated Burr distributions.

MSC 2010: Primary 60E05

Keywords: Burr distribution; exponentiated distributions; stochastic ordering; reliability properties; maximum likelihood

1. Introduction

The development of statistical distributions is one of the oldest research topic in the field of statistics. There has been a renewed interest in developing more flexible statistical distributions in recent decades. Since the seminal work by Karl Pearson in 1895 [37], several general methods have been developed for generating family of distributions. Pearson presented a systematic approach for generating statistical distribution to model non-symmetric type of data using differential equation. The Pearson system of continuous distributions is a system for which every probability density function (PDF) f(x) satisfies the differential equation of the following form:

(1.1)1f(x)⋅df(x)dx=a+xb0+b1x+b2x2,

where a, b₀, b₁, and b₂ are the shape parameters. Different types of distributions correspond to different forms of solution to Eqn. (1.1). The form of solution of Eqn. (1.1) depends on the root of the equation

b0+b1x+b2x2=0.

Pearson presented four types of distributions [14] and characterizations of Pearson type distributions are also available in the literature [4]. Irving W. Burr proposed another important development in this category. In Burr’s method [9], system of distributions satisfy the following differential equation:

(1.2)dF=F(1−F)g(x)dx,

where 0 ≤ F ≤ 1 and g(x) is a non-negative function over x. Twelve different solutions to the Eqn. (1.2) in the form of cumulative distribution functions (CDF) were given and named as Burr Types I–XII distributions. The Burr type-XII distribution, being a member of the Burr system has gained more attention in the last decade due to its potential use in practical situations. The Burr Type XII, simply called Burr distribution, is highly useful for fitting heavy-tailed data sets from the field of reliability, economics, hydrology, actuarial science, and network science among many others [24]. Burr distribution also emerges as a suitable model to describe stationary states of complex and non-equilibrium systems [38, 39]. The main advantage of Burr distribution from the extreme value statistics’ point of view is that it has algebraic tails which are effective for modeling failures that occur with lesser frequency than with corresponding models based on exponential tails [41]. The CDF and PDF of the Burr distribution are defined as follows.

Definition 1.

A random variable X follows Burr distribution with parameters c, λ, α, if the CDF is of the following form:

(1.3)F(x;c,λ,α)=1−1+xλc−α, x>0, c, α, λ>0,

where α and c are shape parameters and λ is a scale parameter. The density function of the Burr distribution is given by

(1.4)f(x; c, λ, α)=cαλ−cxc−11+xλc−α−1, x>0, c, α, λ>0.

The corresponding survival function is given by

(1.5)S(x; c, λ, α)=1+xλc−α, x>0.

The hazard rate function is given by

(1.6)h(x;c, λ, α)=f(x; c, λ, α)S(x; c, λ, α)=cαλ−cxc−11+xλc−1, x>0.

It is interesting to note that the CDF in (1.3) is regularly varying at infinity, viz. they satisfy for some γ > 0, called the tail-index, and for all t > 0,

limx→∞1−F(tx)1−F(x)=t−γ.

This result suggests that Burr distribution well-suited for modeling extreme values as a heavytailed distribution. The bivariate and multivariate extensions of Burr distribution are available in the past literature [13, 42]. Consequently, numerous modifications or generalizations of the Burr distribution, using different compounding and weighting techniques, have been suggested. See, for instance, the beta Burr distribution introduced by [36], the McDonald Burr distribution introduced by [17], order-statistics based generalized Burr distribution [5], the Marshall-Olkin Burr (MO Burr) distribution developed by [22], modified Burr distribution [21] the exponentiated Burr distribution introduced by [26] and the log-Weibull Burr distribution developed by [1]. Though all these modifications and generalizations resulted well in specific data analysis but all of these assumed constant exponents for the Burr distribution which causes potential failure when working with large-scale heavy-tailed data sets from various applied domains [24]. In this work, the primary hypothesis is that the exponent α of the Burr distribution is not a constant and varies according to a nonlinear function g which depends on the data.

This article aims to introduce an extra parameter to Burr, Pareto, and exponentiated distributions to bring more flexibility to the given families when dealing with real-world large-scale heavy-tailed data sets. We introduce a new method of generalization, namely the shape-parameter transformation (SPT) method, where the non-negative shape parameter is assumed to be expressible as a nonlinear function of the empirical data and adds an additional parameter to the base distribution. The proposed SPT method is straightforward to use; hence it can effectively be used for data analysis purposes. The proposed SPT method is first applied to a Burr distribution to yield generalized Burr (GBurr) family of distributions. The newly introduced GBurr family has some excellent statistical properties and belongs to the maximum domain of attractions of the Frechet distribution. The method presented in this paper adds an additional parameter in the distribution and can be a competitor to the popularly used Marshall-Olkin (MO) [30] and Lehmann alternatives-based [27] method of generalizing continuous distributions. The SPT method is further applied to the Pareto and the exponentiated family of distributions which shows the broader applicability of the proposed SPT method. Further, a specific nonlinear variant of the GBurr family, we call it NBurr distribution, is discussed in detail. Complementary theoretical aspects are studied, such as shapes, asymptotes, quantiles, stochastic ordering, reliability parameter, and inferential statistics. The application of the proposed NBurr distribution is shown using large-scale heavy-tailed network data sets from various disciplines.

The rest of the paper is organized as follows. We discuss the SPT method and its application to the Burr, Pareto, and exponentiated family of distributions in Section 2 along with some structural properties. In Section 3, we study a simple model from the GBurr family of distributions and discuss its stochastic and inferential characteristics. The empirical application of this new NBurr distribution to various large scale heavy-tailed network data sets is presented in Section 4. Finally, we conclude this paper in Section 5.

2. Shape parameter transformation (SPT) method

In this section, we introduce the SPT method to generalize Burr, Pareto, and exponentiated distributions by modeling the shape parameter of these families as a nonlinear function of the data. It also adds an additional parameter to the families as mentioned above of distributions for flexible modeling of real-life data sets. The SPT method is first applied to yield generalized Burr distributions which are regularly varying distributions at infinity and are heavy-tailed. Furthermore, this new SPT method is applied to Pareto distribution and exponentiated family of distributions to generalize these families.

2.1. The generalized Burr (GBurr) family

Definition 2.

A continuous random variable X follows generalized Burr (Burr) family of distributions if and only if it has the following CDF:

(2.1)FGBurr(x; c, λ, α, β)=1−1+xλc−gxλ,α, β, x≥0, α, λ, c>0,

and F(x) = 0 if x < 0, where the real valued continuous, positive function g: (0, ∞) → ℝ⁺ is differentiable on (0, ∞). The shape parameter α of the Burr distribution is replaced with g(z) = g(x/λ; α, β) = g(x/λ), say, where β is an additional shape parameter and g(z) satisfies the following conditions:

The function g(z) is strictly positive and have finite limit at infinity, viz.
(2.2)limz→∞ g(z)=α(>0).
limz→0+ 1+zcg(z)=1 and limz→∞ 1+zcg(z)=∞.
g′(z)g(z)≥−czc−11+zclog(1+z), z > 0, where g′(z)=ddz [g(z)].

It is very easy to verify that:

F_GBurr(x) is non-decreasing since,
cxλc−1gxλ+g′xλ 1+xλc log 1+xλ>0, x>0.
limx→ −∞FGBurr(x)=FGBurr(−∞)=0, limx→∞FGBurr(x)=1.
F_GBurr(x) is right continuous.

Thus, (2.1) is a standard CDF and it can also be expressed as follows:

(2.3)FGBurr(x)=1−exp−gxλlog1+xλc, x>0.

The corresponding survival function is given by

(2.4)SGBurr(x)=1+xλc−g(x/λ),x>01,x≤0.

The probability density function is

(2.5)fGBurr(x)=1λcxλc−11+xλcgxλ+g′xλlog1+xλc1+xλc−g(x/λ),x>00,x≤0.

The hazard rate function is given by

(2.6)hGBurr(x)=cλ⋅xλc−11+xλcgxλ+1λg′xλlog1+xλc,x>00,x≤0.

The CDF (2.1) of the GBurr family of distributions is a function with regularly varying tails and belongs to the maximum domain of attraction (MDA) of the Frechet distribution with index α > 0, viz. F ∈ MDA(Φ_α), with Φ_α = exp{−x^−α}, x > 0, α > 0 [15].

Theorem 2.1.

GBurr family of distributions are:

heavy-tailed,
right tail-equivalent to a Pareto distribution, a
belongs to the MDA of the Frechet distribution.

Proof.

For the GBurr family of distributions, we note that
limx→∞ exp{kx} (1−F(x))=limx→∞ exp kx−gxλ log 1+xcλc=∞,
where k, λ, c > 0.
To show the tail-equivalent property, we show
limx→∞ 1−F(x)1−G(x)=limx→∞exp −gxλlog1+xcλcexp −αlog1+xλ=1, for all λ, c>0,
where G(x) is the CDF of the Pareto Type-II distribution.
It should be noted that any function g as defined in Definition 2 satisfying limz→∞g(z)=α>0, is slowly varying at infinity:
limz→∞ g(tz)g(z)=1, for all t>0.

Now,

limx→∞ 1−F(tx)1−F(x)=limx→∞ 1+(tx)cλc−gtxλ1+xcλc−gxλ=t−α, for all t>0 and λ, c>0.

□

It can be seen that Burr distribution belongs to this GBurr family and corresponds to the simplest choice g(z) = α. We give some examples of nonlinear g(z) satisfying the condition given in Definition 2 and present some distributions belonging to the GBurr family in Table 1. In order to select a model, one can choose the nonlinear exponent g(z) that meets the empirical characteristics of the given data sets. Obviously, this example list of GBurr models can further be expanded to more models by introducing some other forms of g(z) satisfying limz→∞ g(z)=α>0 or by increasing the number of parameters in the function g.

Table 1.

Some examples of GBurr family of distributions.

g(z) for all z > 0	β	f(x) for all x > 0
α−βz+1	β ≤ α	[βλ(x+λ)2log(1+(xλ)c)+(α−βλx+λ)cxc−1xc+λc][1+(xλ)c]−(α−β1+x/λ)
αz1+zβ	β > −1	αλ xx+λββλ2x(x+λ)+λcxc−1xc+λcexp −αlog 1+xcλc xx+λβ
α log(1+z)1+log(1+z)β	β > −1	αλlog(1+x/λ)1+log(1+x/λ)ββlog1+xcλc(1+x/λ)[1+log(1+x/λ)][log(1+x/λ)]+λcxc−1xc+λc×exp−αlogβ(1+x/λ)log1+xcλc[1+log(1+x/λ)]β

We can also obtain some popular size distributions as the particular cases of the GBurr family:

By taking the constant g function, viz., when g(z) = 1, then GBurr distributions reduces to the Fisk distribution [16].
For α → ∞ and β = 0, GBurr family of distributions converges to the Weibull distribution [25].
By taking c = 1 and β = 0, GBurr distribution reduces to the Lomax distribution [29].

2.2. Generalized classical Pareto distribution

In this section, we apply the SPT method to the classical Pareto distribution to yield generalized classical Pareto (GCP) distributions as defined below.

Definition 3.

A continuous random variable X follows GCP family of distributions if and only if it has the following CDF:

(2.7)FGCP(x)=1−xγ−mxγ, x≥γ

and F(x) = 0 if x ≤ γ, γ (> 0) is a scale parameter, m: (1, ∞) → ℝ⁺ is a real, continuous, positive function which is differentiable on (1, ∞), and the function m satisfies the following conditions:

m is strictly positive and have finite limit at infinity, viz.
(2.8)limz→∞ m(z)=α(>0).
limz→1+ zm(z)=1 and limz→∞ zm(z)=∞.
m′(z)m(z)>−1zlog(z),z>1 .

It is noted that the condition (3) is equivalent to

ddzlog(g(z))≥−ddzlog[log(z)], z>1.

It is very easy to verify that F(x) in (2.7) is a standard distribution function and any continuous random variable X satisfying the above-mentioned conditions are called as GCP distributions. An alternative expression for F(x) can also be given as follows:

FGCP(x)=1−exp−mxγlogxγ, x≥γ.

The survival function is given by

(2.9)SGCP(x)=xγ−m if x>γ1 if x≤γ.

The PDF for the family of GCP distributions is given by

(2.10)fGCP(x)=mxγ+m′xγxγlogxγSGCP(x)x if x>γ0 if x≤γ.

The hazard function is given by

(2.11)hGCP(x)=gxγ+m′xγxγlogxγx if x>γ0 if x≤γ.

Table 2 shows some examples of GCP distributions satisfying condition (2.8) (i.e., limz→ ∞ m(z)=α>0). It is noted that the simplest choice m(z) = α leads the classical Pareto distribution. The rest of the models are completely new. In order to select a model, one can choose the nonlinear exponent m(z) that meets the empirical characteristics of the given data sets. Obviously, this example list of GCP models can further be expanded to more models by introducing some other forms of m(z) satisfying limz→ ∞ m(z)=α>0 or by increasing the number of parameters in the function g.

Table 2.

Some examples of GCP family of distributions with limz→ ∞ m(z)=α>0..

m(z) for all z > 1	β	f(x) for all x > γ
α−βz	β ≤ α	xγ−α+βγ/xαx+γβxlog(x/γ)−1x
αz−1zβ	β > −1	αxxγ−α(x/γ)−1(x/γ)β(x/γ)−1+βlog(x/γ)(x/γ)−1 (x/γ)−1(x/γ)β
αlog(z)1+log(z)	β > −1	αxxγ−αlog(x/γ)log(x/γ)+1βlog(x/γ)+1+βlog(x/γ)+1 log(x/γ)log(x/γ)+1β

2.3. Generalized exponentiated distributions

The idea of exponentiated family of distributions is based on Lehmann alternatives [27] that can add a non-negative shape parameter to any continuous probability distributions:

(2.12)F*(x)=[F(x)]α, where α>0 (real),

where F(x) is a standard CDF. This new family F* is called ‘exponentiated family’ where one raises the CDF of an existing distribution to a power of an additional parameter. Lehmann motivated it in the following way: [F(x)]^α is the distribution of the maximum of α independent and identically distributed variables with distribution F when α is an integer and when α is a rational number, [F(x)]^α is one-parameter family of nonparametric class of alternatives. In the theory of generalized probability distributions, Lehmann alternatives has been widely used to generate so-called exponentiated Weibull family of distribution by [31]. A systematic treatment of exponentiated weibull, exponentiated gamma, and exponentiated pareto distribution are available in [18]. Gupta and Kundu studied exponentiated exponential distribution [20] and recently Gupta et al. [19] proposed Power Normal distributions using the same idea of Lehmann’s alternatives for standard normal distribution. Nadarajah and Kotz [32] studied a list of exponentiated X family of distributions, including exponentiated Gumbel, Fréchet, gamma, etc. distributions. We use our SPT method to generalize any ‘exponentiated family’ of distributions.

Definition 4.

Any continuous random variable X with distribution function F(x) follows generalized exponentiated distribution if and only if it has the following CDF

G(x)=[F(x)]m(x), −∞<x<∞,

where m: (−∞, ∞) → (0, ∞) is a real, continuous function which is differentiable on (0, ∞). The function m satisfies the following conditions:

m is strictly positive and have finite limit at infinity, viz.
limx→ ∞ m(x)=α(>0).
limx→− ∞m(x)=β(>α) .
m′(x) < 0.

It is important to note that when m(x) = α, then g(x) corresponds to the exponentiated family of distributions [18, 32]. We give some examples of m(x) to be used for the distributions with X ∈ ℝ.

Example 1.

Let m(x) be a function of the following form satisfying limx→ − ∞ ξ(x)=2 and limx→∞ξ(x)=0:

m(x)=(β−α)2ξ(x)+α, β>α>0.

Some choices of ξ(x) are as follows:

(a) ξ(x)=e−x if x≥0,2−ex if x<0,

(b) ξ(x)=1x2k+1+1 if x≥0,2+1x2k+1−1 if x<0,

where k is any natural numbers.

Example 2.

Let m(x) be a function of the following form satisfying limx→ − ∞σ(x)=0 and limx→ − ∞σ(x)=1:

m(x)=ασ(x)+β(1−σ(x)), β>α>0.

Some choices of σ(x) are as follows:

σ(x)=11+e−xx∈ℝ.
σ(x)=12+1πarctan(x), x∈ℝ.
σ(x)=12π∫−∞xexp−y2/2dy, x ∈ ℝ.

It is interesting to see that

G(x)=F(x)αΦ(x)+βΦ(−x), x∈ℝ

is a standard CDF, where Φ(x) is a standard normal CDF. For all the above examples when α = β, we get G(x) = [F(x)]^α.

3. NBurr distribution: Definition and properties

In this section, we study a relevant model of the GBurr family of distributions with the choice of g(z)=α−β1+z in Table 1. The CDF is given by

(3.1)FNBurr(x)=1−1+xλc−α−β1+x/λ, x>0,

where α, λ, c > 0, β > −1 and α > β. We call this simple generalized form of Burr distribution as NBurr distribution (Burr distribution with nonlinear exponent). The NBurr distribution includes the Burr distribution when β = 0. The CDF in (3.1) can alternatively be written in the following form:

(3.2)FNBurr(x)=1−exp −α−β1+xλ log 1+xλc, x>0.

The corresponding survival function is given by,

(3.3)SNBurr(x)=1+xλc−α−β1+x/λ, x>0.

The probability density function for x > 0 is given by

(3.4)fNBurr(x)=βλ(x+λ)2log1+xλc+α−βλx+λcxc−1xc+λcSNBurr(x), x>0,

and the hazard rate function is given by

(3.5)hNBurr(x)=βλ(x+λ)2log1+xλc+α−βλx+λcxc−1xc+λc, x>0.

The proposed NBurr distribution satisfies the extreme value properties as given in Theorem 2.1. Some graphics of the NBurr model derived from this newly introduced GBurr family of distributions are illustrated in Figure 1. Remark that NBurr distribution with parameters α, β, λ, c, as a distribution of the MDA of the Frechet distribution, satisfies the von Mises condition:

limx→∞xfNBurr(x)SNBurr(x)=α>0.

FIGURE 1.

Plots of PDFs of the NBurr distribution.

Next, we study the reliability properties and inferential properties of the new NBurr distribution in the next subsections.

3.1 Reliability properties of NBurr distribution

In this section, we study some reliability properties of the newly introduced NBurr distribution including monotonicity of hazard rates, stochastic orderings, entropy, etc. The following theorem shows that the hazard rate function of the NBurr distribution is increasing and decreasing under certain conditions on c and β.

Theorem 3.1.

The hazard rate function of the NBurr distribution satisfies the following properties:

If 0 ≤ c ≤ 1 and β > 0 then for all x > 0, h_NBurr(x) is decreasing in x;
If c > 1 and β > 0, then for all x > 0,
1. h_NBurr(x) is increasing in x, whenc−1>xλc;
2. h_NBurr(x) is decreasing in x, whenc−1<xλc;
3. h_NBurr(x) is maximum at x = λ[c − 1]^1/c.

Proof.

Differentiating (3.5) with respect to x, we have

hNBurr′(x)=−2βλ(x+λ)3log(1+xcλc)+2cβλxc−1(x+λ)2(xc+λc) +(α−βλx+λ)cxc−2λc(xc+λc)2[c−1−xcλc]=2βλ(x+λ)3[cxc−1x+λxc+λc−log(1+xcλc)] +(α−βλx+λ)cxc−2(xc+λc)2[c−1−xcλc].

Now, α≥βλx+λ, cλcxc−2xc+λc2≥0 and if β > 0, 2βλ(x+λ)3xc+λc>0.

Thus, hNBurr′(x)≥ (≤)0 if

A(x)=cxc−1x+λxc+λc−log1+xcλc≥(≤)0;
c−1−xcλc≥(≤)0.

Now,

A′(x)=cλcxc−2(x+λ)xc+λc2c−1−xcλc.

We can see that A′(x) ≤ 0 if 0 ≤ c ≤ 1. Therefore, A(x) is decreasing in X and again A(0) = 0. Thus, we have A(x) ≤ 0.

Similarly, we can prove that if c > 1, then A(x) ≥ 0 when c−1>xλc and A(x) ≤ 0 when c−1<xλc. This implies that for 0 ≤ c ≤ 1, h_NBurr(x) is decreasing in x; for c > 1, h_NBurr(x) is increasing in x, when c−1>xλc and h_NBurr(x) is decreasing in x, when c−1<xλc. Thus, the hazard function h_NBurr(x) attain its maximum at X = λ [c − 1]^1/c. □

The following example shows that the NBurr distribution does not preserves the likelihood ratio ordering. It is useful to recall that a random variable X is said to be larger than another random variable Y in likelihood ratio ordering (written as X ≥_LRY) if f_X(x)/f_Y(x) is an increasing function for X > 0.

Example 3.

Let X and Y be two random variables following NBurr distributions with parameters α₁, β₁, λ₁, c₁and α₂, β₂, λ₂, c₂respectively. Then, for all x > 0, the ratio of the corresponding density functions of X and Y is given by

(3.6)fXNBurr (x)fYNBurr (x)=β1λ1x+λ12log1+xλ1c1+α1−β1λ1x+λ1c1xc1−1xc1+λ1c11+xλ1c1−α1−β11+x/λ1β2λ2x+λ22log1+xλ2c2+α2−β2λ2x+λ2c2xc2−1xc2+λ2c21+xλ2c2−α2−β21+x/λ2.

Now, for λ₁ = λ₂ = 1,

Case I, when c₁ = c₂ = 1, (3.6) reduces to

fXNBurr (x)fYNBurr (x)=β1(1+x)2log(1+x)+α1−β11+x11+x(1+x)−α1−β11+xβ2(1+x)2log(1+x)+α2−β21+x11+x(1+x)−α2−β21+x=P1(x), say.

We can see that for α₁ = 1.5, α₂ = 2, β₁ = 0.5 and β₂ = 1.5, P₁(0.1) = 1.366, P₁(2) = 0.889 and P₁(6) = 1.426, which implies that fXNBurr(x)fYNBurr(x) is not monotone. Thus X≥LRY.

Case II, when c₁ ≠ c₂, (3.6) reduces to

fXNBurr (x)fYNBurr (x)=β1(1+x)2log1+xc1+α1−β11+xc1xc1−11+xc11+xc1−α1−β11+xβ2(1+x)2log1+xc2+α2−β21+xc2xc2−11+xc21+xc2−α2−β21+x=P2(x), say .

Again, we can see that for c₁ = 2, c₂ = 4, α₁ = 1.5, α₂ = 2, β₁ = 0.5 and β₂ = 1.5, P₂(0.5) = 1.5753, P₂(1) = 0.4843 and P₂(2) = 2.8757, which implies that fXNBurr(x)fYNBurr(x) is not monotone. Thus X≥LRY. Hence, it can be concluded that the proposed NBurr distribution does not preserves the likelihood ratio ordering. □

In the next theorem, we show that the NBurr distribution preserves the usual stochastic ordering. It is useful to remind that a random variable X is said to be larger than another random variable Y in usual stochastic ordering (written as X ≥_STY) if S_X(x) ≥ S_Y(x), for all x > 0.

Theorem 3.2.

Let X and Y be two random variables following NBurr distribution with parameters α₁, β₁, λ₁, c₁and α₂, β₂, λ₂, c₂, respectively. Then, X ≥_ST (≤_ST)Y provided

λ₁ ≥ (≤)λ₂,
c₁ ≤ (≥)c₂,
α₁ ≤ (≥)α₂, and
β₁ ≤ (≥)β₂for β > 0.

Proof.

X ≥_ST (≤_ST)Y if and only if for all x > 0, S_{X_NBurr}(x) ≥ (≤)S_{Y_NBurr}(x), which is equivalent to

1+xλ2c2α2−β2λ2x+λ21+xλ1c1α1−β1λ1x+λ1≥(≤)1.

This holds if (i) λ₁ ≥ (≤)λ₂, (ii) c₁ ≤ (≥)c₂, (iii) α₁ ≤ (≥)α₂and (iv) β₁ ≤ (≥)β₂, for β > 0. □

The following theorem gives the condition under which the NBurr distribution preserves the hazard rate ordering. It is well known that a random variable X is said to be larger than another random variable Y in hazard rate ordering (written as X ≥_HRY) if, for all x > 0, h_X(x) ≤ h_Y(x).

Theorem 3.3.

Let X and Y be two random variables following NBurr distribution with parameters α₁, β₁, λ₁, c₁and α₂, β₂, λ₂, c₂, respectively.Then, X ≥_HR (≤_HR)Y provided

λ₁ ≥ (≤)λ₂,
c₁ ≤ (≥)c₂,
α₁ ≤ (≥)α₂, and
β₁ ≤ (≥)β₂for β > 0.

Proof.

X ≥_HR (≤_HR)Y if and only if for all x > 0, h_{X_NBurr}(x) ≤ (≥)h_{Y_NBurr}(x), which is equivalent to

β1λ11+xλ12log1+xλ1c1+α1−β11+xλ1c1xλ1c1−1λ11+xλ1c1≤(≥)β2λ21+xλ22log1+xλ2c2+α2−β21+xλ2c2xλ2c2−1λ21+xλ2c2.

This holds if (i) λ₁ ≥ (≤)λ₂, (ii) c₁ ≤ (≥)c₂, (iii) α₁ ≤ (≥)α₂ and (iv) β₁ ≤ (≥)β₂, for β > 0. □

Remark 1.

It is interesting to note that the hazard rate function of the NBurr distribution can be both increasing and decreasing under certain conditions as given in Theorem 3.1. Also, NBurr distribution preserves stochastic and hazard rate orderings as shown in Theorems 3.2 and 3.3, respectively.

The Shannon entropy is an important and well-known concept in information theory as well as engineering sciences. Let X be a random variable that follows NBurr distribution with parameters α, β, λ, c. Then Shannon’s entropy for the NBurr distribution is defined as

H(X)=−E[logfNBurr(X)]=−∫0∞fNBurr(x)logfNBurr(x)dx=−∫0∞(βλ(x+λ)2log(1+xcλc)+(α−βλx+λ)cxc−1xc+λc)[1+(xλ)c]−(α−β1+x/λ) ×[log(βλ(x+λ)2log(1+xcλc)+(α−βλx+λ)cxc−1xc+λc) −(α−β1+xλ)log(1+(xλ)c)]dx.

When we want to study the system that survived up to an age t, then Shannon’s entropy function is not useful in measuring the uncertainty about the residual lifetime of the system. Ebrahimi [3] has introduced residual entropy and defined as

H(X;t)=1−∫t∞fX(x)SX(t)logfX(x)SX(x)dx,

Then the residual entropy for NBurr distribution with parameters α, β, λ, c is given by

H(X;t)=1−1SNBurr(t)∫t∞fNBurr(x)loghNBurr(x)dx=1−1+tλcα−β1+t/λ ×∫t∞βλ(x+λ)2log1+xλc+α−βλx+λcxc−1xc+λc1+xλc−α−β1+x/λ ×logβλ(x+λ)2log1+xλc+α−βλx+λcxc−1xc+λcdx.

3.2. Parameter estimation

Let x₁, x₂,…, x_n be a sample of size n from NBurr(α, β, λ, c) distribution. We give procedure for parameter estimation the including the log-likelihood functions and corresponding normal equations. The log-likelihood function for the vector of parameters Θ = (α, β, λ, c)^T corresponding to NBurr distribution is given by

(3.7)l≡l(x;α,β,λ,c)=∑i=1nlogβλλ+xi2log1+xicλc+α−βλλ+xicxic−1λc+xic −α∑i=1nlog1+xicλc+βλ∑i=1n1λ+xilog1+xicλc,

where n is the sample size, and the maximum likelihood estimates of the unknown parameter vector (α, β, λ, c) are those that maximize the log-likelihood function l in (3.7). The normal equations can be obtained by taking the partial derivatives of (3.7) w.r.t. α, β, λ, c and equating them to zero:

(3.8)∂l∂α=∑i=1ncωi2ωi−1c−1β1+ωi−1clog1+ωi−1c+cωiαωi−βωi−1c −∑i=1nlog1+ωi−1c

(3.9)∂l∂β=∑i=1n1+ωi−1clog1+ωi−1c−cωiωi−1cβ1+ωi−1clog1+ωi−1c+cωiαωi−βωi−1c +∑i=1n1ωilog1+ωi−1c

(3.10)∂l∂λ=∑i=1nβxi−βλλc+xic2log1+xicλc−2cβxicλ+xiλc+xic−c2λc−1xic−1λ+xi2αλ+xi−βλβλλ+xiλc+xic2log1+xcλc+cxic−1αλ+xi−βλλ+xi2λc+xc +cαλ∑i=1nxicλc+xic+β∑i=1nxilog1+xcλcλ+xi2−cβ∑i=1n1λ+xiλc+xic

(3.11)∂l∂c=∑i=1nβωi−1c1+ωi−1clogωi−1+ωiωi−1c−1αωi−β1+c+ωi−1cβ1+ωi−1c2log1+ωi−1c+cωiωi−1c−1αωi−β1+ωi−1c −∑i=1nαωi−βωi−1clogωi−1ωi1+ωi−1c,

where ωi=1+xiλ.

The MLEs of the four parameters for the NBurr distribution with α, β, λ and c are obtained by setting the above partial derivatives to zero and solving them simultaneously. The closed-form solutions are not available for the equations (3.8), (3.9), (3.10) and (3.11). So, an iterative algorithm should be applied to solve these equations numerically. For practical implementation of the model, we fit the NBurr models in the whole range of the data sets with quasi-Newton BFGS numerical algorithm with initial values to be chosen as α0^,β0^,λ0^,c0^=(1,1,1,1) to find the MLE estimates of the parameters.

We also present the asymptotic distributions for the NBurr distribution. The Fisher information matrix (I) can be obtained by taking the expected values of the second-order and mixed partial derivatives of ℓ(x; α, β, λ, c) w.r.t. α, β, λ and c. Since the analytical expression is hard to compute, it can be approximated by numerically investigating the I = (I_ij) matrix. The asymptotic I matrix can be given as follows:

I=−∂2l∂α2−∂2l∂α∂β−∂2l∂α∂λ−∂2l∂α∂c−∂2l∂α∂β−∂2l∂β2−∂2l∂β∂λ−∂2l∂β∂c−∂2l∂α∂λ−∂2l∂β∂λ−∂2l∂λ2−∂2l∂λ∂c−∂2l∂α∂c−∂2l∂β∂c−∂2l∂λ∂c−∂2l∂c2

The second order partial derivatives of ℓ(x; α, β, λ, c) w.r.t. α, β, λ and c can be calculated but the calculations are very tedious. Hence, we omit the calculation part. The variance-covariance matrix is approximated by M = (M_ij) where Mij=Iij−1. The asymptotic distribution of MLEs for α, β, λ, and c can be written as

[(α^−α),(β^−β),(λ^−λ),(c^−c)]~N40,I−1(θ^).

Then the approximate 100(1 – k)% confidence intervals for α, β, λ, and c are given by α^±Zk2Var(α^), β^±Zk2Var(β^), λ^±Zk2Var(λ^), and c^±Zk2Var(c^); where Θ^=(α^,β^,λ^,c^) and Zk is the upper 100 k-th percentile of the standard normal distribution.

3.3. Goodness of fit

The measure of closeness between the hypothesized NBurr distribution and the observed real-world network can be well determined by goodness-of-fit test. We have used the Chi-square statistic test and its corresponding p value to determine the goodness of fit for the NBurr distribution. We calculate the respective p values using the bootstrap resampling computational technique as given below:

Initially the best fit NBurr distribution can be determined by estimating parameters through the MLE method given network data. Then we calculate the Chi-square statistic value as a measure of goodness-of-fit corresponding to the best-fitted NBurr model.
Next we generate 50000 synthetic data sets from the NBurr distribution and calculate the Chi-square statistic for each of the synthetic data sets.
Finally, we obtain the p value for the synthetic data sets as the fraction of NBurr synthetic data sets with a Chi-square value greater than the empirical one. Higher p values signify that the proposed model is ‘most’ suitable for the data set.

In addition, the effectiveness of the proposed NBurr distribution compared to other heavytailed distributions, is also verified by computing other well known statistical measures such as Kullback-Leibler divergence (KLDiv), root mean squared error (RMSE), and mean absolute error (MAE).

4. Real-world applications

We show the application of the NBurr distribution in the analysis of large-scale complex network data sets from various disciplines. The examples of such large-scale real-world complex networks include Twitter, Facebook, Orkut, Youtube, Amazon, LinkedIn, Wiki networks, etc. where the number of nodes is of the order of thousands or millions. There has been significant interest and attention devoted toward modeling aspects of such large-scale complex networks. Recent research [23, 28, 44] involved in the analysis of various important structural characteristics of network such as degree distribution, average nearest neighbor, clustering coefficient, community discovery, motif distribution, etc. Most of the interest has been focused on the analysis of the node degree distribution corresponding to these real-world networks [2,6,7]. Empirical observations suggest that the node degree distributions of such real-world networks, for example, collaboration networks, communication networks, social networks, biological networks, etc., follow a heavy-tailed power-law distribution [6, 33]. Previous researchers reported that a baseline power-law, exponential, Pareto, log-normal, and Burr models are insufficient to fit the empirical data properly in its whole range unless some of the lower degree nodes are left out while fitting the model [10–12, 40, 43]. In recent work, Broido et al. [8] pointed out that the recent data concentration on all these networks data shows that they no longer follow the power-law distribution.

4.1. Data

We consider large-scale real-world network data sets from different disciplines, namely social networks, collaboration networks, citation networks, web graphs, product co-purchasing networks, temporal networks, communication networks, and ground-truth networks. We study several individual data sets from each discipline to showcase the general applicability of the proposed NBurr distribution. These data sets are publicly available at http://snap.stanford.edu/data/index.html. These are standard network data sets with heavy-tail behaviors and used for modeling in the statistical analysis of networks [33–35]. An overview of these publicly available network data sets along with statistical measures (mean (μ), standard deviation (s), etc.) are presented in Table 3. Another interesting property of these network data sets is their coefficient of variation (s/μ) exceeding unity.

Table 3

Network data sets and estimated parameters of the proposed NBurr model

Networks		# Nodes	# Edges	Statistical measures			Estimated-Parameters				Bootstrap chi-square value (p)
Networks		# Nodes	# Edges	s	μ	sμ	α^	β^	λ^	c^	Bootstrap chi-square value (p)
Social	TwitterNet	81,306	1,768,149	57.965	21.747	2.6654	3.3395	0.0918	43.693	0.7324	0.9740
	GplusNet	107,614	13,673,453	1404.8	283.42	4.9568	1.3737	0.4503	24.924	0.5837	0.9890
	DeliciousNet	536,108	1,365,961	39.826	10.673	3.7312	7.6500	3.1845	26.999	0.2197	0.9651
	Live JournalNet	4,847,571	68,993,773	44.969	15.368	2.926	7.7807	3.7317	79.102	0.3536	0.9720
	AthletesFacebookNet	13,866	86,859	17.978	12.438	1.4453	2.8158	-0.4558	24.053	1.1022	0.9400
Citation	HepThNet	27,770	352,807	43.139	15.220	2.8342	2.9198	-0.4475	26.793	0.7954	0.8300
	PatentsNet	3,774,768	16,518,948	6.9125	5.0687	1.3637	6.7724	3.9977	15.758	0.5793	0.8130
	CiteseerNet	227,320	814,134	9.8260	5.4322	1.8088	3.4008	-0.8157	12.286	0.8391	0.6160
Web	GoogleNet	875,713	5,105,039	43.320	7.1444	6.0634	16.595	5.6714	61.718	0.1137	0.9865
	BerkStanNet	685,230	7,600,595	300.08	12.316	24.364	0.4522	0.1833	1.1626	2.2058	0.6550
	Wikipedia2009Net	1,864,433	4,507,315	12.846	4.8903	2.6268	1.8443	-0.9706	3.0329	0.9147	0.9750
Product CoPurchasing	Amazon0601Net	403,394	3,387,388	15.279	8.3989	1.8191	2.9534	1.8907	9.5274	0.9508	0.6920
	Amazon0505Net	410,236	3,356,828	15.313	8.1826	1.8714	4.3982	3.3724	12.739	0.6965	0.6670
	Amazon0312Net	400,727	3,200,444	15.073	7.9865	1.8873	6.2523	5.5583	16.563	0.4971	0.6517
Temporal	Mathover flowNet	24,818	506,550	31.476	10.424	3.0195	0.5426	0.7039	1.1057	1.6187	0.9890
	SuperuserNet	194,085	1,443,339	23.782	5.8239	4.0836	0.5987	0.5741	0.9920	1.8685	0.9800
	AskubuntuNet	159,316	964,437	18.404	4.3856	4.1966	0.5997	1.1289	0.7386	2.0061	0.9760
Communication	EmailEnronNet	36,692	183,831	36.100	10.021	3.6027	4.1159	2.6481	6.3332	0.3129	0.9900
	WikiTalkNet	2,394,385	5,021,410	12.259	2.1195	5.7844	0.2091	2.7853	0.1733	3.4821	0.9760
	RecLibimsetiNet	220,970	17,359,346	413.71	102.85	4.0227	4.6667	1.8315	123.52	0.2107	0.9843
GroundTruth	WikiTopcatsNet	1,791,489	28,511,807	283.78	15.915	17.831	1.5987	0.8982	2.8452	0.7260	0.8400
	OrkutNet	3,072,441	117,185,083	154.78	76.281	2.0291	5.4171	5.0492	137.14	0.5217	0.9891
	YoutubeNet	1,134,890	2,987,624	50.754	5.2650	9.6398	2.5928	-0.4403	0.3159	0.5004	0.8520

4.2. Experimental results

In this section, we compare the NBurr distribution with the other seven established models, namely Power-law, Pareto, Log-normal, Power-law with cutoff, Burr, exponentiated Burr [26], MO Burr [22] distributions. To estimate the parameters (α, β, λ, c) of the NBurr distribution numerically, we have used ‘optim’ function along with the quasi-Newton L-BFGS-B algorithm in R statistical software by taking the initial parameters value (α, β, λ, c) = (1,1,1,1). The estimated values of the parameters for all the network data sets satisfied the following conditions: α > 0, β; > −1 λ > 0 and c > 0 as depicted in Table 3, which characterize the proposed NBurr distribution. Empirically it is observed that in the case of social networks, the estimated value of the parameter λ attains the higher values as compared to the estimated value of α. From Table 3 it is also clear that the proposed NBurr distribution produces higher p values through bootstrapping chi-square test which suggests that the null hypothesis i.e., “the data are drawn from NBurr distribution”, cannot be ruled out at the 0.05 level of significance. This recommends in favor of the use NBurr distribution for fitting the node degree distribution of a network. Experimental results (given in Table 3) suggests that the proposed NBurr distribution is effective in modeling the entire degree distribution of real-world complex networks. Also, we used some other statistical measures, viz. KLDiv, RMSE, and MAE to compare the performance of the proposed NBurr distribution with the competitive heavy-tailed distributions as shown in Tables 4 and 5.

Table 4

Performances of the proposed NBurr model in terms of RMSE, KLDiv, and MAE compared to the competitive heavy-tailed models over real-world networks

Networks		Burr			NBurr			Exponentiated Burr			MO Burr
Networks		RMSE	KLDiv	MAE	RMSE	KLDiv	MAE	RMSE	KLDiv	MAE	RMSE	KLDiv	MAE
Social	TwitterNet	13.179	0.00788	1.1663	13.391	0.00787	1.1725	34.754	0.02012	3.1259	28.727	0.01911	2.7780
	GplusNet	2.7411	0.05621	0.1994	1.8217	0.05572	0.1849	9.0526	0.06382	0.3193	4.2128	0.05679	0.2171
	DeliciousNet	29.147	0.00499	1.7952	13.810	0.00453	1.1364	20.291	0.00464	1.4183	44.550	0.00627	2.4668
	LiveJournalNet	384.81	0.00168	13.924	219.73	0.00050	5.0602	464.51	0.00337	19.028	916.55	0.00951	32.342
	AthletesFacebookNet	4.9857	0.00886	1.3876	4.8669	0.00885	1.3721	10.288	0.01361	2.6042	8.0527	0.00965	1.8504
Citation	HepThNet	2.6780	0.01331	0.4718	2.4786	0.01327	0.4561	11.985	0.02091	1.0592	2.8134	0.01346	0.4756
	PatentsNet	626.74	0.00024	64.674	68.176	6.5E-05	10.397	173.55	9.4E-05	22.971	898.55	0.00030	84.917
	CiteseerNet	43.719	0.00200	3.6469	33.484	0.00186	3.1622	13.165	0.00216	1.8519	39.521	0.00192	3.4805
Web	GoogleNet	187.96	0.00699	8.3224	117.15	0.00428	5.8484	197.65	0.00640	8.1009	177.87	0.00708	8.2026
	BerkStanNet	32.321	0.02671	0.5353	32.388	0.02679	0.5355	31.945	0.02670	0.5260	32.198	0.02671	0.5301
	Wikipedia2009net	176.29	0.00115	9.4514	52.148	0.00104	4.7242	91.517	0.00097	6.0324	168.35	0.00115	9.1700
Product CoPurchasing	Amazon0601Net	92.912	0.00315	7.6485	77.387	0.00283	6.4876	188.63	0.00719	14.032	87.137	0.00265	6.9522
	Amazon0501Net	118.17	0.00375	9.1393	78.727	0.00297	6.9574	245.89	0.01183	18.526	112.00	0.00335	8.5741
	Amazon0312Net	105.14	0.00337	8.1901	71.210	0.00260	6.0427	106.63	0.00342	8.2086	80.538	0.00263	6.4877
Temporal	MathoverflowNet	8.8478	0.01432	1.3720	6.1401	0.01381	1.1713	6.5265	0.01421	1.2302	9.2084	0.01432	1.3693
	SuperuserNet	17.943	0.00319	1.3692	11.709	0.00303	1.0642	18.326	0.00319	1.3717	18.797	0.00319	1.3977
	AskubuntuNet	33.041	0.00352	2.1017	27.986	0.00334	1.8650	34.267	0.00375	2.1469	31.861	0.00326	1.9377
Communication	EmailEnronNet	76.787	0.03525	5.2356	73.139	0.03381	5.0662	74.244	0.03497	5.1816	76.118	0.03516	5.2236
	WikiTalkNet	517.42	0.00343	21.725	146.89	0.00109	8.2835	665.73	0.00356	25.758	668.30	0.00356	25.823
	RecLibimsetiNet	41.348	0.03263	0.9010	22.851	0.03400	0.7659	48.044	0.06648	1.4156	44.413	0.06273	1.3570
GroundTruth	WikiTopcat sNet	9.6105	0.00188	0.0913	6.1751	0.00187	0.0767	10.724	0.00189	0.0961	9.2087	0.00188	0.0898
	OrkutNet	194.24	0.00721	6.2222	95.580	0.00298	3.3811	654.44	0.12206	35.536	116.81	0.00329	3.1295
	YoutubeNet	37.006	0.00112	0.5858	33.861	0.00112	0.5684	46.425	0.00116	0.6403	31.298	0.00111	0.5312

Table 5

Performances of the proposed NBurr model in terms of RMSE, KLDiv, and MAE compared to the competitive heavy-tailed models over real-world networks

Networks		Burr			NBurr			Exponentiated Burr			MO Burr
Networks		RMSE	KLDiv	MAE	RMSE	KLDiv	MAE	RMSE	KLDiv	MAE	RMSE	KLDiv	MAE
Social	TwitterNet	204.35	0.1831	10.847	354.25	0.2857	15.603	53.863	0.0169	2.9494	68.004	0.0397	4.1974
	GplusNet	53.064	0.2299	0.9221	86.955	0.3113	1.1847	10.155	0.0678	0.2523	30.925	0.1475	0.6821
	DeliciousNet	349.66	0.2021	14.867	471.02	0.1349	17.874	281.34	0.0579	10.781	66.896	0.0185	4.2366
	LiveJournalNet	5025.2	0.1614	127.98	8100.9	0.1785	164.18	3473.6	0.0355	70.64	808.79	0.0101	24.501
	AthletesFacebookNet	100.16	0.2049	13.387	204.91	0.4164	23.839	15.461	0.0127	2.5674	25.099	0.0324	4.3180
Citation	HepThNet	73.531	0.1741	4.0821	122.79	0.2566	5.997	22.59	0.0255	2.331	25.42	0.0464	2.816
	PatentsNet	27.5K	0.2266	2049.5	34.8K	0.2366	2533.2	9612.7	0.0192	725.71	2424.5	0.0061	271.89
	CiteseerNet	889.88	0.3308	49.467	1156.2	0.2916	62.026	353.26	0.0299	21.921	195.67	0.0131	13.877
Web	GoogleNet	1809.1	0.124	45.023	1809.2	0.124	45.023	1514.5	0.0878	40.067	188.01	0.0157	9.6549
	BerkStanNet	615.03	0.1863	4.0722	615.01	0.1863	4.0721	185.04	0.1002	2.0198	322.63	0.1037	2.8203
	Wikipedia2009Net	4371.9	0.1352	164.58	4371.9	0.1352	164.58	2720.9	0.0798	116.43	781.87	0.0082	35.531
Product CoPurchasing	Amazon0601Net	1495.4	0.2708	70.281	2539.8	0.4022	114.59	286.61	0.0102	16.881	297.39	0.0382	22.199
	Amazon0505Net	1572.9	0.2463	73.003	2494.5	0.3711	111.56	358.59	0.0125	19.123	260.85	0.0342	20.136
	Amazon0312Net	1564.4	0.2425	71.875	2462.9	0.3686	109.21	338.03	0.0116	17.742	273.39	0.0352	20.381
Temporal	MathoverflowNet	213.91	0.2131	13.600	213.82	0.2132	13.612	41.934	0.0634	5.3161	92.603	0.0861	7.9912
	SuperuserNet	900.04	0.1808	33.837	900.33	0.1808	33.839	243.42	0.0616	13.199	354.72	0.0570	16.613
	AskubuntuNet	949.66	0.2091	39.419	949.73	0.2091	39.420	212.91	0.0649	12.451	389.14	0.0719	20.113
Communication	EmailEnronNet	246.51	0.1779	14.886	245.25	0.1778	14.859	121.47	0.0873	8.445	95.468	0.0689	7.664
	WikiTalkNet	9669.4	0.3376	293.63	9669.4	0.3376	293.63	7978.6	0.1902	246.26	672.32	0.0036	25.905
	RecLibimsetiNet	77.081	0.2198	2.1486	133.91	0.2096	2.7441	87.472	0.0755	1.4021	28.059	0.0359	0.6971
Groundtruth	WikiTopcatsNet	565.21	0.1377	2.6145	930.44	0.1612	3.8073	272.99	0.0464	1.5159	389.86	0.0629	2.2289
	OrkutNet	2443.6	0.5498	80.712	4299.3	0.8033	101.64	452.92	0.0459	19.624	261.75	0.0479	16.197
	YoutubeNet	1380.5	0.1342	15.690	1380.5	0.1342	15.691	1422.2	0.1416	17.219	143.79	0.0045	2.2564

Given a network, the information about the differences between actual and predicted degree frequencies can be determined by calculating the root mean squared error (RMSE) and mean absolute error (MAE). The higher similarity between actual and predicted distributions is achieved by generating smaller values of RMSE and MAE. From Tables 4 and 5, it is clear that the proposed NBurr distribution produces smaller RMSE and MAE values than other competitive distributions. This indicates that the proposed NBurr distribution is competitive in almost all the networks, except a few, where Burr, MO Burr, and power-law cutoff distribution perform better than the proposed one. Power-law and Pareto distributions provide lower performance compared to the other competing distributions in terms of RMSE and MAE measures over all the real-world networks, as clearly seen from Table 5. Similar performances have been observed through Table 4 in the case of Burr and MO Burr distributions as they produce similar RMSE and MAE values. The dissimilarity between two probability distributions can also be measured by calculating KLD values. The higher similarity between the actual and the predicted distribution is achieved by generating high KLD values. The proposed NBurr distribution produces smaller KLD values compared to others in almost all the networks as clearly seen from Tables 4 and 5. This in-turn suggests that the proposed NBurr distribution is much closure to the observed degree distribution and always superior to the state-of-the-art models in almost all the networks.

Thus in conclusion we can say that the proposed NBurr distribution performs better than the competing distributions by considering all the measures (RMSE, MAE, and KLDiv) together. This suggests/confirms that the observed distribution corresponding to a real-world network plausibly comes from the proposed NBurr distribution.

The scatter plot of the fitted results can be used to verify the effectiveness of the proposed NBurr distribution. To do this, the log-log plots of the original frequency distribution, the estimated frequency by NBurr distribution, and the frequency estimated by Burr, Exponentiated Burr, MO Burr, Power-law, Pareto Type-I, Log-normal and Power-law Cutoff distributions are drawn corresponding to a real-world network. For more clarification, eight such plots are given in Figures 2–5. These are the TwitterNet, LiveJournalNet, CiteseerNet, BerkStanNet, Amazon0601Net, SuperuserNet, EmailEnronNet, and WikiTopcatsNet. From Figures 2–5, it is quite clear that the proposed NBurr curve always passes through the middle of the scatter plot of each of the observed distribution. This signifies that the proposed NBurr distribution provides a better fit compared to the other competitive distributions in almost all of the networks. Thus, it is visually clear, through observing the plotted results, we may now conclude that the entire node degree distribution can be better represented by the NBurr distribution compared to other heavy-tailed distributions. Hence the proposed NBurr distribution, a modification of the Burr distribution with nonlinear exponent in the shape parameter, can be used for effective and efficient modeling of the entire degree distribution of real-world networks without ignoring the lower degree nodes. Finally, it is clear that these heavy-tailed network data sets when modeled in the whole range using the proposed NBurr distribution shows significant improvements in comparison to state-of-the-art models.

FIGURE 2.

Degree distribution of TwitterNet and LiveJournalNet in log-log scale

FIGURE 3.

Degree distribution of CiteseerNet and BerkStanNet in log-log scale

FIGURE 4.

Degree distribution of Amazon0601Net and SuperuserNet in log-log scale

FIGURE 5.

Degree distribution of EmailEnronNet and WikiTopcatsNet in log-log scale

5. Conclusion

In this paper, we present a new method called shape parameter transformation (SPT) to extend Burr and related families. The SPT method’s idea is to use a nonlinear exponent (depends on data and an additional parameter) instead of using a constant shape parameter in the Burr, Pareto, and exponentiated family of distributions. The method was first applied to the Burr distribution and a generalized Burr (GBurr) model is introduced. These newly introduced GBurr models belong to the maximum domain of attraction of the Frechet distribution and are right-tail equivalent to the Pareto distribution. Further, the SPT method is applied to the classical Pareto and exponentiated distributions. We also studied a relevant model, namely NBurr distribution, from the generalized Burr family and derived various statistical properties. The practical usefulness of the proposed NBurr distribution was shown using multiple heavy-tailed network data sets from different disciplines. It is interesting to note that the NBurr distribution is a new competitor for popularly used MO Burr and exponentiated Burr distributions. An immediate extension of this work is to apply these newly introduced probability models for modeling survival and lifetime data sets. Another possible extension of this work would be to look for implementing the proposed SPT method for other size distributions available in the statistics paradigm.

(Communicated by Gejza Wimmer)

Acknowledgement

The authors are thankful to Professor Gopal K. Basak of Indian Statistical Institute, Kolkata for constructive comments and insightful suggestions in this paper.

REFERENCES

1 [1] AFIFY, A. Z.—CORDEIRO, G. M.—ORTEGA, E. M.—YOUSOF, H. M.—BUTT, N. S.: The four-parameter Burr XII distribution: Properties, regression model, and applications, Comm. Statist. Theory Methods 47 (2018), 2605–2624.10.1080/03610926.2016.1231821Search in Google Scholar

2 [2] AMARAL, L. A. N.—SCALA, A.—BARTHELEMY, M.—STANLEY, H. E.: Classes of small-world networks, Proc. Natl. Acad. Sci. USA 97 (2000), 11149–11152.10.1073/pnas.200327197Search in Google Scholar

3 [3] ASADI, M.—EBRAHIMI, N.: Residual entropy and its characterizations in terms of hazard function and mean residual life function, Statist. Probab. Lett. 49 (2000), 263–269.10.1016/S0167-7152(00)00056-0Search in Google Scholar

4 [4] ASADL, M.: Characterization of the pearson system of distributions based on reliability measures, Statist. Papers 39 (1998), 347–360.10.1007/BF02927098Search in Google Scholar

5 [5] AUSTIN, J. A.: Control chart constants for largest and smallest in sampling from a normal distribution using the generalized Burr distribution, Technometrics 15 (1973), 931–933.10.1080/00401706.1973.10489126Search in Google Scholar

6 [6] BARABÁI, A. L.: The origin of bursts and heavy tails in human dynamics, Nature 435 (2005), 207–211.10.1038/nature03459Search in Google Scholar PubMed

7 [7] BARABÁSI, A. L.—ALBERT, R.: Emergence of scaling in random networks, Science 286 (1999), 509–512.10.1515/9781400841356.349Search in Google Scholar

8 [8] BROIDO, A. D.—CLAUSET, A.: Scale-free networks are rare, Nature Communications 10 (2019), 1–10.10.1038/s41467-019-08746-5Search in Google Scholar PubMed PubMed Central

9 [9] BURR, I. W.: Cumulative frequency functions, Ann. Math. Statist. 13 (1942), 215–232.10.1214/aoms/1177731607Search in Google Scholar

10 [10] CHATTOPADHYAY, S.—CHAKRABORTY, T.—GHOSH, K.—DAS, A. K.: Uncovering patterns in heavytailed networks: A journey beyond scale-free. In: 8th ACM IKDD CODS and 26th COMAD, 2021.10.1145/3430984.3431021Search in Google Scholar

11 [11] CHATTOPADHYAY, S.—MURTHY, C. A.—PAL, S. K.: Fitting truncated geometric distributions in large scale real world networks, Theoret. Comput. Sci. 551 (2014), 22–38.10.1016/j.tcs.2014.05.003Search in Google Scholar

12 [12] CLAUSET, A.—SHALIZI, C. R.—NEWMAN, M. E. J.: Power-law distributions in empirical data, SIAM Review 51 (2009), 661–703.10.1137/070710111Search in Google Scholar

13 [13] DOMMA, F.: Some properties of the bivariate Burr type III distribution, Statistics 44 (2010), 203–215.10.1080/02331880902986547Search in Google Scholar

14 [14] DUNNING, K. A.—HANSON, J. N.: Generalized pearson distributions and nonlinear programing, J. Stat. Comput. Simul. 6 (1977), 115–128.10.1080/00949657708810176Search in Google Scholar

15 [15] EMBRECHTS, P.—KLÜPPELBERG, C.—MIKOSCH, T.: Modelling Extremal Events: for Insurance and Finance, Springer Science & Business, Vol. 33, 2013.10.1007/BF01440733Search in Google Scholar

16 [16] FISK, P. R.: The graduation of income distributions, Econometrica 29 (1961), 171–185.10.2307/1909287Search in Google Scholar

17 [17] GOMES, A. E.—DA SILVA, C. Q.—CORDEIRO, G. M.: Two extended Burr models: Theory and practice, Comm. Statist. Theory Methods 44 (2015), 1706–1734.10.1080/03610926.2012.762402Search in Google Scholar

18 [18] GUPTA, R. C.—GUPTA, P. L.—GUPTA, R. D.: Modeling failure time data by lehman alternatives, Comm. Statist. Theory Methods 27 (1998), 887–904.10.1080/03610929808832134Search in Google Scholar

19 [19] GUPTA, R. D.—GUPTA, R. C.: Analyzing skewed data by power normal model, Test 17 (2008), 197–210.10.1007/s11749-006-0030-xSearch in Google Scholar

20 [20] GUPTA, R. D.—KUNDU, D.: Generalized exponential distributions, Aust. N. Z. J. Stat. 41 (1999), 173–188.10.1111/1467-842X.00072Search in Google Scholar

21 [21] JAMAL, F.—CHESNEAU, C.—NASIR, M. A.—SABOOR, A.—ALTUN, E.—KHAN, M. A.: On a modified Burr XII distribution having flexible hazard rate shapes, Math. Slovaca 70 (2020), 193–212.10.1515/ms-2017-0344Search in Google Scholar

22 [22] JAYAKUMAR, K.—MATHEW, T.: On a generalization to Marshall-Olkin scheme and its application to Burr type XII distribution, Statist. Papers 49 (2008), 421–439.10.1007/s00362-006-0024-5Search in Google Scholar

23 [23] KIM, M.—LESKOVEC, J.: Multiplicative attribute graph model of real-world networks, Internet Math. 8 (2012), 113–160.10.2172/1124904Search in Google Scholar

24 [24] KLEIBER, C.—KOTZ, S.: Statistical Size Distributions in Economics and Actuarial Sciences, John Wiley & Sons 470, 2003.10.1002/0471457175Search in Google Scholar

25 [25] KUMAR, D.: The Burr type XII distribution with some statistical properties, J. Data Sci. 16 (2017), 509–534.10.6339/JDS.201707_15(3).0008Search in Google Scholar

26 [26] KUMAR, D.—SARAN, J.—JAIN, N.: The exponentiated Burr XII distribution: moments and estimation based on lower record values, Sri Lankan J. Appl. Stat. 18 (2017), 1–18.10.4038/sljastats.v18i1.7930Search in Google Scholar

27 [27] LEHMANN, E. L.: The power of rank tests, Ann. Math. Statist. 24 (1953), 23–43.10.1007/978-1-4614-1412-4_33Search in Google Scholar

28 [28] LESKOVEC, J.—CHAKRABARTI, D.—KLEINBERG, J.—FALOUTSOS, C.—GHAHRAMANI, Z.: Kronecker graphs: An approach to modeling networks, J. Mach. Learn. Res. 11 (2010), 985–1042.Search in Google Scholar

29 [29] LOMAX, K. S.: Business failures: Another example of the analysis of failure data, J. Amer. Statist. Assoc. 49 (1954), 847–852.10.1080/01621459.1954.10501239Search in Google Scholar

30 [30] MARSHALL, A. W.—OLKIN, I.: A new method for adding a parameter to a family of distributions with application to the exponential and Weibull families, Biometrika 84 (1997), 641–652.10.1093/biomet/84.3.641Search in Google Scholar

31 [31] MUDHOLKAR, G. S.—SRIVASTAVA, D. K.—FREIMER, M.: The exponentiated Weibull family: A reanalysis of the bus-motor-failure data, Technometrics 37 (1995), 436–445.10.1080/00401706.1995.10484376Search in Google Scholar

32 [32] NADARAJAH, S.—KOTZ, S.: The exponentiated type distributions, Acta Appl. Math. 92 (2006), 97–111.10.1007/s10440-006-9055-0Search in Google Scholar

33 [33] NEWMAN, M. E. J.: The structure of scientific collaboration networks, Proc. Natl. Acad. Sci. USA 98 (2001), 404–409.10.1515/9781400841356.221Search in Google Scholar

34 [34] NEWMAN, M. E. J.: The structure and function of complex networks, SIAM Review 45 (2003), 167–256.10.1137/S003614450342480Search in Google Scholar

35 [35] NEWMAN, M. E. J.: Power laws, Pareto distributions and Zipf’s law, Contemp. Phys. 46 (2005), 323–351.10.1080/00107510500052444Search in Google Scholar

36 [36] PARANAíBA, P. F.—ORTEGA, E. M.—CORDEIRO, G. M.—PESCIM, R. R.: The beta Burr XII distribution with application to lifetime data, Comput. Statist. Data Anal. 55 (2011), 1118–1136.10.1016/j.csda.2010.09.009Search in Google Scholar

37 [37] PEARSON, K.: Contributions to the mathematical theory of evolution, Philosophical Transactions of the Royal Society of London 185 (1894), 71–110.10.1098/rsta.1894.0003Search in Google Scholar

38 [38] RODRIGUEZ, R. N.: A guide to the Burr type XII distributions, Biometrika 64 (1977), 129–134.10.1093/biomet/64.1.129Search in Google Scholar

39 [39] SÁNCHEZ, E.: Burr type XII as a superstatistical stationary distribution, Physica A: Stat. Mech. Appl. 516 (2019), 443–446.10.1016/j.physa.2018.10.044Search in Google Scholar

40 [40] STUMPF, M. P.—WIUF, C.—MAY, R. M.: Subnets of scale-free networks are not scale-free: sampling properties of networks, Proc. Natl. Acad. Sci. USA 102 (2005), 4221–4224.10.1073/pnas.0501179102Search in Google Scholar PubMed PubMed Central

41 [41] TADIKAMALLA, P. R.: A look at the Burr and related distributions, Int. Stat. Rev. 48 (1980), 337–344.10.2307/1402945Search in Google Scholar

42 [42] TAKAHASI, K.: Note on the multivariate Burr’s distribution, Ann. Inst. Statist. Math. 17 (1965), 257–260.10.1007/BF02868169Search in Google Scholar

43 [43] VOITALOV, I.—HOORN, P. V.—HOFSTAD, R. V.—KRIOUKOV, D.: Scale-free networks well done, Phys. Rev. Research 1 (2019), Art. 033034.10.1103/PhysRevResearch.1.033034Search in Google Scholar

44 [44] YANG, J.—LESKOVEC, J.: Defining and evaluating network communities based on ground-truth, Knowl. Inf. Syst. 42 (2015), 181–213.10.1007/s10115-013-0693-zSearch in Google Scholar

Received: 2020-09-19

Accepted: 2021-01-11

Published Online: 2022-02-16

Published in Print: 2022-02-16

This work is licensed under the Creative Commons Attribution 4.0 International License.

Articles in the same Issue

https://doi.org/10.1515/ms-2022-0016

Keywords for this article

Burr distribution; exponentiated distributions; stochastic ordering; reliability properties; maximum likelihood

Creative Commons

BY 4.0