0% found this document useful (0 votes)

13 views14 pages

Bayesian Test for Survival Distributions

The paper evaluates the Fully Bayesian Significance Test (FBST) for discriminating between survival distributions, specifically lognormal, gamma, and Weibull models. It introduces a linear mixture model approach to test hypotheses regarding the mixture weights, reparametrizing the distributions in terms of mean and variance to reduce parameter estimation complexity. Numerical results from simulations and a real dataset of patients with chronic kidney failure are presented to illustrate the application of the proposed method.

Uploaded by

silva.filipe

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

13 views14 pages

Bayesian Test for Survival Distributions

Uploaded by

silva.filipe

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

Communications in Statistics - Theory and Methods

ISSN: 0361-0926 (Print) 1532-415X (Online) Journal homepage: [Link]

Bayesian significance test for discriminating

between survival distributions

Cachimo Combo Assane, Basilio de Bragança Pereira & Carlos Alberto de

Bragança Pereira

To cite this article: Cachimo Combo Assane, Basilio de Bragança Pereira & Carlos
Alberto de Bragança Pereira (2017): Bayesian significance test for discriminating
between survival distributions, Communications in Statistics - Theory and Methods, DOI:
10.1080/03610926.2017.1406117

To link to this article: [Link]

Published online: 05 Dec 2017.

Submit your article to this journal

View related articles

View Crossmark data

Full Terms & Conditions of access and use can be found at

[Link]

Download by: [[Link]] Date: 05 December 2017, At: 15:50

COMMUNICATIONS IN STATISTICS—THEORY AND METHODS
, VOL. , NO. , –
[Link]

Bayesian significance test for discriminating between survival

distributions
Cachimo Combo Assanea , Basilio de Bragança Pereirab ,
and Carlos Alberto de Bragança Pereirac
a
Universidade Eduardo Mondlane, Maputo, Mozambique; b Universidade Federal do Rio de Janeiro (UFRJ),
Rio de Janeiro, Brazil; c Universidade de São Paulo (USP), São Paulo, Brazil

ABSTRACT ARTICLE HISTORY

An evaluation of FBST, Fully Bayesian Significance Test, restricted to Received  September 
Downloaded by [[Link]] at 15:50 05 December 2017

survival models is the main objective of the present paper. A Survival Accepted  November 
distribution should be chosen among the tree celebrated ones, lognor-
KEYWORDS
mal, gamma, and Weibull. For this discrimination, a linear mixture of FBST; Mixture model; Model
the three distributions is an important tool: the FBST is used to test the choice; Separate models;
hypotheses defined on the mixture weights space. Another feature of Signiﬁcance test; Survival
the paper is that all three distributions are reparametrized in that all the distributions.
six parameters are written as functions of the mean and the variance of
the population been studied. Some numerical results from simulations MATHEMATICS SUBJECT
with some right-censored data are considered. CLASSIFICATION
F; F

1. Introduction
In many scientific disciplines, researchers are constantly faced with the fundamental problem
of choosing among alternative statistical models. The Neyman-Pearson theory of hypothesis
testing applies only if the models belong to the same family of distributions. Alternatively, spe-
cial procedures are required if the models belong to families that are separate (or non-nested)
in the sense that an arbitrary member of one family cannot be obtained as a limit of members
of the other. The set of separate families of probability distributions includes the ones used
here: lognormal, gamma, and Weibull models (Pereira 1981; Araujo and Pereira 2007; Pereira
and Pereira 2017) which have been used widely to describe survival data (Lawless 2002;
Lee and Wang 2003).
A considerable amount of research on separate families of hypotheses has been realized
since the fundamental work of Cox (1961, 1962), who first dealt with the problem. For reviews
and references, see Araujo et al. (2005); Araujo and Pereira (2007); and Pereira and Pereira
(2017).
The Fully Bayesian Significance Test (FBST) introduced by Pereira and Stern (1999) is an
alternative test to the ones that are based on Bayes factor or on the classical p-value; mostly
for the case of precise hypotheses. The basis for the FBST is an index known as e-value
(e stands for evidence) that measures the inconsistency of the hypothesis. For this, it considers
the tangent set, T ; the set of all parameter values for which their posterior density values are

CONTACT Basilio de Bragança Pereira [Link]@[Link]/[Link] Department of Mathematics and

Informatics, Faculty of Sciences; Universidade Eduardo Mondlane; Av. Julius Nyerere/Campus , P.O. Box , Maputo,
Mozambique.
Color versions of one or more of the ﬁgures in the article can be found online at [Link]/lsta.
©  Taylor & Francis Group, LLC
2 C. C. ASSANE ET AL.

greater than the values of the posterior densities of all points that attend the hypothesis. For
reviews and further references on FBST, see Pereira et al. (2008) and Stern and Pereira (2014).
For a few interesting applications illustrating the use of e-values and the FBST to practical
problems, see Diniz et al. (2012), Lauretto et al. (2003), Lauretto et al. (2007), and Pereira and
Stern (1999).
In the present work, we consider the FBST for discriminating between the lognormal,
gamma and Weibull distributions. We formulate this problem in the context of linear mixture
model, as suggested by Cox (1961). It means that, the models under comparison are consid-
ered as components of a finite mixture model. The FBST is used for testing hypotheses defined
on the mixture weights space. The e-value is the complementary of the posterior probability
of the tangent set T ; ev = 1 − Pr(T |Data),
Additionally, the density functions of the mixture components are reparametrized in terms
of the mean μ and the variance σ 2 of the population. Hence, the models under discrimina-
tion share common parameters (Kamary et al. 2014; Pereira and Pereira 2017). A standard
Bayesian approach to finite mixture models is to consider different pairs of parameters for
Downloaded by [[Link]] at 15:50 05 December 2017

each of these models and to adopt independent prior distributions for each pair of param-
eters and a Dirichlet prior on the mixture weights (Lauretto and Stern 2005; Lauretto et al.
2007). However, since the comparison between the models is based on the same dataset and
on the same sample, we believe that it would be inappropriate to consider different means
and variances for these models. Note that when we try to define the prior distributions for
the population mean and variance, our uncertainties about these default parameters are
not related to the models under comparison. In this way, the parametrization can be used
to any distribution that may be reparametrized in the way was done here. This practical
argument was the reason we decided to present our “mixture” model. We are not certain
that mixture is the correct word because, in fact, we have a convex combination of density
functions.
Moreover, this reparametrization reduces the number of the parameters to be esti-
mated: in our case, including the weights, from eight to only four. The reduction of the
parameter space may lead to low computational costs..
Note that mean and variance are parameters that can be thought as existing invisible
quantities, but the weights of the convex combination do not. The vector of weights must
be defined in a simplex and it is an artifact that helps to discriminate, between the three
models, those who best adjust the observations. It can happen that the own combination
can be the best model as well the combination of a pair of them. To understand the role of
Dirichlet distributions we refer to Pereira and Stern (2008) and Stern (2011). It is impor-
tant to call attention to the fact that the posterior distribution of the weights is the “arti-
fact” that induces the model choice (Cox 1961, 1962).
To illustrate the procedure, numerical results based on simulated right-censored survival
times were considered. Also, a real example is introduced to use the lognormal-gamma-
Weibull mixture model to the dataset of patients, from Rio de Janeiro hospitals, with end-stage
chronic kidney failure who received hemodialysis.
Section 2 presents a brief review of basic concepts and notation for survival analysis. The
parametric distributions used in this paper are also described. Section 3 reviews the basic con-
cepts o FBST. Section 4 discusses the FBST formulation for discriminating between survival
distributions in the context of mixture models. Section 5 presents the results of the simulation
study. Section 6 is about the use of the lognormal-gamma-Weibull on the real dataset. Final
remarks are presented in Section 7.
COMMUNICATIONS IN STATISTICS—THEORY AND METHODS 3

2. Survival analysis

2.1. Basic concepts and notation

Survival analysis is concerned with the analysis of time to occurrence of a certain event of
interest, such as failure, death, relapse or development of a given disease.
Let T be a non-negative random variable representing the time until some event of interest.
There are three functions of primary interest used to characterize the distribution of T , namely
the survival function, the probability density function and the hazard function (Lee and Wang
2003).
The survival function, denoted by S(t ), is defined as the probability that an individual
survives beyond time t:

S(t ) = P(T > t ) = 1 − F (t ), for t > 0, (2.1)

where F (t ) is the distribution function of T . Note that S(t ) is a nonincreasing continuous

Downloaded by [[Link]] at 15:50 05 December 2017

function of time t with S(0) = 1 and S(∞) = lim∞ S(t ) = 0.

The probability density function, denoted by f (t ), is the probability of failure in a small
interval per unit time. It can be expressed as
dF (t ) d{1 − S(t )} dS(t )
f (t ) = = =− . (2.2)
dt dt dt
The hazard function, denoted by h(t ), represents the probability of failure during a very small
time interval, assuming that the individual has survived to the beginning of the interval:
P(t ≤ T < t + t|T ≥ t ) f (t )
h(t ) = lim = . (2.3)
t→0 t S(t )
This function is also known as the conditional failure rate. The cumulative hazard function is
defined as
t
H(t ) = h(u)d(u). (2.4)
0

Therefore, when t = 0 then, S(t ) = 1 and H(t ) = 0; and when t = ∞ then, S(t ) = 0 and
H(t ) = ∞.

2.2. Parametric survival distributions

In this paper, we consider the the FBST for discriminating between the lognormal, gamma
and Weibull distributions which are most frequently used in modeling survival data (Lawless
2002; Lee and Wang 2003). The probability density functions, the survival functions and the
hazard functions of these distributions are highlighted below.
i) Let T be a lognormal random variable with parameters α = (α1 , α2 ), denoted by
T ∼ LN(α1 , α2 ),

1 (log t − α1 )2
fL (t|α) = √ exp − , −∞ < α1 < ∞, α2 , t > 0;
t 2πα2 2α2
∞
1 1 (log t − α1 )2
SL (t|α) = √ exp − dy
2πα2 t t 2α2
4 C. C. ASSANE ET AL.

(log t − α1 )
=1− √ ;
α2
fLN ()
hL (t|α) = .
SLN ()

ii) If T has a Gamma distribution with parameters γ = (γ1 , γ2 ), denoted by

T ∼ G(γ1 , γ2 ), then

1 γ2 −1 t
fG (t|γ ) = t exp − , γ1 , γ2 , t > 0;
(γ2 )γ1γ2 γ1
t
1 γ2 −1 u
SG (t|γ ) = 1 − γ2 u exp − du;
0 (γ2 )γ1 γ1
fG ()
hG (t|γ ) = .
SG ()
Downloaded by [[Link]] at 15:50 05 December 2017

iii) If T has a Weibull distribution with parameters β = (β1 , β2 ), denoted by

T ∼ W (β1 , β2 ), then

β2 β2 −1 t β2
fW (t|β ) = β2 t exp − , β1 , β2 , t > 0;
β1 β1

t β2
SW (t|β ) = exp − ;
β1
β2
hW (t|β ) = t β2 −1 .
β1β2

3. Fully Bayesian significance test (FBST)

The FBST of Pereira and Stern (1999), which is reviewed in Pereira et al. (2008), is a Bayesian
version of significance testing, as considered by Cox (1977) and Kempthorne (1976), for pre-
cise (or sharp) hypotheses.
First, let us consider a real parameter θ, a point in the parameter space ⊂ , and an
observation y of the random variable Y . A frequentist looks for the set I ∈ of sample points
that are at least as inconsistent with the hypothesis as y is. A Bayesian looks for the tangential
set T (y) ⊂ (Pereira et al. 2008), which is a set of parameter points that are more consistent
with the observed y than the hypothesis is. An example of a sharp hypothesis in a parameter
space of the real line is of the type H : θ = θ0 . The evidence value in favor of H for a frequentist
is the usual p-value, P(Y ∈ I|θ0 ), whereas for a Bayesian, the evidence in favor of H is the e-
value, ev = 1 − Pr(θ ∈ T (y)|y).
In the general case of multiple parameters, ⊂ k , let the posterior distribution for θ
given y be denoted by q(θ|y) ∝ π (θ )L(y, θ ), where π (θ ) is the prior probability density of
θ and L(y, θ ) is the likelihood function. In this case, a sharp hypothesis is of the type H :
θ ∈ H ⊂ , where H is a subspace of smaller dimension than . Letting supH denote the
supremum of H , we define the general Bayesian evidence and the tangential set, T (y), as
follows:

q∗ = sup q(θ|y) and T (y) = {θ : q(θ|y) > q∗ }. (3.1)

H
COMMUNICATIONS IN STATISTICS—THEORY AND METHODS 5

The Bayesian evidence value against H is the posterior probability of T (y),

ev = Pr(θ ∈ T (y)|y) = q(θ|y)dθ; consequently, ev = 1 − ev. (3.2)
T (y)

It is important to note that evidence that favors H is not evidence against the alternative,
H = \ H, because it is not a sharp hypothesis. This interpretation also holds for p-values
in the frequentist paradigm. As in Pereira et al. (2008), we would like to point out that this
Bayesian significance index uses only the posterior distribution, with no need for additional
artifacts such as the inclusion of positive prior probabilities for the hypotheses or the elimi-
nation of nuisance parameters. The computation of the e-values does not require asymptotic
methods, and the only technical tools needed are numerical optimization and integration
methods.

4. Mixture of survival models

Downloaded by [[Link]] at 15:50 05 December 2017

Let us consider a dataset y = {y1 , . . . , yn } and m alternative parametric survival distributions

with densities f1 (y|ψ1 ), f2 (y|ψ2 ), . . . , fm (y|ψm ). Here, ψk , k = 1, . . . , m, are unknown (vec-
tor) parameters and the families of distributions are separate. The problem of interest is to
measure the evidence in favor of each model for fitting the dataset. As suggested by Cox
(1961), we can consider a general model including all candidate distributions where the choice
of a specific distribution is a special case. In this work, we formulate the FBST for the linear
mixture of the survival models as a selection procedure. Denoting θ = (ψ1 , . . . , ψm , p), the
density function for m−component mixture model is
m
f (y j |θ) = p1 f1 (y j |ψ1 ) + · · · + pm fm (y j |ψm ) pk ≥ 0, pk = 1. (4.1)
k=1

where p = (p1 , . . . , pm ) is the vector of the mixture weights.

In the presente work, the density functions of the mixture components in (4.1) are
reparametrized in terms of the mean μ and the variance σ 2 of the population. Hence, the
models under comparison share common parameters (Kamary et al. 2014; Pereira and Pereira
2017). The main reason for this reparametrization is that, since the comparison between the
models is based on the same dataset and on the same sample, we believe that it would be
inappropriate to consider different means and variances for these models as is commonly
performed in traditional Bayesian approach to finite mixture model. Therefore, we have
θ = (μ, σ 2 , p) denoting all parameters of the mixture model, where μ and σ 2 are the con-
necting parameters, with p corresponding to the vector of the mixture weights.
Assuming that the yi are conditionally (on the parameter) independent, the likelihood
function is defined as
n m
L(y, θ) = pk fk (y j |μ, σ ). (4.2)
j=1 k=1

The families of distributions considered include the lognormal, gamma and Weibull mod-
els. Hence, the relationship between the parameters of these models through the μ and σ 2 is
described as follows.
(i) Let y be a lognormal(α1 , α2 ), α1 ∈ R and α2 > 0, with probability density function

1 (log y − α1 )2
fL (y|α1 , α2 ) = √ exp − .
y 2πα2 2α2
6 C. C. ASSANE ET AL.

We then have
⎧
α1 +α2 /2
2
⎨ α1 = log √ μ2 2
μ = E(y|α1 , α2 ) = e μ +σ
⇒ (4.3)
σ 2 = Var(y|α1 , α2 ) = (eα2 − 1)e2α1 +α2 ⎩ α = log μ2 +σ 2 .
2 μ2

(ii) Let y be a gamma(γ1 , γ2 ), γ1 > 0 and γ2 > 0, with probability density function

1 γ2 −1 y
fG (y|γ1 , γ2 ) = y exp − .
(γ2 )γ1γ2 γ1
Therefore

σ2
μ = E(y|γ1 , γ2 ) = γ1 γ2 γ1 = μ
⇒ (4.4)
σ 2 = Var(y|γ1 , γ2 ) = γ2 γ12 γ2 = μ2
.
σ2

(iii) When y ∼ Weibull(β1 , β2 ), β1 > 0 and β2 > 0, with probability density function

Downloaded by [[Link]] at 15:50 05 December 2017

β2 β2 −1 y β2
fW (y|β1 , β2 ) = β2 y exp − ,
β1 β1

then

μ = E(y|β1 , β2 ) = β1 (1 + 1/β2 )
σ 2 = Var(y|β1 , β2 ) = β12 (1 + 2/β2 ) − β12 2 (1 + 1/β2 )
μ
β1 = (1+1/β2 )
⇒ 2 2 (4.5)
2 log (1 + 1/β2 ) − log (1 + 2/β2 ) + log μ μ+σ
2 = 0.

In order to find β2 , the Newton-Rapson method can be used to solve the nonlinear equa-
tion. Here, we use the nleqslv” function in the R” package of the same name.
A special feature of survival data is that survival times are frequently censored. The survival
time of an individual is said to be censored when the event of interest has not been observed
for that individual, but is known only to occur in a certain period of time. There are various
categories of censoring, such as right censoring, left censoring and interval censoring (see
Klein and Moeschberger (2003) for more details). In this paper, we restrict ourselves to data in
which the survival times are subject to right censoring, which is the most common censoring
mechanism in medical research.
In the model for right-censored data, it is convenient to consider the following notation.
Each individual j is assumed to have an event time Tj and a censoring time C j . The observa-
tions consist of (y1 , δ1 ), (y2 , δ2 ), . . . , (yn , δn ), where y j = min{Tj , C j } and δ j = I(Tj ≤ C j ),
indicating whether Tj was observed (δ j = 1) or not (δ j = 0).
Note that the likelihood function given by (4.2) is for uncensored (or exact) observations.
Assuming noninformative censoring, i.e, independence between Tj and C j , then, the likeli-
hood function for right-censored observations is
n n
L(y, θ) = f (y j , δ j |θ) ∝ [ f (y j |θ)]δ j [S(y j |θ)]1−δ j
j j

n
m
δ j m
1−δ j
∝ pk fk (y j |μ, σ ) pk Sk (y j |μ, σ ) , (4.6)
j k=1 k=1

where, Sk is the survival function associated with the mixture component k.

COMMUNICATIONS IN STATISTICS—THEORY AND METHODS 7

Assuming independence, the joint prior density function of θ = (μ, σ 2 , p) is given by

π (θ) = π1 (p)π2 (μ)π3 (σ 2 ). Therefore, according to the Bayesian paradigm, the posterior
density of θ is
f (θ|y) ∝ L(y, θ)π (θ). (4.7)
In this paper, the prior distributions for the connecting parameters, μ and σ 2 , are assumed
to be independent gamma distributions, both with a mean of one and a variance of 100, that
is, μ, σ 2 ∼ gamma(0.01, 100) (Pereira and Pereira 2017). For the mixture weights, we use a
Dirichlet prior, p ∼ Dir(1, 1, 1) when all families of models are considered (m = 3) or a Beta
prior with parameters (1,1) (uniform(0, 1)) for any combination of m = 2.
In order to measure the evidence in favour of each model, the hypotheses on the mixture
weights are tested (Kamary et al. 2014; Pereira and Pereira 2017).
The hypothesis specifying that y has the density function fk (y|ψk ) is equivalent to
Hk : pk = 1 ∧ pi = 0, i = k. (4.8)
Downloaded by [[Link]] at 15:50 05 December 2017

On the other hand, the hypothesis that y has not the density fk (y|ψk ) is equivalent to

H : pk = 0 ∧ pi = 1. (4.9)
i=k

The alternative hypotheses to (4.8) and (4.9) are Ak : pk < 1 and Ak : pk > 0, respectively,
which are not sharp anyway.
The FBST procedure is used to test Hk , k = 1, . . . , m, according to the expressions (3.1)
and (3.2). For the optimization step, we used the conjugate gradient method (Fletcher and
Reeves 1964). In order to perform the integration over the posterior measure, we used an
Adaptive Metropolis Markov chain Monte Carlo algorithm (MCMC) of Haario et al. (2001).
In this paper, the implementation of the Bayesian models is carried out using LaplacesDe-
mon” R” package. The LaplacesDemon” is an open-source package that provides a complete
environment for simulation in Bayesian inference (Statisticat, LCC 2016).

5. Simulations
In this section we present some numerical results based on simulated right-censored survival
times in order to evaluate the performance of the FBST for discriminating between the sur-
vival distributions via lognormal-gamma-Weibull mixture model (LGW). The main purpose
is to measure the convergence rate of correct decisions, concerning the identification of the
true model used to generate the survival times T .
The simulations of this paper were performed on a Intel(R) Core(TM) i7-5500U CPU@
2.40GHz computer.

5.1. Simulation scheme of sample points

Let HL , HG and HW be the hypotheses specifying the probability density functions of the log-
normal, gamma and Weibull distributions, respectively. From each distribution, we generate
200 samples of sizes n = 100, 200, 300, and 500. Each sample contain a desired proportion of
right-censored observations.
The steps used to simulate a sample, y, of size n, in which part of the observations is right-
censored, are shown below. For this example, we assume that the true survival times has a
lognormal distribution.
8 C. C. ASSANE ET AL.

1. Assign values to parameters μ e σ 2 ;

2. Calculate the lognormal parameters (α1 , α2 ) using the expressions (4.3);
3. For j = 1, . . . , n,
r Generate the survival time Tj from lognormal(α1 , α2 );
r Generate the right-censoring time C j from a exponential distribution, i.e, C j ∼
Exp(λ), where the parameter λ is chosen such that approximately a desired per-
centage of simulated observations are right-censored;
r Obtain the observed time y j = min{Tj , C j }
r Create an indicator random variable δ j = I(Tj ≤ C j )
Using this generated sample, we obtain the posterior samples for the mixture parameters
from Adaptive Metropolis algorithm and we use the FBST to calculate the evidence measures
in favor of each model.
The value for the censoring distribution parameter, λ, is determined by numerical methods
(Wan 2017). We let pc denote the right-censoring probability. We suppose that the censoring
time C has exponencial density function g(c|λ) and the independence assumption between T
Downloaded by [[Link]] at 15:50 05 December 2017

and C holds. In order to simulate a sample with approximately pc % of right-censored obser-

vations, the value of λ is obtained by solving the following equation:

pc = Pr(δ = 0|λ, μ, σ 2 )
= Pr(C ≤ T ≤ ∞, 0 ≤ C ≤ ∞)
= 1 − Pr(0 ≤ T ≤ C, 0 ≤ C ≤ ∞)
∞ c
=1− g(c|λ) fL (t|μ, σ )dtdc
∞
0 0

=1− g(c|λ)FL (c|μ, σ )dc, (5.1)

where fL and FL are the lognormal probability density and distribution functions of survival
times, respectively.
For generating right-censored survival times from the gamma and Weibull distributions,
an analogous procedure to that used for the lognormal distribution is employed.

5.2. Criteria for evaluating the performance of the FBST

In order to evaluate the performance of the FBST on selecting the true distribution used
to generate the survival times, we have compared the measures of evidence in favor of the
hypotheses H : pk = 0 and H : pk = 1, k = L, G, W , where pk are respectively the mixture
weights associated with the lognormal, gamma and Weibull components in the LGW mix-
ture model.
For instance, suppose again that the true survival time has a lognormal distribution. We
consider that the FBST has made a correct choice on the LGW model, if the evidence in favor
of H : pL = 0 is less than that in favor of H : pG = 0 and H : pW = 0, and the evidence in
favor of H : pL = 1 is greater than that in favor of H : pG = 1 e H : pW = 1.
The calculation of the proportions of correct decisions made by FBST is based on 200
replicates. In these simulations, we have assigned μ = 20 and σ 2 = 50. The FBST proce-
dure is evaluated considering the samples with different censoring percentages: 10%, 30%
and 50%.
COMMUNICATIONS IN STATISTICS—THEORY AND METHODS 9

5.3. Simulation results

Table 1 presents the mean of the estimates for the LGW mixture model parameters and the
percentages of correct decisions made by FBST on selecting the true distribution used to gen-
erate the survival times. It is observed that, regardless of the distribution used for generating
the survival times and the sample sizes, the estimates for the mean μ are very close to each
other and to the true value of the parameter. For the estimates of the variance σ 2 , we observe
a variation between them but, in general, they approach the true value of the parameter as the
sample size increases.
It is observed that the FBST presents a high performance on identifying the Weibull dis-
tribution as the true data generation process and low performance on identifying the gamma
distribution. This happens because, regarding the parameters chosen for these simulations,
the gamma and lognormal densities are very similar. The general pattern of the simulation
results shows that the FBST achieves good performance even for samples with 50% right-
censoring.
Downloaded by [[Link]] at 15:50 05 December 2017

Table . Mean of estimates for LGW model parameters and percentages of correct decisions made by FBST
on selecting the true distribution used to generate the survival times, using samples with diﬀerent right-
censoring percentages.
μ σ2 pL pG pW
% of Rc† Model n   — — — % of Cd†

 Lognormal  . . . . . 

 . . . . . 
 . . . . . 
 . . . . . 
Gamma  . . . . . 
 . . . . . 
 . . . . . 
 . . . . . 
Weibull  . . . . . 
 . . . . . 
 . . . . . 
 . . . . . 
 Lognormal  . . . . . 
 . . . . . 
 . . . . . 
 . . . . . 
Gamma  . . . . . 
 . . . . . 
 . . . . . 
 . . . . . 
Weibull  . . . . . 
 . . . . . 
 . . . . . 
 . . . . . 
 Lognormal  . . . . . 
 . . . . . 
 . . . . . 
 . . . . . 
Gamma  . . . . . 
 . . . . . 
 . . . . . 
 . . . . . 
Weibull  . . . . . 
 . . . . . 
 . . . . . 
 . . . . . 
† percentage of right-censoring.
‡ percentage of correct decision.
10 C. C. ASSANE ET AL.

6. Application: Choice of a survival model for patients with end-stage kidney

disease

6.1. Dataset
The dataset used in this paper refers to a cohort study of 473 patients with end-stage chronic
kidney failure who received hemodialysis (HD) in four centers in the State of Rio de Janeiro,
Brazil. The patients were followed up 11 years. The observed time for each patient was the
number of months from admission to hemodialysis until death or the end of the observation
period (kidney transplant or end of the study) which indicates a right-censored survival time.
For a complete description of this dataset, see Alves et al. (2014).
In this paper, our main interest is to apply the LGW model to the survival data for HD
patients and use the FBST procedure to examine the mixture parameters in order to choose
the parametric distribution that best fits the observed data. But before that, we have performed
pairwise comparisons by fitting the lognormal-Weibull, lognormal-gamma, and gamma-
Downloaded by [[Link]] at 15:50 05 December 2017

Weibull mixture models.

6.2. Results
The measures of evidence provided by HD data in favor of the three models concerning
the pairwise comparisons are presented in Table 2. For the comparison between the log-
normal and Weibull distributions, the FBST indicates to choose the lognormal model since
the e-values ev (HL ) = 0.874 and ev (HW ) = 0.043. For selecting between the lognormal and
gamma distributions, the evidence measures indicate that both models provide good fit to the
dataset. Nevertheless, also we would prefer to choose the lognormal model which is the most
plausible. The results of the tests for comparison between the gamma and Weibull distribu-
tions indicate that the Weibull distribution does not provide reasonable fit to the dataset.
Discrimination based on the LGW mixture model
In order to test simultaneously the three hypotheses, we have applied the the LGW model,

f (y|p, μ, σ ) = p1 fL (y|μ, σ ) + p2 fG (y|μ, σ ) + p3 fW (y|μ, σ ), (6.1)

to the HD data.
The estimates for the parameters of the model (6.1) are presented in Table 3. Here, SD, 2.5%
and 97.5% denote the standard deviation, the 2.5th and the 97.5th percentiles of the posterior
distribution of the LGW parameters, respectively. Both the classical and the Bayesian mea-
sures of evidence, presented in Table 4, indicate that neither the gamma and Weibull models
should be considered because the null hypotheses H : p2 = 0 e H : p3 = 0 are not rejected.

Table . Measures of evidence provided by HD data.

Evidence in favor of null hypothesis
Comparison Null hypothesis e-value p-value∗

HL × HW HL . .
HW . .
HL × HG HL . .
HG . .
HG × HW HG . .
HW . .
∗ p-value calculated according to Diniz et al. ().
COMMUNICATIONS IN STATISTICS—THEORY AND METHODS 11

Table . Summary of the posterior distribution of the LGW parameters.

Parameter Mean SD 2.5% Median 97.5%

p1 -lognormal . . . . .

p2 -gamma . . . . .
p3 -Weibull . . . . .
μ . . . . .
σ2 . . . . .

Table . Hypothesis testing on the mixture weights of LGW model.

Hiptese e-valor p-valor∗

p1 = 0 . .
p2 = 0 . .
p3 = 0 . .
∗ p-value calculated according to Diniz et al. ().
Downloaded by [[Link]] at 15:50 05 December 2017

Consequently, among the three models, the lognormal model is the most appropriate for mod-
eling HD data.
Figure 1 displays the survival curves calculated using Bayesian estimates of the lognormal
model (Table 5), the LGW mixture model (Table 3) and a procedure called the piecewise expo-
nential estimator (PEXE), introduced by Kim and Proschan (1976), representing the observed
data. Unlike the well-known Kaplan-Meier estimator, the PEXE is smooth and continuous
estimator of the survival function.
It appears reasonable to disregard both the gamma and the Weibull models; the lognormal
model by itself produces a good estimate of survival function.

Time to Survival/Progression
1.0

PEXE

Lognormal
0.8

LGW mixture
Probability

0.6
0.4

0 5 10 15 20 25

Time

Figure . Survival curves based on the estimates of the lognormal model, the LGW model and the PEXE.
12 C. C. ASSANE ET AL.

Table . Summary of the posterior distribution of lognormal parameters.

Parmetro Mean SD 2.5% Median 97.5%

μ . . . . .

σ2 . . . . .

Note that the preference for the lognormal model is evident in evaluating the LGW mixture
model more than in the comparison between the lognormal and gamma distributions, where
the evidence measures in favor of both models are very close. It means that the discrimination
power provided by LGW model is much higher than the power of the pairwise comparisons.
This finding is in agreement with the discussion of Sawyer (1984).

7. Final remarks
In this paper we considered the FBST for discriminating between survival distributions in
Downloaded by [[Link]] at 15:50 05 December 2017

the context of linear mixture model. The mixture approach allows us to compare between all
alternative models at once by testing the hypotheses on the mixture weights space. The fami-
lies of survival distributions considered include the lognormal, gamma and Weibull models.
In this work, the density functions of the mixture components were reparametrized in terms
of the mean μ and the variance σ 2 of the population so that all models under discrimination
share common parameters (Kamary et al. 2014; Pereira and Pereira 2017).
From the simulation results, we observed that the FBST achieves good performance on
identifying the true distribution used to generate the survival times.
The application of the LGW mixture model to the survival data for HD patients allowed
us to identify the lognormal distribution as the most appropriate in modeling observed data.
Therefore, one can construct a regression model to the HD data considering the lognormal
model as the distribution of the response variable.
It would be of interesting to apply the proposed procedure to survival data also considering
another censoring mechanisms.

Acknowledgements
The authors are grateful for the support of CNPq, COPPE/UFRJ and IME/USP.

References
Araujo, M. I., and B. B. Pereira. 2007. A comparison of bayes factors for separated models: Some simu-
lation results. Communications in Statistics–Simulation and Computation 36:297–309.
Araujo, M. I., B. B. Pereira, R. Cleroux, M. Fernandes, and A. Lazraq. 2005. Separate families of models:
Sir David Cox contributions and recent developments. Student 5:251–8.
Alves, M., N. A. Souza e Silva, L. H. A. Salis, B. B. Pereira, P. H. Godoy, E. M. Nascimento, and J. M.
F. Oliveira. 2014. Survival and predictive factors of lethality in hemodyalisis: D/I polymorphism of
the angiotensin I-Converting enzyme and of the angiotensinogen M235T genes. Arq Bras Cardiol
103:209–18.
Cox, D. R. 1961. Tests of separate families of hypotheses. Proceedings 4th Berkeley Symposium in Math-
ematical Statistics and Probability 1:105–23.
Cox, D. R. 1962. Further results on test of separate families of hypotheses. Journal of the Royal Statistical
Society 24:406–24.
Cox, D. R. 1977. The role of significance tests. Scand. J. Statist 4:49–70.
COMMUNICATIONS IN STATISTICS—THEORY AND METHODS 13

Diniz, M., C. A. B. Pereira, A. Polpo, J. M. Stern, and S. Wechsler. 2012. Relationship between Bayesian
and frequentist significance indices. International Journal for Uncertainty Quantification 2:161–72.
Fletcher, R., and C. M. Reeves. 1964. Function minimization by conjugate gradients. Computer Journal
7:148–54.
Haario, H., E. Saksman, and J. Tamminen. 2001. An adaptive Metropolis algorithm. Bernoulli 7:223–42.
Kamary, K., K. Mengersen, C. P. Robert, and J. Rousseau. 2014. Testing hypotheses via a mixture esti-
mation model. arXiv:1412.2044v2.
Kempthorne, O. 1976. Of what use are tests of significance and tests of hypothesis. Communications in
Statistics -Theory and Methods 8:763–77.
Kim, J. S., and F. Proschan. 1991. Piecewise exponential estimator of the survivor function. IEEE Trans-
actions on Reliability 40:134–9.
Klein, J., and M. L. Moeschberger. 2003. Survival analysis: Techniques for censored and truncated data.
2nd ed. New York, USA: Springer.
Lauretto, M., C. A. B. Pereira, J. M. Stern, and S. Zacks. 2003. Comparing parameters of two bivariate
normal distributions using the invariant full Bayesian significance test. Brazilian Journal of Proba-
bility and Statistics 17:147–68.
Lauretto, M. S., and J. M. Stern. 2005. FBST for mixture model selection. AIP Conference Proceedings
Downloaded by [[Link]] at 15:50 05 December 2017

803:121–8.
Lauretto, M. S., S. R. Faria Jr, C. A. B. Pereira, B. B. Pereira, and J. M. Stern. 2007. The problem of
separate hypotheses via mixture models. AIP Conference Proceedings 954:268–75.
Lawless, J. F. 2002. Statistical models and methods for lifetime data. 2nd ed. New York, USA: John Wiley
& Sons.
Lee, E. T., and J. W. Wang. 2003. Statistical methods for survival data analysis. 3rd ed. New Jersey, USA:
Wiley.
Pereira, B. B. 1981. Choice of a survival model for patients with a brain tumour. Metrika 28:53–61.
Pereira, B. B., and C. A. B. Pereira. 2017. Model choice in nonnested families. 1st edn. Berlin: Springer.
Pereira, C. A. B., and J. Stern. 1999. Evidence and credibility: Full Bayesian significance test for precise
hypotheses. Entropy 1:69–80.
Pereira, C. A. B., and J. Stern. 2008. Special characterization of standard discrete models. Revstat - The
Statistical Journal 6:199–230.
Pereira, C. A. B., J. Stern, and S. Wechsler. 2008. Can a significance test be genuinely Bayesian. Bayesian
Analysis 3:79–100.
Sawyer, K. R. 1984. Multiple hypotheses testing. Journal of Teh Royal Statistical 46:419–24.
Statisticat, LCC 2016. LaplacesDemon: A Complete Environment for Bayesian Inference within R.
R Package version 17.07.2016. [Link]
[Link].
Stern, J. 2011. Symmetry, invariance and ontology in physics and statistics. Symmetry 3:611–35.
Stern, J., and C. A. B. Pereira. 2014. Bayesian epistemic values: Focus on surprise, measure probability.
Logic Journal of The IGPL 22:236–54.
Wan, F. 2017. Simulating survival data with predefined censoring rates for proportional hazards models.
Statistics in Medicine 36:838–54.

Common questions

Reparametrization facilitates comparison by ensuring that the models share common parameters, specifically the mean (μ) and variance (σ²), which are consistent across all candidate models. This allows for an unbiased comparison by making sure that differences in model fits are due to the model structures themselves rather than differences in parameter scales or units. For a dataset with censored observations, this uniformity in parameterization helps in evaluating and distinguishing among the survival models, like the lognormal, gamma, and Weibull distributions .

The FBST is a Bayesian form of significance testing that evaluates precise hypotheses about model parameters. It is used to measure evidence in favor of specific survival models, such as lognormal, gamma, and Weibull distributions. In the context of survival data modeled with linear mixtures, the FBST is applied to discriminate between different parametric distributions by simultaneously considering all candidate models rather than performing pairwise comparison, thus providing an encompassing test of model suitability .

Using FBST over traditional pairwise comparisons ensures a comprehensive evaluation across all models jointly rather than isolated comparisons. This approach considers entire model frameworks and tests hypotheses regarding model probabilities simultaneously, increasing discrimination power between models. It avoids issues of multiple comparison biases and aligns the context with Bayesian inference practices, providing a consistent, unified basis for concluding model suitability within a multivariate parameter space .

Evidences include e-values and posterior distributions from the FBST process, which assess the fit of lognormal, gamma, and Weibull models. The lognormal distribution was preferred due to higher support from data (higher e-values), suggesting a better fit to hemodialysis patient survival times compared to gamma and Weibull distributions. The FBST indicators pointed towards greater plausibility of the lognormal model within the LGW mixture, which effectively distinguished model performances for this medical data .

A Weibull distribution might be inappropriate if evidence from the data, such as the FBST approach, suggests a poor fit compared to other models. In a study comparing distributions using the LGW mixture model for hemodialysis patient data, the Weibull model provided lower e-values indicating less evidence for its fit compared to the lognormal model. The results suggest that the lognormal model was more appropriate for the data, highlighting cases where the Weibull distribution does not align well with observed patterns .

Reparametrizing the mixture model components in terms of the mean and variance helps maintain consistency across models being compared, providing a common ground for evaluation. This approach ensures that the datasets share parameters, thereby enabling an equitable comparison of model fits to censored survival data. It simplifies the estimation and comparison process by allowing the sharing of common parameters in the presence of right-censored data .

The choice of censoring mechanism directly affects how survival probabilities and hazard rates are estimated, which in turn influences the interpretation of model parameters and the conclusions drawn from the analysis. Right censoring, for example, assumes that survival times extend beyond observed periods, requiring techniques like Kaplan-Meier or parametric models tailored to handle such partial information. It seeks to improve estimates under the assumption that non-informative censoring is maintained, influencing the reliability of the inferred survival curves and hazard functions .

For right-censored survival data, the likelihood function considers both uncensored (exact event times) and censored observations. It is constructed by multiplying the conditional likelihoods of observing each event time or censorship, given the model parameters. For each population, the likelihood combines the probability density and survival function values for each data point, reflecting noninformative censoring assumptions. The likelihood is expressed as L(y, θ) ∝ Σ(pk fk(yj|μ, σ))^δj(Sk(yj|μ, σ)^(1-δj), where δj indicates if an event was observed or censored .

The survival function, denoted as S(t), is defined as the probability that an individual survives beyond time t: S(t) = P(T > t) = 1 − F(t), where F(t) is the distribution function of T. It is a nonincreasing continuous function with boundary conditions S(0) = 1 and S(∞) = 0. The hazard function, denoted by h(t), is the probability of failure during a very small interval, conditioned on the individual having survived to the start of the interval. It is mathematically expressed as h(t) = f(t) / S(t), where f(t) is the probability density function. The hazard function provides the instantaneous failure rate at time t .

Right censoring is crucial in survival analysis because it accounts for cases where the event of interest, such as death or failure, has not been observed within the study period. This mechanism is particularly common in medical research where patients may not experience the studied event before the study concludes or they are lost to follow-up. Understanding and properly handling right censoring is essential to ensure the reliability and accuracy of survival estimates .

Weibull Mixture Models for Survival Data
No ratings yet
Weibull Mixture Models for Survival Data
13 pages
Bayesian Tests for Randomized Block Design
No ratings yet
Bayesian Tests for Randomized Block Design
7 pages
Bayesian Estimation of EWD Shape Parameter
No ratings yet
Bayesian Estimation of EWD Shape Parameter
11 pages
RTA Weibull Common Shape
No ratings yet
RTA Weibull Common Shape
14 pages
Introduction To Bayesian Statistics 1st Edition William M. Bolstad Ebook Downloadable Anytime
100% (4)
Introduction To Bayesian Statistics 1st Edition William M. Bolstad Ebook Downloadable Anytime
31 pages
Statistical Methods for AMS Circuit Design
No ratings yet
Statistical Methods for AMS Circuit Design
5 pages
EN - Bayesian Methods in Survival Analysis Enhancing Insights in Clinical Research
No ratings yet
EN - Bayesian Methods in Survival Analysis Enhancing Insights in Clinical Research
11 pages
DAE7
No ratings yet
DAE7
41 pages
Understanding the F Distribution in ANOVA
No ratings yet
Understanding the F Distribution in ANOVA
20 pages
Bayesian Analysis of Step-Stress Testing
No ratings yet
Bayesian Analysis of Step-Stress Testing
30 pages
Cochran 1947 Some Consequences When The Assumptions For The Analysis of Variance Are Not Satisfied
No ratings yet
Cochran 1947 Some Consequences When The Assumptions For The Analysis of Variance Are Not Satisfied
18 pages
Bayesian Statistics: Key Concepts and Methods
No ratings yet
Bayesian Statistics: Key Concepts and Methods
70 pages
One-Way ANOVA Explained: Key Concepts
No ratings yet
One-Way ANOVA Explained: Key Concepts
8 pages
Properties of Bayes Factors Explained
No ratings yet
Properties of Bayes Factors Explained
15 pages
Bayesian Unit Roots Significance Test
No ratings yet
Bayesian Unit Roots Significance Test
15 pages
Statistical Error Analysis in Measurements
No ratings yet
Statistical Error Analysis in Measurements
37 pages
Overview of ANOVA Methodology
No ratings yet
Overview of ANOVA Methodology
8 pages
ANOVA: Single Factor Experiment Design
No ratings yet
ANOVA: Single Factor Experiment Design
7 pages
Discriminating Weibull, Log-Normal, Log-Logistic
No ratings yet
Discriminating Weibull, Log-Normal, Log-Logistic
28 pages
Non-Linear Model Identification in Finance
No ratings yet
Non-Linear Model Identification in Finance
6 pages
Bayesian Analysis of Lottery Randomness
No ratings yet
Bayesian Analysis of Lottery Randomness
34 pages
Bayesian Methods in Medical Survival Analysis
No ratings yet
Bayesian Methods in Medical Survival Analysis
72 pages
Mudholkar1996 PDF
No ratings yet
Mudholkar1996 PDF
10 pages
Single-Factor ANOVA with Multiple Levels
No ratings yet
Single-Factor ANOVA with Multiple Levels
58 pages
Weighted Optimal Sequential Testing Methods
No ratings yet
Weighted Optimal Sequential Testing Methods
41 pages
EN - Bayesian Adaptive Designs For Clinical Trials
No ratings yet
EN - Bayesian Adaptive Designs For Clinical Trials
25 pages
B-S Distribution (Kundu)
No ratings yet
B-S Distribution (Kundu)
108 pages
Bayesian Priors for Exponential-Logarithmic Distribution
No ratings yet
Bayesian Priors for Exponential-Logarithmic Distribution
22 pages
Bayesian Estimation of 3-Component Mixture
No ratings yet
Bayesian Estimation of 3-Component Mixture
29 pages
Bayes Estimation for Survival Data Analysis
No ratings yet
Bayes Estimation for Survival Data Analysis
14 pages
Understanding ANOVA and F-Distribution
No ratings yet
Understanding ANOVA and F-Distribution
14 pages
Comparison of Estimates Using Record Statistics From Weibull Model: Bayesian and Non-Bayesian Approaches
No ratings yet
Comparison of Estimates Using Record Statistics From Weibull Model: Bayesian and Non-Bayesian Approaches
13 pages
Experimental Design and Anova-Vi
No ratings yet
Experimental Design and Anova-Vi
40 pages
Comparing Data Splitting Methods in ML
No ratings yet
Comparing Data Splitting Methods in ML
14 pages
Zzzz-Essential Bayes
No ratings yet
Zzzz-Essential Bayes
158 pages
Complete Randomized Design Overview
No ratings yet
Complete Randomized Design Overview
16 pages
Beta-Binomial ANOVA for Proportions
No ratings yet
Beta-Binomial ANOVA for Proportions
5 pages
Exact Statistical Inference For Categorical Data 1st Edition Shan Online Reading
100% (1)
Exact Statistical Inference For Categorical Data 1st Edition Shan Online Reading
192 pages
Bayesian Analysis of Weibull Survival
No ratings yet
Bayesian Analysis of Weibull Survival
19 pages
Nonparametric Statistical Tests Overview
No ratings yet
Nonparametric Statistical Tests Overview
12 pages
Simplified Unit 4 and 5 Study Material
No ratings yet
Simplified Unit 4 and 5 Study Material
34 pages
Bayesian Analysis of Lomax Mixture Model
No ratings yet
Bayesian Analysis of Lomax Mixture Model
18 pages
Data Quality and Methodology in Risk Assessment
No ratings yet
Data Quality and Methodology in Risk Assessment
7 pages
Chi-Square Test for Goodness of Fit
No ratings yet
Chi-Square Test for Goodness of Fit
15 pages
Manuscript of RTA
No ratings yet
Manuscript of RTA
15 pages
Block Design in Inferential Statistics
No ratings yet
Block Design in Inferential Statistics
43 pages
Observational Causality Testing Methods
No ratings yet
Observational Causality Testing Methods
19 pages
Understanding Hypothesis Testing Steps
No ratings yet
Understanding Hypothesis Testing Steps
5 pages
Rta 2 2025-43
No ratings yet
Rta 2 2025-43
14 pages
Bayesian Model Selection for Dominant-Lethal Assays
No ratings yet
Bayesian Model Selection for Dominant-Lethal Assays
12 pages
Fuzzy ANOVA Results Assessment
No ratings yet
Fuzzy ANOVA Results Assessment
9 pages
Limitations of The Anajkglysis of Variance
No ratings yet
Limitations of The Anajkglysis of Variance
5 pages
Random Behavior Patterns in Simulation
No ratings yet
Random Behavior Patterns in Simulation
6 pages
ANOVA in Pig Fattening Formulas
No ratings yet
ANOVA in Pig Fattening Formulas
10 pages
Competing-Risk Theory Methodology Review
No ratings yet
Competing-Risk Theory Methodology Review
4 pages
P-Value Recommendations in Hypothesis Testing
No ratings yet
P-Value Recommendations in Hypothesis Testing
9 pages
ANOVA and Population Spread Analysis
No ratings yet
ANOVA and Population Spread Analysis
46 pages
Testing Equality of k Population Means
No ratings yet
Testing Equality of k Population Means
36 pages
Optimal Significance Levels in Testing
No ratings yet
Optimal Significance Levels in Testing
27 pages
ISC XI B Timetable & Google Meet IDs
No ratings yet
ISC XI B Timetable & Google Meet IDs
2 pages
Maneuvering Derivatives of SUBOFF Model
No ratings yet
Maneuvering Derivatives of SUBOFF Model
11 pages
Defence Mathematics DPP 2026 Solutions
No ratings yet
Defence Mathematics DPP 2026 Solutions
3 pages
Mini Steam Turbine Lab Report Analysis
No ratings yet
Mini Steam Turbine Lab Report Analysis
9 pages
Methods of Integration Explained
No ratings yet
Methods of Integration Explained
15 pages
IISER Kolkata Joint Ph.D. Admission 2023-24
No ratings yet
IISER Kolkata Joint Ph.D. Admission 2023-24
3 pages
Algebra and Data Analysis Concepts
No ratings yet
Algebra and Data Analysis Concepts
5 pages
Guide to Alien Species and Traits
100% (1)
Guide to Alien Species and Traits
154 pages
Biographies of Notable Statisticians
No ratings yet
Biographies of Notable Statisticians
12 pages
MCQs on Laws of Motion
100% (1)
MCQs on Laws of Motion
20 pages
Optimization on Flag Varieties
No ratings yet
Optimization on Flag Varieties
21 pages
Awwa C561 Fabricated Stainless Steel Slide Gates
100% (1)
Awwa C561 Fabricated Stainless Steel Slide Gates
36 pages
FX-9860G Slim: Solving Equations Guide
No ratings yet
FX-9860G Slim: Solving Equations Guide
36 pages
SAOCOM-1 DInSAR Time Series Analysis
No ratings yet
SAOCOM-1 DInSAR Time Series Analysis
24 pages
Em Assignment 1
No ratings yet
Em Assignment 1
3 pages
Strength and Weight Optimization of Passenger Aircraft Fuselage Skin
No ratings yet
Strength and Weight Optimization of Passenger Aircraft Fuselage Skin
7 pages
Bue Gee Form: Mastering Wing Chun Techniques
No ratings yet
Bue Gee Form: Mastering Wing Chun Techniques
8 pages
Magnetic Forces on Current-Carrying Wires
No ratings yet
Magnetic Forces on Current-Carrying Wires
23 pages
Sketching Electric Field of Charge
No ratings yet
Sketching Electric Field of Charge
43 pages
NEET 2025 Practice Test Overview
No ratings yet
NEET 2025 Practice Test Overview
27 pages
History and Applications of Nanobiotechnology
No ratings yet
History and Applications of Nanobiotechnology
25 pages
LTHA and NLTHA Design Procedures
No ratings yet
LTHA and NLTHA Design Procedures
4 pages
7th Grade Math Test Answers
No ratings yet
7th Grade Math Test Answers
3 pages
Understanding Avogadro's Number
No ratings yet
Understanding Avogadro's Number
8 pages
Hardfacing of AISI H13 Tool Steel With Stellite 21 Alloy Using Cold Metal Transfer Welding Process
No ratings yet
Hardfacing of AISI H13 Tool Steel With Stellite 21 Alloy Using Cold Metal Transfer Welding Process
9 pages
Fourier Series and Transforms Quiz 1
No ratings yet
Fourier Series and Transforms Quiz 1
4 pages
Year 8 Physics Revision Guide
No ratings yet
Year 8 Physics Revision Guide
7 pages
Wind Load Calculation for Elevated Pipe
No ratings yet
Wind Load Calculation for Elevated Pipe
1 page
Rolling Process in Metal Forming
No ratings yet
Rolling Process in Metal Forming
89 pages
Topological Tools for Dynamicists
No ratings yet
Topological Tools for Dynamicists
21 pages

Bayesian Test for Survival Distributions

Uploaded by

Bayesian Test for Survival Distributions

Uploaded by

Communications in Statistics - Theory and Methods

ISSN: 0361-0926 (Print) 1532-415X (Online) Journal homepage: [Link]

Bayesian significance test for discriminating

Cachimo Combo Assane, Basilio de Bragança Pereira & Carlos Alberto de

To link to this article: [Link]

Published online: 05 Dec 2017.

Submit your article to this journal

View related articles

View Crossmark data

Full Terms & Conditions of access and use can be found at

Download by: [[Link]] Date: 05 December 2017, At: 15:50

Bayesian significance test for discriminating between survival

ABSTRACT ARTICLE HISTORY

CONTACT Basilio de Bragança Pereira [Link]@[Link]/[Link] Department of Mathematics and

2.1. Basic concepts and notation

S(t ) = P(T > t ) = 1 − F (t ), for t > 0, (2.1)

where F (t ) is the distribution function of T . Note that S(t ) is a nonincreasing continuous

function of time t with S(0) = 1 and S(∞) = lim∞ S(t ) = 0.

2.2. Parametric survival distributions

ii) If T has a Gamma distribution with parameters γ = (γ1 , γ2 ), denoted by

iii) If T has a Weibull distribution with parameters β = (β1 , β2 ), denoted by

3. Fully Bayesian significance test (FBST)

q∗ = sup q(θ|y) and T (y) = {θ : q(θ|y) > q∗ }. (3.1)

The Bayesian evidence value against H is the posterior probability of T (y),

4. Mixture of survival models

Let us consider a dataset y = {y1 , . . . , yn } and m alternative parametric survival distributions

where p = (p1 , . . . , pm ) is the vector of the mixture weights.

where, Sk is the survival function associated with the mixture component k.

Assuming independence, the joint prior density function of θ = (μ, σ 2 , p) is given by

5.1. Simulation scheme of sample points

1. Assign values to parameters μ e σ 2 ;

and C holds. In order to simulate a sample with approximately pc % of right-censored obser-

=1− g(c|λ)FL (c|μ, σ )dc, (5.1)

5.2. Criteria for evaluating the performance of the FBST

5.3. Simulation results

 Lognormal  . . . . . 

6. Application: Choice of a survival model for patients with end-stage kidney

Weibull mixture models.

f (y|p, μ, σ ) = p1 fL (y|μ, σ ) + p2 fG (y|μ, σ ) + p3 fW (y|μ, σ ), (6.1)

Table . Measures of evidence provided by HD data.

Table . Summary of the posterior distribution of the LGW parameters.

p1 -lognormal . . . . .

Table . Hypothesis testing on the mixture weights of LGW model.

Table . Summary of the posterior distribution of lognormal parameters.

μ . . . . .

Common questions

How does reparametrization facilitate comparison between different survival models in the presented research?

How does reparametrization facilitate comparison between different survival models in the presented research?

How does the Fully Bayesian Significance Test (FBST) approach model selection for survival data?

How does the Fully Bayesian Significance Test (FBST) approach model selection for survival data?

What is the significance of using Fully Bayesian Significance Tests (FBST) over traditional pairwise comparisons in model selection?

What is the significance of using Fully Bayesian Significance Tests (FBST) over traditional pairwise comparisons in model selection?

What evidences are considered when choosing the best fitting survival model for hemodialysis patient data within the LGW framework?

What evidences are considered when choosing the best fitting survival model for hemodialysis patient data within the LGW framework?

What are the conditions under which the Weibull distribution might be an inappropriate model for survival data?

What are the conditions under which the Weibull distribution might be an inappropriate model for survival data?

In the context of censored survival data, what is the specific challenge addressed by reparametrizing the mixture model components in terms of mean and variance?

In the context of censored survival data, what is the specific challenge addressed by reparametrizing the mixture model components in terms of mean and variance?

What impact does the choice of censoring mechanism have on the analysis of survival data?

What impact does the choice of censoring mechanism have on the analysis of survival data?

Describe how the likelihood function for a mixture model is constructed when dealing with right-censored survival data.

Describe how the likelihood function for a mixture model is constructed when dealing with right-censored survival data.

What is the survival function, and how is it related to the hazard function in survival analysis?

What is the survival function, and how is it related to the hazard function in survival analysis?

Why is right censoring a crucial consideration in survival analysis, particularly in medical research?

Why is right censoring a crucial consideration in survival analysis, particularly in medical research?

You might also like