Good practices for analysing MCMC results using `arviz`¶

Assessing the convergence of a set of MCMC chains is not an easy task in general. jaxspec provides a convenient way to analyse the results of a fit using the arviz library. This library provides powerful tool to explore Bayesian inference results, such as trace plots, pair plots, and summary statistics. Let's run some MCMCs!

In [1]:

Copied!





%%capture 
# Hide the output of this cell
import numpyro

numpyro.enable_x64()
numpyro.set_platform("cpu")
numpyro.set_host_device_count(4)

import numpyro.distributions as dist
from jaxspec.fit import MCMCFitter
from jaxspec.data.util import load_example_obsconf
from jaxspec.model.additive import Blackbodyrad, Powerlaw
from jaxspec.model.multiplicative import Tbabs

spectral_model = Tbabs()*(Powerlaw() + Blackbodyrad())

obsconf = load_example_obsconf("NGC7793_ULX4_PN")

prior = {
    "powerlaw_1_alpha": dist.Uniform(1, 3),
    "powerlaw_1_norm": dist.LogUniform(1e-5, 1e-3),
    "blackbodyrad_1_kT": dist.Uniform(0, 2),
    "blackbodyrad_1_norm": dist.LogUniform(1e-2, 1),
    "tbabs_1_nh": dist.Uniform(0, 1)
}

fitter = MCMCFitter(spectral_model, prior, obsconf)
%%capture 
# Hide the output of this cell
import numpyro

numpyro.enable_x64()
numpyro.set_platform("cpu")
numpyro.set_host_device_count(4)

import numpyro.distributions as dist
from jaxspec.fit import MCMCFitter
from jaxspec.data.util import load_example_obsconf
from jaxspec.model.additive import Blackbodyrad, Powerlaw
from jaxspec.model.multiplicative import Tbabs

spectral_model = Tbabs()*(Powerlaw() + Blackbodyrad())

obsconf = load_example_obsconf("NGC7793_ULX4_PN")

prior = {
    "powerlaw_1_alpha": dist.Uniform(1, 3),
    "powerlaw_1_norm": dist.LogUniform(1e-5, 1e-3),
    "blackbodyrad_1_kT": dist.Uniform(0, 2),
    "blackbodyrad_1_norm": dist.LogUniform(1e-2, 1),
    "tbabs_1_nh": dist.Uniform(0, 1)
}

fitter = MCMCFitter(spectral_model, prior, obsconf)

In [2]:

Copied!





result_nuts = fitter.fit(
    num_chains=4, 
    num_warmup=1000, 
    num_samples=1000,
    sampler="nuts",
    mcmc_kwargs={"progress_bar": True}
)
result_nuts = fitter.fit(
    num_chains=4, 
    num_warmup=1000, 
    num_samples=1000,
    sampler="nuts",
    mcmc_kwargs={"progress_bar": True}
)

  0%|          | 0/2000 [00:00<?, ?it/s]

  0%|          | 0/2000 [00:00<?, ?it/s]

  0%|          | 0/2000 [00:00<?, ?it/s]

  0%|          | 0/2000 [00:00<?, ?it/s]

From the result object, you can access the inference_data attribute, which is an arviz.InferenceData object. This leverage the use of every arviz function to analyse the results of the fit.

This object carries all the information we need about our fit : the posterior samples, the associated likelihood, posterior predictives and so on. They can be a bit tricky to manipulate at first, as they are based on the xarray, but they provide a lot of flexibility to analyse the results of the fit.

Trace plot¶

This visualization is useful to see the evolution of the parameters during the sampling process. It can be used to diagnose convergence issues. The ideal situation is when the chains are well mixed and randomly scattered around the target distribution. If instead, chains are stuck in some region of the parameter space, or show some trends, this might indicate that the sampler did not explore the full parameter space.

In [4]:

Copied!

import arviz as az
import matplotlib.pyplot as plt 

with az.style.context("arviz-darkgrid", after_reset=True):
    az.plot_trace(inference_data, compact=False)

plt.show()
import arviz as az
import matplotlib.pyplot as plt 

with az.style.context("arviz-darkgrid", after_reset=True):
    az.plot_trace(inference_data, compact=False)

plt.show()

No description has been provided for this image

However, this kind of plot can become messy pretty quickly if you work with numerous chains/walkers, which what you must do when using Ensemble Samplers such as ESS or AIES. A general trick proposed by Vehtari et al. (2019) is to plot the rank of each sample in the global run instead of its value. This rank should be evenly spread across the steps if the chains are well mixed. This can be done using the kind="rank_vlines" argument in the plot_trace function.

In [5]:

Copied!

with az.style.context("arviz-darkgrid", after_reset=True):
    az.plot_trace(result_nuts.inference_data, compact=False, kind="rank_vlines")

plt.show()
with az.style.context("arviz-darkgrid", after_reset=True):
    az.plot_trace(result_nuts.inference_data, compact=False, kind="rank_vlines")

plt.show()

A more quantitative way to assess the convergence of the chains is to use the summary function. This function provides a summary of the posterior distribution of the parameters, including the mean, the standard deviation, and the 95% highest posterior density interval.

In [6]:

Copied!

az.summary(result_nuts.inference_data.posterior)
az.summary(result_nuts.inference_data.posterior)

Out[6]:

	mean	sd	hdi_3%	hdi_97%	mcse_mean	mcse_sd	ess_bulk	ess_tail	r_hat
blackbodyrad_1_kT	0.743	0.035	0.679	0.808	0.001	0.001	631.0	1056.0	1.01
blackbodyrad_1_norm	0.200	0.037	0.134	0.269	0.002	0.001	596.0	1040.0	1.01
powerlaw_1_alpha	2.029	0.100	1.836	2.205	0.004	0.003	630.0	1050.0	1.01
powerlaw_1_norm	0.000	0.000	0.000	0.000	0.000	0.000	604.0	967.0	1.01
tbabs_1_nh	0.095	0.028	0.043	0.146	0.001	0.001	592.0	796.0	1.01

The r_hat column provides a measure of the splitted Gelman-Rubin statistic with rank normalisartion. The closer this value is to 1, the better. A value larger than 1.01 point to convergence issues. This statistic can be directly computed using the r_hat function, see Vehtari et al. (2019). The ess denotes the Effective Sample Size of the chains, which is a measure of the quality of the samples. The larger the better, in general, we want this value to be larger than 400 for a reliable estimate of the posterior distribution using the NUTS sampler with 1000 warmups and 1000 samples.

Pair plot¶

This visualization is useful to see the correlation between the parameters. The ideal situation is when the parameters are uncorrelated, which means that the posterior distribution is close to a multivariate Gaussian distribution.

In [9]:

Copied!

with az.style.context("arviz-darkgrid", after_reset=True):
    az.plot_pair(result_nuts.inference_data)
    
plt.show()
with az.style.context("arviz-darkgrid", after_reset=True):
    az.plot_pair(result_nuts.inference_data)
    
plt.show()

Take a look at arviz's documentation to see what else you can do with this library.

Good practices for analysing MCMC results using arviz¶

Trace plot¶

Pair plot¶

Good practices for analysing MCMC results using `arviz`¶