Package 'pbox' reference manual

Title:	Exploring Multivariate Spaces with Probability Boxes
Description:	Advanced statistical library offering a method to encapsulate and query the probability space of a dataset effortlessly using Probability Boxes (p-boxes). Its distinctive feature lies in the ease with which users can navigate and analyze marginal, joint, and conditional probabilities while taking into account the underlying correlation structure inherent in the data using copula theory and models. A comprehensive explanation is available in the paper "pbox: Exploring Multivariate Spaces with Probability Boxes" to be published in the Journal of Statistical Software.
Authors:	Ahmed T. Hammad [aut, cre, cph]
Maintainer:	Ahmed T. Hammad <[email protected]>
License:	GPL-3
Version:	0.1.8
Built:	2025-02-04 05:11:07 UTC
Source:	https://github.com/athammad/pbox

Extract Coefficients

Description

This is an internal method to extract coefficients from the list of the fitted distributions for each variable resulting from fit_dist_pbox. This method handles potential issues with parameter extraction from the complex objects created by GAM-like models.

Usage

coefAll2(obj, deviance = FALSE)
coefAll2(obj, deviance = FALSE)

Arguments

`obj`	An object typically resulting from `fit_dist_pbox`.
`deviance`	Logical value indicating whether to compute deviance for the fitted model.

Value

A list of coefficients, possibly including 'mu', 'sigma', 'nu', and 'tau', depending on the model specification in obj. If deviance is TRUE, it also includes the deviance of the model.

Examples

data(SEAex)
pbx <- set_pbox(SEAex)
coefAll2(pbx@fit[[1]]$allDistrs$Thailand)
data(SEAex)
pbx <- set_pbox(SEAex)
coefAll2(pbx@fit[[1]]$allDistrs$Thailand)

Method for extracting coefficients from GAM-like models

Description

Method for extracting coefficients from GAM-like models

Usage

## S4 method for signature 'ANY'
coefAll2(obj, deviance = FALSE)
## S4 method for signature 'ANY'
coefAll2(obj, deviance = FALSE)

Arguments

`obj`	A model object, typically from a GAM-like fitting procedure.
`deviance`	A Boolean flag that when TRUE calculates the deviance of the model.

Value

A list containing model coefficients and optionally deviance.

Define Copula Families and Parameters

Description

Internal list of defined copula families and their corresponding parameters.

Usage

.copula_families
.copula_families

Format

An object of class list of length 3.

Compute Confidence Interval using Delta Method

Description

Internal method to compute the probability using delta method which approximates the variance of a function of random variables (in this case, the ratio) based on the variance of the original estimates.

'deltaCI' general method. Internal method to compute the probability using delta method which approximates the variance of a function of random variables (in this case, the ratio) based on the variance of the original estimates.

Usage

deltaCI(cond)

## S4 method for signature 'ANY'
deltaCI(cond)
deltaCI(cond)

## S4 method for signature 'ANY'
deltaCI(cond)

Arguments

cond

list with the result of the perturbed probability for 'mj' and 'co' and correspondent CI.

Value

The Confidence Interval for the conditional probability.

Numeric vector representing the computed probability and confidence intervals using the perturbed copula and delta method.

Examples

  cond <- list(
  c(P = 0.3597117, `2.5%` = 0.3074215, `97.5%` = 0.4075315),
  c(P = 0.5682882, `2.5%` = 0.4560553, `97.5%` = 0.6823438))
  deltaCI(cond)
cond <- list(
  c(P = 0.3597117, `2.5%` = 0.3074215, `97.5%` = 0.4075315),
  c(P = 0.5682882, `2.5%` = 0.4560553, `97.5%` = 0.6823438))
  deltaCI(cond)

Build a Multivariate Distribution from Copula

Description

Combines the results from 'fit_copula_pbox' and 'fit_dist_pbox' to build a multivariate distribution from copula, selecting the best copula based on AIC and utilizing the best-fitted marginal distributions. Note that

Method to construct a 'mvdc' object by combining best-fit copula and marginal distribution results. The method uses the best copula model as determined by the lowest AIC and combines it with marginal distributions fitted to each variable.

Usage

final_pbox(results_df, allDitrs, data, verbose = TRUE)

## S4 method for signature 'ANY'
final_pbox(results_df, allDitrs, data, verbose = TRUE)
final_pbox(results_df, allDitrs, data, verbose = TRUE)

## S4 method for signature 'ANY'
final_pbox(results_df, allDitrs, data, verbose = TRUE)

Arguments

`results_df`	A data.table with AIC and parameter estimates of evaluated copulas and families from 'fit_copula_pbox'.
`allDitrs`	A list containing fitted distributions for each variable from 'fit_dist_pbox'.
`data`	A data frame or data table; this will be coerced to a 'data.table' internally.
`verbose`	control verbosity of the output. Default to TRUE.

Value

An object of class 'mvdc' representing the combined multivariate distribution.

Examples

  data("SEAex")
  copulaFits <- fit_copula_pbox(data = SEAex, .copula_families)
  distFits <- fit_dist_pbox(data = SEAex)
  final_mvd <- final_pbox(copulaFits, distFits$allDitrs, SEAex)
  print(final_mvd)
data("SEAex")
  copulaFits <- fit_copula_pbox(data = SEAex, .copula_families)
  distFits <- fit_dist_pbox(data = SEAex)
  final_mvd <- final_pbox(copulaFits, distFits$allDitrs, SEAex)
  print(final_mvd)

Copula Fit

Description

Internal method to automatically find the best Copula given a data.frame. Wrapper around the function fitCopula.

Automatically fits a copula model using the provided pseudo-observations. This method supports various families of copulas and calculates the corresponding AIC and parameter estimates.

Usage

.fit_copula(copula, family, dim, u)

## S4 method for signature 'ANY'
.fit_copula(copula, family, dim, u)
.fit_copula(copula, family, dim, u)

## S4 method for signature 'ANY'
.fit_copula(copula, family, dim, u)

Arguments

`copula`	A `data.frame` or `data.table` (the data will be coerced to a `data.table` internally).
`family`	List of copula types and their corresponding families. Currently supported families are "clayton", "frank", "amh", "gumbel", and "joe" for Archimedean Copula; "galambos", "gumbel", and "huslerReiss" for Extreme-Value copula; "normal" and "t" for Elliptical copula.
`dim`	number of columns of data.
`u`	matrix of (pseudo-)observations. Consider applying the function `pobs()` first in order to obtain such data.

Value

A data.table with the corresponding AIC and the parameter estimates of the evaluated copulas and families.

Fit Copula Models to Data

Description

Automatically fits various copula models specified in a list to the provided data. This function is a wrapper around the underlying copula fitting function, facilitating the exploration of multiple copula families to identify the best fitting model based on criteria such as AIC.

'fit_copula_pbox' method to fit a variety of copula models to data. This method performs a grid search over specified copula families to find the best fit. It employs the pseudoinverse of the empirical distribution functions to standardize the data.

Usage

fit_copula_pbox(data, .copula_families)

## S4 method for signature 'ANY'
fit_copula_pbox(data, .copula_families)
fit_copula_pbox(data, .copula_families)

## S4 method for signature 'ANY'
fit_copula_pbox(data, .copula_families)

Arguments

`data`	A data frame or data table; the data will be coerced to a 'data.table' internally.
`.copula_families`	A list specifying copula families to evaluate. The list should be structured with names corresponding to the type of copula (e.g., 'archmCopula', 'evCopula', 'ellipCopula') and elements being vectors of strings naming the copula families (e.g., "clayton", "frank").

Value

A data table summarizing the AIC and parameter estimates for each copula family evaluated.

Examples

  data("SEAex")
  .copula_families <- list(
    archmCopula = c("clayton", "frank", "gumbel", "joe"),
    evCopula = c("galambos", "gumbel", "huslerReiss"),
    ellipCopula = c("normal")
  )
  distFits <- fit_copula_pbox(data = SEAex, .copula_families)
  print(distFits)
data("SEAex")
  .copula_families <- list(
    archmCopula = c("clayton", "frank", "gumbel", "joe"),
    evCopula = c("galambos", "gumbel", "huslerReiss"),
    ellipCopula = c("normal")
  )
  distFits <- fit_copula_pbox(data = SEAex, .copula_families)
  print(distFits)

Fit Marginal Distributions

Description

Fits the best marginal distribution for each variable in a data frame using the 'gamlss::fitDist' function from the GAMLSS package. This function is designed to evaluate multiple distributions, returning a summary of fit for each, along with the Akaike Information Criterion (AIC) for comparison.

Implements the generic function 'fit_dist_pbox' for data frames and data tables. This method utilizes statistical techniques to fit distributions to each column in the 'data' argument, evaluating fit using criteria like AIC to determine the best fitting model.

Usage

fit_dist_pbox(data, ...)

## S4 method for signature 'ANY'
fit_dist_pbox(data, ...)
fit_dist_pbox(data, ...)

## S4 method for signature 'ANY'
fit_dist_pbox(data, ...)

Arguments

`data`	A data frame or data table.
`...`	Additional parameters to pass to the fitting function.

Value

A list containing two elements:

`allDitrs`	List of the fitted distributions for each variable.
`distTable`	A data table displaying the AIC for each tested distribution.

Examples

  data(SEAex)
  distFits <- fit_dist_pbox(data=SEAex)
  print(distFits$allDitrs)
  print(distFits$distTable)
data(SEAex)
  distFits <- fit_dist_pbox(data=SEAex)
  print(distFits$allDitrs)
  print(distFits$distTable)

Summary Statistics

Description

Computes summary statistics for a numeric vector. This function is an S4 method for the generic 'fun_stats', specifically tailored for numeric vectors. It calculates the minimum, maximum, mean, and median values.

Usage

fun_stats(x)
fun_stats(x)

Arguments

`x`	A numeric vector for which summary statistics are to be computed.

Value

A list containing the minimum, maximum, mean, and median of the input vector.

Examples

x <- c(1, 2, 3, 4, 5)
fun_stats(x)

x <- c(1, 2, 3, 4, 5)
fun_stats(x)

Summary statistics method for numeric vectors

Description

This method is a specific implementation of the 'fun_stats' function for numeric vectors. It efficiently calculates and returns summary statistics including the minimum, maximum, mean, and median, excluding NA values.

Usage

## S4 method for signature 'numeric'
fun_stats(x)
## S4 method for signature 'numeric'
fun_stats(x)

Arguments

`x`	Numeric vector for which summary statistics are computed.

Value

A list with components min, max, mean, and median.

Generate Scenarios

Description

Internal method to Generate scenarios based on parameter list variations.

Usage

gen_scenario(params = "list")

## S4 method for signature 'ANY'
gen_scenario(params = "list")
gen_scenario(params = "list")

## S4 method for signature 'ANY'
gen_scenario(params = "list")

Arguments

params

List of parameters where each parameter can vary across scenarios.

Value

Nested list of scenarios.

Examples

some_distr<-list(A=list(mu = 31.07, sigma = 0.28),
B=list(mu = c(34.4,31.4,25.6), sigma = 0.98, nu = 1.7),# note mu!
C=list(mu = 31.4, sigma = 0.34),
D=list(mu = 25.6, sigma = 0.24))
gen_scenario(some_distr)
some_distr<-list(A=list(mu = 31.07, sigma = 0.28),
B=list(mu = c(34.4,31.4,25.6), sigma = 0.98, nu = 1.7),# note mu!
C=list(mu = 31.4, sigma = 0.34),
D=list(mu = 25.6, sigma = 0.24))
gen_scenario(some_distr)

Iterate Over a Grid of All Possible Quantiles and Calculate Probabilities

Description

This function queries the probabilistic space of a pbox object to calculate probabilities associated with specific marginal or conditional distributions on a quantile grid. It supports conditional probability calculations as well.

This method processes the pbox object to compute probabilities based on the specified marginal and conditional parameters. It handles both simple probability calculations and complex queries involving joint and conditional distributions, with an option for bootstrap confidence interval estimation.

Usage

grid_pbox(pbx, mj = character(), co = NULL, probs = seq(0, 1, 0.1), ...)

## S4 method for signature 'pbox'
grid_pbox(pbx, mj = character(), co = NULL, probs = seq(0, 1, 0.1), ...)
grid_pbox(pbx, mj = character(), co = NULL, probs = seq(0, 1, 0.1), ...)

## S4 method for signature 'pbox'
grid_pbox(pbx, mj = character(), co = NULL, probs = seq(0, 1, 0.1), ...)

Arguments

`pbx`	An object of class `pbox` from which to query the probabilistic space.
`mj`	A character vector specifying the variables to query.
`co`	A character vector specifying the variables to query
`probs`	A numeric vector of quantiles to calculate probabilities for (default: seq(0, 1, 0.1)).
`...`	Additional parameters passed to `qpbox`.

Value

A data.table containing estimated probabilities for each combination of quantiles and distributions queried.

Examples

  data("SEAex")
  pbx <- set_pbox(SEAex)
  grid_pbox(pbx, mj = c("Vietnam", "Malaysia"))
data("SEAex")
  pbx <- set_pbox(SEAex)
  grid_pbox(pbx, mj = c("Vietnam", "Malaysia"))

Create a Probability Box (Pbox) Object

Description

Constructs a probability box (Pbox) object from a given dataset and a pre-defined copula model. This auxiliary method facilitates the integration of data with a copula to form a comprehensive probabilistic model known as a Pbox.

Method for creating a 'pbox' object using a specified copula and data. This method ensures that the input data and copula are compatible in terms of dimensions and structurally fit to form a Pbox.

Usage

make_pbox(data, cop)

## S4 method for signature 'ANY'
make_pbox(data, cop)
make_pbox(data, cop)

## S4 method for signature 'ANY'
make_pbox(data, cop)

Arguments

`data`	A dataframe or data table; this data will be coerced to a 'data.table' internally.
`cop`	An object of class 'mvdc' representing the multivariate dependency structure (copula).

Value

An object of class 'pbox' with slots: - '$data': The data coerced into a 'data.table'. - '$copula': The provided copula object.

Examples

  library(copula)
  data("SEAex")

  cop <- normalCopula(param = 0.5, dim = 4)
  distList <- c("RG", "SN1", "RG", "RG")
  allDistrs <- list(list(mu = 31.07, sigma = 0.28),
                    list(mu = 34.4, sigma = 0.98, nu = 1.7),
                    list(mu = 31.4, sigma = 0.34),
                    list(mu = 25.6, sigma = 0.24))
  copSEA <- mvdc(cop, distList, allDistrs)
  pbx <- make_pbox(data = SEAex, cop = copSEA)
  print(class(pbx))
library(copula)
  data("SEAex")

  cop <- normalCopula(param = 0.5, dim = 4)
  distList <- c("RG", "SN1", "RG", "RG")
  allDistrs <- list(list(mu = 31.07, sigma = 0.28),
                    list(mu = 34.4, sigma = 0.98, nu = 1.7),
                    list(mu = 31.4, sigma = 0.34),
                    list(mu = 25.6, sigma = 0.24))
  copSEA <- mvdc(cop, distList, allDistrs)
  pbx <- make_pbox(data = SEAex, cop = copSEA)
  print(class(pbx))

Generate Query Vector

Description

This function defines a generic function for creating a query vector to explore the probabilistic space based on provided matches and data. It is used internally to handle different types of inputs efficiently.

Usage

match_maker(varSet, matches, data)
match_maker(varSet, matches, data)

Arguments

`varSet`	A data frame or list describing the variable set.
`matches`	A data frame describing the matches with potential additional control parameters.
`data`	A data frame representing the data to be queried.

Value

A modified version of 'varSet' with values updated based on 'matches'.

Method for match_maker

Description

This method implements the 'match_maker' function for handling specific types of 'varSet', 'matches', and 'data'. It modifies the 'varSet' based on 'matches' which can contain variable names and values to be matched or operations to be performed. It supports operations and direct value assignment.

Usage

## S4 method for signature 'ANY'
match_maker(varSet, matches, data)
## S4 method for signature 'ANY'
match_maker(varSet, matches, data)

Arguments

`varSet`	A data frame or list describing the variable set.
`matches`	A data frame describing the matches with variable names and corresponding values or operators.
`data`	A data frame representing the data to be queried.

Value

A modified version of 'varSet' that integrates conditions or values from 'matches'.

Modify Parameters Box

Description

Internal method to modify specific parameters in a nested list structure by applying deviations.

Usage

modify_pbox(all_parms, params_list, sigma = 0.05, range = seq(-3, 3, 1))

## S4 method for signature 'ANY'
modify_pbox(all_parms, params_list, sigma = 0.05, range = seq(-3, 3, 1))
modify_pbox(all_parms, params_list, sigma = 0.05, range = seq(-3, 3, 1))

## S4 method for signature 'ANY'
modify_pbox(all_parms, params_list, sigma = 0.05, range = seq(-3, 3, 1))

Arguments

`all_parms`	nested list of parameters from the pbox object.
`params_list`	Named list where each name corresponds to a variable in the dataset and the value is a vector of parameter names to modify (e.g. list(Vietnam="mu")).
`sigma`	Standard deviation used for calculating parameter deviations.
`range`	Range values for generating deviations.

Value

Modified list of parameters.

Examples

some_distr<-list(A=list(mu = 31.07, sigma = 0.28),
B=list(mu = 34.4, sigma = 0.98, nu = 1.7),
C=list(mu = 31.4, sigma = 0.34),
D=list(mu = 25.6, sigma = 0.24))
modify_pbox(some_distr, list(A = "mu"))
some_distr<-list(A=list(mu = 31.07, sigma = 0.28),
B=list(mu = 34.4, sigma = 0.98, nu = 1.7),
C=list(mu = 31.4, sigma = 0.34),
D=list(mu = 25.6, sigma = 0.24))
modify_pbox(some_distr, list(A = "mu"))

Compute Parameter Deviations

Description

Internal method to calculate ± 1, 2, 3 standard deviations for given parameters.

Usage

param_dev(param = "numeric", sigma = 0.05, range = seq(-3, 3, 1))

## S4 method for signature 'ANY'
param_dev(param = "numeric", sigma = 0.05, range = seq(-3, 3, 1))
param_dev(param = "numeric", sigma = 0.05, range = seq(-3, 3, 1))

## S4 method for signature 'ANY'
param_dev(param = "numeric", sigma = 0.05, range = seq(-3, 3, 1))

Arguments

`param`	Numeric vector of parameters.
`sigma`	Numeric value representing standard deviation (default is 0.05).
`range`	Numeric vector specifying range of deviations (default is seq(-3, 3, 1)).

Value

Numeric vector of parameters adjusted by the specified deviations.

Examples

param_dev(31)
param_dev(31)

Class "pbox": Main S4 class of the library pbox.

Description

"pbox" is a class representing the probabilistic space which combines data, copula and margins.

Slots

data: The original data coerced to a data.table.
copula: The copula object of class mvdc.
fit: The results of the automated selection for both the marginal distribution and the copula.

Compute Probability Using a Perturbed Copula

Description

Computes the probability by applying a perturbation to the copula parameters within a 'pbox' object, and then evaluating the probability for specified query values. This method ensures that variations in the copula parameters can be assessed for their impact on the computed probabilities.

‘perProb' method for objects of class ’pbox'. This method perturbs the parameters of the copula contained in the 'pbox' and then computes the probability of the vector query using the perturbed copula. The perturbation process adjusts the copula parameters and evaluates the impact on the outcome probability.

Usage

perProb(x, vecQuery)

## S4 method for signature 'pbox'
perProb(x, vecQuery)
perProb(x, vecQuery)

## S4 method for signature 'pbox'
perProb(x, vecQuery)

Arguments

`x`	A 'pbox' object, which is expected to contain a copula.
`vecQuery`	A numeric vector representing the query values.

Value

The probability computed using a perturbed copula.

Numeric value representing the computed probability using the perturbed copula.

Examples

  data(SEAex)
  pbx <- set_pbox(SEAex[, .(Malaysia, Thailand)])
  vecQuery <- c(31, 34)
  perProb(pbx, vecQuery)
data(SEAex)
  pbx <- set_pbox(SEAex[, .(Malaysia, Thailand)])
  vecQuery <- c(31, 34)
  perProb(pbx, vecQuery)

Perturb Parameters

Description

This function defines a generic function to perturbate parameter values for each distribution within a copula, using random perturbations to simulate variability or uncertainty.

Usage

perturbate_params(paramMargins)
perturbate_params(paramMargins)

Arguments

paramMargins

A list containing lists of parameter values for each distribution in the copula.

Value

A list of lists containing perturbed parameter values.

Examples

paramMargins <- list(list(0.2, 0.3), list(0.4, 0.5))
perturbed <- perturbate_params(paramMargins)
print(perturbed)
paramMargins <- list(list(0.2, 0.3), list(0.4, 0.5))
perturbed <- perturbate_params(paramMargins)
print(perturbed)

Perturb Parameters Method

Description

This method implements the generic 'perturbate_params' function specifically for lists of copula distribution parameters. It applies a random perturbation to each parameter based on a normal distribution centered at zero with a standard deviation of 0.05.

Usage

## S4 method for signature 'ANY'
perturbate_params(paramMargins)
## S4 method for signature 'ANY'
perturbate_params(paramMargins)

Arguments

paramMargins

A list containing lists of parameter values for each distribution in the copula.

Value

A list of lists containing perturbed parameter values.

Probability Confidence Interval

Description

Calculates the confidence interval around a vector of probabilities using the quantiles based on the specified significance level.

Usage

probCI(probabilities, alpha=0.05)
probCI(probabilities, alpha=0.05)

Arguments

`probabilities`	A numeric vector of probabilities for which the confidence interval is desired.
`alpha`	The significance level used for constructing the confidence interval; default is 0.05.

Value

A list containing the lower and upper bounds of the confidence intervals for each probability.

Examples

probabilities <- c(0.1, 0.2, 0.3, 0.4, 0.5)
probCI(probabilities)
probCI(probabilities, alpha = 0.1)
probabilities <- c(0.1, 0.2, 0.3, 0.4, 0.5)
probCI(probabilities)
probCI(probabilities, alpha = 0.1)

Method to calculate confidence intervals for a vector of probabilities

Description

This method calculates the lower and upper bounds of the confidence interval for each element in the input vector of probabilities using the given alpha level.

Usage

## S4 method for signature 'numeric'
probCI(probabilities, alpha = 0.05)
## S4 method for signature 'numeric'
probCI(probabilities, alpha = 0.05)

Arguments

`probabilities`	A numeric vector of probabilities.
`alpha`	A numeric value specifying the significance level for the confidence intervals; defaults to 0.05.

Value

A numeric vector containing the lower and upper quantile bounds for each probability in the input vector.

Parse Query

Description

This function defines a generic function to parse a string query into structured data that can be used to explore a pbox object. It extracts components of the query using regular expression matching.

Usage

q_parser(query)
q_parser(query)

Arguments

query

A string representing the query.

Value

A data table with columns 'Varnames', 'Value', 'Operator', and 'Varnames2', where numeric values are converted to numeric type, and unnecessary columns are removed.

Examples

query <- "Vietnam:23"
q_parser(query)
query <- "Vietnam:23"
q_parser(query)

Method for Parsing Queries

Description

Implements the 'q_parser' function specifically for string input. It uses a regular expression to split the query into its components, converting numeric strings to numeric values where applicable, and structuring the result as a data table for easy manipulation.

Usage

## S4 method for signature 'ANY'
q_parser(query)
## S4 method for signature 'ANY'
q_parser(query)

Arguments

query

A string representing the query.

Value

A data table with the parsed elements of the query.

Query the probabilistic space of a pbox object.

Description

This function queries the probabilistic space of a pbox object to calculate probabilities associated with specific marginal or conditional distributions. It supports conditional probability calculations and can optionally estimate confidence intervals through bootstrapping.

Usage

qpbox(
  pbx,
  mj = "character",
  co = "character",
  lower.tail = TRUE,
  fixed = FALSE,
  CI = FALSE,
  iter = 1000
)

## S4 method for signature 'pbox'
qpbox(
  pbx,
  mj = "character",
  co = "character",
  lower.tail = TRUE,
  fixed = FALSE,
  CI = FALSE,
  iter = 1000
)
qpbox(
  pbx,
  mj = "character",
  co = "character",
  lower.tail = TRUE,
  fixed = FALSE,
  CI = FALSE,
  iter = 1000
)

## S4 method for signature 'pbox'
qpbox(
  pbx,
  mj = "character",
  co = "character",
  lower.tail = TRUE,
  fixed = FALSE,
  CI = FALSE,
  iter = 1000
)

Arguments

`pbx`	An object of class `pbox` from which to query the probabilistic space.
`mj`	A character string specifying the marginal and or joint distribution of the variable. It must specify the variable and the value in the format 'Var:Val'.
`co`	A character string specifying the marginal and conditional distribution of the variable. It must specify the variable and the value in the format 'Var:Val'.
`lower.tail`	Logical; if TRUE (default), probabilities are calculated for the area to the right of the specified value.
`fixed`	Logical; if TRUE, calculates conditional probabilities with conditions treated as fixed.
`CI`	Logical; if TRUE, calculates bootstrap confidence intervals.
`iter`	Integer; the number of replications for the confidence interval calculation. Default is 1000.

Value

Estimated probabilities as a numeric value or a named vector including confidence intervals if requested.

Examples

  data("SEAex")
  pbx <- set_pbox(SEAex)
  # Get marginal distribution
  qpbox(pbx, mj="Malaysia:33")
  # Get conditional distribution
  qpbox(pbx, mj="Malaysia:33 & Vietnam:31", co="avgRegion:26")
data("SEAex")
  pbx <- set_pbox(SEAex)
  # Get marginal distribution
  qpbox(pbx, mj="Malaysia:33")
  # Get conditional distribution
  qpbox(pbx, mj="Malaysia:33 & Vietnam:31", co="avgRegion:26")

Scenario Analysis

Description

Performs scenario analysis by modifying underlying parameters of a pbox object. Query the probabilistic space under different scenarios with different combinations of parameters for a single query.

Usage

scenario_pbox(
  pbx,
  param_list = "list",
  sigma = 0.05,
  range = seq(-3, 3, 1),
  ...
)

## S4 method for signature 'pbox'
scenario_pbox(
  pbx,
  param_list = "list",
  sigma = 0.05,
  range = seq(-3, 3, 1),
  ...
)
scenario_pbox(
  pbx,
  param_list = "list",
  sigma = 0.05,
  range = seq(-3, 3, 1),
  ...
)

## S4 method for signature 'pbox'
scenario_pbox(
  pbx,
  param_list = "list",
  sigma = 0.05,
  range = seq(-3, 3, 1),
  ...
)

Arguments

`pbx`	object of class pbox
`param_list`	List specifying which parameters to modify.
`sigma`	Standard deviation for parameter deviations, defaulting to 0.05.
`range`	Range of deviation multipliers, default is seq(-3, 3, 1).
`...`	Additional arguments passed to qpbox.

Value

Named list of results from each scenario evaluation.

Examples

  data("SEAex")
  pbx <- set_pbox(SEAex)
  scenario_pbox(pbx,mj = "Vietnam:31 & avgRegion:26", param_list = list(Vietnam="mu"))

data("SEAex")
  pbx <- set_pbox(SEAex)
  scenario_pbox(pbx,mj = "Vietnam:31 & avgRegion:26", param_list = list(Vietnam="mu"))

Maximum yearly temperature data from 1901 to 2022 (CRU TS v4)

Description

Maximum yearly temperature data from 1901 to 2022 in 11 countries in Southeast Asia and the average temperature of the entire region extracted from Climatic Research Unit gridded Time Series Version 4. Data contains only temperatures for Malaysia, Thailand, Vietnam and the average regional temperature.

Usage

SEAex
SEAex

Format

## 'SEAex' A data frame with 122 rows and 4 columns:

Malaysia,Thailand,Vietnam: Yearly max temperatures in Celsius for each country over 122 years.
avgRegion: Average temperature in Celsius over the whole South East Asia region

Source

<https://crudata.uea.ac.uk/cru/data/hrg/cru_ts_4.07/crucy.2304181636.v4.07/countries/>

Examples


data(SEAex)
head(SEAex)
data(SEAex)
head(SEAex)

Create a Probability Box from Data

Description

Constructs a probability box (pbox) by automatically selecting the best marginal distribution and copula for a given dataset. This function facilitates the creation of a pbox object, which encapsulates the uncertainty and dependencies of the input data.

'set_pbox' method that utilizes data frames or data tables to configure a comprehensive pbox structure. The method involves stages of distribution fitting and copula selection, executed through external functions presumed to be available in the working environment or described in the package.

Usage

set_pbox(data, verbose = TRUE, ctype = "all", cfamily = "all", ...)

## S4 method for signature 'ANY'
set_pbox(data, verbose = TRUE, ctype = "all", cfamily = "all", ...)
set_pbox(data, verbose = TRUE, ctype = "all", cfamily = "all", ...)

## S4 method for signature 'ANY'
set_pbox(data, verbose = TRUE, ctype = "all", cfamily = "all", ...)

Arguments

`data`	A data frame or data table. The data will be coerced to a 'data.table' internally.
`verbose`	control verbosity of the output. Default to TRUE.
`ctype`	Charter indicating the type of copula among archmCopula,evCopula,ellipCopula.
`...`	Other arguments to be passed to the 'fitDist' function.
`cfamilly`	Charter indicating the family of copula among clayton,frank,gumbel,joe,galambos, huslerReiss, normal.

Value

An object of class 'pbox' with the following slots: - '@data': The original data coerced into a 'data.table'. - '@copula': The selected copula object, typically of class 'mvdc'. - '@fit': A list containing results from the automated selection processes for both the marginal distributions and the copula.

Examples

  data("SEAex")
  pbx <- set_pbox(data = SEAex)
  print(pbx)
  print(class(pbx))
data("SEAex")
  pbx <- set_pbox(data = SEAex)
  print(pbx)
  print(class(pbx))

Methods for 'show()' in Package 'pbox'

Description

Methods for function show in package pbox.

Usage

## S4 method for signature 'pbox'
show(object)
## S4 method for signature 'pbox'
show(object)

Arguments

object

an object of class pbox.

Calculate Basic Statistics

Description

Computes basic statistics such as mean and median for specified variables in a data frame or data table based on a set of operations specified in the 'matches' data frame. This function updates the 'varSet' with the computed results for each variable.

Method implementation for calculating statistics using 'data.table' and 'stats'. This method allows the computation of mean and median for subsets of data defined in 'matches' and updates 'varSet' with these results.

Usage

stats_calc(data, matches, varSet)

## S4 method for signature 'ANY'
stats_calc(data, matches, varSet)
stats_calc(data, matches, varSet)

## S4 method for signature 'ANY'
stats_calc(data, matches, varSet)

Arguments

`data`	A data frame or data table.
`matches`	A data frame describing the operations to apply.
`varSet`	A data frame to be updated with results.

Value

Returns a modified version of 'varSet' with updated values based on the calculations.

Package 'pbox'

Help Index

Extract Coefficients

Description

Usage

Arguments

Value

Examples

Method for extracting coefficients from GAM-like models

Description

Usage

Arguments

Value

Define Copula Families and Parameters

Description

Usage

Format

Compute Confidence Interval using Delta Method

Description

Usage

Arguments

Value

Examples

Build a Multivariate Distribution from Copula

Description

Usage

Arguments

Value

Examples

Copula Fit

Description

Usage

Arguments

Value

Fit Copula Models to Data

Description

Usage

Arguments

Value

Examples

Fit Marginal Distributions

Description

Usage

Arguments

Value

Examples

Summary Statistics

Description

Usage

Arguments

Value

Examples

Summary statistics method for numeric vectors

Description

Usage

Arguments

Value

Generate Scenarios

Description

Usage

Arguments

Value

Examples

Iterate Over a Grid of All Possible Quantiles and Calculate Probabilities

Description

Usage

Arguments

Value

Examples

Create a Probability Box (Pbox) Object

Description

Usage

Arguments

Value

Examples

Generate Query Vector

Description

Usage

Arguments

Value