Tutorials

Independence Tests

The independence testing problem is generalized as follows: consider random variables X and Y that have joint density FXY=FX|YFY. We are testing:

H0:FXY=FXFYHA:FXYFXFY

These tutorials overview how to use these tests as well as benchmarks comparing the algorithms included against each other.

K-sample Tests

The k-sample testing problem is generalized as follows: consider random variables X1,X2,,Xk that have densities F1,F2,,Fk. Then, we are testing

H0: F1=F2=FkHA:  jj s.t. FjFj

This tutorial overview how to use k-sample tests in hyppo.

Time-Series Tests

Time-series tests of independence consider the following problem: consider random variables X and Y with joint density FXY and marginal densities FX and FY. Let FXt, FYs, and FXtYs represent the marginal and joint distributions of time-indexed random varlables Xt and Ys at timesteps t and s. Let {(Xt,Yt)}t= be a full jointly-sampled strictly stationary time series with the observed sample {(X1,Y1),(Xn,Yn)}. Choose some nonnegative integer M as the maximium lag hyperparamater. Then we are testing,

H0:FXtYtj=FXtFYtj for each j{0,1,,M}HA:FXtYtjFXtFYtj for some j{0,1,,M}

This tutorial overview how to use time_series based tests in hyppo.

Sims

To evaluate existing implmentations and benchmark against other packages, we have developed a suite of 20 dependency structures. The simulation settings include polynomial (linear, quadratic, cubic), trigonometric (sinusoidal, circular, ellipsoidal, spiral), geometric (square, diamond, w-shaped), and other functions. We also include 3 sample Gaussian simulations as well, which are sampled from multivariate normal distribusions.