Baseline strategies¶

skpro offers simple baseline strategy strategies for model validation.

DensityBaseline¶

The DensityBaseline strategy wraps scikit-learn’s KernelDensity estimation to predict a density using the training labels.

The following example illustrates the baseline usage on Bosting housing data:

from sklearn.datasets.base import load_boston
from sklearn.model_selection import train_test_split

from skpro.baselines import DensityBaseline
from skpro.metrics import log_loss

# Load boston housing data
X, y = load_boston(return_X_y=True)
X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.3)

# Train and predict on boston housing data using a baseline model
y_pred = DensityBaseline().fit(X_train, y_train)\
                          .predict(X_test)
# Obtain the loss
loss = log_loss(y_test, y_pred, sample=True, return_std=True)

print('Loss: %f+-%f' % loss)

# Plot performance
import utils
utils.plot_performance(y_test, y_pred)

>>> Loss: 3.444260+-0.062277

Please refer to the module documentation to learn more.