Baseline strategies¶
skpro offers simple baseline strategy strategies for model validation.
DensityBaseline¶
The DensityBaseline
strategy wraps scikit-learn’s KernelDensity estimation to predict a density using the training labels.
The following example illustrates the baseline usage on Bosting housing data:
from sklearn.datasets.base import load_boston
from sklearn.model_selection import train_test_split
from skpro.baselines import DensityBaseline
from skpro.metrics import log_loss
# Load boston housing data
X, y = load_boston(return_X_y=True)
X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.3)
# Train and predict on boston housing data using a baseline model
y_pred = DensityBaseline().fit(X_train, y_train)\
.predict(X_test)
# Obtain the loss
loss = log_loss(y_test, y_pred, sample=True, return_std=True)
print('Loss: %f+-%f' % loss)
# Plot performance
import utils
utils.plot_performance(y_test, y_pred)
>>> Loss: 3.444260+-0.062277
Please refer to the module documentation to learn more.