Release 0.9

New features

Grouped Decimal Features in Interpretability
Introduced the ability to handle and analyze grouped decimal features, enhancing model interpretability.
Event Attributions to interpret recommendation models
Users can now trace back and understand how specific events influence model outputs and predictions.
Prediction Storage in Snowflake Database
Added functionality to save predictions directly into a Snowflake database.
Data Source Name in Minimum Group Size Logs
Added logging of the data source name when enforcing minimum group size requirements.
Join Functionality for Attribute Data Sources (enhanced)
Expanded support to allow joining attribute data sources with multiple data sources.
Filtering on Extra Columns in Data Source Definition
Users can now filter, group, and leverage extra columns passed in the data source definition.
New Parameter inDataParams:training_end_date
Introduced the training_end_date parameter, providing more flexibility and control over model training timelines.
New Parameters inTestingParams: local_save_location,remote_save_location
Introduced local_save_location and remote_save_location as parameters within TestingParams.

🚧
Note
Please adapt your configuration file to reflect this syntax change.
Extended Group Max Retries
Default values of group computation retries and retry interval have been increased. Default forGROUPS_N_RETRIES is now set to 20 and default for GROUPS_RETRY_INTERVAL is now set to 60. This reduces the likelihood of failures due to transient issues and improves overall robustness. For more information refer to Dividing event tables section.
Entity Number Limit for Target Function Validation
The number of entities that can be used when validating target functions is now capped to ensure efficiency and prevent overload during the validation process.
Enhanced Debug Messages for Target Function Validation
More comprehensive debug messages have been added during target function validation to assist in troubleshooting and increase transparency in the validation process.

Fixes

Fixed issues with None values in grouping.
Fixed regression loss calculation and logging.
Fixed errors in pandas query parsing.
Improved Neptune alerter logging.
Removed unused validations and loss functions.
Optimized memory usage in interpretability.
Fixed handling of missing metrics in Neptune.
Reduced memory consumption.
Improved directory creation based on cache path.
Enhanced schema selection in Hive builder.
Handled potential NaN values in decimal calculations.

Docs

Updated the documentation navigation to be more readable and user-friendly.
Added Recipes section for easy reference when building target functions.