Release 0.14
April 10th, 2025
New features - Core BaseModel Repository:
- Modular foundation model training
- The two components of the pretrain stage — data preprocessing & representation fitting, and FM training — can now be run independently, or together, as before:
- Two respective functions:
fit_behavioral_representationandtrain_foundation_modelwere added to compute features and train foundation model separately. - Changed
monad.pretraincommand name tomonad.runwith optional--fitand--fmarguments to compute only features or foundation model.
- Two respective functions:
- The two components of the pretrain stage — data preprocessing & representation fitting, and FM training — can now be run independently, or together, as before:
- Flexible prediction outputs
- Introduced different types of predicted output defined by mandatory
output_typeparameter. This requires changes to previously written scripts. - Added functions
readout_sketchandread_target_entity_idsto map recommendation outputs to feature values.
- Introduced different types of predicted output defined by mandatory
- Enhanced model training with early stopping
- Introduced
early_stoppingparameter to prevent overfitting.
- Introduced
- Expanded model interpretability
- Introduced
interpret_entityfunction to compute event-level attributions for a single main entity.
- Introduced
- Automated model testing
- Introduced
testmethod to compute metrics based on predictions and ground truth.
- Introduced
- Flexible BigQuery connection
- Added
project_idparameter to define project different from the one in the service account.
- Added
New features - GUI Application (Snowflake Native):
- Cascading run execution – enables dependent jobs/runs to trigger in sequence.
- Run and job status tracking – added detailed status tracking for better monitoring.
- New table designs – updated tables with improved layout and readability.
- Validation improvements – enhanced input and data validation across the platform.
- Multi-GPU training – enabled distributed model training across multiple GPUs.
- Listing state restoration – automatically restores UI state when returning to listings.
Fixes:
- Fixed an issue where all main entities were loaded on one GPU when distributed training across multiple GPUs was used.
- Fixed an issue where default multiclass metric was returning an error.
- Fixed an issue where interpretability attributions for time-series features were empty.
Dependencies
- The
dasklibrary is no longer a dependency.
