Release 0.7.0

Features

  • Add dimension checks in interpretability
  • Allow to set maximum percentage of nulls in a column
  • Forbid undefined fields in configs
  • Handle none as entity_id in parquets
  • Create new data source definition.
  • New config.yaml design.
  • Enable caching queried data
  • Handle duplicated column names when joining tables.
  • Support parquet data source.
  • Fix monad metrics
  • Validate allowed_columns.
  • Support lambdas at config level
  • Implement mechanism for metric initialization
  • Add joins to benchmarking configs
  • Cast main_entity_id to string
  • Validate columns uniqueness
  • Allow defining lambdas in extra columns
  • Add recommendations to interpretability
  • Set max number of expressions via environment variable
  • Verify if data source name contains any forbidden sequences

Fixes

  • Add recency modality slices to feature value interpretability
  • Allow join_on column in select
  • Allow None value for limit_train_batches
  • Always use stored config at pretraining phase
  • Changed defaults for loader params
  • Check data source type before accessing date column
  • Fix Recommendation model
  • Fix to date parsing in hive
  • Make snowflake config work with new setup
  • Use alias and table name correctly
  • Fix metrics in training params
  • Append suffix to with clause alias
  • Fix detecting cyclic joins.