This extension wraps functionality from the Smile library ( and provides them as operators.

This extension wraps functionality from the Smile library ( and provides them as operators.

Smile is a fast and comprehensive machine learning engine. They focus on Speed, Ease of Use, Comprehensive, Natural Language Processing and Mathematics and Statisitcs.

Currently the extension provides the following Operators:

  • Anomaly:
    • Gaussian Mixture
  • Blending:
    • t-SNE
  • Cleansing:
    • Probabilistic Principal Component Analysis (PPCA) 
  • Clustering:
    • G-Means
  • Models:
    • Parametric Probability Estimator
  • Learner:
    • Lasso Regression
    • Random Forest (Smile) (now with classification in 0.4.0)
    • Gradient Boosted Tree (Smile)  (now with classification in 0.4.0)
  • Statisitics
    • Compare Distribution (enhanced in 0.4.0)

Version 0.4.1 (2021-04-08)

  • Fixed a bug that GMM was not able to handle one-class or unlabeled data even though it was able to do.

Version 0.4.0 (2019-12-18)

  • Random Forest (Smile) and Gradient Boosted Trees (Smile) now support Classification.
    • Random Forest Regression (Smile) renamed to Random Forest (Smile)
  • Compare Distributions: 
    • Added Kullback-Leibler and Jensen-Shannon as options to compare distributions. They run on a normalized bin version of the distribution.
    • Binning for Chi-Square, KL and JS are done on the superset of the data (i.e. min/max are determined on the superset).
    • A proper error message is thrown if you use Compare Distributions on data with missing values, which is not supported.

Version 0.3.0 (2019-09-11)

  • New operator: Compare Distributions
    • test the compatibility of two ExampleSets.
  • New operator: Gradient Boosted Tree (Smile)
    • Train a gradient boosted tree for Regression (classification currently not supported)
  • Renamed Regression operator folder to Learner
  • Major internal code refactoring. This may cause that previously trained models are not applicable anymore.


Version 0.2.0 (2019-07-30)

  • Added new operator Random Forest Regression (Smile)
  • Added the corresponding Random Forest Model

Version 0.1.0 (2019-02-08)

  • Extension release
  • New operator Gaussian Mixture
  • New operator G-Means 
  • New operator Probabilistic Principal Component Analysis 
  • New operator Lasso Regression
  • Operator t-SNE copied from Operator Toolbox Extension
  • Operator Parametric Probability Estimator copied from Operator Toolbox Extension

Product Details

Version 0.4.1
File size 1.3 MB
Downloads 5229 (2 Today)5229 downloads
Vendor RapidMiner Labs
Category Machine Learning
Released 4/8/21
Last Update 4/8/21 9:23 AM
License AGPL
Product web site
Rating 0.0 stars(0)