In-Database ProcessingIn-Database Processing Supported

Visually define data prep or ETL workflows and execute them directly in the database. Reduce data transfer by loading only the data you need after preparation.

With the new In-Database Processing extension you can design a subprocess with new, but familiar preprocessing operators. Computation of these operators is pushed down into a database, i.e. they are automatically translated into SQL code which is submitted to the database. You can then process the result with other operators just like in a normal RapidMiner process.

The main goal of this extension is to allow you to limit the data that you read from a database into the memory of RapidMiner Studio or Server. This is especially important when you are using cloud engines like Google BigQuery where you have to pay for the amount of data you retrieve. Another goal is to leverage your database's computing power which is also important when using distributed, scalable database or cloud engines. All this is done without the need to write SQL code.

This first version of the extension supports Google BigQuery (via OAuth 2), PostgreSQL and MySQL. Further database and cloud engine support is planned for the future.

Product Details

Version 9.7.0
File size 3.9 MB
Downloads 5830 (19 Today)5830 downloads
Vendor RapidMiner
Category Operators
Released 8/6/20
Last Update 8/6/20 1:36 PM
License RM_EULA
Product web site
Rating 0.0 stars(0)