Category: Data Sources and Formats

Connectors to new data sources, including parsers, generators, and converters for new file formats.

Braincube Connector
This extension contains a connector to Braincube Api and composant for read data from Braincube Api.

CAE Connectors
This extension allows the user to read CAE files. It uses Altair compose and python in the backend. You have to have an installation of both to be able to use this extension.

Execute MCMD Extension
MCMD (M-Command) is a set of open source commands developed to process large scale data structures (CSV data) efficiently. The commands can process more than 20 million records of CSV data on a regular PC. This operator runs in a UNIX environment (Linux / MacOS). To use this operator, firstly you need to install MCMD from NYSOL website below: http://www.nysol.jp/en/home This extension was developed by KSK Analytics, Inc.

Facebook Extension
An extension for fetching pages and content related to them from Facebook.

Freemarker operator
Freemarker Extension can apply a freemarker template on the Rapidminer's exampleset.

HDF 5 Extension
This extension brings HDF5 files to RapidMiner.

Hive Connector
The Hive Connector Extension provides an operator to pull data from Apache Hive.

Image Handling
This extension provides operators to work on image data. Capabilities provided include: extracting text from images, performing transformations and loading as ExampleSets or Tensor objects e.g. required for Deep Learning.

IoT Connector
IoT Connector Extension provides the Altair IoT Studio connection type to allow users to retrieve information for their IoT setup through the provided operators.

JSON Processing with jq
Use the powerful jq language to transform JSON data or extract parts in RapidMiner Studio.

Kafka Connector
This extension contains operators to interact with the Apache Kafka message broker. It adds two operators to interact with a specific Kafka topic: - Read messages from a topic, either old messages in a batch, or collect new published messages - Write data from an example set as new messages into a topic on a Kafka cluster

Mozenda Connector
The Mozenda Connector provides a connector to the Mozenda API.

NoSQL Connectors
The NoSQL Connectors Extension provides operators to connect to both MongoDB and Cassandra.

OPC-UA Connector
OPC-UA Connector Extension: This extensions provides a connection object and operators to read data from an OPC-UA server.

Parquet Extension
This extension contains an operator, that enables simple parquet file reading.

PDF Table Extraction
This extension provides a convenient way to extract data tables from a PDF document and converts them to RapidMiner exampleset(s). The PDF document can be loaded from a local path or a remote (URL) location.

PMML Extension
The PMML Extension adds a new operator for writing models into the PMML standard. PMML is a standard for statistical and data mining models and supported by many vendors and organizations.

Qlik Connector
The Qlik Connector provides a connector to the Business Intelligence and Self-Service Data Visualization software products from Qlik.

SAS Connector
The SAS connector provides an operator for reading SAS files.

Semweb
This toolkit facilitates to learn from Semantic Web data i.e. RDF within RapidMiner. It transforms RDF triples to an example set and then any kind of learning can be applied on this data.

Sensor Link
This extension provides connectors for the OSIsoft PI System.

SharePoint Connector
This extension contains operators, that enable access to Microsoft SharePoint Sites. It includes reading files and folders, writing and deleting files, as well as looping over files on a folder.

SMB Connector
This extension allows you to connect to and work with Windows SMB and Linux Samba shares. It includes the connection and operators to read, write, delete, and loop over files.

Solr Connector
The Solr Connector provides a connector to Apache Solr, an open-source enterprise full-text search platform allowing to index data and query indexes to retrieve data again.

Splunk Connector
The Splunk Connector provides a connector to the Splunk platform for Operational Intelligence, a platform to collect, search and analyze machine-generated data.

Spreadsheet Table Extraction
This extension provides operators to extract data tables from online spreadsheet applications and convert them to RapidMiner exampleset(s). Currently, it provides two operators to retrieve data tables from Google Spreadsheets, which uses Google Sheets API and Microsoft Excel Online, which uses Microsoft Graph API.

Tableau Table Writer
Export your data to Tableau.

Web Automation Extension
This extension adds functionality to efficiently connect to RESTful webservices and parse any given JSON string into one or more flat tables. The JSON can be part of a data table as attribute, a file or given as macro. The REST operators also can directly parse returned JSON and support managed rate limits.

Web Table Extraction
This extension provides a convenient way to extract data tables from HTML webpages and converts them to RapidMiner exampleset(s). The HTML document can be loaded from a local path or a remote (URL) location.

Webhose.io Extension
An extension for structured web data coming from news, blogs, discussion and reviews sites.