Category: Data Sources and Formats

Connectors to new data sources, including parsers, generators, and converters for new file formats.

Facebook Extension
An extension for fetching pages and content related to them from Facebook.

Freemarker operator
Freemarker Extension can apply a freemarker template on the Rapidminer's exampleset.

Google_cloud_platform
This extension contains an operator that allows you to use the Google Cloud Speech API to convert audio files to text.

HDF 5 Extension
This extension brings HDF5 files to RapidMiner.

Hive Connector
The Hive Connector Extension provides an operator to pull data from Apache Hive.

Mozenda Connector
The Mozenda Connector provides a connector to the Mozenda API.

NoSQL Connectors
The NoSQL Connectors Extension provides operators to connect to both MongoDB and Cassandra.

PDF Table Extraction
This extension provides a convenient way to extract data tables from a PDF document and converts them to RapidMiner exampleset(s). The PDF document can be loaded from a local path or a remote (URL) location.

PMML Extension
The PMML Extension adds a new operator for writing models into the PMML standard. PMML is the leading standard for statistical and data mining models and supported by over 20 vendors and organizations.

Qlik Connector
The Qlik Connector provides a connector to the Business Intelligence and Self-Service Data Visualization software products from Qlik.

SAS Connector
The SAS connector provides an operator for reading SAS files.

Semweb
This toolkit facilitates to learn from Semantic Web data i.e. RDF within RapidMiner. It transforms RDF triples to an example set and then any kind of learning can be applied on this data.

SharePoint Connector
This extension contains operators, that enable access to Microsoft SharePoint Sites. - List SharePoint Files: Generates a list of files and folders of a site - Download from SharePoint: Downloads files provided

Solr Connector
The Solr Connector provides a connector to Apache Solr, an open-source enterprise full-text search platform allowing to index data and query indexes to retrieve data again.

Splunk Connector
The Splunk Connector provides a connector to the Splunk platform for Operational Intelligence, a platform to collect, search and analyze machine-generated data.

Spreadsheet Table Extraction
This extension provides operators to extract data tables from online spreadsheet applications and convert them to RapidMiner exampleset(s). Currently, it provides two operators to retrieve data tables from Google Spreadsheets, which uses Google Sheets API and Microsoft Excel Online, which uses Microsoft Graph API.

Tableau Table Writer
Export your data to Tableau.

Web Table Extraction
This extension provides a convenient way to extract data tables from HTML webpages and converts them to RapidMiner exampleset(s). The HTML document can be loaded from a local path or a remote (URL) location.