Category: Data Sources and Formats
Connectors to new data sources, including parsers, generators, and converters for new file formats.
This extension contains a connector to Braincube Api and composant for read data from Braincube Api.
MCMD (M-Command) is a set of open source commands developed to process large scale data structures (CSV data) efficiently. The commands can process more than 20 million records of CSV data on a regular PC.
This operator runs in a UNIX environment (Linux / MacOS).
To use this operator, firstly you need to install MCMD from NYSOL website below: http://www.nysol.jp/en/home
This extension was developed by KSK Analytics, Inc.
Freemarker Extension can apply a freemarker template on the Rapidminer's exampleset.
The Hive Connector Extension provides an operator to pull data from Apache Hive.
This extension provides operators to work on image data. Capabilities provided include: extracting text from images, performing transformations and loading as ExampleSets or Tensor objects e.g. required for Deep Learning.
Use the powerful jq language to transform JSON data or extract parts in RapidMiner Studio.
This extension contains operators to interact with the Apache Kafka message broker.
It adds two operators to interact with a specific Kafka topic:
- Read messages from a topic, either old messages in a batch, or collect new published messages
- Write data from an example set as new messages into a topic on a Kafka cluster
The NoSQL Connectors Extension provides operators to connect to both MongoDB and Cassandra.
OPC-UA Connector Extension: This extensions provides a connection object and operators to read data from an OPC-UA server.
This extension provides a convenient way to extract data tables from a PDF document and converts them to RapidMiner exampleset(s). The PDF document can be loaded from a local path or a remote (URL) location.
The PMML Extension adds a new operator for writing models into the PMML standard. PMML is a standard for statistical and data mining models and supported by many vendors and organizations.
The Qlik Connector provides a connector to the Business Intelligence and Self-Service Data Visualization software products from Qlik.
This toolkit facilitates to learn from Semantic Web data i.e. RDF within RapidMiner. It transforms RDF triples to an example set and then any kind of learning can be applied on this data.
This extension contains operators, that enable access to Microsoft SharePoint Sites.
- List SharePoint Files: Generates a list of files and folders of a site
- Download from SharePoint: Downloads files provided
This extension allows you to connect to and work with Windows SMB and Linux Samba shares. It includes the connection and operators to read, write, delete, and loop over files.
The Solr Connector provides a connector to Apache Solr, an open-source enterprise full-text search platform allowing to index data and query indexes to retrieve data again.
The Splunk Connector provides a connector to the Splunk platform for Operational Intelligence, a platform to collect, search and analyze machine-generated data.
This extension provides operators to extract data tables from online spreadsheet applications and convert them to RapidMiner exampleset(s). Currently, it provides two operators to retrieve data tables from Google Spreadsheets, which uses Google Sheets API and Microsoft Excel Online, which uses Microsoft Graph API.
This extension adds functionality to efficiently connect to RESTful webservices and parse any given JSON string into one or more flat tables. The JSON can be part of a data table as attribute, a file or given as macro. The REST operators also can directly parse returned JSON and support managed rate limits.
This extension provides a convenient way to extract data tables from HTML webpages and converts them to RapidMiner exampleset(s). The HTML document can be loaded from a local path or a remote (URL) location.
An extension for structured web data coming from news, blogs, discussion and reviews sites.