Category: Data Sources and Formats
Connectors to new data sources, including parsers, generators, and converters for new file formats.
This extension contains a connector to Braincube Api and composant for read data from Braincube Api.
This extension allows the user to read CAE files.
It uses Altair compose and python in the backend. You have to have an installation of both to be able to use this extension.
MCMD (M-Command) is a set of open source commands developed to process large scale data structures (CSV data) efficiently. The commands can process more than 20 million records of CSV data on a regular PC.
This operator runs in a UNIX environment (Linux / MacOS).
To use this operator, firstly you need to install MCMD from NYSOL website below: http://www.nysol.jp/en/home
This extension was developed by KSK Analytics, Inc.
Freemarker Extension can apply a freemarker template on the Rapidminer's exampleset.
The Hive Connector Extension provides an operator to pull data from Apache Hive.
This extension provides operators to work on image data. Capabilities provided include: extracting text from images, performing transformations and loading as ExampleSets or Tensor objects e.g. required for Deep Learning.
IoT Connector Extension provides the Altair IoT Studio connection type to allow users to retrieve information for their IoT setup through the provided operators.
Use the powerful jq language to transform JSON data or extract parts in RapidMiner Studio.
This extension contains operators to interact with the Apache Kafka message broker.
It adds two operators to interact with a specific Kafka topic:
- Read messages from a topic, either old messages in a batch, or collect new published messages
- Write data from an example set as new messages into a topic on a Kafka cluster
The NoSQL Connectors Extension provides operators to connect to both MongoDB and Cassandra.
OPC-UA Connector Extension: This extensions provides a connection object and operators to read data from an OPC-UA server.
This extension provides a convenient way to extract data tables from a PDF document and converts them to RapidMiner exampleset(s). The PDF document can be loaded from a local path or a remote (URL) location.
The PMML Extension adds a new operator for writing models into the PMML standard. PMML is a standard for statistical and data mining models and supported by many vendors and organizations.
The Qlik Connector provides a connector to the Business Intelligence and Self-Service Data Visualization software products from Qlik.
This toolkit facilitates to learn from Semantic Web data i.e. RDF within RapidMiner. It transforms RDF triples to an example set and then any kind of learning can be applied on this data.
This extension contains operators, that enable access to Microsoft SharePoint Sites. It includes reading files and folders, writing and deleting files, as well as looping over files on a folder.
This extension allows you to connect to and work with Windows SMB and Linux Samba shares. It includes the connection and operators to read, write, delete, and loop over files.
The Solr Connector provides a connector to Apache Solr, an open-source enterprise full-text search platform allowing to index data and query indexes to retrieve data again.
The Splunk Connector provides a connector to the Splunk platform for Operational Intelligence, a platform to collect, search and analyze machine-generated data.
This extension provides operators to extract data tables from online spreadsheet applications and convert them to RapidMiner exampleset(s). Currently, it provides two operators to retrieve data tables from Google Spreadsheets, which uses Google Sheets API and Microsoft Excel Online, which uses Microsoft Graph API.
This extension provides a convenient way to extract data tables from HTML webpages and converts them to RapidMiner exampleset(s). The HTML document can be loaded from a local path or a remote (URL) location.
An extension for structured web data coming from news, blogs, discussion and reviews sites.
yassos ("yet another simple, stupid object storage") is a standalone object storage: Manage any file type by uploading, downloading and versioning it easily. To make your life even easier, organize your files in a folder-like structure.