Rosette Text AnalyticsRosette Text Analytics

Deepen your insight with Rosette Text Analytics for RapidMiner Studio by Basis Technology. Rosette enables users to quickly and comprehensively process documents, social media, emails, name lists, and other unstructured data in over 55 Asian, European, and Middle Eastern languages. Better understand your content and customers without leaving the RapidMiner platform.

Rosette uses natural language processing (NLP), statistical analysis, and machine learning to power advanced text analysis and names processing. Leveraging Basis Technology’s twenty-plus years of industry experience, Rosette offers:

Document Analysis

Entity Extraction… locate people, places, organizations, and 15 other entity types in your text

Entity Linking… connect the entities in your text across documents and into the real world

Categorization… classify documents based on the IAB QAG taxonomy

Sentiment Analysis… measure response feedback — positive, negative, or neutral at either the document or the entity level

Transliteration... transliterate Arabic text written in Arabizi or Romanized Arabic to native Arabic script and vice versa

Base Linguistics

Morphological Analysis… identify the linguistic elements of text, including parts of speech, lemmas (root forms), subtokens, and Han readings

Tokenization… segment text into its component words

Sentence Tagging… locate sentence boundaries in noisy text

Name Analysis

Name Translation… comprehensive, accurate multilingual name translation

Name Matching…  fuzzy, cross-lingual matching and resolution of people, place, and organization names

Name Deduplication… group potential name duplicates into clusters

Read more about all these features at


Signing Up

In order to use the Rosette RapidMiner extension, you will need an API key. You can signup for a free 30-day trial today without a credit card. After that, we offer a number of paid plans.

Get your Rosette API key today

Once you have your key, enter it as a Rosette Connection in RapidMiner Studio.

Check out the quick-start guide on our blog and our interactive documentation to get started. 


Additional Information

Learn more about Rosette for RapidMiner on our website. If you need help, contact support here.

Rosette Text Analytics is also available as an on-premise solution. Get in touch with us for more information.



Version 1.11.0

  • Icons have been updated.
  • Supported language lists have been updated.

Version 1.10.1

  • Translate Names operator now supports Greek to English.


Version 1.10.0

  • Analyze Sentiment and Entity Sentiment operators now support Persian.
  • Morphology operator now supports Catalan, Estonian, Serbian, and Slovak.
  • Match Names operator now supports Greek.
  • Bug fix: Morphology operator correctly reads user-specified Source Language parameter. 


Version 1.8.1

  • Bug fix: UI fix to supported languages drop-down menu for Entity Sentiment


Version 1.8.0

  • New Transliterate operator transforms Arabic text between Arabic chat (Arabizi) and Arabic script.
  • New Deduplicate Names operator returns a "cluster ID" value for each name in a given list of names. You can then sort on "cluster ID" to group together potentially duplicate names. 
  • Extract Entities operator adds an entity linking feature to link extracted entities to their QID in Wikidata.
  • Support for Japanese and French has been added to detect sentiment (negative, neutral, positive) for a whole document or for an entity in a document.
  • Bug fix: Correctly reads value of concurrency count returned by the Rosette API.


Product Details

Version 1.11.0
File size 7.0 MB
Downloads 52940 (25 Today)52940 downloads
Vendor Basis Technology
Category Domain specific operators
Released 9/5/18
Last Update 9/5/18 2:04 PM
License Vendor License
Product web site
Rating 5.0 stars(1)