Skip to main content

Orange3 TextMining add-on.

Reason this release was yanked:

does not fix Tweepy

Project description

Orange3 Text

Orange add-on for text mining. It provides access to publicly available data, like NY Times, Twitter and PubMed. Further, it provides tools for preprocessing, constructing vector spaces (like bag-of-words, topic modeling and word2vec) and visualizations like word cloud end geo map. All features can be combined with powerful data mining techniques from the Orange data mining framework.

See documentation.

Features

Access to data

  • Load a corpus of text documents
  • Access publicly available data (The Guardian, NY Times, Twitter, Wikipedia, PubMed)

Text analysis

  • Preprocess corpus
  • Generate bag of words
  • Embed documents into vector space
  • Perform sentiment analysis
  • Detect emotions in tweets
  • Discover topics in the text
  • Compute document statistics
  • Visualize frequent words in the word cloud
  • Find words that enrich selected documents

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page