CRAN
tm.plugin.webmining 1.3
Retrieve Structured, Textual Data from Various Web Sources
Released May 11, 2015 by Mario Annau
Dependencies
NLP 0.1-11 RCurl boilerpipeR 1.3 XML 3.98-1.9 tm 0.7-1 RJSONIO 1.3-0
Facilitate text retrieval from feed formats like XML (RSS, ATOM) and JSON. Also direct retrieval from HTML is supported. As most (news) feeds only incorporate small fractions of the original text tm.plugin.webmining even retrieves and extracts the text of the original text source.
Installation
Maven
This package can be included as a dependency from a Java or Scala project by including
the following your project's pom.xml
file.
Read more
about embedding Renjin in JVM-based projects.
<dependencies> <dependency> <groupId>org.renjin.cran</groupId> <artifactId>tm.plugin.webmining</artifactId> <version>1.3-b39</version> </dependency> </dependencies> <repositories> <repository> <id>bedatadriven</id> <name>bedatadriven public repo</name> <url>https://nexus.bedatadriven.com/content/groups/public/</url> </repository> </repositories>
Renjin CLI
If you're using Renjin from the command line, you load this library by invoking:
library('org.renjin.cran:tm.plugin.webmining')
Test Results
This package was last tested against Renjin 0.8.2523 on Nov 12, 2017.