Embedded Scientific Computing

Data-Driven Business and Open Science

Using OpenCPU for integrating scientific computing into the next generation of systems and applications.

Methods for scientific computing are traditionally implemented in specialised software packages such as R or STATA. However, many users and organisations wish to integrate statistical computing into third party software. Ans so, rather than working in a specialised statistical environment, methods to analyse and visualise data get incorporated into pipelines, web applications and big data infrastructures.

OpenCPU is a software system for embedded statistical computation and reproducible research. The server exposes a web API interfacing R, Latex and Pandoc. This API is used for example to integrate statistical functionality into systems, share and execute scripts or reports on centralized servers, and build R based “apps“.

OpenCPU app is an R package which includes some web page(s) that call the R functions in the package using the OpenCPU API thereby making a convenient way to develop, package and ship portable, standalone R web applications.

Research

  • focuses on domain specific challenges related to integrating scientific computing into the next generation of systems and application.

Contact

Automatic Recognition of Product Mentions in Text Corpora

Kaggle Competition

Identify product mentions within a largely user-generated web-based corpus and disambiguate the mentions against a large product catalog.

Challenge

  • to automatically identify all mentions of consumer products in a largely user-generated collection of web content, and to correctly identify the product(s) that each product mention refers to from a large catalog of products.

Dataset

  • hundreds of thousands of text items, a product catalog with over fifteen million products, and hundreds of manually annotated product mentions supporting data-driven approaches.

Evaluation

 Winners:

1st Zhanpeng Fang

2nd: Olexandr Topchylo

How to install the ‘RWordPress’ package in R

Scripts & Statistics

The RWordPress package is a very convenient tool for publishing blog posts from R to WordPress. In his blog post Publish blog posts from R + knitr to WordPress, Yihui Xie explains how to install and use the package. Furthermore, the blog post How to publish with R Markdown in WordPress gives some additional information on how to use the package.

However, the package repository http://www.omegahat.org/R does not seem to exist anymore (2016-04-04).

Fortunatelly, the RWordPress package is also available from Github and, thus, can be easily installed using the devtools package.

Since RWordPress depends on the packages RCurl, XML, and XMLRPC, these packages need to be installed before we can actually install RWordPress.

Unlike RCurl and XML, the XMLRPC package is not available from the CRAN repository. Instead, it is available from Github.

Here is the code to install all required packages:

View original post 5 more words