textTinyR 1.0.4

Text Processing for Small or Big Data Files

Homepage: https://github.com/mlampros/textTinyR

Platform: CRAN

Language: R

License: GPL-3.0

View on registry: https://cran.r-project.org/package=textTinyR

Documentation: http://cran.r-project.org/web/packages/textTinyR/textTinyR.pdf

Direct download link: https://cran.r-project.org/src/contrib/textTinyR_1.0.4.tar.gz

CRAN_Status_Badge Travis-CI Build Status codecov.io AppVeyor build status Downloads


The textTinyR package consists of text pre-processing functions for small or big data files. More details on the functionality of the textTinyR can be found in the blog-post and in the package Vignette. The R package can be installed, in the following OS's: Linux, Mac and Windows. However, there are some limitations :

  • there is no support for chinese, japanese, korean, thai or languages with ambiguous word boundaries.
  • there is no support functions for utf-locale on windows, meaning only english character strings or files can be input and pre-processed.

System Requirements ( for unix OS's )


sudo apt-get install libboost-all-dev

sudo apt-get update

sudo apt-get install libboost-locale-dev


yum install boost-devel

Macintosh OSX/brew

The boost library will be installed on the Macintosh OSx using the Homebrew package manager,

If the boost library is already installed using brew install boost then it must be removed using the following command,

brew uninstall boost

Then the formula for the boost library should be modified using a text editor (TextEdit, TextMate, etc). The formula on a Macintosh OS Sierra is saved in:


The user should open the boost.rb formula and replace the following code chunk beginning from (approx.) line 71,

# layout should be synchronized with boost-python
args = ["--prefix=#{prefix}",

if build.with? "single"
  args << "threading=multi,single"
  args << "threading=multi"

with the following code chunk,

# layout should be synchronized with boost-python
args = ["--prefix=#{prefix}",

#if build.with? "single"
#  args << "threading=multi,single"
#  args << "threading=multi"

Then the user should save the changes, close the file and run,

brew update

to apply the changes.

Then he/she should open a new terminal (console) and type the following command, which installs the boost library using the modified formula from source, (warning: there are two dashes before : build-from-source)

brew install /usr/local/Homebrew/Library/Taps/homebrew/homebrew-core/Formula/boost.rb --build-from-source

That's it.

Installation of the textTinyR package (CRAN, Github)

To install the package from CRAN use,

install.packages('textTinyR', clean = TRUE)

and to download the latest version from Github use the install_github function of the devtools package,

devtools::install_github(repo = 'mlampros/textTinyR', clean = TRUE)

Use the following link to report bugs/issues,




1.0.4 March 28, 2017
1.0.3 January 29, 2017
1.0.2 January 20, 2017
1.0.1 January 11, 2017
1.0.0 January 07, 2017

Project Statistics

SourceRank 9
Dependencies 0
Dependent projects 0
Dependent repositories 0
Total releases 5
Latest release
First release
Stars 10
Forks 4
Watchers 6
Contributors 1
Repo Size: 465 KB

Top Contributors See all

Lampros Mouselimis

Something wrong with this page? Make a suggestion

Export .ABOUT file for this library

Last synced: 2017-03-28 10:31:26 UTC

Login to resync this project