text-ppf

Text pre-processing function for NLP


Keywords
nlp, preprocessing, python, tensorflow
License
MIT
Install
pip install text-ppf==1.0.0

Documentation

TextPreProcessing

Python package for preprocessing text for NLP models

INSTALLATION

Download PyPi Package , or pip install text-ppf

WHAT IT DOES

An all-in-one function which,

  • Removes punctuation
  • Splits the text into a list of words
  • Removes stopwords
  • Makes each word lowercase
  • Lemmatizes each word

HOW TO USE:

text_ppf

import pandas as pd
from text_ppf import text_ppf

#LOADING DATASET
df = pd.read_csv('<filename.csv>')
# print(df.head)

dfNew = df['heading'].apply(text_ppf.clean_up)
print(dfNew)