How to reduce your reliance on "bad" open source packages ✨ RSVP

normality
Release 2.5.0

Micro-library to normalize text strings

Homepage Repository PyPI Python

Keywords: text, unicode, normalization, slugs, normalizer, unicode-characters
License: MIT
Install: pip install normality==2.5.0

Documentation

normality text cleanup

Normality is a Python micro-package that contains a small set of text normalization functions for easier re-use. These functions accept a snippet of unicode or utf-8 encoded text and remove various classes of characters, such as diacritics, punctuation etc. This is useful as a preparation to further text analysis.

WARNING: This library works much better when used in combination with pyicu, a Python binding for the International Components for Unicode C library. ICU provides much better text transliteration than the default text-unidecode.

Example

# coding: utf-8
from normality import normalize, slugify, collapse_spaces

text = normalize('Nie wieder "Grüne Süppchen" kochen!')
assert text == 'nie wieder grune suppchen kochen'

slug = slugify('My first blog post!')
assert slug == 'my-first-blog-post'

text = 'this \n\n\r\nhas\tlots of \nodd spacing.'
assert collapse_spaces(text) == 'this has lots of odd spacing.'

License

normality is open source, licensed under a standard MIT license (included in this repository as LICENSE).

Dependencies: 4
Dependent packages: 29
Dependent repositories: 224
Total releases: 51
Latest release: Oct 7, 2023
First release: Jan 24, 2015
Stars: 131
Forks: 19
Watchers: 6
Contributors: 9
Repository size: 96.7 KB
SourceRank: 14

Source repo 2FA enabled: TEXT!
Package manager 2FA enabled: TEXT!
Is security responsive: TEXT!
Dependencies are managed: TEXT!
Issue-free release available: TEXT!
Succession plan available: TEXT!
Package manager 2FA enabled: TEXT!

Releases

2.5.0: Oct 7, 2023
2.4.0: Jul 29, 2022
2.3.3: Apr 11, 2022
2.3.2: Mar 24, 2022
2.3.1: Mar 10, 2022
2.3.0: Mar 10, 2022
2.2.5: Nov 6, 2021
2.2.4: Nov 5, 2021
2.2.3: Aug 9, 2021
2.2.2: May 6, 2021

See all 51 releases

Contributors

See all contributors

Something wrong with this page? Make a suggestion

Export .ABOUT file for this package

Last synced: 2023-12-02 01:19:16 UTC

Login to resync this project