pattern-clustering

This tool clusterizes lines of text given a collection of input patterns modeled using regular expressions.


Keywords
pattern_clustering, clustering-algorithm, dynamic-programming, language-theory, pattern-matching
License
BSD-3-Clause
Install
pip install pattern-clustering==1.0.0

Documentation

Pattern clustering

PyPI Status Build Status Documentation Status Readthedocs Status Code Coverage

This tool clusterizes lines of text given a collection of input patterns modeled using regular expressions.

This work has been published to:

[ICPR'2022] A novel pattern-based edit distance for automatic log parsing, Maxime Raynal, Marc-Olivier Buob, Georges Quénot.

Features

  • Forms groups of homogeneous line using a pattern based distance, based on customizable patterns.
  • Configured by default to use common patterns (IP addresses, numeric values, etc.)

License

This project is licensed under the BSD-3-Clause license - see the LICENSE.

More about pattern-clustering

For more information, feel free to visit the wiki:

Acks