ssb-klass-python

A Python package built on top of KLASS's API for retrieving classifications, codes, correspondences etc.


Keywords
ssb-project
License
MIT
Install
pip install ssb-klass-python==0.0.3

Documentation

ssb-klass-python / KLASS

PyPI Status Python Version License

Documentation Tests Coverage Quality Gate Status

pre-commit Black Ruff Poetry

A Python package built on top of Statistics Norway's code- and classification-system "KLASS".
The package aims to make Klass's API for retrieving data easier to use by re-representing Klass's internal hierarchy as python-classes. Containing methods for easier traversal down, search classes and widgets, reasonable defaults to parameters etc. Where data is possible to fit into pandas DataFrames, this will be preferred, but hiererachical data / objects containing metadata will be kept as json / dict structure.

Installing

The package is available on Pypi, and can be installed by for example poetry like this:

poetry add ssb-klass-python

Example usages

Getting started

from klass import search_classification
# Opens a ipywidget in notebooks for searching for classifications and copying code, to get started
search_classification(no_dupes=True)

The GUI available through the search_classification function

Getting a classification directly

from klass import get_classification # Import the utility-function
nus = get_classification(36)
# Does the same as the code above, but does not shy away from using the class directly
from klass import KlassClassification # Import the class for KlassClassifications
nus = KlassClassification(36)  # Use ID for classification

When you have the classification stored in an object, you can "dig into" the API from there.

codes = nus.get_codes() # codes from current date
print(codes)
codes.data  # Pandas dataframe available under the .data attribute

From searching through "families", down to a specific codelist

from klass import KlassSearchFamilies
search = KlassSearchFamilies(360)
print(search)
>>> "Family ID: 20 - Utdanning - Number of classifications: 5"
utdanning = search.get_family(20)
print(utdanning)
>>> "The Klass Family "Utdanning" has id 20."
>>> "And contains the following classifications:"
>>>  "36: Standard for utdanningsgruppering (NUS)"
nus = utdanning.get_classification(36)
print(nus)
>>> "Classification 36: Standard for utdanningsgruppering (NUS)..."
nus_codes = nus.get_codes("2023-01-01")
print(nus_codes)
>>> "Codelist for classification: 36"
>>> " From date: 2023-01-01"
nus_codes.data  # A pandas dataframe

For more examples check out the demo-notebooks in the demo/ folder in the repo.

Technical notes

Documentation for the endpoints we are using can be found on Statistics Norways pages.

Technical architecture of the API we are interacting with is detailed in Statistics Norway's internal wiki.

This project has been migrated to follow the SSB PyPI Template from Statistics Norway.