Watch our latest webinar to understand the difference between data from Libraries.io and the Tidelift Subscription.

totokenizers
Release 1.4.0

Text tokenizers.

Homepage PyPI Python

Install: pip install totokenizers==1.4.0

Documentation

totokenizers

A model-agnostic library to encode text into tokens and couting them using different tokenizers.

install

pip install totokenizers

usage

from totokenizers.factories import TotoModelInfo, Totokenizer

model = "openai/gpt-3.5-turbo-0613"
desired_max_tokens = 250
tokenizer = Totokenizer.from_model(model)
model_info = TotoModelInfo.from_model(model)

thread_length = tokenizer.count_chatml_tokens(thread, functions)
if thread_length + desired_max_tokens > model_info.max_tokens:
    raise YourException(thread_length, desired_max_tokens, model_info.max_tokens)

Dependencies: 3
Dependent packages: 1
Dependent repositories: 1
Total releases: 24
Latest release: 25 days ago
First release: Aug 10, 2023
Stars: 0
Forks: 0
Watchers: 2
Contributors: 7
Repository size: 7.27 MB
SourceRank: 8