speech-features-kit

This is a kit for simple speech feature extraction.


Keywords
speech, feature, extraction, machine, learning, emotion-analysis, mfcc-features, speech-emotion-recognition, speech-feature, volume-analysis
License
MIT
Install
pip install speech-features-kit==1.0.0.dev1

Documentation

Speech Feature Kit

A Python wrapper for convenient speech feature extraction

Installation

pip install speech-features-kit

Functions

  1. MFCC feature analysis
  2. Volume analysis
  3. Emotion analysis

Example of emotion analysis

from speech_features_kit.Emotion.speech_toolkit import SpeechEmotionToolkit

# set the path of pre-trained model for speech emotion model
# the used model here is optimized for Chinese speech; however, it is possible you can train your own model. 
speech_kit = SpeechEmotionToolkit()

# load the model
speech_kit.load()

# obtain emotion list with timestamp given an audio file
list_emo, list_timestamp = speech_kit.get_emotion_list_by_blocks(audio_file="../data/english.wav",
                                                                     num_sec_each_file=1)

# print the list of emotion over timestamp
print("Time interval\tEmotion")
for idx, e in enumerate(list_emo):
    print(list_timestamp[idx], "\t", e)

Note

Other functions please see the examples folder!