klingon-file-manager

A Python module for managing files on both local and AWS S3 storage.


License
MIT
Install
pip install klingon-file-manager==0.0.36

Documentation

Klingon File Manager

Introduction

Klingon File Manager is a Python module designed for managing files on both local and AWS S3 storage. It provides a unified interface for file operations such as 'get', 'post', 'delete', 'move', and additional utility functions.

Installation

Run the following command to install the package:

pip install klingon-file-manager

The module looks for the following environment variables:

  • AWS_ACCESS_KEY_ID
  • AWS_SECRET_ACCESS_KEY

Features

  • Supports both local and AWS S3 storage.
  • Single function interface (manage_file) to handle 'get', 'post', 'delete', and 'move' operations.
  • Enhanced debugging support
  • AWS credentials are fetched using the get_aws_credentials function
  • File operations are performed using the read_file, write_file, and delete_file functions
  • The manage_file function returns a dictionary containing the result of the file operation with the following schema:
{
    'action': str,                      # Action performed ('get', 'post', 'delete', or 'move')
    'path': str,                        # Path for the file operation
    'content': Union[str, bytes, None], # File content for 'get' and 'post' actions
    'content_size_mb': float,           # Size of the content in megabytes
    'binary': bool,                     # Flag indicating if the content is binary
    'md5': Optional[str],               # md5 hash of file content for 'get', 'post', and 'move' actions
    'status': int,                      # HTTP-like status code (e.g., 200 for success, 500 for failure)
    'debug': Optional[Dict[str, str]]   # Debug information (only included if 'debug' flag is True)
}
  • Utility functions such as is_binary_file, get_md5_hash, and check_file_exists to assist with file management tasks.
  • Internal functions _get_from_s3, _get_from_local, _post_to_s3, and _post_to_local for handling S3 and local file operations.

Usage Examples

Using manage_file function

Here's a basic example to get you started:

GET example

GET is the same as reading/downloading a file either locally or on S3.

from klingon_file_manager import manage_file

result = manage_file(action='get', path='path/to/local/file.txt')

print(result)

When the 'get' action is used with the manage_file function, the output is a dictionary (which can be converted to a JSON object) with the following schema:

{
    "status": "integer",
    "action": "string",
    "path": "string",
    "content": "string or bytes or null",
    "content_size_mb": "float or null",
    "binary": "boolean or null",
    "md5": "string or null",
    "debug": "object or null"
}
  • The move_file function allows moving files between local and S3 storage, ensuring data integrity with MD5 checksums.

Here is a description of each field:

  • status: An integer representing the status of the operation. A status of 200 indicates success, while a status of 500 indicates an error.
  • action: A string representing the action performed. In this case, it will be 'get'.
  • path: A string representing the path of the file that was read.
  • content: A string or bytes representing the content of the file that was read, or null if the file could not be read.
  • content_size_mb: A float representing the size of the content in megabytes, or null if the file could not be read.
  • binary: A boolean indicating whether the file is binary (true) or text (false), or null if the file could not be read.
  • debug: An object containing debug information, or null if debugging is not enabled.

POST example

POST is the same as saving/uploading a file either locally or on S3.

from klingon_file_manager import manage_file

# POST a file to S3
result = manage_file(action='post', path='s3://your-bucket/your-key', content='Your content here')

print(result)

When the 'post' action is used with the manage_file function, the output is a dictionary (which can be converted to a JSON object) with the following schema:

{
    "status": "integer",
    "action": "string",
    "path": "string",
    "content": "string or bytes or null",
    "content_size_mb": "float or null",
    "binary": "boolean or null",
    "md5": "string or null",
    "debug": "object or null"
}

The move_file function allows moving files between local and S3 storage, ensuring data integrity with MD5 checksums. Here is a description of each field:

  • status: An integer representing the status of the operation. A status of 200 indicates success, while a status of 500 indicates an error.
  • action: A string representing the action performed. In this case, it will be 'post'.
  • path: A string representing the path of the file that was written.
  • content: A string or bytes representing the content that was written to the file, or null if the file could not be written.
  • content_size_mb: A float representing the size of the content in megabytes, or null if the file could not be written.
  • binary: A boolean indicating whether the file is binary (true) or text (false), or null if the file could not be written.
  • debug: An object containing debug information, or null if debugging is not enabled.

DELETE example

DELETE allows you to delete files either locally or stored on S3.

from klingon_file_manager import manage_file

# To delete a file from local storage
result = manage_file(action='delete', path='path/to/local/file.txt')

print(result)

When the 'delete' action is used with the manage_file function, the output is a dictionary (which can be converted to a JSON object) with the following schema:

{
    "status": "integer",
    "action": "string",
    "path": "string",
    "md5": "string or null",
    "debug": "object or null"
}
  • The move_file function allows moving files between local and S3 storage, ensuring data integrity with MD5 checksums.

Here is a description of each field:

  • status: An integer representing the status of the operation. A status of 200 indicates success, while a status of 500 indicates an error.
  • action: A string representing the action performed. In this case, it will be 'delete'.
  • path: A string representing the path of the file that was deleted.
  • debug: An object containing debug information, or null if debugging is not enabled.

post_file function

The post_file function in klingon_file_manager/post.py is used to write content to a file at a given path, which can be either a local file or an S3 object.

Arguments

  • path (str): The path where the file should be written. Can be a local path or an S3 URI (e.g., 's3://bucket/key').
  • content (Union[str, bytes]): The content to write to the file.
  • md5 (Optional[str]): The MD5 hash of the content, used to verify the integrity of the data. Defaults to None.
  • metadata (Optional[Dict[str, str]]): Additional metadata to include with the file. Defaults to None.
  • debug (bool): Flag to enable debugging. Defaults to False.

Returns

A dictionary containing the status of the write operation with the following schema:

{
    "status": int,                      # HTTP-like status code (e.g., 200 for success, 500 for failure)
    "message": str,                     # Message describing the outcome
    "md5": Optional[str],               # The MD5 hash of the file content (only included if status is 200)
    "debug": Optional[Dict[str, str]]   # Debug information (only included if 'debug' flag is True)
}
  • The delete_file function enables deletion of files from local and S3 storage with simple error handling.

Usage Example

from klingon_file_manager.post import post_file

result = post_file('path/to/local/file.txt', 'Hello, World!')

print(result)

Running Tests

To run tests, execute the following command:

make test

Contribution Guidelines

If you wish to contribute to this project, please submit a pull request.