ZeroMq based multiprocessing framework.


License
BSD-3-Clause
Install
pip install pyzmp==0.0.10

Documentation

PyZMP

docs Documentation Status
tests Travis-CI Build Status Requirements Status Code Quality Status
Python PyPI Package latest release PyPI Package monthly downloads PyPI Wheel Supported versions Supported implementations
ROS Indigo ROS Indigo Build Jade ROS Jade Build Kinetic ROS Kinetic Build

PyZMP is a multiprocess library based on ZeroMQ.

The aim is to make experimenting with multiprocess and distributed architecture more solid and overall easier. If at all possible, the goal is to arrive at a minimal set of concepts, that makes solid and efficient distributed system easy to build.

Distributed systems models, as per wikipedia https://en.wikipedia.org/wiki/Distributed_computing#Models, can be classified as:

  • Parallel algorithms in shared-memory model: This seems applicable in distributed software using a consensus algorithm as the shared memory.
  • Parallel algorithms in message-passing model: This seems applicable in distributed software relying mostly on dataflow architecture, where the implementor can decide on the network structure
  • Distributed algorithms in message-passing model: This seems to be the most widely used currently, (web backend model, relying on services available from multiple places for example)

We will focus on the latter first, while keeping in mind it is likely just a special case of the second (network cannot be controlled, algorithm on each node has to be the same). A good exercise here is how to keep a representation of the distribution coherent on each node, despite potential network partition that cna occur.

Doing an Analysis on existing distributed software architecture is likely a very broad task, but we can focus on just a few here, at least as a first step. These should be enough to implement any of the distributed systems models cited above:

with different views for the user, more or less transparently : - make a request / send a task and (asynchronously or not) wait the response/result - receive dataflow from somewhere and send dataflow to somewhere else

Additionally, an interesting endeavour could be to see how https://en.wikipedia.org/wiki/Control_theory applies to such distributed systems (message passing <=> charge transfer).

Note : This is currently a personal perspective that likely require more thorough analysis, so feel free to send a Pull Request.

Repository structure

This repository has a few main branches:

  • master : main branch, python dev workflow, releasing version tags into a pip package.

Apart from these we follow a feature branching workflow

How to use

Install ` pip install pyzmp `

Run self tests ` pyzmp `

How to develop

Clone this repository ` git clone http://github.com/asmodehn/pyzmp `

Create you virtualenv to workon using virtualenvwrapper ` mkvirtualenv pyzmpenv `

Install all dependencies via dev-requirements ` pip install -r dev-requirements.txt `

Run self tests ` pyzmp `

Run all tests (with all possible configurations) with tox ` tox `

Note : Tox envs are recreated every time to ensure consistency. So it s better to develop while in a non-tox-managed venv.

Tutorials and examples

A good example showing use of pyzmp, simple RPC client/server example

tutorials implementing multi node communication (under development)

Roadmap

Distributed software means software being executed in different "nodes" and collaboration via communication through different "channels":

  • Node : A code executing entity. Can be a process, a thread, or a group of nodes communicating together.
  • Channels : A way to make two or more node communicate in a way that allow them to collaborate.

This will allow us to structure our software in a network graph. PYZMP aims to be the foundation on which such a network graph can be easily, and confidently, built and used.

Implementation Priorities :

  1. Local multiprocess first (we force data partition without forcing connections/sockets management)
  2. Multiple concurrency implementation (Thread (all kinds), entity-component as a monothread implementation)
  3. Remote concurrency (managing remote connections)

Type of Distributed Architecture that can be built with pyzmp:

  1. Service(RPC) based architecture http://zguide.zeromq.org/page:all#Ask-and-Ye-Shall-Receive :
  • It s a well proven way of architecture a distributed software, since it is the prevalent model used in the web architecture (REST, HTTP, RPC, etc.)
  • There are some constraints in the way this must be implemented to work.
  • There are more constraints if we want to implement it in a way that is easy to use.
  1. DataFlow based architecture http://zguide.zeromq.org/page:all#Getting-the-Message-Out :
  • It s a quite heavily used distributed architecture (topics, XMPP, ROS, etc.)
  • There are some constraints in the way this must be implemented to work.
  • There are more constraints if we want to implement it in a way that is easy to use.
  • It is theoretically more complex to grasp than the service based architecture, therefore will be dealt with at a later time.
  1. TBD : depending on analysis of existing system and what can be necessary to existing architecture, we will see what comes up.

Constraints:

  • we want to be able to control where is executed what (no full transparency of the distribution)
  • we want to create a solid platform on which other distributed algorithms can be implemented
  • usual distributed algorithms ( cache, proxy, feedback ) should be super easy to implement, and will eventually be provided here as examples, or part of a larger "toolbox".
  • We should minimize our software complexity on order to build a stable and easily maintainable system. A consensus algorithm (raft) would be very useful to implement distributed algorithms, but should be built outside of pyzmp. However pyzmp might need it to be able to function properly...