pycubexr

pyCubexR is a Python package for reading the Cube4 file format.


Keywords
performance-analysis, performance-measurement, performance-reports
License
BSD-3-Clause
Install
pip install pycubexr==2.0.0a0

Documentation

pyCubexR

PyPI - Python Version GitHub release (latest by date) PyPI version PyPI - License GitHub issues GitHub pull requests GitHub Workflow Status

pyCubexR is a Python package for reading the Cube4 (.cubex) file format. Cube is used as a performance report explorer for Scalasca and Score-P. It is used as a generic tool for displaying a multi-dimensional performance space consisting of the dimensions (i) performance metric, (ii) call path, and (iii) system resource. Each dimension can be represented as a tree, where non-leaf nodes of the tree can be collapsed or expanded to achieve the desired level of granularity. The Cube4 (.cubex) data format is provided for Cube files produced with the Score-P performance instrumentation and measurement infrastructure or the Scalasca version 2.x trace analyzer (and other compatible tools).

For additional information about the Cube file format and related software, see the pyCubexR report.

For questions regarding pyCubexR please send a message to extra-p-support@lists.parallel.informatik.tu-darmstadt.de.

Installation

To install the current release, which includes support for Ubuntu and Windows:

$ pip install pycubexr

To update pyCubexR to the latest version, add --upgrade flag to the above commands.

Usage

The following code provides a minimal example that shows how pyCubexR can be used to read all metrics, callpaths, and measurement values of a .cubex file:

from pycubexr import CubexParser

cubex_file_path = "some/profile.cubex"
with CubexParser(cubex_file_path) as cubex:

    # iterate over all metrics in cubex file
    for metric in cubex.get_metrics():
        metric_values = cubex.get_metric_values(metric=metric)
        
        # return the name of the current metric
        metric_name = metric.name
        
        # iterate over all callpaths in cubex file
        for callpath_id in range(len(metric_values.cnode_indices)):
            cnode = cubex.get_cnode(metric_values.cnode_indices[callpath_id])
            
            # return the current region i.e. callpath
            region = cubex.get_region(cnode)
            
            # return the name of the current region
            region_name = region.name
            
            # return the measurement values for all mpi processes for the current metric and callpath
            cnode_values = metric_values.cnode_values(cnode)

Not all .cubex files must contain measurement values for all metrics for each callpath. This is especially true for MPI functions such as MPI_Waitall. In some cases, metrics can be missing. Use the MissingMetricError to deal with these exceptions.

from pycubexr import CubexParser
from pycubexr.utils.exceptions import MissingMetricError

cubex_file_path = "some/profile.cubex"
with CubexParser(cubex_file_path) as cubex:

    for metric in cubex.get_metrics():
    
        try:
            metric_values = cubex.get_metric_values(metric=metric)
            
            for callpath_id in range(len(metric_values.cnode_indices)):
                
                cnode = cubex.get_cnode(metric_values.cnode_indices[callpath_id])
                
                # return only a specific number of measurement values for the current metric and callpath
                cnode_values = metric_values.cnode_values(cnode)[:5]
                
                region = cubex.get_region(cnode)
                
                # print the data read from the file
                print('\t' + '-' * 100)
                print(f'\tRegion: {region.name}\n\tMetric: {metric.name}\n\tMetricValues: {cnode_values})')
                
         except MissingMetricError as e:
            # Ignore missing metrics
            pass

The call tree of a .cubex file can be displayed with the following code:

from pycubexr import CubexParser

cubex_file_path = "some/profile.cubex"
with CubexParser(cubex_file_path) as cubex:

    # print the call tree of the .cubex file
    cubex.print_calltree() 

In special cases, it is also possible that a .cubex file is missing measurement values for some of the callpaths of a metric or that a .cubex file of the same application contains fewer callpaths than another file. These cases need to be handled externally and are not supported by pyCubexR.

License

BSD 3-Clause "New" or "Revised" License