GO-Stats
Compute statistics and changes for both the GO ontology and annotations at every release.
The following details each folder of this repository:
libraries
go-stats
This is the python package that is used to compute statistics over go annotations. You can read more here.
GO Notification System
The code checks the release date in the main pipeline (http://current.geneontology.org/metadata/release-date.json) and when the date changes, it triggers a secondary pipeline by publishing a message in a specific topic (SNS) and update the release date on the secondary pipeline
GO Store Changes
The code loads the GO obo file (http://purl.obolibrary.org/obo/go.obo) and compare the terms of the new release to the previous most recent release.
GO Update Statistics
The code send queries to GOLr to fetch statistics about the GO annotations (e.g. per aspect, per species, per group etc)
GO Update GO-CAMs
The code compute a number of views over the GO-CAMs data (e.g. models, gene products, go terms, etc) using the GO SPARQL endpoint.
File Access
- go-meta.json: basic statistics
- go-stats.json: detailed statistics
- go-last-changes.json: changes in the ontology
- go-terms-aspect.json: terms per aspect (BP/MF/CC)
- go-annotated-gps.json: annotations per gene
- gocam-models.json: detailed list of models
- gocam-pmids.json: list of articles/references per model
- gocam-gps.json: list of gene products permodel
- gocam-goterms: list of GO terms per model