stats-to-pandas

Import data from Statistics Norway, Sweden, Ireland , UK and others that use the stat-json format to a Pandas dataframe in Python


Keywords
pandas, Statistics, Norway, json-stat
License
Other
Install
pip install stats-to-pandas==0.0.7

Documentation

stats-to-pandas

Download data from statistical agencies to pandas dataframes.

Installation

pip install stats_to_pandas

Requirements

  • pandas
  • requests
  • pyjstat
  • Jupyter notebook, IPython, ipywidgets

Overview

0. Import package
Alternative 1: from stats_to_pandas import *
Alternative 2: import stats_to_pandas as stp
1. Search for tables based on keywords
search('cows')
2. Select a table id from the dataframe and create a gui-box to select the variables to be included

(Works when the user is in a jupyter notebook and has widgets installed)

box = select(table_id = '10714')
3. Read the variable selection from the gui-box and download the data from Statistics Norway
df = read_box(box)

4. More

Read all values for all variables for table with id 10714

(Avoid having to select variables, step 2 above, but may result in large tables)

df = read_all(table_id = '10714')
Get the json string for a full query of table number 10714
query = full_json(table_id = '10714', out = 'str')
Get the json dict for a full query of table number 10714
query = full_json(table_id = '10714', out = 'dict')
Get the json dict associated with the current selection from a widget box
query = get_json(box)
Read data from Statistics Ireland
irl_url = 'http://www.cso.ie/StatbankServices/StatbankServices.svc/jsonservice/responseinstance/CNA31'

df = read_url(full_url = irl_ulr)