hkdatasets

Datasets Related to Hong Kong


Keywords
dataset, hong-kong, r
License
GPL-3.0

Documentation

hkdatasets

📦 An R data package containing datasets related to Hong Kong

R build status CodeFactor License: MIT CRAN status CRAN last month downloads

📃 Introduction

This package contains datasets on Hong Kong. This package is developed as part of the Hong Kong Districts Info group of projects, which is dedicated to using data science to help make public information more accessible and consumable. The package contains the following datasets:

  • hkdc - A dataset containing information about Hong Kong District Councillors (elected 2019), with variables such as their constituency, region, share of vote, total votes, link to their Facebook pages, etc. (Sources: https://www.districtcouncils.gov.hk; https://dce2019.hk01.com/)
  • hkstreetnames20 - A dataset containing names of all the streets in Hong Kong as at 2020.
  • hkdistrict_summary - A summary table detailing Hong Kong's districts, their region classifications, and abbreviated labels.
  • hk_accidents* - A dataset containing traffic accidents between 2014 and 2019.
  • hk_casualties* - A dataset containing information of collisions by casualties between 2014 and 2019.
  • hk_vehicles* - A dataset containing information of vehicles by casualties between 2014 and 2019.

The datasets hk_accidents, hk_collisions, and hk_vehicles are related and can be joined together using the variable Serial_No_. This dataset was made available via a Freedom of Information request to the Hong Kong Transport Department. We would also like to thank the authors behind the {HK80} package to enable us to convert the HK1980GRID coordinate system to longitudes and latitudes in the hk_accidents dataset.

*Note that version 1.0.0 introduced major breaking changes, which moved hk_accidents, hk_casualties, and hk_vehicles to being only available via download_data() and will not be available directly as a namespace due to CRAN size limitations. See NEWS.md for more information.

Visit our GitHub

🔧 Installation

{hkdatasets} is now available on CRAN. You can install this with:

install.packages("hkdatasets")

You can also install the latest development version from GitHub with:

install.packages("devtools")
devtools::install_github("Hong-Kong-Districts-Info/hkdatasets")

📈 Datasets

If you are exploring the package, we recommend starting with 2019 District Councillors dataset:

library(hkdatasets)
head(hkdc)

To find out more about the variable and the source of the dataset, you can run:

?hkdc

Note: this dataset contains Chinese characters in the UTF-8 encoding.

📬 Contact

Please feel free to submit suggestions and report bugs: https://github.com/Hong-Kong-Districts-Info/hkdatasets/issues

Also check out our website for our other work and projects!

You can find out about our current backlog of work on our public Trello Doing board.