A synthetic health facility frame inspired by Service Provision Assessment (SPA) and Service Availability and Readiness Assessment (SARA) surveys. Uses real Kenya counties but contains entirely fictional data.

kenya_health

Format

A tibble with approximately 3,000 rows and 9 columns:

facility_id

Character. Unique facility identifier

region

Factor. Former province (8 regions)

county

Factor. County name (47 counties)

urban_rural

Factor. Urban/Rural classification

facility_type

Factor. Type of facility (Referral Hospital, County Hospital, Sub-County Hospital, Health Centre, Dispensary, Clinic, Maternity Home)

ownership

Factor. Ownership type (Public, Private, Faith-based, NGO)

beds

Integer. Number of inpatient beds

staff_count

Integer. Number of health workers

outpatient_visits

Integer. Monthly outpatient visits (measure of size)

Details

This dataset is designed for demonstrating:

  • Health facility surveys

  • Stratification by facility type and region

  • PPS sampling using patient volume

  • Sampling across different ownership types

Facility types follow the Kenyan health system hierarchy from referral hospitals down to dispensaries and clinics.

Note

This is a synthetic dataset. Counties and regions are real but all data values are fictional.

Examples

data(kenya_health)
head(kenya_health)
#> # A tibble: 6 × 9
#>   facility_id     region  county urban_rural facility_type      beds staff_count
#>   <chr>           <fct>   <fct>  <fct>       <fct>             <dbl>       <dbl>
#> 1 KE_Cen_Kia_0001 Central Kiambu Rural       Dispensary            2           5
#> 2 KE_Cen_Kia_0002 Central Kiambu Rural       Health Centre         8          11
#> 3 KE_Cen_Kia_0003 Central Kiambu Rural       Sub-County Hospi…    49          41
#> 4 KE_Cen_Kia_0004 Central Kiambu Rural       Dispensary            1           5
#> 5 KE_Cen_Kia_0005 Central Kiambu Rural       Dispensary            3           5
#> 6 KE_Cen_Kia_0006 Central Kiambu Rural       Dispensary            2           2
#> # ℹ 2 more variables: outpatient_visits <dbl>, ownership <fct>
table(kenya_health$facility_type)
#> 
#>   Referral Hospital     County Hospital Sub-County Hospital       Health Centre 
#>                  31                  91                 177                 460 
#>          Dispensary              Clinic      Maternity Home 
#>                1346                 755                 156 

# Stratified sample by facility type
if (FALSE) { # \dontrun{
sampling_design() |>
  stratify_by(facility_type, alloc = "proportional") |>
  draw(n = 400) |>
  execute(kenya_health, seed = 42)
} # }