This dataset, `housing_pts`, is a data frame containing information on median house prices for California districts, derived from the 1990 census. It includes geographic coordinates, demographic and housing characteristics, and district-level income and housing attributes. The dataset consists of 20,640 observations and 10 variables. Missing values may be present in some variables.
Usage
data(housing_pts)
Format
A data frame with 20,640 observations and 10 variables:
- longitude
Longitude coordinate of the district (numeric)
- latitude
Latitude coordinate of the district (numeric)
- housing_median_age
Median age of houses in the district (numeric)
- total_rooms
Total number of rooms in the district (numeric)
- total_bedrooms
Total number of bedrooms in the district (numeric)
- population
Population of the district (numeric)
- households
Number of households in the district (numeric)
- median_income
Median income in the district (numeric)
- median_house_value
Median house value in the district (numeric, in US dollars)
- ocean_proximity
Proximity of the district to the ocean (character string categories)
Source
Data taken from Kaggle: https://www.kaggle.com/datasets/camnugent/california-housing-prices
Details
The dataset name has been kept as `housing_pts` to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of your package and assists users in identifying its specific characteristics. The suffix `pts` indicates that the dataset contains georeferenced point data. The original content has not been modified in any way.