Skip to contents

This dataset, columbus_poly, is a data frame containing socioeconomic and housing characteristics for 49 neighborhoods in Columbus, Ohio, based on 1980 data. The dataset is widely used in spatial econometrics and geographic analysis.

Usage

data(columbus_poly)

Format

A data frame with 49 observations and 22 variables:

AREA

Area of the neighborhood (numeric)

PERIMETER

Perimeter of the neighborhood (numeric)

COLUMBUS.

Identifier variable (integer)

COLUMBUS.I

Identifier variable (integer)

POLYID

Polygon ID (integer)

NEIG

Neighborhood ID (integer)

HOVAL

Housing value (numeric)

INC

Household income (numeric)

CRIME

Crime rate (numeric)

OPEN

Open space (numeric)

PLUMB

Plumbing quality (numeric)

DISCBD

Distance to central business district (numeric)

X

X coordinate of centroid (numeric)

Y

Y coordinate of centroid (numeric)

AREA

Area variable (numeric, duplicated)

NSA

Neighborhood spatial attribute A (numeric)

NSB

Neighborhood spatial attribute B (numeric)

EW

East/West indicator (numeric)

CP

Central place indicator (numeric)

THOUS

Thousands of dollars (numeric)

NEIGNO

Neighborhood number (numeric)

PERIM

Perimeter variable (numeric, duplicated)

Source

Data taken from the spData package version 2.3.4

Details

In addition to the attributes, the original dataset also included a polygon list of neighborhood boundaries, a centroid matrix, and a neighbor list object, although these are not part of columbus_poly. The matrix bbs is deprecated but retained in other packages for compatibility.

The dataset name has been kept as columbus_poly to avoid confusion with other datasets in the R ecosystem. This naming convention helps distinguish this dataset as part of the lightsf package and assists users in identifying its specific characteristics. The suffix poly indicates that the dataset can be linked to polygon boundaries. The original content has not been modified in any way.