Hummingbird Data Schemas

Field-level documentation for all exported data formats
← Back to catalog

Tornado — buildings_damage_{event_id}.geojson

One feature per building within the modelled track corridor. Geometries are Overture Maps polygon footprints.
geometry: Polygon
FieldTypeDescription
idstringOverture Maps building UUID
statestringUS state abbreviation
wind_speed_mphfloatPeak wind speed at building centroid (mph)
r_milesfloatDistance from track centerline (miles)
track_sideintSide of track: 0 = left, 1 = right
along_track_fracfloatFractional position along track (0.0 = start, 1.0 = end)
area_m2floatBuilding footprint area (m²)
construction_classstringStructural class: W1 (light wood frame), W2 (heavy wood), C1–C3 (concrete), S1–S5 (steel), MH (manufactured housing), etc.
si_occtypestringOccupancy type from Structure Inventory: RES1 (single-family), RES2 (manufactured home), RES3 (multi-family), COM1–COM10 (commercial), IND1–IND6 (industrial)
si_roof_materialstringRoof covering material from SI (asphalt_shingle, metal, tile, etc.)
si_med_yr_bltfloatMedian year built for the census block group
damage_ratiofloatStructural damage ratio, 0.0 (no damage) to 1.0 (total destruction)
damage_indexfloatComposite damage severity index including SI modifiers
damage_statestringCategorical state: none / slight / moderate / extensive / complete
p_destructionfloatProbability of total destruction (damage_ratio ≥ 0.5)
in_debris_zoneboolWhether building is within the debris impact corridor
rcv_structurefloatReplacement cost value — structural component (USD)
rcv_contentsfloatRCV — contents (USD)
rcv_alefloatRCV — additional living expense (USD)
rcv_totalfloatTotal ground-up RCV (structure + contents + ALE) (USD)
rcv_insuredlossfloatEstimated insured loss after take-up rate and coverage limits (USD)

Hail — structures_hail_{date}.geojson

One feature per building within the MESH swath. Multiple files per event represent different processing dates or state/county batches; the latest-dated file is canonical.
geometry: Polygon
FieldTypeDescription
fd_idstringFEMA / HAZUS flood-district building identifier
lonfloatBuilding centroid longitude
latfloatBuilding centroid latitude
occtypestringOccupancy type (RES1, COM1, etc.)
bldgtypestringBuilding construction type (W1, C3, etc.)
val_structfloatStructure replacement value (USD)
footprint_sqftfloatBuilding footprint area (sqft)
hail_size_infloatMaximum MESH-derived hail diameter at building location (inches)
max_hail_mmfloatMaximum hail size (mm)
max_damage_indexlossfloatDamage index at peak hail size, 0.0–1.0. Multiply by val_struct for ground-up loss estimate.
duration_periodsintNumber of storm periods in which the building received hail ≥ threshold

Hurricane — buildings_damage_{event_id}.geojson

One feature per building within the wind-field footprint. Wind field modelled using a parametric RVM; damage via HAZUS-MH fragility curves with SI modifiers.
geometry: Point
FieldTypeDescription
overture_idstringOverture Maps building UUID
lat / lonfloatBuilding centroid coordinates
val_structfloatStructure replacement value (USD)
bldgtypestringBuilding type classification
construction_classstringStructural class (W1, C3, MH, etc.)
wind_speed_mphfloatModelled peak 3-second gust at building location (mph)
wind_speed_msfloatSame, in metres per second
footprint_area_sqmfloatBuilding footprint area (m²)
med_yr_bltfloatMedian year built for the census block group
roof_shapestringRoof shape (hip, gable, flat, etc.)
damage_ratiofloatMean damage ratio, 0.0–1.0
damage_statestringCategorical damage state (none / slight / moderate / extensive / complete)
damage_indexfloatComposite damage severity index
rcv_structurefloatStructural replacement cost value (USD)
economic_loss_usdfloatGround-up economic loss (USD)
loss_usdlossfloatEstimated insured loss (USD)

Wildfire — buildings.geojsonl

GeoJSON Lines format (one JSON object per line). One feature per building within the fire perimeter. Damage modelled using zone-based ignition probability combined with structural vulnerability.
geometry: Point
FieldTypeDescription
zonestringFire exposure zone: Z1 (within perimeter), Z2 (ember exposure), Z3 (buffer)
BDIfloatBuilding Damage Index — probability-weighted damage severity, 0.0–1.0
DR_midfloatDamage ratio, midpoint estimate, 0.0–1.0
val_structfloatStructure replacement value (USD)
ground_up_losslossfloatGround-up economic loss (DR_mid × val_struct) (USD)
occtypestringOccupancy type (RES1, COM1, etc.)
roof_materialstringRoof covering material — key vulnerability driver for wildfire

Flood — viewer snapshot GeoJSON

One feature per building. Inundation depth from HAND/NWM model, uncertainty bounds from ensemble runs. PMTiles format available for large events.
geometry: Polygon
FieldTypeDescription
fd_idstringFEMA / HAZUS building identifier
depth_ftfloatInundation depth, central estimate (ft)
depth_lo_ftfloatLower-bound inundation depth (ft)
depth_hi_ftfloatUpper-bound inundation depth (ft)
p_floodedfloatProbability that inundation depth > 0, 0.0–1.0
data_sourcestringDepth model source: HAND (Height Above Nearest Drainage), NWM (National Water Model), SLOSH (storm surge), or pluvial
occtypestringOccupancy type (RES1, COM1, etc.)
damage_indexfloatStructural damage index from depth-damage curve, 0.0–1.0
p_insuredfloatEstimated flood insurance take-up rate for the census tract
est_claim_klossfloatEstimated insured claim (thousands USD)

Straight-line Wind — structures_loss_{event_id}.geojson

One feature per building within the storm footprint. Wind field interpolated from WSR-88D peak gust observations; damage via HAZUS straight-line wind fragility curves.
geometry: Point
FieldTypeDescription
structure_idstringStructure identifier
peak_gust_mphfloatPeak 3-second gust at building location (mph)
eff_gust_mphfloatEffective gust after terrain roughness and exposure adjustments (mph)
const_classstringConstruction class (W1, C3, MH, etc.)
occtypestringOccupancy type (RES1, COM1, etc.)
footprint_sqftfloatBuilding footprint area (sqft)
replacement_valuefloatStructure replacement value (USD)
p_slightfloatProbability of slight damage state
p_moderatefloatProbability of moderate damage state
p_extensivefloatProbability of extensive damage state
p_completefloatProbability of complete damage / total loss
mdrfloatMean damage ratio (probability-weighted across all states), 0.0–1.0
damage_indexfloatComposite damage index
wind_multiplierfloatSite-specific wind amplification factor (topography, surface roughness)
calibration_factorfloatModel calibration adjustment applied to this event
event_typestringStorm type: derecho / convective / tropical_remnant
ground_up_losslossfloatGround-up economic loss (mdr × replacement_value) (USD)

PMTiles — {event_id}.pmtiles

Cloud-optimised vector tile archive. Contains the same building-level features as the corresponding GeoJSON, optimised for map rendering at zoom levels 8–16. Use with PMTiles JS or MapLibre GL. Hosted on Cloudflare R2 at https://media.hummingbirdsoftware.net/{peril}/event/{event_id}/{event_id}.pmtiles.
FieldTypeDescription
layer nameSame as peril: tornado, hail, hurricane, slw. Use this as the source-layer in MapLibre style expressions.
damage_indexfloatPrimary field for choropleth styling (0.0–1.0). Available at all zoom levels.
damage_statestringCategorical label for tooltip display.
rcv_insured / loss_usd / ground_up_lossfloatLoss field — name varies by peril (see GeoJSON schemas above). Available at zoom ≥ 12.
occtypestringOccupancy type filter field.

Behavioral Claim Risk Model — behavioral_scores.geoparquet

Census tract–level behavioral risk scores for 23 US metro areas, 21,834 tracts. GeoParquet format (Apache Parquet + embedded WKB geometry). Read with GeoPandas, DuckDB, or any Parquet-compatible tool. Hosted at gs://hummingbird-si/behavioral/behavioral_scores.geoparquet.

Architecture: Raw signals from ACS, CBP (Overture Places), CFPB, HMDA, Zillow, Eviction Lab, HUD, court records, and police dispatch are assembled into four composite indices. Those indices plus all raw signals feed a K-means cluster model that assigns each tract to one of four behavioral cohorts. All index fields are normalized 0–100 within city.
geometry: Polygon (Census tract boundary)
FieldTypeDescription
geoid_tractkeystring11-digit FIPS Census tract code (state 2 + county 3 + tract 6). Join key to any Census dataset.
geometrygeometryCensus tract polygon boundary (WGS 84 / EPSG:4326)
citystringMetro area name (e.g. "Chicago", "Austin", "Miami")
cohortprimarystringK-means behavioral cohort assignment — apply as a loss multiplier to peril model outputs:
Careful Keepers — low stress, low dispute ecosystem, high maintenance investment. Below-expected claiming.
Squeaky Wheels — moderate stress + dispute culture. Baseline claiming.
Duct Tape Dynasty — elevated financial stress + deferred maintenance. Elevated claiming.
Claim Cavalry — high dispute culture, active legal ecosystem, high stress. Highest claiming propensity.
tipping_zonestringClaim Cavalry neighbourhood concentration: high (>25% of nearby tracts are CC) / medium (15–25%) / normal (<15%). Contagion signal — CC clusters amplify each other.
cc_pctfloatShare of census tracts within 1 km radius that are Claim Cavalry. Used to compute tipping_zone.
cc_migration_riskfloat0–100 score estimating probability the tract shifts toward Claim Cavalry within 3 years. Driven by: trajectory_direction, tipping_zone proximity, behavioral_trajectory momentum, and ecosystem_growth_rate (Overture Places delta 2024–2026).
COMPOSITE INDICES (0–100, higher = more of that characteristic)
financial_stress_indexfloatWeighted composite of financial pressure signals. Inputs (approximate weights):
• income_pct_county (28%, ACS — inverted: lower income = higher stress)
• renter_share (18%, ACS)
• snap_rate (14%, ACS — SNAP recipients)
• vacancy_rate (12%, ACS)
• rent_burden (10%, ACS — gross rent >30% income)
• poverty_rate (10%, ACS)
• unemployment_rate (10%, ACS)
• single_parent_share (8%, ACS)
• eviction_filing_rate (8%, Eviction Lab 2015–18)
• income_trend_pct (5%, ACS 2019→2023 delta — inverted)
• pawn_shop_index (5%, Overture Places CBP)
• linguistic_isolation_rate (5%, ACS)
• disability_rate (5%, ACS)
• mobile_home_share (4%, ACS)
• overcrowded_rate (4%, ACS — >1 occupant/room)
• noncitizen_share (4%, ACS — institutional distrust proxy)
• no_internet_rate (3%, ACS)
• no_vehicle_rate (3%, ACS)
dispute_culture_indexfloatWeighted composite of behavioural willingness to dispute (activity and outcome signals — not supply-side infrastructure, which is in dispute_ecosystem_index). Inputs:
• cfpb_complaint_index (40%, CFPB Consumer Complaint Database — normalized complaint rate)
• claiming_culture_index (30%, derived from CFPB + legal ecosystem co-occurrence)
• atty_cfpb_interaction (10%, attorney density × CFPB complaint rate — areas with both legal infrastructure AND active disputing)
• dispatch_dispute_rate (10%, police dispatch — property/noise/neighbor calls; Socrata cities only)
• court_filing_rate (10%, civil court filing rate; FL only)
dispute_ecosystem_indexfloatWeighted composite of professional infrastructure that enables and amplifies claims (supply-side). Inputs:
• atty_density_index (30%, Overture Places CBP — law offices per capita)
• public_adjuster_index (30%, Overture Places CBP)
• chiropractor_index (15%, Overture Places CBP — post-loss billing proxy)
• diagnostic_imaging_index (15%, Overture Places CBP — urgent care / imaging centers)
• process_server_index (10%, Overture Places CBP)
maintenance_indexfloatWeighted composite of active investment and upkeep behaviour. Higher = better maintained, lower claiming propensity. Inputs:
• owner_occupancy_rate (25%, ACS)
• hmda_home_impr_rate (25%, HMDA — home improvement loan originations per unit)
• long_tenure_share (20%, ACS — owner-occupied 13+ years; community embeddedness)
• maintenance_response_index (15%, derived — HMDA/age-adjusted investment choice)
• permit_rate (15%, county permit records — pulling permits = active investment)
• pct_veteran (15%, ACS)
• median_housing_age (10%, ACS — inverted: older housing = more deferred maintenance)
• discretionary_services_index (10%, Overture Places — pet grooming/vet/fitness/specialty food: investment mindset proxy)
• in_migration_rate (5%, ACS — inverted: high in-migration = weaker maintenance culture)
• college_rate (5%, ACS — financial sophistication proxy)
• mobile_home_share (5%, ACS — inverted)
• elderly_share (5%, ACS — inverted: fixed income + physical limitation)
TRAJECTORY DIMENSIONS (0–100, 25% each in neighborhood_trajectory_score)
housing_trajectoryfloatHousing market momentum. Inputs:
• zhvi_delta_36m (25%, Zillow — 3-year home value change)
• zhvi_acceleration (20%, Zillow — ZHVI second derivative; price change rate-of-change)
• vacancy_rate_usps (13%, USPS — inverted: lower vacancy = improving)
• zri_delta_12m (18%, Zillow — 1-year rent index change)
• vacancy_trend_yoy (12%, USPS — inverted: falling vacancy = improving)
• zhvi_delta_12m (12%, Zillow — 1-year home value change)
• owner_occ_change (8%, ACS 2019→2023 delta)
• vacancy_change (6%, ACS 2019→2023 delta)
investment_trajectoryfloatProperty investment activity momentum. Inputs:
• permit_trend_yoy (22%, county permit records — year-over-year permit growth rate)
• permit_rate (18%, county permit records — permits per housing unit)
• hmda_home_impr_rate (12%, HMDA)
• discretionary_services_index (8%, Overture Places — investment confidence proxy)
• home_improvement_index (9%, Overture Places CBP)
• roofing_contractor_index (7%, Overture Places CBP)
• maintenance_response_index (7%, derived)
• owner_occupancy_rate (7%, ACS)
• long_tenure_share (6%, ACS)
• lihtc_unit_density (6%, HUD LIHTC — inverted: high LIHTC = low private investment)
• pct_veteran (3%, ACS)
• college_rate (2%, ACS)
• mobile_home_share (1%, ACS — inverted)
economic_trajectoryfloatEconomic vitality and trend. Inputs include:
• income_trend_pct (17%, ACS 2019→2023 delta)
• hmda_purchase_trend (13%, HMDA — purchase mortgage origination trend)
• qct_flag (7%, HUD — inverted: Qualified Census Tract = economic weakness)
• income_pct_county (8%, ACS — relative income)
• unemployment_rate (7%, ACS — inverted)
• poverty_rate (7%, ACS — inverted)
• lodes_prof+finance combined (6%, LODES — professional/finance employment density)
• severe_burden_rate (6%, HUD — inverted: >50% income on rent)
• subsidized_share (5%, HUD — inverted)
• snap_rate (5%, ACS — inverted)
• poverty_change (5%, ACS 2019→2023 — inverted)
• discretionary_services_index (5%, Overture Places)
• rent_burden (5%, ACS — inverted)
• unemployment_change (4%, ACS delta)
• lfpr_change (4%, ACS — labor force participation delta)
• eviction_filing_rate (4%, Eviction Lab — inverted)
• eviction_trend_yoy (4%, Eviction Lab — inverted)
• lihtc_expiring_pct (4%, HUD — expiring subsidized units = worsening)
• assisted_income_ratio (4%, HUD)
• labor_force_participation_rate (3%, ACS)
• extreme_poverty_rate (3%, ACS — inverted)
• restaurant_ratio_index (3%, Overture Places — full-service vs fast-food: economic confidence)
• lodes_health_index (2%), dollar_store_index (2%, inv), rent_to_own_index (2%, inv)
behavioral_trajectoryfloatDispute ecosystem and claiming pressure momentum. Higher = more pressure building. Inputs grouped by category:
Ecosystem (~37%): dispute_ecosystem_index (12%), atty_density_index (7%), public_adjuster_index (7%), chiropractor_index (3%), restoration_company_index (3%), ecosystem_growth_rate (5%, Overture Places delta 2024–2026)
Claiming (~39%): dispute_culture_index (12%), claiming_culture_index (8%), cfpb_complaint_index (8%), cfpb_trend_yoy (6%), dispatch_trend_yoy (5%)
Financial stress proxies (~20%): pawn_shop_index (7%), bail_bond_index (7%), payday_lender_index (6%)
Community (~10%): civic_engagement_index (5%), cc_migration_risk (5%)
neighborhood_trajectory_scorefloatEqual-weighted average of all four trajectory dimensions (housing 25% + investment 25% + economic 25% + behavioral 25%). Scores 0–100; higher = stronger neighbourhood momentum. Note: behavioral_trajectory is scored positively here but signals claiming pressure momentum — a high overall score reflects gentrification with active claim infrastructure, not just health.
trajectory_directionstringQualitative label derived from trajectory dimension scores and ZHVI acceleration: Improving / Stable / Declining / Reversing Down (was improving, now decelerating)

Structure Inventory — {state}_structures.parquet

National building-level inventory. One row per structure. State-level Parquet files on GCS at gs://hummingbird-si/si/. Source: FEMA NSI 2022, enriched with Overture Maps footprints, 3DEP LiDAR roof slope, USGS seismic hazard, FEMA FIRM flood zones, and WUI classification. Underlies all peril damage models.
geometry: Point (lon/lat columns — no embedded geometry)
FieldTypeDescription
fd_idkeyint64FEMA / HAZUS building identifier. Primary join key across all peril GeoJSON outputs.
lonfloatStructure centroid longitude (WGS 84)
latfloatStructure centroid latitude (WGS 84)
county_fipsstring5-digit FIPS county code
cbfipsstringCensus block FIPS code (12 digits)
occtypestringHAZUS occupancy type: RES1 (single-family), RES1-1SNB/1SWB (subtype), RES2 (manufactured home), RES3A–F (multi-family), COM1–COM10 (commercial), IND1–IND6 (industrial), AGR1 (agricultural), REL1 (religious), GOV1–GOV2 (government), EDU1–EDU2 (education)
bldgtypestringHAZUS building type / construction class: W (wood), S (steel), C (concrete), PC (precast concrete), RM (reinforced masonry), URM (unreinforced masonry), MH (manufactured housing)
sqftfloatFloor area (sqft)
num_storyintNumber of stories
val_structfloatStructure replacement cost value (USD) — NSI 2022 estimate
val_contfloatContents replacement cost value (USD)
med_yr_bltintMedian year built for the census block group
found_typestringFoundation type: S (slab), B (basement), C (crawlspace), P (pier/post)
found_htfloatFoundation height above grade (ft) — used in flood depth-damage calculations
first_floor_elvfloatFirst floor elevation above sea level (ft NAVD88)
ground_elvfloatGround elevation at structure location (ft NAVD88) — from 3DEP DEM
sourcestringNSI data source code: P (parcel), A (address), C (census), I (imputed)
parcel_idstringCounty assessor parcel identifier (where available)
overture_idstringOverture Maps building UUID matched to this structure (UUID v4). Null if no Overture footprint within 20m.
footprint_area_sqmfloatBuilding footprint area from Overture Maps (m²)
heightfloatBuilding height from Overture Maps (m) — null where not available
num_floorsfloatNumber of floors from Overture Maps — null where not available
roof_shapestringRoof shape from Overture Maps: gable, hip, flat, gambrel, mansard, etc. Null where not available.
roof_materialstringRoof covering material from Overture Maps: asphalt_shingle, metal, tile, concrete, wood_shingle, etc. Key vulnerability driver for hail and wildfire models.
roof_slope_degfloatRoof slope derived from 3DEP LiDAR DEM (degrees)
roof_slope_classstringRoof slope category: flat (<5°), low (5–15°), moderate (15–25°), steep (>25°)
roof_slope_sourcestringSource of slope estimate: lidar_1m / lidar_3m / lidar_10m / heuristic (from building type when LiDAR unavailable)
asce7_wind_speed_mphintASCE 7 design wind speed at location (mph) — used in SLW and hurricane vulnerability
firmzonestringFEMA FIRM flood zone designation: A, AE, X, X500, VE, etc.
ss / s1 / sds / sd1 / pgamfloatUSGS seismic hazard parameters (spectral acceleration values at 0.2s and 1.0s periods)
sdcstringASCE 7 Seismic Design Category (A–F)
nehrp_site_classstringNEHRP soil site class (A–F) based on vs30
vs30_msfloatTime-averaged shear wave velocity in top 30m of soil (m/s)
wui_classstringUSFS Wildland-Urban Interface class: Interface, Intermix, Very_Low_Dens_NoVeg, Non-WUI. Key vulnerability modifier for wildfire model.
enclosure_classstringOpening protection class for wind/hurricane: enclosed, partially_enclosed, open
enclosure_sourcestringSource of enclosure classification: model / assessor / default
Hummingbird Software · Data Schemas · April 2026 · Questions: ken.bywaters@hummingbirdsoftware.net