[15pt] Incorporate buildings into FIM by AliForghani-NOAA · Pull Request #1777 · NOAA-OWP/inundation-mapping

AliForghani-NOAA · 2026-03-03T22:07:12Z

This PR closes the issue #1739 and includes the following enhancements to address buildings Fimpacts:

Ingests FEMA buildings as a new input data for FIM.
Derives the threshold discharge required for buildings inundation. To achieve this, the minimum non-zero HAND value within each building is extracted as the inundation threshold stage. The corresponding threshold discharge values are then interpolated from the HydroTables.
Enhances tools/fimpacts_inundation.py (formerly road_inundation.py) to identify inundated buildings and calculate corresponding flood depths for specific events.

In addition to introducing building pre-clipping in the data/wbd/generate_pre_clip_fim_huc8.py script, this PR refactors the interface from --copy_* arguments (e.g., --copy_osm_roads) to direct layer arguments for preclipping (e.g., --osm_roads). Listed layers are pre-clipped, while unlisted layers are copied, simplifying the interface and making layer selection more intuitive.

The updated pre-clipped dataset with new FEMA buildings data has been prepared here: inputs/pre_clip_huc8/20260306/.

In-Depth Workflow Explanation

data/buildings/get_fema_buildings.py (new script)
This script downloads FEMA’s latest per-state building structure geodatabases from the official USA Structures page. It then converts the gdb files to GeoParquet format using the appropriate CRS for each region (CONUS, Alaska, Guam, and American Samoa). The script supports preparing data for specific states only if desired.
data/buildings/make_buildings_parts_per_huc.py (new script)
This script splits state-level building parquet datasets into HUC8-based parquet “parts”, keeping only the following building attributes ["UUID", "HEIGHT", "OCC_CLS", "SOURCE", "VAL_METHOD"] plus geometry. It processes a mixed sequence of parquet row groups in parallel (taking row groups from different states in turn, instead of finishing one state at a time), and uses a bounding-box prefilter to efficiently identify which HUCs intersect each row group before running the spatial join. Outputs are written as per-HUC8 folders (for example, huc8_XXXXXXXX/STATE_rg001.parquet), and it can optionally run for only selected states.
src/process_buildings_fimpact.py (new script)
This script is run for each branch of an HUC using three inputs:
- Buildings polygons
- HAND raster
- HAND-generated HydroIDs gpkg
A single building segment may intersect multiple HydroIDs. To account for this, the script splits building segments at HydroID boundaries and calculates the minimum HAND value (excluding zeros) within each segment to serve as the inundation threshold.

Three new columns are added to the building dataset: threshold_hand, HydroID, and feature_id. The results are saved as buildings_fimpact_***.csv for each branch, where *** represents the branch number. Each CSV file contains one record per UUID, which is the unique identifier for each building segment, within each HydroID, providing the minimum HAND value for that combination.
src/aggregate_by_huc.py (updated script)
For each branch, the script retrieves the discharge value corresponding to each threshold_hand from the branch’s HydroTable (per HydroID) and assigns it as threshold_discharge. Any record with a threshold_hand value greater than 25m (the maximum stage listed in the HydroTables) is removed entirely. The outputs from all branches are combined into a single file: buildings_fimpact.csv.
tools/fimpacts_inundation.py (formerly called ‎tools/road_inundation.py
This tool now takes three inputs:
- A FIM run directory (which includes buildings_fimpact.csv in addition to osm_roads_fimpact.csv file), and
- A flow file.
- A new flag to indicate whether the script should process buildings or roads
The script identifies buildings segments where the given flow (referred to as evaluated_discharge) exceeds the threshold discharge and flags them as inundated. It also looks up the stage corresponding to the evaluated_discharge (and call it evaluated_stage) and subtracts the evaluated_stage from the threshold_hand value to calculate the flood_depth.

Records with negative flood depth are currently removed, as these may result from non-monotonic synthetic rating curves—most commonly observed in branch zero.

Note that a single building segment may have multiple inundation records, originating from different branches or intersecting multiple HydroIDs. The code retains only the record with the maximum flood depth for each building segment.

The figure below displays the output of the fimpacts_inundation.py tool with inundated buildings (with their flood depth) and non-inundated buildings (gray) overlaid on a FIM raster. Both results were generated from a common 50-year recurrence interval flow file for HUC 11070103.

Additions

data/buildings/get_fema_buildings.py
data/buildings/make_buildings_parts_per_huc.py
src/process_buildings_fimpact.py

Changes

Renamed tools/road_inundation.py to tools/fimpacts_inundation.py and extended the script to support building inundation processing in addition to roads.
data/wbd/clip_vectors_to_wbd.py -> Updated to enable pre-clipping of buildings dataset
data/wbd/generate_pre_clip_fim_huc8.py -> Updated to enable pre-clipping of buildings dataset. Also refactored the CLI to switch from copy-first arguments to preclip-first arguments (as described above).
src/aggregate_branches_to_huc.py -> Aggregates branch-level building FIMpact results by HUC
src/delineate_hydros_and_produce_HAND.sh -> Calls the new src/process_buildings_fimpact.py script
src/bash_variables.env -> Updated the reference to the new pre-clipped dataset and added a reference to the building parts dataset required for pre-clipping
src/calibrate_rating_curves.sh -> Enables aggregating buildings FIMpact results by HUC

Testing

Generally, you do not copy this part into the ChangeLog. These are some quick notes on what you did test and/or notes for the reviewer to help with their review testing.

Deployment Plan (For FIM developers use)

Does the change impact inputs, docker or python packages?
- Yes
- No (f no.. skip the rest of the Deployment Plan section)
If you are not a FIM dev team member: Please let us know what you need and we can help with it.
If you are a FIM Dev team member:
- Please work with the DevOps team and do not just go ahead and do it without some co-ordination.
- Copy where you can, assign where you can not, and it is your responsibility to ensure it is done. Please ensure it is completed before the PR is merged.
- Has new or updated python packages, PipFile, Pipefile.lock or Dockerfile changes? DevOps can help or take care of it if you want. Just need to know if it is required.
  - Yes
  - No
- Require new or adjusted data inputs? Does it have a way to version (folder or file dates)?
  - No
  - Yes
    - Require new pre-clip set or any other data reloads, such as DEMS, osm, etc. ie.. pre-requisite re-data upstream of your input changes.
      - Yes
      - No
    - Has the inputs been copied/exist in all four enviros:
      - FIM EFS
      - FIM S3
      - ESIP
      - Dev1
Please use caution in removing older version unless it is at least two versions ago. Confirm with DevOps if cleanup might be involved.
If new or updated data sets, has the FIM code, including running fim_pipeline.sh, been updated and tested with the new/adjusted data? You can dev test against subsets if you like.
- Yes

Notes to DevOps Team or others:

Please add any notes that are helpful for us to make sure it is all done correctly. Do not put actual server names or full true paths, just shortcut paths like 'efs..../inputs/, or 'dev1....inputs', etc.

Issuer Checklist (For developer use)

You may update this checklist before and/or after creating the PR. If you're unsure about any of them, please ask, we're here to help! These items are what we are going to look for before merging your code.

Reviewer / Approver Checklist

Where applicable, has fim_pipeline been tested with muliple HUCs, including some other unaffected HUCs?
If there are new inputs, have you confirmed that they have been copied to all enviroments?

Merge Checklist (For Technical Lead use only)

Update CHANGELOG with latest version number and merge date
Update the Citation.cff file to reflect the latest version number in the CHANGELOG
If applicable, update README with major alterations

ZahraGhahremani

I ran fim_pipline for HUC 12090301 and tested the fimpacts_inundation tool for that:

I also tested data/buildings/get_fema_buildings.py and data/buildings/make_buildings_parts_per_huc.py for Idaho and it works as expected.

mluck

I tested get_fema_buildings.py on Vermont (VT) and make_buildings_parts_per_huc.py on these data. It worked perfectly, although it still loops through all 2155 HUCs even though VT covers only a small handful of those HUCs. Is there a way to preselect the HUCs that intersect the data to be more efficient?

The refactoring and CLI logging are nice updates to preclipping.

mluck · 2026-03-17T14:41:18Z

+      huc_root/<HUC8>/wbd_buffered.gpkg
+    """
+    if not current_preclip_directory.exists():
+        raise RuntimeError(f"Prclip directory does not exist: {current_preclip_directory}")


Prclip spelled incorrectly

Just pushed a commit to address this.

mluck · 2026-03-17T15:02:00Z

 # NOTE: $inputsDir is defined in Dockerfile

-export                        pre_clip_huc_dir=${inputsDir}/pre_clip_huc8/20260205
+export                        pre_clip_huc_dir=${inputsDir}/pre_clip_huc8/20260306


Update pre_clip_huc_dir to 20260312

Should we use '20260312' when we merge your PR? My PR is making '20260306'

AliForghani-NOAA · 2026-03-17T19:03:51Z

I tested get_fema_buildings.py on Vermont (VT) and make_buildings_parts_per_huc.py on these data. It worked perfectly, although it still loops through all 2155 HUCs even though VT covers only a small handful of those HUCs. Is there a way to preselect the HUCs that intersect the data to be more efficient?

The refactoring and CLI logging are nice updates to preclipping.

I looked into this and tested an extent-based preselection so we would only load HUCs intersecting the selected state data instead of all 2,155 HUCs. In my testing, that change actually increased runtime from about 16 minutes to about 21 minutes, (only for VT) which suggests that the added preselection work outweighed any savings from reducing the HUC load step. An alternative would be to maintain a separate static input file mapping HUCs to each state, but that would add maintenance overhead and potential issues if HUC boundaries change. Therefore, I’d prefer to keep the original approach.

RobHanna-NOAA · 2026-03-17T19:34:15Z

Ya.. I agree. I am not sure we should allow by HUC only. It seems like there is a much greater risk of things getting out of sync unless we do full HUCs. And, the time is already negligible.

AliForghani-NOAA added 7 commits February 27, 2026 04:17

added codes to pull fema buildings and prclip for each huc

1cb1da9

applied linting

a0b2aa7

improved workflow for building parts per huc

5142c89

added fimpact processing for buildings

3583f01

rename the script to also include buildings

eeed96d

expanded fimpact flood depth calculations to buildings

2a056de

Merge branch 'dev' into dev-buildings

3b292f7

AliForghani-NOAA self-assigned this Mar 5, 2026

AliForghani-NOAA added the enhancement New feature or request label Mar 5, 2026

AliForghani-NOAA added 2 commits March 5, 2026 05:14

renamed the file

377c812

splitted buildings at HydroID boundaries to correct inundation

02f7534

AliForghani-NOAA marked this pull request as ready for review March 6, 2026 16:46

CarsonPruitt-NOAA requested a review from ZahraGhahremani March 6, 2026 17:11

AliForghani-NOAA requested a review from RobHanna-NOAA March 6, 2026 22:34

AliForghani-NOAA added 4 commits March 7, 2026 00:25

refactored preclipping

e3cbe70

updated changelog and preclip path

ff7b6c5

Merge branch 'dev' into dev-buildings

d90517c

small bug fixed

7bb794a

mluck self-requested a review March 12, 2026 16:03

ZahraGhahremani previously approved these changes Mar 12, 2026

View reviewed changes

recorded CLI args and the source of preclipped data

b072c31

AliForghani-NOAA dismissed ZahraGhahremani’s stale review via b072c31 March 16, 2026 19:38

mluck previously approved these changes Mar 17, 2026

View reviewed changes

fixed spelling and recorded run duration

0998ad0

AliForghani-NOAA dismissed mluck’s stale review via 0998ad0 March 17, 2026 19:09

Merge branch 'dev' into dev-buildings

0537c20

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[15pt] Incorporate buildings into FIM#1777

[15pt] Incorporate buildings into FIM#1777
AliForghani-NOAA wants to merge 16 commits intodevfrom
dev-buildings

AliForghani-NOAA commented Mar 3, 2026 •

edited

Loading

Uh oh!

ZahraGhahremani left a comment

Uh oh!

mluck left a comment

Uh oh!

mluck Mar 17, 2026

Uh oh!

AliForghani-NOAA Mar 17, 2026

Uh oh!

mluck Mar 17, 2026

Uh oh!

AliForghani-NOAA Mar 17, 2026

Uh oh!

AliForghani-NOAA commented Mar 17, 2026

Uh oh!

RobHanna-NOAA commented Mar 17, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

AliForghani-NOAA commented Mar 3, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

In-Depth Workflow Explanation

Additions

Changes

Testing

Deployment Plan (For FIM developers use)

Notes to DevOps Team or others:

Issuer Checklist (For developer use)

Reviewer / Approver Checklist

Merge Checklist (For Technical Lead use only)

Uh oh!

ZahraGhahremani left a comment

Choose a reason for hiding this comment

Uh oh!

mluck left a comment

Choose a reason for hiding this comment

Uh oh!

mluck Mar 17, 2026

Choose a reason for hiding this comment

Uh oh!

AliForghani-NOAA Mar 17, 2026

Choose a reason for hiding this comment

Uh oh!

mluck Mar 17, 2026

Choose a reason for hiding this comment

Uh oh!

AliForghani-NOAA Mar 17, 2026

Choose a reason for hiding this comment

Uh oh!

AliForghani-NOAA commented Mar 17, 2026

Uh oh!

RobHanna-NOAA commented Mar 17, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

AliForghani-NOAA commented Mar 3, 2026 •

edited

Loading