Comstock to Thermal Loads #555

yingli-NREL · 2025-11-18T18:45:07Z

related issue: #535

This PR will update the current thermal load components data to the latest ComStock data.
COMSTOCK vs Current Scout Data comparison:

Key notes:

Current Scout data has extreme high (>1) and low values (<-1).
Heating Window Conduction Load: High values in current Scout data (average absolute difference = 0.5). Some building types (e.g., large office) show component loads >1 in the current Scout data.
Heating Light: Low values in current Scout data (average absolute difference = 0.59). some building types (e.g., food sales) show component loads <-1 in the current Scout data.
Floor component load: All values are zero in ComStock.
Heating Ground Floor has the large sign differences observed, only 99% samples have mismatched signs. High values in current Scout data (>0.5), while negative values in ComStock data.
Heating Ground Floor, Heating Non-Electric Equipment, Heating Floor, Cooling Window Conduction Load, Cooling Wall, Cooling Ground Floor, Cooling Non-Electric Equipment, Cooling Ventilation: Large sign differences observed; more than half of the samples have mismatched signs.
Cooling loads generally show smaller absolute differences compared to heating loads.

How to interpret these plots:

Color shows heating vs cooling.
Number shows building type.
Sign change happens in quadrants II and IV.
A large distance from the diagonal line means there was a large difference between new and old data.
Components are normalized values and can be negative or <1.

More stats:
HEATING

Variable	Mean_Difference	MAE	Sign_Agree_%	Correlation
WIND_COND	-0.48	0.50	100	0.13
WIND_SOL	0.37	0.37	100	0.11
ROOF	-0.15	0.16	100	0.28
WALL	-0.33	0.36	100	-0.07
INFIL	-0.02	0.17	91	0.37
PEOPLE	0.09	0.10	100	0.12
GRND	-0.23	0.23	1	-0.73
EQUIP_ELEC	0.16	0.17	68	0.32
EQUIP_NELEC	0.09	0.09	27	-0.13
FLOOR	-0.06	0.06	39	NaN
LIGHTS	0.58	0.59	99	0.26
VENT	-0.02	0.32	100	0.22

COOLING

Variable	Mean_Difference	MAE	Sign_Agree_%	Correlation
WIND_COND	0.12	0.12	14	-0.36
WIND_SOL	-0.28	0.28	100	0.55
ROOF	0.05	0.07	72	0.78
WALL	0.10	0.10	36	0.82
INFIL	0.04	0.05	69	-0.06
PEOPLE	0.01	0.04	100	0.74
GRND	0.12	0.13	33	-0.12
EQUIP_ELEC	0.01	0.10	100	0.51
EQUIP_NELEC	-0.04	0.04	27	0.06
FLOOR	0.03	0.03	36	NaN
LIGHTS	-0.46	0.46	100	0.54
VENT	0.29	0.29	6	-0.32

Calculation methods:
Mean diff = (ComStock-ScoutCurrentData).mean()
MAE = (|ComStock-ScoutCurrentData|).mean()
Sign Agree = (if sign_ComStock == sign_ScoutCurrentData).mean() to percentage
Correlation = ComStock.corr(ScoutCurrentData)

yingli-NREL · 2025-11-20T21:16:57Z

@rHorsey Please review this https://github.com/trynthink/scout/blob/comstock_component_load/scout/supporting_data/thermal_loads_data/comstock_to_thermalLoads.py#L129-L159

jmythms · 2025-11-21T14:58:47Z

scout/supporting_data/thermal_loads_data/comstock_to_thermalLoads.py

-    # Create NAREA column from calc.weighted.sqft..ft2
-    weight_column = df["calc.weighted.sqft..ft2"]
-    df["AREA"] = weight_column
-    print("Number of None in BLDG:", df["AREA"].isna().sum())


Nice! 🙂👍🏾

jmythms

We need to recreate the mseg_res_com_cz, mseg_res_com_emm, and mseg_res_com_state files. Only this will let Scout pick up our updates and trigger changes in the results.

How to do this:

python scout/com_mseg.py
This should output mseg_res_com_cdiv.json
Generating the final aggregated files:
python scout/final_mseg_converter.py

Select options 1,1 when prompted. This should output mseg_res_com_cz.json. Also run with options 1,2 and 1,3. This will generate mseg_res_com_emm, and mseg_res_com_state and we need to update these on the repo.

Otherwise, the changes look very nice, and great job generating the component loads file. Looking forward to seeing the results of the QA work.

jmythms · 2025-11-21T16:00:14Z

scout/supporting_data/thermal_loads_data/comstock_to_thermalLoads.py

+def add_missing_building_type(df):
+    avg_cols = list(set(COMSTOCK_SEGMENT_TO_CATEGORY.values()))
+    result_list = []
+
+    # Iterate over CDIV × ENDUSE combinations
+    for cdiv in df['CDIV'].unique():
+        for enduse in df['ENDUSE'].unique():
+            subset = df[(df['CDIV'] == cdiv) & (df['ENDUSE'] == enduse)]
+
+            # Establish rows for "Assembly" building type as an average of the rows
+            # for "Education", "Sm. Office", and "Merch./Service"
+            assembly_avg = subset[subset['BLDG'].isin([2, 8, 9])][avg_cols].mean().round(4)
+            # Establish rows for "Other" building type as an average of the rows
+            # for "Lodging", "Lg. Office", and "Warehouse"
+            other_avg = subset[subset['BLDG'].isin([6, 7, 10])][avg_cols].mean().round(4)
+            # Establish rows for "food sales" building type as an average of the rows
+            # for "food service" and "mercantile/service"
+            food_sales_avg = subset[subset['BLDG'].isin([4, 9])][avg_cols].mean().round(4)
+
+
+            for bldg in subset['BLDG'].unique():
+                block = subset[subset['BLDG'] == bldg].copy()
+
+                if bldg == 1:
+                    for col in avg_cols:
+                        block[col] = assembly_avg[col]
+                elif bldg == 11:
+                    for col in avg_cols:
+                        block[col] = other_avg[col]
+                elif bldg == 3:
+                    for col in avg_cols:
+                        block[col] = food_sales_avg[col]
+
+                result_list.append(block)
+
+    final_df = pd.concat(result_list, ignore_index=True)
+
+    return final_df


I suggest adding a docstring to clarify the rationale for this function, so it’s easier to understand when we revisit this PR.

jmythms · 2025-11-21T16:01:17Z

scout/supporting_data/thermal_loads_data/comstock_to_thermalLoads.py

+    # Create weight column
+    weight_column = df["weight"]
+    df["weight"] = weight_column


Did you encounter any rows in this column with no values like NAs, blank rows, etc?

Yingli added 7 commits November 18, 2025 11:44

first commit

2ed821f

code quality

64df165

code quality

eedd644

missing building type

e46cdcc

code quality

6a1bd62

code quality

38d5127

building type mapping

5703fc8

yingli-NREL requested review from jmythms and rHorsey November 20, 2025 21:15

remove area column

c24de96

jmythms reviewed Nov 21, 2025

View reviewed changes

Yingli and others added 7 commits November 24, 2025 14:06

revise building type mapping

e19c3cf

updated files

dfc1c3e

data update

6601c43

update document

8314145

code quality

eacd9af

code quality

2f06d0e

Upload results files from CI build

1754258

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Comstock to Thermal Loads #555

Comstock to Thermal Loads #555

Uh oh!

yingli-NREL commented Nov 18, 2025 •

edited

Loading

Uh oh!

yingli-NREL commented Nov 20, 2025

Uh oh!

jmythms Nov 21, 2025

Uh oh!

jmythms left a comment

Uh oh!

jmythms Nov 21, 2025

Uh oh!

jmythms Nov 21, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Comstock to Thermal Loads #555

Are you sure you want to change the base?

Comstock to Thermal Loads #555

Uh oh!

Conversation

yingli-NREL commented Nov 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

yingli-NREL commented Nov 20, 2025

Uh oh!

jmythms Nov 21, 2025

Choose a reason for hiding this comment

Uh oh!

jmythms left a comment

Choose a reason for hiding this comment

Uh oh!

jmythms Nov 21, 2025

Choose a reason for hiding this comment

Uh oh!

jmythms Nov 21, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

yingli-NREL commented Nov 18, 2025 •

edited

Loading