Hail to High Variance: Rethinking Test Squares and Roof Damage Assessment

May 6, 2026 by Giancarlo Grant

For decades, the 10×10‑foot test square has been the industry standard for evaluating hail damage to roofing systems. It is simple, repeatable and widely understood. But convenience is not the same as reliability. When test squares are used to draw conclusions about an entire roof, important limitations often go unrecognized.

Hail damage may not be evenly distributed across a roof. Wind direction, roof geometry, slope orientation, shielding and material differences all influence the extent and severity of damage due to hail impacts. As a result, isolated test areas can both miss existing damage and exaggerate localized conditions. Understanding why this happens, and what can be concluded from test squares, is essential for inspectors who want defensible, reliable assessments.

Sampling, Not Certainty

At its core, a test square is a sample, not a measurement of the entire roof. Sampling is necessary when full inspection is impractical, but for sample results to represent the whole, the sampling process must be objective and probability‑based.

Too often, test squares are placed where damage “looks worst” or where access is easiest. These judgment-based selections introduce bias. A low‑exposure placement can drastically understate damage, while a high‑impact area can make a roof appear more severely damaged than average. Imagine for example a roof that is mostly shaded by trees or adjacent structures. A practitioner may seek out the unshaded portion and apply the results to the entire roof, but this neglects that the majority of the roof is shaded. In this example, it would be necessary to do at least two separate squares one for the shaded and another for the unshaded portion.

The key question is whether a square meaningfully represents the roof area it is being used to characterize.

Zero Hits Doesn’t Mean Zero Damage

One of the most common and most problematic conclusions in hail inspections is “no damage observed” based on a single 100-square-foot test area.

Hail impacts occur randomly across a surface. Even when a roof has real, measurable damage, small inspected areas can produce zero observed hits simply due to chance. This is especially true when damage density is low or moderate.

In practical terms, a zero‑hit result only becomes meaningful once enough area is inspected that damage would be likely to appear if it existed. While a larger inspected area rapidly reduces the chance of missing damage, the larger the square the more impractical it becomes for the practitioner.

For ease of understanding, hail damage was modeled as a Poisson distribution shown in Figure 1. From the chart we can see that event at 0 identified hits within a 100-square-foot test square, at a 95% confidence level there is still a good chance that the actual damage density could be up to three or four hits per 100 square feet.

In this case, the conclusion should not be “there is no damage,” but rather “no damage was observed within this limited area, and certain damage densities can be ruled out with stated confidence.”

The Risk of Mischaracterizing Damage Severity

Just as damage can be missed, it can also be mischaracterized.

Observed hit counts in a single test square can vary significantly from the true average damage across a roof slope. Variations in both storm and site characteristics can cause this. This is why multiple competent inspectors can assess the same roof and report different results. While their observations can be locally correct, without a robust sampling methodology, it rarely can be extrapolated to make roof wide conclusions.

As damage density increases, so does variability. Higher hit counts come with wider uncertainty, meaning that a single test square becomes less precise, not more, as damage increases. Relying on one test to determine whether a roof crosses a replacement threshold ignores this inherent variability.

Roof Size Does Matter

On a large roof facet, one or two test squares represent only a small fraction of the surface. Even if those squares show no damage, there is a likelihood that damage exists elsewhere but was not sampled. As roof facets increase in size or complexity, additional test areas become necessary to justify extending conclusions beyond the inspected locations.

What Can Defensibly Be Said From Test Squares

Test squares do support valid conclusions, but those conclusions must be probabilistic, not absolute.

Examples of defensible statements include:

“No hail hits were identified within the inspected 100-square-foot test area. Although isolated impacts elsewhere on the slope cannot be ruled out, the findings are not consistent with replacement‑level damage based on the sampled area without additional supporting test locations.”
“The observed hit counts of [X] are consistent with scattered or isolated hail impacts rather than uniform or widespread damage. These findings do not support a conclusion of slope‑wide replacement based on the inspected area alone; additional test areas would be required to determine whether damage density elsewhere materially differs. Conditions elsewhere on the slope may reasonably range from no damage to additional isolated impacts.”
“The observed damage density within the inspected area exceeds commonly used replacement thresholds. Assuming damage-inducing exposure was reasonably similar across the slope, it is probable that portions of the slope beyond the test area also exceed repairable conditions, supporting slope replacement unless contradictory sampling is observed.”

These statements acknowledge uncertainty while still providing actionable findings.

Improving Reliability with Better Sampling

When full‑roof inspection is not feasible, multiple randomly selected test areas dramatically improve reliability.

A practical approach is stratified sampling:

Divide the roof by orientation or exposure.
Overlay an imaginary grid of potential test locations.
Randomly select locations within each stratum before inspection.
Document substitutions and access limitations.

This transforms test squares from isolated observations into a defensible sampling method that supports roof‑level conclusions.

Practical Takeaways for the Industry

A single 10×10 test square is useful, but limited. Inspect as much roof as feasible.
Zero observed hits do not prove the roof is undamaged.
Single test areas commonly under‑ or over‑represent true roof conditions.
Placement bias materially affects outcomes.
Larger roofs require more sampling to justify broad conclusions.
Probabilistic language is more accurate and more defensible than absolute claims.
When stakes are high, more area is the answer.

Test squares remain a valuable tool for documenting localized hail damage, but they are not definitive measures of overall roof condition. Treating them as such introduces unnecessary risk, disagreement and error.

By recognizing hail impacts as inherently variable, and inspections as samples rather than certainties, roofing professionals can make clearer, more reliable, and more defensible assessments. In hail assessment, certainty does not come from tradition, it comes from understanding variance.

Grant is a senior managing consultant with BRG, licensed in multiple states. He has extensive experience evaluating hail and wind damage to roofing systems. His work now focuses on defensible damage and cost assessments and property claim analysis.

Popular Today

Popular This Month