Forecast Skill

Forecast skill score (SS) is defined as the average accuracy of the test forecasts relative to the accuracy of forecasts produced by a reference method:

,

where MSE is the mean squared error of the test forecasts and MSEr is that of the reference forecasts (NOAA Forecast Verification Glossary). MSE measures mean squared difference between the forecasts and observations:

.

For an individual station, N is the number of years of  forecast; for an area, N is the number of years times the number of stations.

Three reference methods are used here:

  1. Climatology, or climate normals. Climatology is obtained by averaging the corresponding weather derivative values over the 30-year base period, 1971-2000. Climatology is one of the two most widely used standards of reference in the field of forecast verification. The second one is persistence.
  2. Persistence. It represents a simple method of forecasting the most recent observation. Despite its apparent simplicity, persistence often provides a hard-to-beat forecast.
  3. Optimal climate normals (OCN). The OCN is one of the main forecasting tools used by the Climate Prediction Center (CPC). It  predicts a climate variable using its average values for a given month/season during the last 10 years (for temperature). The tool, as it is used at the CPC, is simply a measure of the trend. In the absence of a clearly defined ENSO phase, the OCN output dominates the forecast maps produced by the CPC, as shown in this Seasonal Forecast Guidance.

Positive (negative) skill score (presented here in per cent) means that the test forecast is better (worse) than that of the reference method. A score of 0 indicates that both forecasts are equally skillful. Note that the skill score is asymmetric relative to zero. For a perfect forecast, the skill score reaches a maximum of 100, but it can go to negative infinity for a badly missed forecast, if the reference forecast is close to the observation. Therefore, the skill score can be unstable for small sample sizes. 

To be consistent with our forecast maps, upon which the weather derivative forecasts are based, we pooled our European monthly forecasts into two groups: 1) November-December, and 2) January-February-March. For the North American cities, only seasonal forecasts are available so far. It should be noted that many cities are close to each other, so that their temperature variations are highly correlated. This further reduces the effective sample size, because the forecasts for those cities cannot be considered independent. Therefore, in order to get a better feeling of the accuracy of the forecasts, it is recommended to read our post-mortem analyses, which describe how successful (or otherwise) the forecast was in capturing the essential aspects of climatic processes during the forecast season. The latest post-mortem analysis can be found in our Forecast section, and for the earlier years in the Forecast Archive.

An example of a performance analysis for the CPC's climate outlooks is presented by Klaus Wolter (NOAA). It is shown that temperature forecast skill (relative to climatology) averages just under +20 (for non-CL forecasts) and under +10 for all forecasts, with wild swings from season to season and from year to year. There was no advance in forecast skill since 1995, when the first climate outlook was issued. Given a strong warming trend in recent decades, it would be interesting to see how the CPC's climate outlooks are scored relative to persistence or the OCN. 

Another way to estimate the accuracy of the forecasts is to see how profitable they can be when used for weather derivative trading on WeatherBill.com. See the results of betting on heating degree days for the winter (Nov-Mar) of 2008.



© 2007 Climate Logic
All Rights Reserved
 
Home | Forecasts | Services | Software | About Us | Contact Us
Disclaimer