Wednesday, August 12, 2020

A Tropical Cyclone Forecast Metric for Operations and Model Development

prospectus for a WAF paper 

  • why metrics matter: "you're only as good as what you measure"
    • .gov & .mil set standards for operational forecast quality or goodness
      • GPRA for NWS - mean 48-h PE/IE
      • PACOM for .mil - mean 24,48,72,120 PE
    • model 'goodness' based on mean PE/IE statistics
  • TC forecast is...
    • surface (10 m) wind field
    • 2-D functional representation using Position (lat/lon), Intensity (Vmax) and Radii (R34/50/64) POCI/ROCI (pressure and radius of outermost closed isobar) and other parameters...
    • NWS (noaa.gov) & PACOM (.mil) warnings/advisories based on onset of 34 kt winds
  • Standard TC metrics:
    • PE - position error falsely called 'track' error
    • IE - intensity error; intensity defined by Vmax not Pmin
    • forecast taus 0,12,24,36,48,60,72,96,120
    • primary statistic is the mean
    • best track uncertainty:
      • P ~ 5-20 nmi depending on I (big I small P uncertainty)
      • I ~ 10-20 kts largest for small I and (ironically) very large I 
  • Properties of PE (and IE)
    • first and foremost NOT equivalent to NWP metrics like 5-day 500 mb NHEM AnomCorr (NAC)
      • time series of  5DNAC --> mean
        • continuous from a continuous process (the model)
        • # of cases the same for day 5, day 10, day...
      • time series of 24/72/120 h PE -- 2019 LANT for hwrf,avno,tecm5
        • discontinuous 
        • 2 or more PE at a given time (2 or more storms)
      • # of cases at each tau is different showed by histogram of R34 at tau 24/72/120
        • varies with basins
        • in the LANT only 1 of every 2 forecasts has a 72-h verifying position
      • show how 24-h PE using only 120 h storms != as 24-h PE using all possible
    • display means of both NAC and PE as die off curves
      • NAC dieoff can be differentiated PE dieoff cannot
      •  PE should be displayed for each tau separately!
      • # of storms / mean PE
    • apples v orange problem!!
    • the 'population' is season/basin dependent
    • year-to-year variability in season/basin mean implies the population cannot be well defined
    • serial correlation between forecasts reduces number of cases
      • e-folding time ~ 12-18 h or for forecasts every 6-h Nind~Nall/3
    • for every forecast...
      • number of verifying cases at:
        • tau 24 ~ 80% (short range)
        • tau 72 ~ 50% (medium range)
        • tau 120 ~ 30% (long range)
      • mean PE/IE represent a subset of storms
      • contribution by storm highly variable
  • the most important part of the forecast is track
    • 80% v 20%?
  • How to improve mean PE?
    • separate from IE
    • a model must make a 'good' track forecast 
      • to use the intensity? maybe...but physically intensity does depend on track
        • can be seen in ensembles -- need to make this plot again...

    • improve the process that generates the forecast -- the model -- why ECMWF is the best TC forecast model
    • reduce 'big' errors
      • do big errors happen within a storm or by storm?
  • Forecast Error (FE)
    • is not the same as PE or IE
    • must be related to the wind field (the forecast) and particularly the extent of 34 kt winds represented by the Radius of 34 kt winds (R34)
    • conceptually FE=f(PE,IE,R34)
      • in the early years (60-80s) Charlie Neumann defined FE=PE
      • or FE=a*PE + b*IE ; a=1.0 and b=0.0
  • New FE=f(PE,IE)
    • only require the forecaster (human or model) to predict position and intensity use best track (BT) R34
    • define error IKE (integrated kinetic energy, Powell...) as symmetric difference of two (intersecting) circles of R34
      • symmetric R34
      • statistical relationship between Vmax and Rmax for forecast & BT
    • use of this simplified IKE represents a lower limit in FE 
  • How
  • demonstrate how 'large' errors contribute to the season/basin mean
    • for every one 'bad' forecast it takes 5-10 'good' forecasts to compensate
    • one storm can dominate the mean at the medium/long range
  • demonstrate how storms contribute to the mean PE/IE and why a model failure for a single storm can 'ruin' the seasonal mean
  • analyze 'large' model errors for both PE and FE
    • are large PE errors always large FE?
    • how do large FE compare to PE