Thursday, November 30, 2023

Why Model Track Forecasts do NOT Depend on Initial Intensity

The Question

Konno-san asked the following in his 6 June 2023 post to lists.tstorms.org:

How can ECMWF be the best in the world at forecast track while not being much good at intensity?

Some Answers


Sim Aberson and Mark DeMaria suggested a couple of reasons:
  • DeMaria (1985) barotropic model study showing a stronger dependence on outer wind structure ('strength' and 'size') than inner-core intensity
  • Intensity (wind) dependence on pressure profile and model resolution
  • Steering-flow dependence on intensity -- more intense storms move with a deeper mean flow (e.g., 850-400 hPa) than weaker storms --> track forecast depends more on the accuracy of the larger-scale, steering flow (DeMaria et al., 2022)
  • Variation in steering flow with basins?

First considerations


Initial and short-range (0-24 h) TC motion can be considered a combination of: 1) internal vortex dynamical motion (e.g., beta drift); and 2) steering (advection).  Vortex initialization will dominate the beta-drift component (~1-3 m/s) and can be significant even if smaller than the steering flow.  The role of vortex structure in TC motion was my PhD dissertation work with Russ Elsberry (Fiorino and Elsberry, 1989a and 1989b).  Our main finding is that the flow in the 300-500 km annulus strongly affects both speed and direction of beta drift.

For a model to make a good 12 & 24 h track forecast both the initial and forecast large-scale flow has to be accurate as well as the initial vortex flow in the 'critical' 300-500 km annulus.  The inner-core (intensity) largely does not affect beta drift.  

To understand the role of initial beta drift in model motion, we need to define a metric for a 'good' TC vortex initialization.  Conventionally, it is the initial position error (IPE) and initial intensity error (IIE) -- what a forecaster might desire of a vortex initialization -- the model has the storm where I analyze it to be with my estimate of intensity.

Before showing results from the trackers it is necessary to understand what a model TC forecast is and is not and that the model does not forecast TCs directly.  The 'model' TC forecast comes from a tracker that takes in model data and outputs TC tracks

There are many model issues that need to be understood before analyzing model TC tracks.  

First, NWP models do not forecast the 10-m wind, it is a diagnosed quantity from the model prognostic variables (and sometimes similarity theory).  

Second, the difference between the model spatial grid and analysis grid used in the tracker.  Rarely does a tracker use native-resolution grids, especially for Gaussian and limited-area model grids.  The tracker most commonly uses post-processed, lat-lon grids.  

Third, the time dimension of the model grid.  The instantaneous value of pressure and especially wind is a mean over a time step which can be as small as 10-15 s and is even more complicated with models that use different time steps for dynamics and physics.  

These model issues are important when interpreting model tracker output especially because of differences in model tracker algorithms.  There are always some apples-v-oranges problems.  

Finally, it is important to verify the raw output of the model tracker.  In operations the tracker output is both post-processed (bias correction) and interpolated in time to be consistent with the time when the advisory/warning is issued.  For example, the 06Z NHC forecast will not have 06Z model runs available, instead 00Z runs will be used.

The Models


We will consider three modeling systems for the period 2020-22 (4-character ATCF ID in BOLD): 

  • AVNO - GFS (the American global model)
  • HWRF - GFS-HWRF (dynamical downscaling of the GFS using the HWRF limited-area model)
  • ECMWF tracks.  The three ECMWF tracks are:

    • TECM5- GFDL tracker of the deterministic run of the IFS; 
    • EMDT - ECMWF tracker of the deterministic IFS run; and
    • TERA5FDL tracker of the ERA5 reanalysis forecasts.  

The most significant feature of the modeling systems is that only GFS-HWRF explicitly analyses the TC vortex.

Bottom line (up front)


  • HWRF has almost no bias in the initial intensity and the smallest IIE & IPE -- a 'good' vortex initialization(?).  
  • Conversely, ECMWF has the largest IIE & IPE, but the lowest  12- and 24-h mean position errors (PE)!  
  • Also noteworthy is that the ECMWF ERA5 reanalysis has lower error than the ECMWF IFS operational runs.

The big conclusions:

  • the HWRF vortex initialization is NOT good -- the TC must be analyzed as part of the large-scale flow
  • the global model 12- and 24-h PE are incredibly small.


The GFDL & ECMWF TC Trackers


The de facto standard tracker for the US operational models is the GFDL vortex tracker (Marchok 2021) and the code is available here.


The new ECMWF tracker described ECMWF tracker 2012 (page 17) was recently updated for improved analysis of the surface wind field and sea-level pressure in Enhancing tropical cyclone wind forecasts | ECMWF.  The new tracker is an update to these trackers:

and what is most distinctive about the ECMWF tracker is the use of a lower grid resolutions in finding the TC center (~200 km)  and full native resolution (~9 km) to find the max 10-m surface wind (intensity) and the wind radii.   The GFDL tracker uses a single resolution.

The grid resolution of the 3 global models using the GFDL tracker (AVNO, ECMF and ERA5) is 0.25 deg.  The resolution of the GFDL tracker grid for HWRF is near native ~3 km.

Comparing ECMF (gfdl) v EMDT (ecmwf) shows the effect of the tracking algorithm, whereas comparing ECMF (gfdl) v ERA5 (gfdl) shows the effect of the modeling system. 


All statistics are homogenous.  The bar charts include a table with the value and the counts in [] and a box-whisker (min / 25% / 75% / max) for the error distribution with the thick black line indicating the median.


Fig. 1 gives the mean PE (bars) for the standard forecast times of taus of 0, 12, 24,36, 48, 72 (3 d), 96 (4 d) and 120 (5 d).  Note how the mean PE of the GFDL tracker (TECM5) is lower than the ECMWF tracker (EMDT)



Figure 1.  Effect of tracker algorithm (EMDT v TECM5) and modeling (TECM5 v TERA5) on Position Error (PE).  The distribution is shown as a box-whisker.  The black line is the median.
  

To display how the ECMWF tracker degrades or has higher PE than the GFDL tracker using the same model output.

The % improvement of PE of model1 (PE1) relative to the PE of model2 (PE2 is:

%improve = -((PE1 - PE2)/PE2))*100%

When PE1 <  PE2 then the %improve is positive (a lower PE is good).  In Fig. 2 below we see how the ECMWF tracker produces higher PE than the GFDL tracker (negative %improve).  Both trackers use grids taken from the same native resolution model which implies grid resolution is important in finding the TC center/position.


Figure 2.  % improvement (lower PE) of ECMWF tracker relative to GFDL tracker.

Now consider the effect of grid resolution on intensity in Fig. 3 below:

Figure 3. As in Fig. 1, the effect of tracker algorithm (EMDT v TECM5) and modeling (TECM5 v TERA5) on Intensity Error (IE).  The lines are mean absolute IE, the bars and box-whisker the error itself.

Whereas the ECMWF tracker degraded PE, the intensity error (mean absolute IE) is only about1 kt lower which can be attributed to using 0.25 deg (~21 km) grids in the GFDL tracker compared to ~9 km native resolution in the ECMWF tracker used to find the max wind.  The bias (mean IE) is about 2 kt lower again showing how higher resolution is better for analyzing the wind fields.


These results suggest that ECMWF should use a higher-resolution grids for find the TC center.


The ERA5 IE are about 5-7 kts higher, but in Fig 4. below we find no effect on PE for taus 0-36 h and a slight degradation at days 3-5.


Figure 4.  As in Fig. 1.  The effect of modeling on PE (IFS v ERA5).  The bars are the %improve of EMDT (ecmwf)  and TERA5 (gfdl) over TECM5 (gfdl).

The ERA5 initial PE is higher, but it is very impressive how the lower-resolution model of ERA5 is better or even out to tau 48 h compared to the more recent and higher-resolution IFS.


References


DeMaria, M., 1985: Tropical Cyclone Motion in a Nondivergent Barotropic Model. Monthly Weather Review, 113, 1199–1210, https://doi.org/10.1175/1520-0493(1985)113<1199:TCMIAN>2.0.CO;2

DeMaria, M., and Coauthors, 2022: The National Hurricane Center Tropical Cyclone Model Guidance Suite. Weather and Forecasting, 37, 2141–2159, https://doi.org/10.1175/WAF-D-22-0039.1.

FIORINO, M., and R. ELSBERRY, 1989: SOME ASPECTS OF VORTEX STRUCTURE RELATED TO TROPICAL CYCLONE MOTION. J Atmos Sci, 46, 975–990, https://doi.org/10.1175/1520-0469(1989)046<0975:SAOVSR>2.0.CO;2.
Fiorino, M., and R. L. Elsberry, 1989: Contributions to Tropical Cyclone Motion by Small, Medium and Large Scales in the Initial Vortex. Monthly Weather Review, 117, 721–727, https://doi.org/10.1175/1520-0493(1989)117<0721:CTTCMB>2.0.CO;2.

Magnusson, L., and Coauthors, 2019: ECMWF Activities for Improved Hurricane Forecasts. Bulletin of the American Meteorological Society, 100, 445–458, https://doi.org/10.1175/BAMS-D-18-0044.1.

Marchok, T., 2021: Important Factors in the Tracking of Tropical Cyclones in Operational Models. Journal of Applied Meteorology and Climatology, 60, 1265–1284, https://doi.org/10.1175/JAMC-D-20-0175.1.

Van der Grijn, G., J.-E. Paulsen, F. Lalaurette & M. Leutbecher, 2005: Early medium-range forecasts of tropical cyclones. ECMWF Newsletter No. 102, 7–14.

Vitart, F., J. L. Anderson, and W. F. Stern, 1997: Simulation of Interannual Variability of Tropical Storm Frequency in an Ensemble of GCM Integrations. Journal of Climate, 10, 745–760, https://doi.org/10.1175/1520-0442(1997)010<0745:SOIVOT>2.0.CO;2.