Xarray Interpolation, Groupby, Resample, Rolling, and Coarsen

Contents

21. Xarray Interpolation, Groupby, Resample, Rolling, and Coarsen#

Attribution: This notebook is a revision of the Xarray Interpolation, Groupby, Resample, Rolling, and Coarsen notebook by Ryan Abernathey from An Introduction to Earth and Environmental Data Science. Thanks to Aiyin Zhang for preparing this notebook.

In this lesson, we cover some more advanced aspects of xarray.

import numpy as np
import xarray as xr
from matplotlib import pyplot as plt

21.5. An Advanced Example#

In this example we will show a realistic workflow with Xarray. We will

Load a “basin mask” dataset
Interpolate the basins to our SST dataset coordinates
Group the SST by basin
Convert to Pandas Dataframe and plot mean SST by basin

basin_surf.plot(vmax=10)
#basin_surf

<matplotlib.collections.QuadMesh at 0x78264ba04610>

../_images/dcc509bd7dc55da7e7e58dc5c8ceba276771a1c73cb1b3775b3a2d830f232c69.png

basin_surf_interp = basin_surf.interp_like(ds.sst, method='nearest')
basin_surf_interp.plot(vmax=10)
#basin_surf_interp

<matplotlib.collections.QuadMesh at 0x78264afdfe50>

../_images/3bad6ac95723bda3fa0dd93548232658868fb200cce3eb376a6b55e1f178f0c3.png

df = basin_mean_sst.mean('time').to_dataframe()
df

	Z	sst
basin
1.0	0.0	19.317692
2.0	0.0	21.204735
3.0	0.0	21.147755
4.0	0.0	19.902565
5.0	0.0	8.199746
6.0	0.0	15.138650
7.0	0.0	28.522148
8.0	0.0	26.654783
9.0	0.0	0.345633
10.0	0.0	1.550839
11.0	0.0	-0.799598
12.0	0.0	12.162644
53.0	0.0	14.433341
56.0	0.0	28.495367

import pandas as pd
basin_names = basin_surf.attrs['CLIST'].split('\n')
basin_df = pd.Series(basin_names, index=np.arange(1, len(basin_names)+1))
basin_df

               Atlantic Ocean
               Pacific Ocean 
                 Indian Ocean
            Mediterranean Sea
                   Baltic Sea
                    Black Sea
                      Red Sea
                 Persian Gulf
                   Hudson Bay
              Southern Ocean
                Arctic Ocean
                Sea of Japan
                    Kara Sea
                    Sulu Sea
                  Baffin Bay
          East Mediterranean
          West Mediterranean
              Sea of Okhotsk
                   Banda Sea
               Caribbean Sea
               Andaman Basin
             North Caribbean
              Gulf of Mexico
                Beaufort Sea
             South China Sea
                 Barents Sea
                 Celebes Sea
              Aleutian Basin
                  Fiji Basin
        North American Basin
         West European Basin
      Southeast Indian Basin
                   Coral Sea
           East Indian Basin
        Central Indian Basin
    Southwest Atlantic Basin
    Southeast Atlantic Basin
     Southeast Pacific Basin
             Guatemala Basin
         East Caroline Basin
              Marianas Basin
              Philippine Sea
                 Arabian Sea
                 Chile Basin
                Somali Basin
             Mascarene Basin
                Crozet Basin
                Guinea Basin
                Brazil Basin
             Argentine Basin
                  Tasman Sea
       Atlantic Indian Basin
                 Caspian Sea
                 Sulu Sea II
             Venezuela Basin
               Bay of Bengal
                    Java Sea
  East Indian Atlantic Basin
dtype: object

df = df.join(basin_df.rename('basin_name'))

df.plot.bar(y='sst', x='basin_name')

<Axes: xlabel='basin_name'>

../_images/889252c5ae016840bb9b8ff0ac35ccca054203c09d006feed946ffc250ce6c32.png