Xarray Fundamentals

20. Xarray Fundamentals#

Attribution: This notebook is a revision of the Xarray Fundamentals notebook by Ryan Abernathy from An Introduction to Earth and Environmental Data Science.

In this lecture, we will have a deep dive to xarray package. So far in this class, we have used rioxarray for raster data processing, and stackstac for analyzing stacks of satellite imagery. Both of these packages are built on top of xarray; however, we didn’t use all the functionalities of xarray with them.

We also worked with xarray DataArray and Dataset after reading data from different sources. Here, we will learn how to create a DataArray or Dataset from scratch.

In the previous lecture on Raster Data Processing, we introduced Dataset in rioxarray and discussed how it is different from DataArray. Here we will have a complete review of data structures in xarray:

20.3. Computation#

Xarray DataArrays and Datasets work seamlessly with arithmetic operators and numpy array functions.

temp_kelvin = argo.temperature + 273.15
temp_kelvin.plot(yincrease=False)

<matplotlib.collections.QuadMesh at 0x706ffdc6eed0>

../_images/ba6c9abf929d5f5a3d54a0362c54269c2b125e1f6859823444f9570d9c651428.png

We can also combine multiple xarray datasets in arithemtic operations

g = 9.8
buoyancy = g * (2e-4 * argo.temperature - 7e-4 * argo.salinity)
buoyancy.plot(yincrease=False)

<matplotlib.collections.QuadMesh at 0x706ffdec04d0>

../_images/c72308334740b555d69db86930114617f04c915a9ada211cad9895751ebdf4ec.png

Xarray Fundamentals

Contents

20. Xarray Fundamentals#

20.1. Xarray data structures#

20.1.1. DataArray#

20.1.2. Multidimensional DataArray#

20.1.3. Datasets#

20.1.4. Coordinates vs. Data Variables#

20.2. Selecting Data (Indexing)#

20.3. Computation#

20.4. Broadcasting, Aligment, and Combining Data#

20.4.1. Broadcasting#

20.4.2. Alignment#

20.4.3. Combing Data: Concat and Merge#

20.5. Reductions#

20.5.1. Weighted Reductions#

20.6. Loading Data from netCDF Files#