Performance considerations¶

Understanding performance characteristics helps you make informed decisions about how to use earthkit-hydro efficiently.

One-time costs vs. repeated operations¶

Creating/loading networks:

Recommendation: Export and reuse custom networks

Running operations:

Once a network is loaded, operations are highly optimized:

Performance scales roughly with:

Example scaling:

Different backends have different performance characteristics:

NumPy (CPU): - Mature, well-optimized - Single-threaded for most operations - Good for moderate problem sizes

CuPy (GPU): - Major speedup for large datasets - Requires GPU with appropriate VRAM - Best for repeated operations on large grids

PyTorch (CPU or GPU): - Similar to NumPy on CPU - Good GPU performance - Overhead from autograd if not using torch.no_grad()

JAX: - JIT compilation can provide speedups - Good for repeated operations with same shapes - Initial JIT compilation has overhead

Strategies for large datasets:

GPU memory: More limited than system RAM - monitor VRAM usage with nvidia-smi

For repeated analyses:

For large domains:

For ML workflows:

Repeated network creation: Cache networks instead of recreating

Unnecessary data copies: Many array operations create copies - use in-place operations when possible

Type conversions: Converting between array types is expensive - stick to one backend per workflow

Reading data repeatedly: Load data once, process multiple times

Small batch sizes: Vectorization benefits from larger batches