StructureFunctions.jl v0.3.0

High-performance structure function calculations for turbulence and spatial correlation analysis.

StructureFunctions.jl computes structure functions (SFs) from scattered data, characterizing spatial correlations and scaling properties of turbulent/spatially-varying fields. Optimized for multi-dimensional data with typed backends supporting serial, threaded, distributed, and GPU execution.

Features

Structure Functions: 1st, 2nd, 3rd order; longitudinal & transverse projections in 1D, 2D, 3D
Typed Backend System: Serial, Threaded, Distributed, GPU, Auto — choose your parallelization strategy
Type-Stable Dispatch: No runtime overhead from symbolic dispatch; all paths validated with JET
Extensible Architecture: Optional extensions for parallelization and GPU acceleration
Production Ready: Comprehensive test coverage, numerical validation, performance benchmarking
Modern Julia: Julia 1.12+ with qualified imports and explicit type annotations

Quick Start

using StructureFunctions: Calculations as SFC, StructureFunctionTypes as SFT

# 2D data: 3 points
x = ([0.0, 1.0, 2.0], [0.0, 0.0, 0.0])
u = ([1.0, 1.1, 1.2], [0.0, 0.05, 0.1])

# Distance bins (physical units)
bins = [(0.0, 1.0), (1.0, 2.0), (2.0, 3.0)]

# Calculate 2nd-order longitudinal SF
sf_type = SFT.LongitudinalSecondOrderStructureFunctionType()
result = SFC.calculate_structure_function(sf_type, x, u, bins)

# result.values contains the SF values for each bin
println("SF values: ", result.values)

# Speed it up with threading (if available)
using Base.Threads
if nthreads() > 1
    result_threaded = SFC.calculate_structure_function(
        sf_type, x, u, bins;
        backend=SFC.ThreadedBackend()
    )
end

Architecture

Operator Types ✕ Result Container Pattern

The v0.3.0 API separates operators (structure function definitions) from result containers (computed outcomes):

AbstractStructureFunctionType (operators)
  ├── LongitudinalSecondOrderStructureFunctionType
  ├── TransverseSecondOrderStructureFunctionType
  ├── LongitudinalThirdOrderStructureFunctionType
  └── ... (3+ other variants)

StructureFunction (result container)
  ├── operator::AbstractStructureFunctionType
  ├── distance_bins::AbstractVector
  ├── values::AbstractVector
  └── order::Int

This split ensures:

Clear semantics: operators are inputs, containers are outputs
Type stability: dispatch happens at compilation time
Extensibility: custom operators and containers are easy to add

Backend Dispatch System

calculate_structure_function(sf_type, x, u, bins; backend=AutoBackend())
    ↓
_dispatch_execution_backend(backend, ...)
    ├── SerialBackend       → serial_calculate_structure_function
    ├── ThreadedBackend     → threaded_calculate_structure_function (from OhMyThreadsExt)
    ├── DistributedBackend  → parallel_calculate_structure_function (from DistributedExt)
    ├── GPUBackend(b)       → gpu_calculate_structure_function (from GPUExt)
    └── AutoBackend         → (tries distributed → threaded → serial)

All code paths produce numerically identical results (validated by intensive test suite).

Backends

SerialBackend (Default Reference)

Single-threaded CPU execution. Use when:

Debugging or validating calculations
Data is small
Deterministic execution is required

result = SFC.calculate_structure_function(sf_type, x, u, bins)  # Defaults to Serial
result = SFC.calculate_structure_function(sf_type, x, u, bins; 
                                        backend=SFC.SerialBackend())

Performance: O(N²) pairwise distance/SF evaluations.
Memory: O(N + B) where N = points, B = distance bins.

ThreadedBackend (Multi-CPU)

Multi-threaded execution using OhMyThreads.jl.

using Base.Threads

result = SFC.calculate_structure_function(sf_type, x, u, bins;
                                        backend=SFC.ThreadedBackend())

Prerequisites: Threads.nthreads() > 1
Thread-local reductions: No locks or atomic operations
Speedup: Near-linear up to ~4 threads; diminishing returns beyond (memory bandwidth limit)

DistributedBackend (Multi-Process/Cluster)

Multi-worker execution using Distributed.jl.

using Distributed: addprocs

addprocs(4)  # Or specify SSH workers, etc.

result = SFC.calculate_structure_function(sf_type, x, u, bins;
                                        backend=SFC.DistributedBackend())

Prerequisites: Workers launched via addprocs() or similar
Communication overhead: One @distributed reduction loop
Ideal for: Large datasets, compute clusters

GPUBackend (GPU Acceleration)

GPU execution via KernelAbstractions.jl.

using KernelAbstractions as KA

# NVIDIA GPU (after loading CUDA.jl)
using CUDA
result = SFC.calculate_structure_function(sf_type, x, u, bins;
                                        backend=SFC.GPUBackend(CUDA.CUDABackend()))

# AMD GPU (after loading AMDGPU.jl)
using AMDGPU
result = SFC.calculate_structure_function(sf_type, x, u, bins;
                                        backend=SFC.GPUBackend(AMDGPU.ROCBackend()))

# CPU backend for testing (no GPU required)
result = SFC.calculate_structure_function(sf_type, x, u, bins;
                                        backend=SFC.GPUBackend(KA.CPU()))

Ideal for: Very large datasets (1M+ points) where GPU memory is sufficient
Kernel architecture: Embarrassingly parallel pairwise loops
Precision: Full precision maintained; mixed-precision kernels supported

AutoBackend (Recommended Default)

Automatic selection based on environment:

result = SFC.calculate_structure_function(sf_type, x, u, bins;
                                        backend=SFC.AutoBackend())

# Selection order:
# 1. Distributed  (if nworkers() > 1)
# 2. Threaded     (if nthreads() > 1)
# 3. Serial       (fallback)

API Reference

Main Entry Point

calculate_structure_function(sf_type::AbstractStructureFunctionType,
                            x::Union{Tuple, Matrix},
                            u::Union{Tuple, Matrix},
                            distance_bins::AbstractVector{<:Tuple};
                            backend=SerialBackend(),
                            return_sums_and_counts=false,
                            distance_metric=Euclidean(),
                            verbose=true,
                            show_progress=true) → StructureFunction

Arguments:

sf_type: Operator instance (e.g., LongitudinalSecondOrderStructureFunctionType())
x: Position data (Tuple of 1D vectors OR N×M matrix for N dimensions, M points)
u: Velocity/field data (same shape as x)
distance_bins: Vector of (r_min, r_max) tuples defining bins

Returns: StructureFunction result container

See also: serial_calculate_structure_function, parallel_calculate_structure_function, gpu_calculate_structure_function

Operator Types

All inherit from AbstractStructureFunctionType. Instantiate with ():

SFT.LongitudinalSecondOrderStructureFunctionType()    # 2nd order, longitudinal
SFT.TransverseSecondOrderStructureFunctionType()      # 2nd order, transverse
SFT.LongitudinalThirdOrderStructureFunctionType()     # 3rd order, longitudinal
SFT.TransverseThirdOrderStructureFunctionType()       # 3rd order, transverse
# ... and other variants (see docs/theory.md)

Each operator is callable (functors):

sf_op = SFT.LongitudinalSecondOrderStructureFunctionType()
sf_op(du, rhat)  # Equivalent to: calculate_structure_function(sf_op, ...)

Result Container

struct StructureFunction{FT, OT, BT, VT} <: AbstractStructureFunction
    operator::OT                   # AbstractStructureFunctionType
    distance_bins::BT              # AbstractVector of (r_min, r_max)
    values::VT                     # AbstractVector{FT} — computed SF
    order::Int                     # 1, 2, 3, ...
end

Access results:

result.values       # SF values, one per bin
result.distance_bins  # Original input bins
result.operator     # The SF operator used
result.order        # Order of the SF

Theory & References

Structure functions quantify spatial correlations of a field u at separation distance r:

$$S_p(r) = \langle |u(\mathbf{x} + \mathbf{r}) - u(\mathbf{x})|^p \rangle$$

where $\langle \cdot \rangle$ is ensemble/spatial average over all displacement vectors $\mathbf{r}$.

Dimensional Variants

1D: Single coordinate axis (e.g., time series)
2D: Horizontal plane (e.g., satellite imagery)
3D: Full spatial field (e.g., atmospheric snapshots)

Order Variants

1st order ($p=1$): Absolute increment
2nd order ($p=2$): Energy-like; related to kinetic energy spectrum by Wiener-Khinchin
3rd order ($p=3$): Skewness; tests Kolmogorov refined similarity hypotheses

References

Kolmogorov (1941): The Local Structure of Turbulence in Incompressible Viscous Fluid for Very Large Reynolds Numbers
- Foundational theory; predicts $S_2(r) \sim r^{2/3}$ in inertial range
Balwada et al. (2016): Scale-aware analysis of satellite sea surface temperature variability
- Applied SF analysis to geophysical gridded data; demonstrates multi-scale recovery
Wikipedia: Turbulence
- Accessible overview of Kolmogorov theory

See also: docs/theory.md for detailed mathematical formulations and dimensional projections.

Performance

Scaling Characteristics

Dimension	Metric	Value
N points	Algorithm	O(N²)
B bins	Space	O(N + B)
D dim's	CPU ops	~D² per pair
Threads	Speedup	~0.8–0.9× per thread (dims ≤ 3)

Benchmarks (v0.3.0, Julia 1.12)

Setup: 10K points, 2D, 100 bins, Serial vs Multi-threaded

Backend	Time (ms)	Threads
Serial	120	1
Threaded	32	4
Threaded	18	8
(GPU)	5–15*	N/A

*Depends on GPU model and data transfer overhead; for 1M+ points, GPU benefits scale significantly.

Optimization Tips

Use AutoBackend for deployment (automatic tuning)
Prefer larger datasets for threading overhead to amortize
Pre-sort bins by distance to improve cache locality
Use Float32 if precision allows (faster GPU transfers)
Batch multiple SFs by reusing distance calculations

Extensions

Optional packages extend StructureFunctions with additional functionality:

OhMyThreadsExt (ThreadedBackend)

Loaded automatically when OhMyThreads.jl is in Project.toml:

[extras]
OhMyThreads = "67456a42-ebe4-4781-8ad1-67f7eda8d8f7"

[extensions]
StructureFunctionsOhMyThreadsExt = "OhMyThreads"

DistributedExt (DistributedBackend)

Requires Distributed.jl (stdlib) + SharedArrays.jl (stdlib):

using Distributed: addprocs
addprocs(4)
backend = StructureFunctions.DistributedBackend()

GPUExt (GPUBackend)

Requires KernelAbstractions.jl + GPU package (CUDA.jl, AMDGPU.jl, etc.):

[extras]
KernelAbstractions = "63c18a36-062a-441e-b365-b594b6ce51b1"

[extensions]
StructureFunctionsGPUExt = "KernelAbstractions"

Migration from v0.2

Breaking Changes

v0.2	v0.3
Symbol-based backend selection	Typed backend objects
`backend=:serial`	`backend=SerialBackend()`
`backend=:threaded`	`backend=ThreadedBackend()`
`backend=:distributed`	`backend=DistributedBackend()`
No GPU support	`backend=GPUBackend(...)`

Recommended Updates

# OLD (v0.2)
result = calculate_structure_function(sf, x, u, bins; backend=:threaded)

# NEW (v0.3)
result = calculate_structure_function(sf, x, u, bins; backend=ThreadedBackend())

# Or use AutoBackend for automatic selection:
result = calculate_structure_function(sf, x, u, bins)  # Defaults to AutoBackend()

Compatibility

v0.3 is not backward-compatible with v0.2 scripts
Update scripts by replacing symbol backends with typed backends
See CHANGELOG.md for full change log

Examples

Detailed worked examples are in examples/ directory:

simple_2d.jl: Basic 2D structure function calculation
threaded_calculation.jl: Multi-threaded execution
gpu_acceleration.jl: GPU acceleration with KernelAbstractions
distributed_parallel.jl: Multi-process execution
real_data_climate.jl: Processing real climate/turbulence data
custom_operator.jl: Defining custom SF operators

Clone and run:

cd examples/
julia simple_2d.jl
julia threaded_calculation.jl

Contributing

Contributions welcome! Please:

Fork and create a feature branch
Add tests for new functionality
Ensure full test suite passes: julia test/runtests.jl
Document changes in docstrings and CHANGELOG.md

License

See LICENSE file.

Citation

If you use StructureFunctions.jl in research, please cite:

@software{structurefunctions_jl_2024,
  author = {Benjamin, Jordan and Contributors},
  title = {StructureFunctions.jl: High-performance structure function calculations},
  year = {2024},
  doi = {10.5281/zenodo.14945669},
  url = {https://zenodo.org/records/14945669}
}

Last Updated: March 2026 | Version: 0.3.0 | Julia: 1.12+

Name		Name	Last commit message	Last commit date
Latest commit History 88 Commits
.dev		.dev
.github		.github
docs		docs
examples		examples
ext		ext
src		src
test		test
.JuliaFormatter.toml		.JuliaFormatter.toml
.gitignore		.gitignore
CHANGELOG.md		CHANGELOG.md
LICENSE		LICENSE
NOTICE		NOTICE
Project.toml		Project.toml
README.md		README.md
RELEASE_NOTES_v0.3.0.md		RELEASE_NOTES_v0.3.0.md

Folders and files

Latest commit

History

Repository files navigation

StructureFunctions.jl v0.3.0

Table of Contents

Features

Quick Start

Architecture

Operator Types ✕ Result Container Pattern

Backend Dispatch System

Backends

SerialBackend (Default Reference)

ThreadedBackend (Multi-CPU)

DistributedBackend (Multi-Process/Cluster)

GPUBackend (GPU Acceleration)

AutoBackend (Recommended Default)

API Reference

Main Entry Point

Operator Types

Result Container

Theory & References

Dimensional Variants

Order Variants

References

Performance

Scaling Characteristics

Benchmarks (v0.3.0, Julia 1.12)

Optimization Tips

Extensions

OhMyThreadsExt (ThreadedBackend)

DistributedExt (DistributedBackend)

GPUExt (GPUBackend)

Migration from v0.2

Breaking Changes

Recommended Updates

Compatibility

Examples

Contributing

License

Citation

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 3

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages