Packing Variables

Starting from a high level, at the beginning of a simulation a Parthenon based code defines fields on the Mesh by adding them to a StateDescriptor. The StateDescriptors for all packages are then passed to the Mesh constructor. A constructed Mesh includes a collection of MeshBlocks that cover the domain of the Mesh. The MeshBlocks contain information about the coordinate region they cover, their relationship to neighboring MeshBlocks, and importantly a container holding MeshBlockData objects. These MeshBlockData objects in turn hold Variable objects corresponding to the fields defined on the Mesh [1]. Each of the MeshBlockData objects stored in a given MeshBlock is labeled as a stage. By construction, every MeshBlock in a Parthenon contains the same stages. Memory for storing a field on a block on a stage is only actually allocated (on device) and held within a Variable object in a ParArray [2]. Putting the storage for fields in a separate object from MeshBlock, the MeshBlockData, easily allows for multiple storage locations for a given field on a given block within the mesh, e.g. to store multiple Runge-Kutta stages. Further, for performance reasons, downstream Parthenon codes generally should work with MeshData objects, which hold pointers to groups of MeshBlockData objects across different blocks but on the same stage.

As a result of this somewhat complicated structure, it is impractical to access variable storage by following through these different objects in a downstream code [3]. Therefore, Parthenon defines SparsePacks and SwarmPacks to allow seamless access to variables within compute kernels. Essentially, packs are objects that contain a ParArray of references to the ParArrays stored within Variable over a chosen set of fields on a given set of blocks. Said differently, given a SparsePack pack built from MeshData md and a set of fields var1, var2,… , a sparse pack allows one to access the field var1 on block b of md at position (k, j, i) using syntax like

Real &my_val = pack(b, var1_t(), k, j, i); // Pull out a reference to the value of var1 on block b in cell (k, j, i)
ParArray3D<Real> var1 = pack(b, var1_t()); // Pull out a reference to the 3D array containing var1 on block b

etc. within a kernel.

SparsePacks work for all types of variables (both dense and sparse). They were originally implemented to support sparse variables and to supersede the older VariablePacks and VariableFluxPacks and picked up the Sparse modifier to differentiate them. The latter have not been removed from ``Parthenon`` because some downstream codes still rely on them, but they are deprecated and will be removed eventually.

If you want to deal with particle fields, you will need to use SwarmPacks, which are described at SwarmPacks.

Type-based Packing

Parthenon provides functionality for accessing fields in a pack via a type that is associated with a field name [4]. As an example, if a downstream code includes a field with the name "var1" the code could also define a type

struct var1_t : public parthenon::variable_names::base_t<false> {
  template <class... Ts>
  KOKKOS_INLINE_FUNCTION varname(Ts &&...args)
      : parthenon::variable_names::base_t<false>(std::forward<Ts>(args)...) {}
  static std::string name() { return "var1"; }
}

which inherits from parthenon::variable_names::base_t [5]. Rather than write this boilerplate out for every variable name in a downstream code, it is often easier to define a macro that expands to this class for a given variable name argument. Additionally, fields can be added to a StateDescriptor using the variable name type directly using

StateDescriptor pkg;
Metadata metadata;
pkg->AddField<var1_t>(metadata);

The advantage of using types instead of just strings to denote field names is that the types are accessible within kernels on device. Under the hood, SparsePack::operator() is overloaded on each of the type list of variable name types used to create the pack, so an instance of the variable name type can be used to access desired field within a pack.

Scratch Variable Packing

Parthenon provides the capability to access non-persistent fields allocated on the meshblocks by aliasing a pool of Metadata::overrideable fields with different types. These scratch variables are registered and pooled per StateDescriptor and the total number fields allocated determined by the maximum registered to any single StateDescriptor.

Scratch variables can be defined using the SCRATCH_VARIABLE macro,

using TT = parthenon::TopologicalType;
namespace parthenon { // scratch variables need to be declared in the parthenon namespace
SCRATCH_VARIABLE(First, TT::Cell, 3)     // 3 fields
SCRATCH_VARIABLE(Second, TT::Cell, 2, 4) // 8 fields
SCRATCH_VARIABLE(Third, TT::Cell)        // 1 field
SCRATCH_VARIABLE(Fourth, TT::Cell, 5)    // 5 fields
} // namespace parthenon

// pkgA requests a total of 11 fields
pkgA->AddScratch<parthenon::First>();
pkgA->AddScratch<parthenon::Second>();

// pkgB requests a total of 6 fields
pkgB->AddScratch<parthenon::Third>();
pkgB->AddScratch<parthenon::Fourth>();

In the above example 4 scratch variables are registered to two different packages, totalling 17 indexable fields across the 4 types, but only 11 fields will be allocated. These types can then be packed and accessed as normal in type-based sparse packs. An example of usage can be found in the unit test <https://github.com/parthenon-hpc-lab/parthenon/blob/develop/tst/unit/test_scratch_variables.cpp>

It should not be expected that values in these fields will persist outside the local scope of the tasks where they are packed. For debugging purposes the cmake build can be configured with -DPARTHENON_DEBUG_SCRATCH=ON, in which case the fields will be registered with unique names, scratch_First as an example from the above snippet.

Warning

Parthenon’s tasking infrastructure can not make guarantees about the order that tasks are executed. Therefore care should be taken not to assume that scratch variables can persist between tasks, even when they are directly dependent on each other. Issues resolved by setting -DPARTHENON_DEBUG_SCRATCH=ON can be an indication of this issue.

Building and Using a `SparsePack`

SparsePacks are built in two stages, first a PackDescriptor is built that defines the set of fields to include in the pack using one of the overloaded MakePackDescriptor functions. Then PackDescriptor::GetPack(...) is called on a given MeshData or MeshBlockData object to return an actual pack. In practice, this will look like

parthenon::TaskStatus my_task(MeshData *md) {
  // Pull out indices, etc.
  std:::vector<MetadataFlags> md_flags; // Optional argument below
  std::set<PDOpt> options{PDOpt::WithFluxes}; // Optional argument below
  auto desc = parthenon::MakePackDescriptor<var1_t, var2_t>(pmd, md_flags, options);
  auto pack = desc.GetPack(md);
  parthenon::par_for(/*index ranges, etc. go here */,
      KOKKOS_LAMBDA(int b, int k, int j, int i) {
    for (int c = 0; c < ncomponents_2; ++c)
      pack(b, var1_t(), k, j, i) += 2.0 * pack(b, var2_t(c), k, j, i);
    pack.flux(b, X1DIR, var1_t(), k, j, i) = pack(b, var1_t(), k, j, i) * pack(b, var2_t(), k, j, i);
  });
  return TaskStatus::complete;
}

PackDescriptors can be somewhat expensive to build because they require searching through all fields in simulation. Therefore, they are automatically cached in the StateDescriptor where possible. Additionally, it is often possible to declare PackDescriptors that are created in task functions to be static.

PackDescriptor takes a std::set of PDOpt options to determine what to include in the pack:

Pack Descriptor Options
`PDOpt::WithFluxes`	Fluxes associated with variables in the pack are included in the pack and accessible through `pack.flux(...)`
`PDOpt::Coarse`	Pack the coarse buffers for the fields rather than the normal resolution buffers.
`PDOpt::Flatten`	Packs all blocks across all fields into the variable index so that the pack looks like it has a single block.

`SparsePack::operator()`

There are a number of different overloads for operator() in sparse packs that allow accessing field data:

template <class var_t> Real &operator()(int b, TopologicalElement te, const var_t &t, int k, int j, int i) and template <class var_t> ParArray3D operator()(int b, TopologicalElement te, const var_t &t) : Returns the value on block b for topological element te of var_t. The first call returns a reference to the value at position (k, j, i) while the second returns a ParArray3D (which obeys reference semantics) containing that component of the field on the block. var_t must be in the list of types used to create the pack.
template <class var_t> Real &operator()(int b, const var_t &t, int k, int j, int i) and template <class var_t> Real &operator()(int b, const var_t &t): Same as above, but with the topological element defaulted to cell-centered.
Real &operator()(int b, TopologicalElement te, int idx, int k, int j, int i) and Real &operator()(int b, int idx, int k, int j, int i) and Real &operator()(int b, TopologicalElement te, int idx) and Real &operator()(int b, int idx): Same as above, but directly accesses the field at position idx in the pack. This should be used with the bounds returned from SparsePack::GetLowerBound(...) and SparsePack::GetUpperBound(...).
Real &operator()(int b, TopologicalElement te, PackIdx idx, int k, int j, int i) and Real &operator()(int b, PackIdx idx, int k, int j, int i) and Real &operator()(int b, TopologicalElement te, PackIdx idx) and Real &operator()(int b, PackIdx idx): Same as above, but access the field using PackIdx idx. This only works for packs that were built using a list of names (as opposed to a list of types), see [4].

Other `SparsePack` Methods

Coordinates_t &GetCoordinates(const int b = 0): Returns a reference to the coordinates object associated with block b.
template <class var_t> int GetLowerBound(int b, var_t): Returns the first index in the pack where a field corresponding to var_t is stored. Returns -1 if var_t is not allocated on block b. A similar functions exist for PackIdx.
template <class var_t> int GetUpperBound(int b, var_t): Returns the last index in the pack where a field corresponding to var_t is stored. Returns -2 if var_t is not allocated on block b. A similar functions exist for PackIdx.
int GetLevel(int b, int off3, int off2, int off1): Returns the logical level of neighbor block(s) of block b offset in direction (off3, off2, off1).
bool IsPhysicalBoundary(int b, int off3, int off2, int off1): Returns if block b has a physical boundary in offset direction (off3, off2, off1).
template <class var_t> bool Contains(const int b, const var_t t): Returns if var_t is allocated on block b.
template <class var_t> Real &flux(int b, int dir, const var_t &t, int k, int j, int i): Gets the flux in direction dir associated with variable var_t.

`SparsePack`s with Sparse Fields

A given sparse field may or may not be allocated on each block within a pack. To safely access the fields in a given pack, SparsePacks provide checks on whether or not a given sparse variable/pool is allocated

parthenon::TaskStatus my_task(MeshData *md) {
  // Pull out indices, etc.
  auto desc = parthenon::MakePackDescriptor<sparse_var_t>(pmd);
  auto pack = desc.GetPack(md);
  parthenon::par_for(/*index ranges, etc. go here */,
      KOKKOS_LAMBDA(int b, int k, int j, int i) {
      // For a single sparse field
      if (pack.Contains(b, sparse_var_t())) {
        // The sparse field is allocated on block b, and so is safe to access
      }
      // This loop will go over all allocated sparse fields in the sparse_var sparse pool
      for (int idx = pack.GetLowerBound(b, sparse_var_t()); idx <= pack.GetUpperBound(b, sparse_var_t()); ++idx) {
        pack(b, idx, k, j, i) = 0.0;
      }
  });
  return TaskStatus::complete;
}

Packing Variables

Type-based Packing

Scratch Variable Packing

Building and Using a SparsePack

SparsePack::operator()

Other SparsePack Methods

SparsePacks with Sparse Fields

Building and Using a `SparsePack`

`SparsePack::operator()`

Other `SparsePack` Methods

`SparsePack`s with Sparse Fields