Per-scenario-feasible candidate xhat

Warning

This is an advanced topic. Most mpi-sppy users do not need feasible_xhat_creator to get started. However, if you know of some way to get a good solution that is admissible and implementable, this function could speed up the search for a good upper bound. You can use any method to get this solution, but the documentation here is biased toward a situation where you know how to modify average scenario data so it will work as a way to get a good xhat solution. There are some helper functions to facilitate that.

The contract

A scenario module that participates exposes:

def feasible_xhat_creator(*, solver_name,
                          solver_options=None,
                          **scenario_creator_kwargs):
    """Return {nodename: np.ndarray} -- a candidate point that is
    feasible to fix in every real scenario's per-scenario
    subproblem. Two-stage populates only "ROOT"; multistage
    populates every non-leaf node (see Multistage, below)."""

Discovery is via getattr(module, "feasible_xhat_creator", None), parallel to average_scenario_creator (see Jensen’s Bound as potential starting bound). The returned dict is in the cache form consumed by Xhat_Eval._fix_nonants: one np.ndarray per non-leaf node, each in that node’s nonant_vardata_list order. Two-stage models populate only "ROOT"; the multistage case is covered in Multistage, below.

File layout

By convention the model author’s feasible_xhat_creator (and any average_scenario_creator) lives in examples/<model>/<model>_auxiliary.py, not in the main example file. The introductory example (farmer.py, netdes.py, …) stays focused on what a first-time reader needs: scenario_creator, scenario_names_creator, kw_creator, inparser_adder, sample_tree_scen_creator, scenario_denouement. Auxiliary functions used only by advanced machinery go in the _auxiliary sibling.

Discovery

When one of the xhat-spoke flags below is set, cfg_vanilla._find_feasible_xhat_creator first tries getattr(scenario_module, "feasible_xhat_creator", None) on the user’s main scenario module; if that is None it imports <module_name>_auxiliary and looks there. Discovery is gated on the flag, so the auxiliary import does not happen unless requested. Downstream consumers (e.g. findW) that bypass the cylinder system import the auxiliary module directly and call feasible_xhat_creator themselves.

Why this convention exists

The Jensen’s xhat path (see Jensen’s Bound as potential starting bound) already exposes the average-scenario solution as a candidate first-stage. It tolerates infeasibility: if the candidate xhat value, when fixed for one or more scenarios is infeasible, then _evaluate_xhat returns None and the inner-bound spoke silently moves on. That is the right behavior for an inner-bound spoke whose only job is to opportunistically improve a bound.

feasible_xhat_creator strengthens the contract. Where Jensen’s xhat is opportunistic – silently skipping any scenario in which the candidate is not feasible to fix – feasible_xhat_creator tries to guarantee by construction that the candidate is feasible to fix in every real scenario. The inner-bound spoke that consumes it therefore produces a usable objective on each candidate rather than potentially never updating the inner bound.

That is a strictly stronger contract than “Jensen’s xhat plus luck,” and it is the contract that feasible_xhat_creator aims to provide.

In-cylinder use: `--<xhatter>-try-feasible-xhat-first` flags

The four xhat spokes that ship with mpi-sppy accept a feasible_xhat_creator candidate via a per-spoke flag, parallel to --*-try-jensens-first:

--xhatshuffle-try-feasible-xhat-first
--xhatxbar-try-feasible-xhat-first
--xhatlooper-try-feasible-xhat-first
--xhatspecific-try-feasible-xhat-first

When set, the spoke calls the module’s feasible_xhat_creator once before entering its main loop, fixes the candidate as the first-stage nonants, evaluates the expected objective across all real scenarios, and – if the evaluation is feasible – sends that as its first inner bound. Implementation lives in _PreLoopXhatMixin._try_feasible_xhat in mpisppy/cylinders/_preloop_xhat_mixin.py; the spoke main() methods call it once after _try_average_scenario_xhat.

Mutually exclusive with --*-try-jensens-first

--<xhatter>-try-jensens-first and --<xhatter>-try-feasible-xhat-first are mutually exclusive on the same spoke. cfg_vanilla._maybe_attach_feasible_xhat raises at spoke-setup time if both are enabled, with a message naming the conflicting CLI options.

The two pre-loop candidates serve overlapping purposes: Jensen’s often gives a tighter incumbent bound when its candidate happens to be feasible everywhere, while feasible_xhat_creator is guaranteed feasible by contract but can be a looser incumbent. Per spoke, pick whichever fits the model’s structure – not both.

Across spokes, mixing is fine: one xhat spoke can be configured with --xhatshuffle-try-jensens-first while another runs with --xhatxbar-try-feasible-xhat-first.

Average-data-based Methods

The two lp-based helpers in `mpisppy.utils.xhat_helpers`

Some feasible_xhat_creator implementations are short. mpi-sppy ships two reusable, lp-based engines that try to do the heavy lifting; the implementations call one of them and apply a model-specific repair.

average_xhat_nonants(average_scenario_creator, *, solver_name, ...): Builds the model returned by average_scenario_creator, optionally LP-relaxes it, solves it, and returns the ROOT first-stage values as np.ndarray. One deterministic solve over the average data.
lp_xbar_nonants(scenario_creator, scenario_names, *, solver_name, ...): For each real scenario, builds the model, applies core.relax_integer_vars, solves, and returns the probability-weighted average of ROOT first-stage values across scenarios. K LP solves, where K is the number of scenarios.

These two are not interchangeable for models with binary first-stage: averaging data and averaging solutions do not commute when the optimal first-stage is not a continuous function of the data. The averaged-data problem can omit first-stage activity that some real scenario individually needs; the per-scenario LP-xbar instead carries any activity that any scenario’s LP wanted positive into the average, where a feasibility- preserving rounding rule can promote it. For models with continuous first-stage, the distinction collapses.

Choosing between the two engines is the caller’s responsibility. The caller knows whether averaging data preserves enough information to cover per-scenario feasibility; the framework cannot detect that from the model.

The rounding rule is also yours

The output of either engine is a real-valued vector that has to be turned into a feasible candidate. Whether the right rule is np.ceil, np.floor, np.round, identity, or a per-component try-and-check is a model-specific decision that depends on monotonicity of recourse feasibility in each first-stage variable:

If raising \(x_e\) from 0 to 1 only loosens recourse constraints (as for “open the arc” binaries in netdes), np.ceil is feasibility-preserving.
If the variable indexes a covering decision (open the facility) and more open never tightens recourse, np.round typically suffices.
If recourse feasibility is non-monotone in the variable, neither rule is safe and the implementation must do something model-specific (a proof-of-feasibility per-component repair, an aggregation across scenarios, etc.).

mpi-sppy does not ship an automatic rounder. Even within a single model, different first-stage variables can need different rules; per-component try-and-check degenerates into solving an MIP-feasibility problem in itself. The repair belongs in the feasible_xhat_creator, where the model author has the domain knowledge.

Worked example: farmer (continuous first-stage)

Farmer’s first-stage variable DEVOTED_ACRES is bounded NonNegativeReals, and farmer has relatively complete recourse via the buy/sell variables (QuantityPurchased, QuantitySubQuotaSold, QuantitySuperQuotaSold), so any feasible acreage allocation – including the average-scenario optimum – is feasible to fix in every real scenario. No rounding is needed.

examples/farmer/farmer_auxiliary.py:

from mpisppy.utils.xhat_helpers import average_xhat_nonants
from farmer import average_scenario_creator


def feasible_xhat_creator(*, solver_name, solver_options=None,
                          **scenario_creator_kwargs):
    arr = average_xhat_nonants(
        average_scenario_creator,
        solver_name=solver_name,
        scenario_creator_kwargs=scenario_creator_kwargs,
        solver_options=solver_options,
    )
    return {"ROOT": arr}

This is the simplest case the convention has to handle, and it illustrates an important point about the convention: callers always go through feasible_xhat_creator rather than calling average_xhat_nonants directly. If a downstream model swap replaces farmer with a binary-first-stage model, only the auxiliary file has to change; the call site at the consumer (e.g., findW) is unchanged.

Worked example: netdes (binary, arc-open monotonicity)

Netdes model.x[e] is Binary for each candidate arc. The recourse constraint is \(y_e \le u_e \, x_e\); raising \(x_e\) from 0 to 1 only loosens this bound, and the flow-balance constraints do not involve x. So opening more arcs cannot make any per- scenario subproblem less feasible – np.ceil is feasibility- preserving for the arc-open variables.

The right engine for netdes is not average_xhat_nonants. The averaged-data problem can leave some \(x_e\) at 0 because the average demand pattern does not need that arc; a real scenario with peakier demand may need it. The averaged-solution path is lp_xbar_nonants: any arc that any scenario’s LP wanted positive contributes positively to the average, and np.ceil then promotes it to 1.

examples/netdes/netdes_auxiliary.py:

import numpy as np
from mpisppy.utils.xhat_helpers import lp_xbar_nonants
from netdes import scenario_creator, scenario_names_creator


def feasible_xhat_creator(*, solver_name, solver_options=None,
                          num_scens=None, **scenario_creator_kwargs):
    if num_scens is None:
        from parse import parse
        num_scens = parse(scenario_creator_kwargs["path"],
                          scenario_ix=None)["K"]
    snames = scenario_names_creator(num_scens)
    arr = lp_xbar_nonants(
        scenario_creator, snames,
        solver_name=solver_name,
        scenario_creator_kwargs=scenario_creator_kwargs,
        solver_options=solver_options,
    )
    return {"ROOT": np.ceil(arr - 1e-9)}

The \(-10^{-9}\) margin keeps integer-valued LP solutions from being inadvertently bumped up by floating-point dust.

Worked example: sslp (binary, set-covering monotonicity)

Sslp model.FacilityOpen[j] is Binary. Opening more facilities never tightens DemandConstraint (more capacity available) or ClientConstraint (the LHS does not involve FacilityOpen). The shipped model also carries a high-Penalty Dummy slack, so any fixed candidate is technically feasible; the rounded LP-xbar is still a meaningful low-slack candidate for the inner-bound spoke that consumes it.

Sslp does not currently ship an average_scenario_creator, so the auxiliary skips the average_xhat_nonants engine entirely and goes straight to lp_xbar_nonants. The feasibility-preserving rule chosen here is np.round.

examples/sslp/sslp_auxiliary.py:

import numpy as np
from mpisppy.utils.xhat_helpers import lp_xbar_nonants
from sslp import scenario_creator, scenario_names_creator


def feasible_xhat_creator(*, solver_name, solver_options=None,
                          num_scens, **scenario_creator_kwargs):
    snames = scenario_names_creator(num_scens)
    arr = lp_xbar_nonants(
        scenario_creator, snames,
        solver_name=solver_name,
        scenario_creator_kwargs=scenario_creator_kwargs,
        solver_options=solver_options,
    )
    return {"ROOT": np.round(arr)}

Multistage

The convention extends to multistage problems with no change to the machinery: the cache is {nodename: np.ndarray} over every non-leaf node (not just "ROOT"), each array in that node’s nonant_vardata_list order, and the spoke pins and evaluates it exactly as in the two-stage case (_fix_nonants already loops over every node of the scenario tree). What changes is how you build the candidate.

Inter-stage coupling

In two stages there is a single decision point, so feasibility factors scenario by scenario. In multiple stages the candidate is a whole policy over the tree: a vector at every non-leaf node, and the vectors are coupled – a later-stage decision lives downstream of the earlier decisions on the same path, through the model’s staircase constraints.

The consequence: you cannot assemble a multistage candidate by choosing each node’s vector in isolation (for instance, by averaging each node’s values across scenarios independently). Individually reasonable node choices can be jointly infeasible along a path.

The construction that stays sound is to derive all node vectors from one feasible solution of a single deterministic proxy whose tree has the same node structure as the real problem – typically the expected-value tree (the real branching factors with the random data pinned to its mean). Because the node values then come from one feasible point of the same staircase system, they are jointly feasible along every path by construction. This is the multistage analogue of “solve the average scenario”: solve the average tree.

The engine: `ef_xhat_nonants`

mpisppy.utils.xhat_helpers ships the multistage engine:

ef_xhat_nonants(scenario_creator, scenario_names, *, solver_name, ...): Builds the extensive form over the supplied (proxy) scenario set, optionally LP-relaxes it, solves it once, and returns {nodename: np.ndarray} over all non-leaf nodes. Pass the scenario names/kwargs that define your deterministic proxy tree.

The repair rule is still yours – now for whole paths

As in the two-stage case (see The rounding rule is also yours), the raw solve output may need a model-specific repair to become a candidate that is feasible to fix in the real, stochastic scenarios. Multistage raises the bar:

If the model has relatively complete recourse – every later stage stays feasible for any setting of the earlier (with proper feasibility) decisions – no repair is needed. The expected-value-tree solution is feasible to fix on every path as is. aircond (below) is this case.
If recourse is integer or tightly coupled, a feasible path is a stronger requirement than the two-stage per-variable rounding rules deliver: rounding a stage-\(t\) decision can render a later stage infeasible on some path, so per-node monotone rounding does not generally preserve path feasibility. mpi-sppy does not ship an automatic multistage repair. The repair belongs in your feasible_xhat_creator, where you have the domain knowledge to keep the whole path feasible (a forward pass that re-checks each stage against the fixed earlier stages, an aggregation across scenarios, a per-path proof-of-feasibility, etc.). When in doubt, evaluate the candidate and let the inner-bound spoke skip any path it cannot fix – but then you are back to the weaker “Jensen’s plus luck” contract.

Worked example: aircond (multistage, continuous, complete recourse)

aircond’s node decisions RegularProd and OvertimeProd are bounded NonNegativeReals; the only hard constraint on them is RegularProd <= Capacity. The material-balance constraint lets the free Inventory variable absorb any demand imbalance as penalized backorder, so aircond has relatively complete recourse: any capacity-respecting production plan is feasible to fix on every path. No rounding is needed – the multistage analogue of farmer.

mpisppy/tests/examples/aircond_auxiliary.py (kept beside the model so the <module>_auxiliary discovery resolves):

import numpy as np
from mpisppy.utils.xhat_helpers import ef_xhat_nonants
from mpisppy.tests.examples.aircond import (
    scenario_creator, scenario_names_creator,
)


def feasible_xhat_creator(*, solver_name, solver_options=None,
                          branching_factors=None, **scenario_creator_kwargs):
    proxy_kwargs = dict(scenario_creator_kwargs)
    proxy_kwargs["branching_factors"] = branching_factors
    proxy_kwargs["sigma_dev"] = 0.0   # expected-value tree
    proxy_kwargs["mu_dev"] = 0.0
    proxy_kwargs.setdefault("start_seed", 0)
    snames = scenario_names_creator(int(np.prod(branching_factors)))
    return ef_xhat_nonants(
        scenario_creator, snames, solver_name=solver_name,
        scenario_creator_kwargs=proxy_kwargs, solver_options=solver_options,
    )

The expected-value tree has the same node structure as the real problem (every real non-leaf node has a counterpart), but with sigma_dev=0 it is a trivial deterministic LP. For a model whose true EF is a hard MIP, the proxy’s LP relaxation is where the speedup lives; here aircond is already an LP, so the proxy’s value is to demonstrate the convention and to hand back a feasible deterministic policy. You are free to use any method that yields a feasible per-node candidate – a closed-form myopic rule (set each node’s RegularProd = min(Capacity, max(0, expected demand - incoming inventory))) builds one with no solve and no file at all.

Heuristics fixing methods

For some problems, you might have heuristic ways to fix many of the nonanticipative variables. Once they are fixed, the resulting problem might solve fairly quickly, which can be the basis for a feasible_xhat_creator function.