Configuration¶

Classes that control how BasanosEngine and BasanosStream behave.

BasanosConfig¶

`BasanosConfig` ¶

Bases: BaseModel

Configuration for correlation-aware position optimization.

The required parameters (vola, corr, clip, shrink, aum) must be supplied by the caller. The optional parameters carry carefully chosen defaults whose rationale is described below.

Shrinkage methodology¶

shrink controls linear shrinkage of the EWMA correlation matrix toward the identity:

.. math::

C_{\\text{shrunk}} = \\lambda \\cdot C_{\\text{EWMA}} + (1 - \\lambda) \\cdot I_n

where :math:\\lambda = shrink and :math:I_n is the identity. Shrinkage regularises the matrix when assets are few relative to the lookback (high concentration ratio :math:n / T), reducing the impact of extreme sample eigenvalues and improving the condition number of the matrix passed to the linear solver.

When to prefer strong shrinkage (low shrink / high 1-shrink\ ):

Fewer than ~30 assets with a corr lookback shorter than 100 days.
High-volatility or crisis regimes where correlations spike and the sample matrix is less representative of the true structure.
Portfolios where estimation noise is more costly than correlation bias (e.g., when the signal-to-noise ratio of mu is low).

When to prefer light shrinkage (high shrink\ ):

Many assets with a long lookback (low concentration ratio).
The EWMA correlation structure carries genuine diversification information that you want the solver to exploit.
Out-of-sample testing shows that position stability is not a concern.

Practical starting points (daily return data):

Here n = number of assets and T = cfg.corr (EWMA lookback).

+-----------------------+-------------------+--------------------------------+ | n (assets) / T (corr) | Suggested shrink | Notes | +=======================+===================+================================+ | n > 20, T < 40 | 0.3 - 0.5 | Near-singular matrix likely; | | | | strong regularisation needed. | +-----------------------+-------------------+--------------------------------+ | n ~ 10, T ~ 60 | 0.5 - 0.7 | Balanced regime. | +-----------------------+-------------------+--------------------------------+ | n < 10, T > 100 | 0.7 - 0.9 | Well-conditioned sample; | | | | light shrinkage for stability. | +-----------------------+-------------------+--------------------------------+

See :func:~basanos.math._signal.shrink2id for the full theoretical background and academic references (Ledoit & Wolf, 2004; Chen et al., 2010).

Default rationale¶

denom_tol = 1e-12 Positions are zeroed when the normalisation denominator inv_a_norm(μ, Σ) falls at or below this threshold. The value 1e-12 provides ample headroom above float64 machine epsilon (~2.2e-16) while remaining negligible relative to any economically meaningful signal magnitude.

position_scale = 1e6 The dimensionless risk position is multiplied by this factor before being passed to :class:~jquantstats.Portfolio. A value of 1e6 means positions are expressed in units of one million of the base currency, a conventional denomination for institutional-scale portfolios where AUM is measured in hundreds of millions.

min_corr_denom = 1e-14 The EWMA correlation denominator sqrt(var_x * var_y) is compared against this threshold; when at or below it the correlation is set to NaN rather than dividing by a near-zero value. The default 1e-14 is safely above float64 underflow while remaining negligible for any realistic return series. Advanced users may tighten this guard (larger value) when working with very-low-variance synthetic data.

max_nan_fraction = 0.9 :class:~basanos.exceptions.ExcessiveNullsError is raised during construction when the null fraction in any asset price column strictly exceeds this threshold. The default 0.9 permits up to 90 % missing prices (e.g., illiquid or recently listed assets in a long history) while rejecting columns that are almost entirely null and would contribute no useful information. Callers who want a stricter gate can lower this value; callers running on sparse data can raise it toward 1.0.

Sliding-window mode¶

When covariance_config is a :class:SlidingWindowConfig, the EWMA correlation estimator is replaced by a rolling-window factor model (Section 4.4 of basanos.pdf). At each timestamp t the :math:W \\times n submatrix of the :math:W most recent volatility-adjusted returns is decomposed via truncated SVD to extract :math:k latent factors. The resulting correlation estimate is

.. math::

\\hat{C}_t^{(W,k)}
= \\frac{1}{W}\\mathbf{V}_{k,t}\\mathbf{\\Sigma}_{k,t}^2
  \\mathbf{V}_{k,t}^\\top + \\hat{D}_t

where :math:\\hat{D}_t enforces unit diagonal. The linear system :math:\\hat{C}_t^{(W,k)}\\mathbf{x}_t = \\boldsymbol{\\mu}_t is solved via the Woodbury identity (:func:~basanos.math._factor_model.FactorModel.solve) at cost :math:O(k^3 + kn) per step rather than :math:O(n^3).

covariance_config Pass a :class:SlidingWindowConfig instance to enable this mode. The required sub-parameters are:

``window``
    Rolling window length :math:`W \\geq 1`.  Rule of thumb: :math:`W
    \\geq 2n` keeps the sample covariance well-posed before truncation.

``n_factors``
    Number of latent factors :math:`k \\geq 1`.  :math:`k = 1`
    recovers the single market-factor model; larger :math:`k` captures
    finer correlation structure at the cost of higher estimation noise.

Examples:

>>> cfg = BasanosConfig(vola=32, corr=64, clip=3.0, shrink=0.5, aum=1e8)
>>> cfg.vola
32
>>> cfg.corr
64
>>> sw_cfg = BasanosConfig(
...     vola=16, corr=32, clip=3.0, shrink=0.5, aum=1e6,
...     covariance_config=SlidingWindowConfig(window=60, n_factors=3),
... )
>>> sw_cfg.covariance_mode
<CovarianceMode.sliding_window: 'sliding_window'>

`vola: int = Field(..., gt=0, description='EWMA lookback for volatility normalization.')` `class-attribute` `instance-attribute` ¶

`corr: int = Field(..., gt=0, description='EWMA lookback for correlation estimation.')` `class-attribute` `instance-attribute` ¶

`clip: float = Field(..., gt=0.0, description='Clipping threshold for volatility adjustment.')` `class-attribute` `instance-attribute` ¶

shrink: float = Field(..., ge=0.0, le=1.0, description="Retention weight λ for linear shrinkage of the EWMA correlation matrix toward the identity: C_shrunk = λ·C_ewma + (1-λ)·I. λ=1.0 uses the raw EWMA matrix (no shrinkage); λ=0.0 replaces it entirely with the identity (maximum shrinkage, positions are treated as uncorrelated). Values in [0.3, 0.8] are typical for daily financial return data. Lower values improve numerical stability when assets are many relative to the lookback (high concentration ratio n/T). See shrink2id() for full guidance. Only used when covariance_mode='ewma_shrink'.") `class-attribute` `instance-attribute` ¶

`aum: float = Field(..., gt=0.0, description='Assets under management for portfolio scaling.')` `class-attribute` `instance-attribute` ¶

`denom_tol: float = Field(default=1e-12, gt=0.0, description='Minimum normalisation denominator; positions are zeroed at or below this value. The default 1e-12 is well above float64 machine epsilon (~2.2e-16) while remaining negligible for any economically meaningful signal.')` `class-attribute` `instance-attribute` ¶

`position_scale: float = Field(default=1000000.0, gt=0.0, description='Multiplicative scaling factor applied to dimensionless risk positions to obtain cash positions in base-currency units. Defaults to 1e6 (one million), a conventional denomination for institutional portfolios.')` `class-attribute` `instance-attribute` ¶

`min_corr_denom: float = Field(default=1e-14, gt=0.0, description='Guard threshold for the EWMA correlation denominator sqrt(var_x * var_y). When the denominator is at or below this value the correlation is set to NaN instead of dividing by a near-zero number. The default 1e-14 is safely above float64 underflow while being negligible for any realistic return variance.')` `class-attribute` `instance-attribute` ¶

`max_nan_fraction: float = Field(default=0.9, gt=0.0, lt=1.0, description='Maximum tolerated fraction of null values in any asset price column. ExcessiveNullsError is raised during construction when the null fraction strictly exceeds this threshold. The default 0.9 allows up to 90 % missing prices while rejecting columns that are almost entirely null.')` `class-attribute` `instance-attribute` ¶

`covariance_config: CovarianceConfig = Field(default_factory=EwmaShrinkConfig, description='Covariance estimation configuration. Pass EwmaShrinkConfig() (default) for EWMA correlation with linear shrinkage toward the identity, or SlidingWindowConfig(window=W, n_factors=k) for a rolling-window factor model. See Section 4.4 of basanos.pdf.')` `class-attribute` `instance-attribute` ¶

`cost_per_unit: float = Field(default=0.0, ge=0.0, description='One-way trading cost per unit of position change. At each period, the cost deduction is sum(|x_t - x_{t-1}|) * cost_per_unit where x_t is the cash position vector. Defaults to 0.0 (no cost). The resulting net-of-cost NAV is exposed via Portfolio.net_cost_nav.')` `class-attribute` `instance-attribute` ¶

`max_turnover: float | None = Field(default=None, gt=0.0, description='Optional turnover budget per period in cash-position units. When set, the L1 norm of position changes sum(|x_t - x_{t-1}|) is capped at this value at every solve step by proportionally scaling the position delta toward the previous position. Must be strictly positive when provided. Defaults to None (no turnover constraint).')` `class-attribute` `instance-attribute` ¶

`model_config = {'frozen': True, 'extra': 'forbid'}` `class-attribute` `instance-attribute` ¶

`covariance_mode: CovarianceMode` `property` ¶

Covariance mode derived from :attr:covariance_config.

`window: int | None` `property` ¶

Sliding window length, or None when not in sliding_window mode.

`n_factors: int | None` `property` ¶

Number of latent factors, or None when not in sliding_window mode.

`report: ConfigReport` `property` ¶

Return a :class:~basanos.math._config_report.ConfigReport facade for this config.

Generates a self-contained HTML report summarising all configuration parameters, a shrinkage-guidance table, and a theory section on Ledoit-Wolf shrinkage.

To also include a lambda-sweep chart (Sharpe vs λ), use :attr:BasanosEngine.config_report instead, which requires price and signal data.

Returns:

Type	Description
`ConfigReport`	basanos.math._config_report.ConfigReport: Report facade with
`ConfigReport`	`to_html()` and `save()` methods.

Examples:

>>> from basanos.math import BasanosConfig
>>> cfg = BasanosConfig(vola=10, corr=20, clip=3.0, shrink=0.5, aum=1e6)
>>> report = cfg.report
>>> html = report.to_html()
>>> "Parameters" in html
True

`_reject_legacy_flat_kwargs(data: dict[str, object]) -> dict[str, object]` `classmethod` ¶

Raise an informative TypeError when the pre-v0.4 flat kwargs are used.

Before v0.4 callers passed covariance_mode, n_factors, and window as top-level keyword arguments to :class:BasanosConfig. Those fields were replaced by the nested discriminated union covariance_config. Without this validator Pydantic raises a generic extra_forbidden error that gives no migration guidance.

Examples:

>>> BasanosConfig(
...     vola=10, corr=20, clip=3.0, shrink=0.5, aum=1e6,
...     covariance_mode="sliding_window", window=30, n_factors=2,
... )
Traceback (most recent call last):
    ...
TypeError: ...

`replace(*, vola: int | None = None, corr: int | None = None, clip: float | None = None, shrink: float | None = None, aum: float | None = None, denom_tol: float | None = None, position_scale: float | None = None, min_corr_denom: float | None = None, max_nan_fraction: float | None = None, covariance_config: CovarianceConfig | None = None, cost_per_unit: float | None = None, max_turnover: float | None = _SENTINEL) -> BasanosConfig` ¶

Return a new :class:BasanosConfig with selected fields replaced.

Unlike :meth:model_copy, this method uses explicit constructor kwarg forwarding so that any new required field added to :class:BasanosConfig surfaces immediately as a type or lint error at the call site, rather than silently failing at runtime.

All parameters default to None, meaning keep the existing value. Pass a non-None value for every field you want to change.

Parameters:

Name	Type	Description	Default
`vola`	`int \| None`	EWMA lookback for volatility normalisation.	`None`
`corr`	`int \| None`	EWMA lookback for correlation estimation.	`None`
`clip`	`float \| None`	Clipping threshold for volatility adjustment.	`None`
`shrink`	`float \| None`	Retention weight λ ∈ [0, 1] for linear shrinkage.	`None`
`aum`	`float \| None`	Assets under management for portfolio scaling.	`None`
`denom_tol`	`float \| None`	Minimum normalisation denominator.	`None`
`position_scale`	`float \| None`	Multiplicative scaling factor for cash positions.	`None`
`min_corr_denom`	`float \| None`	Guard threshold for the EWMA correlation denominator.	`None`
`max_nan_fraction`	`float \| None`	Maximum tolerated null fraction per price column.	`None`
`covariance_config`	`CovarianceConfig \| None`	Covariance estimation configuration.	`None`
`cost_per_unit`	`float \| None`	One-way trading cost per unit of position change.	`None`
`max_turnover`	`float \| None`	Optional turnover budget per period in cash-position units. Pass `None` explicitly to clear an existing budget.	`_SENTINEL`

Returns:

Type	Description
`BasanosConfig`	A new :class:`BasanosConfig` with the specified fields replaced and
`BasanosConfig`	all other fields copied from `self`.

Examples:

>>> cfg = BasanosConfig(vola=10, corr=20, clip=3.0, shrink=0.5, aum=1e6)
>>> cfg2 = cfg.replace(shrink=0.8)
>>> cfg2.shrink
0.8
>>> cfg2.vola == cfg.vola
True
>>> cfg3 = cfg.replace(cost_per_unit=0.001, max_turnover=1e5)
>>> cfg3.cost_per_unit
0.001
>>> cfg3.max_turnover
100000.0

`corr_greater_than_vola(v: int, info: ValidationInfo) -> int` `classmethod` ¶

Optionally enforce corr ≥ vola for stability.

Pydantic v2 passes ValidationInfo; use info.data to access other fields.

CovarianceMode¶

`CovarianceMode` ¶

Bases: StrEnum

Covariance estimation mode for the Basanos optimizer.

Attributes:

Name	Type	Description
`ewma_shrink`		EWMA correlation matrix with linear shrinkage toward the identity. Controlled by :attr:`BasanosConfig.shrink`. This is the default mode.
`sliding_window`		Rolling-window factor model. A fixed block of the `W` most recent volatility-adjusted returns is decomposed via truncated SVD into `k` latent factors, giving the estimator .. math:: `\\hat{C}_t^{(W,k)} = \\frac{1}{W} \\mathbf{V}_{k,t}\\mathbf{\\Sigma}_{k,t}^2\\mathbf{V}_{k,t}^\\top + \\hat{D}_t` where :math:`\\hat{D}_t` is chosen to enforce unit diagonal. The system is solved efficiently via the Woodbury identity (Section 4.3 of basanos.pdf) at :math:`O(k^3 + kn)` per step rather than :math:`O(n^3)`. Configured via :class:`SlidingWindowConfig`.

Examples:

>>> CovarianceMode.ewma_shrink
<CovarianceMode.ewma_shrink: 'ewma_shrink'>
>>> CovarianceMode.sliding_window
<CovarianceMode.sliding_window: 'sliding_window'>
>>> CovarianceMode("sliding_window")
<CovarianceMode.sliding_window: 'sliding_window'>

`ewma_shrink = 'ewma_shrink'` `class-attribute` `instance-attribute` ¶

`sliding_window = 'sliding_window'` `class-attribute` `instance-attribute` ¶

EwmaShrinkConfig¶

`EwmaShrinkConfig` ¶

Bases: BaseModel

Covariance configuration for the ewma_shrink mode.

This is the default covariance mode. No additional parameters are required beyond those already present on :class:BasanosConfig (shrink, corr).

.. note:: This class is intentionally minimal. The only field is the covariance_mode discriminator, which is required to make Pydantic's discriminated-union dispatch work correctly (see :data:CovarianceConfig). Before adding new EWMA-specific fields here, consider whether the field name clashes with existing :class:BasanosConfig top-level fields and whether it would constitute a breaking change to the public API.

Examples:

>>> cfg = EwmaShrinkConfig()
>>> cfg.covariance_mode
<CovarianceMode.ewma_shrink: 'ewma_shrink'>

`covariance_mode: Literal[CovarianceMode.ewma_shrink] = CovarianceMode.ewma_shrink` `class-attribute` `instance-attribute` ¶

`model_config = {'frozen': True}` `class-attribute` `instance-attribute` ¶

SlidingWindowConfig¶

`SlidingWindowConfig` ¶

Bases: BaseModel

Covariance configuration for the sliding_window mode.

Requires both window (rolling window length) and n_factors (number of latent factors for the truncated SVD factor model).

Effective component count — at each streaming step the number of SVD components actually used is

.. math::

k_{\text{eff}} = \min(k,\; W,\; n_{\text{valid}},\; k_{\text{max}})

where :math:k = n_factors, :math:W = window, :math:n_{\text{valid}} is the number of assets with finite prices at that step, and :math:k_{\text{max}} = max_components (or :math:+\infty when unset). This ensures the truncated SVD remains well-posed even when assets temporarily drop out of the universe. Setting max_components explicitly caps computational cost in large universes without changing the desired factor count used in batch mode.

Parameters:

Name	Description	Default
`window`	Rolling window length :math:`W \\geq 1`. Rule of thumb: :math:`W \\geq 2n` keeps the sample covariance well-posed before truncation.	required
`n_factors`	Number of latent factors :math:`k \\geq 1`. :math:`k = 1` recovers the single market-factor model; larger :math:`k` captures finer correlation structure at the cost of higher estimation noise.	required
`max_components`	Optional hard cap on the number of SVD components used per streaming step. When set, the effective component count is :math:`\\min(k_{\\text{eff}},\\, \\texttt{max\\_components})`. Useful for large universes where only a few factors dominate and you want to limit SVD cost below `n_factors`. Must be :math:`\\geq 1` when provided. Defaults to `None` (no extra cap).	required

Examples:

>>> cfg = SlidingWindowConfig(window=60, n_factors=3)
>>> cfg.covariance_mode
<CovarianceMode.sliding_window: 'sliding_window'>
>>> cfg.window
60
>>> cfg.n_factors
3
>>> cfg.max_components is None
True
>>> cfg2 = SlidingWindowConfig(window=60, n_factors=10, max_components=3)
>>> cfg2.max_components
3

`covariance_mode: Literal[CovarianceMode.sliding_window] = CovarianceMode.sliding_window` `class-attribute` `instance-attribute` ¶

`window: int = Field(..., gt=0, description='Sliding window length W (number of most recent observations). Rule of thumb: W >= 2 * n_assets to keep the sample covariance well-posed. Note: the first W-1 rows of output will have zero/empty positions while the sliding window fills up (warm-up period). Account for this when interpreting results or sizing positions.')` `class-attribute` `instance-attribute` ¶

`n_factors: int = Field(..., gt=0, description='Number of latent factors k for the sliding window factor model. k=1 recovers the single market-factor model; larger k captures finer correlation structure at the cost of higher estimation noise. At each streaming step the actual number of components used is min(n_factors, window, n_valid_assets[, max_components]), so the effective rank may be lower than n_factors when the number of valid assets or the window length is the binding constraint.')` `class-attribute` `instance-attribute` ¶

`max_components: int | None = Field(default=None, gt=0, description='Optional hard cap on the number of SVD components used per streaming step. When set, the effective component count is min(n_factors, window, n_valid_assets, max_components). Useful for large universes where only a few factors dominate and you want to limit SVD cost below n_factors. Must be >= 1 when provided. Defaults to None.')` `class-attribute` `instance-attribute` ¶

`model_config = {'frozen': True}` `class-attribute` `instance-attribute` ¶

`_validate_max_components() -> SlidingWindowConfig` ¶

Validate that max_components does not exceed n_factors.

CovarianceConfig¶

`CovarianceConfig = Annotated[EwmaShrinkConfig | SlidingWindowConfig, Field(discriminator='covariance_mode')]` `module-attribute` ¶

Discriminated union of covariance-mode configurations.

Pydantic selects the correct sub-config based on the covariance_mode discriminator field:

:class:EwmaShrinkConfig when covariance_mode="ewma_shrink"
:class:SlidingWindowConfig when covariance_mode="sliding_window"

Configuration¶

BasanosConfig¶

BasanosConfig ¶

Shrinkage methodology¶

Default rationale¶

Sliding-window mode¶

vola: int = Field(..., gt=0, description='EWMA lookback for volatility normalization.') class-attribute instance-attribute ¶

corr: int = Field(..., gt=0, description='EWMA lookback for correlation estimation.') class-attribute instance-attribute ¶

clip: float = Field(..., gt=0.0, description='Clipping threshold for volatility adjustment.') class-attribute instance-attribute ¶

aum: float = Field(..., gt=0.0, description='Assets under management for portfolio scaling.') class-attribute instance-attribute ¶

model_config = {'frozen': True, 'extra': 'forbid'} class-attribute instance-attribute ¶

covariance_mode: CovarianceMode property ¶

window: int | None property ¶

n_factors: int | None property ¶

report: ConfigReport property ¶

_reject_legacy_flat_kwargs(data: dict[str, object]) -> dict[str, object] classmethod ¶

corr_greater_than_vola(v: int, info: ValidationInfo) -> int classmethod ¶

CovarianceMode¶

CovarianceMode ¶

ewma_shrink = 'ewma_shrink' class-attribute instance-attribute ¶

sliding_window = 'sliding_window' class-attribute instance-attribute ¶

EwmaShrinkConfig¶

EwmaShrinkConfig ¶

covariance_mode: Literal[CovarianceMode.ewma_shrink] = CovarianceMode.ewma_shrink class-attribute instance-attribute ¶

model_config = {'frozen': True} class-attribute instance-attribute ¶

SlidingWindowConfig¶

SlidingWindowConfig ¶

covariance_mode: Literal[CovarianceMode.sliding_window] = CovarianceMode.sliding_window class-attribute instance-attribute ¶

model_config = {'frozen': True} class-attribute instance-attribute ¶

_validate_max_components() -> SlidingWindowConfig ¶

CovarianceConfig¶

CovarianceConfig = Annotated[EwmaShrinkConfig | SlidingWindowConfig, Field(discriminator='covariance_mode')] module-attribute ¶

`BasanosConfig` ¶

`vola: int = Field(..., gt=0, description='EWMA lookback for volatility normalization.')` `class-attribute` `instance-attribute` ¶

`corr: int = Field(..., gt=0, description='EWMA lookback for correlation estimation.')` `class-attribute` `instance-attribute` ¶

`clip: float = Field(..., gt=0.0, description='Clipping threshold for volatility adjustment.')` `class-attribute` `instance-attribute` ¶

`aum: float = Field(..., gt=0.0, description='Assets under management for portfolio scaling.')` `class-attribute` `instance-attribute` ¶

`model_config = {'frozen': True, 'extra': 'forbid'}` `class-attribute` `instance-attribute` ¶

`covariance_mode: CovarianceMode` `property` ¶

`window: int | None` `property` ¶

`n_factors: int | None` `property` ¶

`report: ConfigReport` `property` ¶

`_reject_legacy_flat_kwargs(data: dict[str, object]) -> dict[str, object]` `classmethod` ¶

`corr_greater_than_vola(v: int, info: ValidationInfo) -> int` `classmethod` ¶

`CovarianceMode` ¶

`ewma_shrink = 'ewma_shrink'` `class-attribute` `instance-attribute` ¶

`sliding_window = 'sliding_window'` `class-attribute` `instance-attribute` ¶

`EwmaShrinkConfig` ¶

`covariance_mode: Literal[CovarianceMode.ewma_shrink] = CovarianceMode.ewma_shrink` `class-attribute` `instance-attribute` ¶

`model_config = {'frozen': True}` `class-attribute` `instance-attribute` ¶

`SlidingWindowConfig` ¶

`covariance_mode: Literal[CovarianceMode.sliding_window] = CovarianceMode.sliding_window` `class-attribute` `instance-attribute` ¶

`model_config = {'frozen': True}` `class-attribute` `instance-attribute` ¶

`_validate_max_components() -> SlidingWindowConfig` ¶

`CovarianceConfig = Annotated[EwmaShrinkConfig | SlidingWindowConfig, Field(discriminator='covariance_mode')]` `module-attribute` ¶