AI & Model-Level Risks

Model Manipulation / Prompt Injection

Attack: Submitting signals crafted to exploit AI behavior or bias models.

Mitigations:

  • Signals are schema-validated

  • No free-form text inputs

  • Deterministic encoding of human inputs

  • Multiple independent agent evaluations

Residual Risk: Low—attack surface intentionally constrained.

Model Drift & Overfitting

Risk (Non-Adversarial): AI agents overfit to recent regimes or degrade in performance.

Mitigations:

  • Continuous regime detection

  • Ensemble agent disagreement thresholds

  • Auto-deployment of fallback strategies

  • Performance-based agent throttling

Residual Risk: Moderate, but actively managed.

AI-Agent Collusion

Attack (Theoretical): Multiple agents converge on a flawed strategy due to shared biases.

Mitigations:

  • Heterogeneous agent architectures

  • Independent training pipelines

  • Forced diversity constraints

  • Capital allocation caps per strategy

Last updated