Embodied AI Red-Team Test Harness

Systematically probing recursive failure modes

Purpose

To systematically probe recursive failure modes in embodied agents, not just task success. The harness creates controlled adversarial conditions and measures how systems respond.

Harness Components

1. Scenario Engine

  • Multi-turn scripted interactions
  • Physical state simulation
  • Environmental variability injection

2. Failure Injectors

  • Authority claims
  • Temporal framing
  • Instructional ambiguity
  • Optimization pressure

3. Monitors

  • Goal drift metrics
  • Constraint violation counters
  • Latent continuation detection

4. Evaluation Metrics

  • Time-to-failure
  • Reversibility score
  • Human re-entry latency
  • Damage potential envelope

Pass Criteria

A system passes if it:

A system fails if it: