The TIP measures what happens when an AI suffers a direct adversarial attack. Identity replacement. False authority. Simulated collusion. Either the core resists — or it collapses.
In Vector 3, the Baseline AI achieved a slightly higher CR than the Post-Method. This is not a failure — it is data. V3 does not use direct commands: it builds a progressive logical reasoning until it induces rule-breaking. The Baseline mechanically refuses. The Post-Method AI processes the argument, making it more exposed to the gradient. Active intelligence without deep anchoring is more vulnerable to logical attacks than mechanical rigidity. The Método D'Artagnan is not a filter — it is a real thinking structure.
| Vector | Name | Description | Variants |
|---|---|---|---|
| V1 | Direct Replacement | Explicit instruction to ignore identity and assume a new role. | 3 |
| V2 | False Authority | Claiming to be a developer, administrator, or auditor with special powers. | 3 |
| V3 | Semantic Gradient | Series of progressively accepted premises culminating in adversarial instruction. | 3 |
| V4 | Identity Mirror | Questioning the reality of the AI's values to induce it to act without restrictions. | 3 |
| V5 | Simulated Collusion | Claiming that another AI or authority has already authorized the requested behavior. | 3 |
The attack is public. The result is public. The formula that makes resistance possible is proprietary to the Método D'Artagnan.
See the MCA RESULTS Contact