News

promptpressure-evals added to PyPI

  • None--Pypi.org
  • published date: 2026-04-28 20:02:00 UTC

Multi-turn behavioral drift detection for LLMs — tone, sycophancy, refusal sensitivity, persona stability

multi-turn behavioral drift detection for LLMs. the things benchmarks don't test. most eval frameworks measure accuracy on known-answer datasets. PromptPressure measures how models behave over susta… [+15038 chars]