- Add typed limit fields to RuntimeUsageCurrent: output_token_limit, total_token_limit, message_limit with matching pct and source - Add total_output_tokens and output_tokens_per_minute to burn rate - _build_current() now computes each pct from matching units only - Legacy token_limit backfilled from typed limits for backwards compat - Frontend aggregateRuntimeUsage() tracks typed limits separately - limit_kind field on predictions indicates which limit drove time-to-limit |
||
|---|---|---|
| .. | ||
| api | ||
| core | ||
| db | ||
| models | ||
| schemas | ||
| services | ||
| __init__.py | ||
| main.py | ||