模型回退冷却过严导致越级烧钱:为 fallback 链加预算与可达性校验
场景:主模型限流后,订阅/OAuth 提供方因 cooldown 被整段跳过,直接回退到付费 API。技巧:在回退链增加 provider 可达性检查与成本护栏,避免隐性费用泄漏。
GITHUBDiscovered 2026-02-14Author pcpilot-dev
Prerequisites
- You use multi-provider fallback (subscription/OAuth + paid API providers).
- Per-provider usage/cost logs are available for auditing fallback decisions.
Steps
- Trigger a controlled primary-model rate-limit scenario and capture fallback attempt order from logs.
- Check whether provider-level cooldown causes full provider skip when all profiles are marked cooling down.
- Set temporary policy: place low-cost providers first and enforce hard spend guard for paid fallback tiers.
- After cooldown expiry window, re-test fallback and compare whether OAuth/subscription providers are retried before paid providers.
Commands
openclaw gateway statusopenclaw logs --local-timeopenclaw helpVerify
Fallback logs show expected retry of subscription/OAuth providers and paid-provider spend remains within predefined guardrail.
Caveats
- Issue describes current behavior; exact cooldown semantics may change in patch releases(需验证).
- Do not infer billing safety from one incident; keep ongoing cost anomaly alerts enabled.
Source attribution
This tip is aggregated from community/public sources and preserved with attribution.
Open original source ↗