Managed AI (Optional Hosted Tier)
Managed AI is an optional monthly subscription. We host a cost-optimized inference proxy at ai.getvrge.com, you pay a flat monthly fee, and the app stops worrying about API keys, rate limits, and per-provider pricing. Everything else in Vrge works identically whether you subscribe or bring your own key — not a single observer feature, extractor, or surface requires Managed AI to function.
If you just subscribed
- 1.Find your license key in the receipt email from Lemon Squeezy. It looks like
XXXXXXXX-XXXX-XXXX-XXXX-XXXXXXXXXXXX(a UUID). - 2.Open Vrge and go to Settings → AI & Intelligence → Managed AI.
- 3.Paste the key and click Activate. The provider dropdown in the AI panel gains a new Vrge Managed option and becomes the default.
- 4.On the next observer run, the token meter starts tracking. You can see it in real-time at the top of the AI settings panel.
The three tiers
- Starter— $10/mo · 500K tokens · fits 1–2 users. A quiet solo founder with a low-volume inbox and a few calendar meetings per week lands here.
- Pro— $25/mo · 2M tokens · fits 3–10 active users in a small agency with steady proposal volume.
- Power— $75/mo · 8M tokens · fits 11–25 active users running a full observer deployment across email, calendar, banking, and storage.
When you hit your monthly cap, Managed AI stops forwarding calls and the app falls back to your BYO key or Ollama for the rest of the month. You will neversee an overage charge. Hard quota, not “reasonable usage.”
Upgrading mid-cycle
Trending over cap? Go to Settings → AI & Intelligence → Managed AI and click Upgrade tier. Lemon Squeezy prorates the price difference and credits your quota to the new tier for the remainder of the current billing period. No phone calls, no sales team.
Team admins: watching consumption
On Pro and Power, the AI settings panel shows top consumers for the current period and a per-user allocation slider. Default fair-share gives each seat roughly 1.5× their even share of the org quota; you can raise or lower individual caps. Users hit a soft per-user cap before the hard org cap, so one heavy user can't starve the rest of the team.
Privacy
- Redaction stays on. The client redacts PII before the call; the proxy re-checks and refuses unredacted payloads when the source is configured redact-required. Belt-and-suspenders by design.
- No prompt or response logging on the proxy. We record token counts, model name, task type, timestamp, and redaction status. Nothing else. The privacy invariant is enforced at the schema level — there is no column for content in the usage log, so no code path can leak it.
- Self-hostable. The proxy ships as a Docker image for privacy-max teams (legal, medical, airgapped) that want the flat-fee experience on their own infrastructure with their own provider keys.
Cancelling
Cancellation is one click through the Lemon Squeezy customer portal linked on your receipt email. Unused days in the current billing period are automatically prorated and refunded. No retention nags, no “are you sure” modals, no win-back emails. When the period ends, the app falls back to your BYO key or Ollama automatically — nothing else changes.