Discussion about this post

User's avatar
David's avatar

This resonated more than I expected. The framing around “human-paced” subscriptions vs agent-paced reality is exactly where the tension shows up in practice. What stood out to me is that most builders aren’t trying to extract unfair value — they’re just letting automation do what it’s explicitly marketed to do: run quietly, repeatedly, and reliably. When that behaviour collides with pricing models built around pauses and intent, the breakage feels less like abuse and more like a design mismatch. It’s encouraging to see this acknowledged openly. The next evolution clearly isn’t just better agents, but pricing and limits that assume AI is becoming infrastructure, not a chat box.

Joe Belial's avatar

So people got mad that when they bypassed the TOS, they got banned?

Back when I was using Crew and Autogen, the go to was to do something like this with ChatGPT. Instead of paying for the API why not just use Oauth and just pay for the flat rate, it doesn't harm anyone breaking the rules right?

I mean let's just ignore the security implications for a second, and say it's all fine and dandy. GPUs and Ram time are expensive. Like incredibly so.

I think more so than just flat rate plans, we need better Routing.

I said it in 2024 and I still think it now. Agent programs need to really hammer in "use my local models for X Y and Z." "determine which model is the best to send this prompt to, and which agent / sub agent to assign"

When your Agents stop spamming Opus models with Millions in tokens, and start using your free and smaller models to begin with, and only use the Super Ultra ones for the absolute bare minimum, you end up spending so much less. I hope rather than seeing more flat rate subs pop up, we see subscriptions to better routing.

9 more comments...

No posts

Ready for more?