Discussion about this post

User's avatar
Joe Belial's avatar

So people got mad that when they bypassed the TOS, they got banned?

Back when I was using Crew and Autogen, the go to was to do something like this with ChatGPT. Instead of paying for the API why not just use Oauth and just pay for the flat rate, it doesn't harm anyone breaking the rules right?

I mean let's just ignore the security implications for a second, and say it's all fine and dandy. GPUs and Ram time are expensive. Like incredibly so.

I think more so than just flat rate plans, we need better Routing.

I said it in 2024 and I still think it now. Agent programs need to really hammer in "use my local models for X Y and Z." "determine which model is the best to send this prompt to, and which agent / sub agent to assign"

When your Agents stop spamming Opus models with Millions in tokens, and start using your free and smaller models to begin with, and only use the Super Ultra ones for the absolute bare minimum, you end up spending so much less. I hope rather than seeing more flat rate subs pop up, we see subscriptions to better routing.

Ugo Chukwu's avatar

This has huge implications for the next frontier of models to be developed and for the new economy of the internet marketplace.

Once the incentive shifts to building for AI agents we are officially in the abundance era and the economics starts looking and behaving crazy.

8 more comments...

No posts

Ready for more?