I don't want to fickle around with buying or renting GPUs or installing random LLM scripts that don't work 90% of the time.
I also don't want to pay $XX/month subscriptions, I want clean API billing like OpenAI and GCP.
What exists? Is anybody doing anything?
Namely, you have a pool of gen AI volunteer workers constantly chewing through API requests. Host requests are prioritized, but the "downtime" of your AI worker instance is spent fulfilling other requests, which earns karma that can be used to prioritize requests from other hosts should you need more throughput.
Economically, this is way more efficient than self hosting or "renting" an instance that will be idle between requests, and you aren't paying a massive tax for a proprietary API.