As a 15 year veteran of the software industry I'm not really interested in building a workflow that involves an online service that will undergo enshittification or just outright fail in a year or two.
I'm running exllamav2 and llama.cpp and I'm doing a lot of cool things with them. I would love to see some content around these tools especially now that pretty much anybody with a gaming GPU can run Gemma 2 9B or anybody with a modern smart phone can run Phi 3.5 Mini Instruct. These are the real future of AI.