You can run it on your own machine. It won’t work on a phone right now, but I guarantee chip manufacturers are working on a custom SOC right now which will be able to run a rudimentary local model.
Both apple and Google have integrated machine learning optimisations, specifically for running ML algorithms, into their processors.
As long as you have something optimized to run the model, it will work fairly well.
They don’t want to have independent ML chips, they want it baked into every processor.
That’s fine, Qualcomm has followed suit, and Samsung is doing the same.
I’m sure Intel and AMD are not far behind. They may already be doing this, I just haven’t kept up on the latest information from them.
Eventually all processors will have it, whether you want it or not.
I’m not saying this is a good thing, I’m saying this as a matter of fact.