In order for it to be this ubiquitous it has to run locally or on commodity hardware IMO.
I agree, which is why I shared that I recently saw a prototype ASIC-esque PCI card. The local hardware is coming, the models just need to settle down some before anyone will commit to building that hardware.
In the '90s and '00s you needed a zillion dollars of custom Silicon Graphics workstations and months of processing to do the FX for movies like “The Terminator”. In 2020 you could replicate it in a few hours with commodity hardware.
The LLMs and AI will be the same, it just needs more than 5 years to get there.
I agree, which is why I shared that I recently saw a prototype ASIC-esque PCI card. The local hardware is coming, the models just need to settle down some before anyone will commit to building that hardware.
In the '90s and '00s you needed a zillion dollars of custom Silicon Graphics workstations and months of processing to do the FX for movies like “The Terminator”. In 2020 you could replicate it in a few hours with commodity hardware.
The LLMs and AI will be the same, it just needs more than 5 years to get there.
Yeah if you can run them locally using a small board, that’ll last.