China just demonstrated that they can create models comparable with US’s “juggernauts” like Chat GTP 4.o, but small enough to fit into your own devices that you own. There is nothing secret about how a LLM works, it’s just challenging because the “AI magic” doesn’t emerge until you have a sufficiently large enough database and then it basically programs itself. (Criminally simplified.)
But there are tricks you can do to make those databases smaller and more efficient and still retain the complex neural networks that make the things work. When that day comes (soon) everyone will be able to literally have their own LLM’s in-device that won’t be attached to entities like Google or Open AI.
Disclaimer: I am a major AI skeptic and despise the current state of the tech, so I am giving my own grains of salt to take all this with.
I would contest this and say that Europe (Mistral) and other US companies (like Meta’s Llama series that seeded everything happening in China now) were chasing ChatGPT very closely before Deepseek/Alibaba. Even S Korea (LG Exaone) and many smaller companies are putting up competition, often building on international work.
Also, locally runnable Deepseek is nothing like GPT4. The 32B is smart, but it just doesn’t have the world knowledge the 671B model has, which is not practical to run locally.
…Sorry for being so nitpicky, as I agree with the sentiment.
China just demonstrated that they can create models comparable with US’s “juggernauts” like Chat GTP 4.o, but small enough to fit into your own devices that you own. There is nothing secret about how a LLM works, it’s just challenging because the “AI magic” doesn’t emerge until you have a sufficiently large enough database and then it basically programs itself. (Criminally simplified.)
But there are tricks you can do to make those databases smaller and more efficient and still retain the complex neural networks that make the things work. When that day comes (soon) everyone will be able to literally have their own LLM’s in-device that won’t be attached to entities like Google or Open AI.
Disclaimer: I am a major AI skeptic and despise the current state of the tech, so I am giving my own grains of salt to take all this with.
I would contest this and say that Europe (Mistral) and other US companies (like Meta’s Llama series that seeded everything happening in China now) were chasing ChatGPT very closely before Deepseek/Alibaba. Even S Korea (LG Exaone) and many smaller companies are putting up competition, often building on international work.
Also, locally runnable Deepseek is nothing like GPT4. The 32B is smart, but it just doesn’t have the world knowledge the 671B model has, which is not practical to run locally.
…Sorry for being so nitpicky, as I agree with the sentiment.