This new Chinese AI Is outperforming ChatGPT — and its open-source & runs locally

Lugh@futurology.today · 3 days ago

This new Chinese AI Is outperforming ChatGPT — and its open-source & runs locally

dimjim@sh.itjust.works · 3 days ago

Yet, even when heavily compressed, it requires roughly 240GB of memory just to load.

Ah I’ll just pop it in the ol’ Raspberry Pi then, easy peasy.

Nouvellalia@lemmy.world · edit-2 2 days ago

Lol, “runs locally”. I mean, Claude rubs locally too if you’re in the room with the racks.

Edit: I said what I said. Get some lube and go hang out with Claude’s hot, noisy, 5kw rack. You know what they say “Once you go stack you never go back.”

mindbleach@sh.itjust.works · 2 days ago

CRAC will give you one hell of a blow-job.

And tinnitus.

Valmond@lemmy.dbzer0.com · 3 days ago

RPi 6: 256GB RAM

Fluffy Kitty Cat@slrpnk.net · 2 days ago

Basically they never had any moat to begin with but no one else seems to know how to fit that much intelligence into less space. It’s possible that it just fundamentally has to take up that much space which would also imply that future Computing gainss are going to be more focused on memory than raw competition

mindbleach@sh.itjust.works · 2 days ago

Genuinely compact models are hitting benchmarks just a couple months behind the big boys. And eventually we’ll get better at decoupling data from processing - so the model can do a regular-ass search of a regular-ass database and pull details into its context as needed. Ideally while also decoupling that context from the prompt, because apparently these things can have a hundred attention heads, and still nobody thought of having two text input fields.

Fluffy Kitty Cat@slrpnk.net · 2 days ago

You’d think they’d have done that by now, and maybe some symbolic AI too

mindbleach@sh.itjust.works · 2 days ago

All focus has been forced onto LLMs and diffusion, even though only diffusion works properly. And those LLMs better iterate on the exact same mechanisms we’ve tweaked for the last six years, because all results will be compared to the state of the art, right the hell now, not a comparable level of development or compute.

eleijeep@piefed.social · 3 days ago

GLM 5.2

saved you a click

Pennomi@lemmy.world · 3 days ago

The really crazy thing is that this model still performs well at one-bit quantization, which shows it’s got a lot of room for improvement on size. It’s within an order of magnitude of being able to be run on consumer hardware, which would be an even more amazing kick in the balls to American AI companies.

John Richard@lemmy.world · edit-2 3 days ago

Sucks that people lump AI into a single category of whatever cloud-hosted subscription that tech bros from Silicon Valley are pushing.

Fluffy Kitty Cat@slrpnk.net · 2 days ago

Given how memory is the bottleneck especially at the very low end it makes me wonder if one bit quantization of an extremely large model would be a gigabyte per gigabyte of ram better

mindbleach@sh.itjust.works · 2 days ago

You can fit GLM in the headline. It’s three letters.

_haha_oh_wow_@sh.itjust.works · 2 days ago

C’mon, pop!

This new Chinese AI Is outperforming ChatGPT — and its open-source & runs locally

This new Chinese AI Is outperforming ChatGPT — and its open-source & runs locally

This new Chinese AI Is outperforming ChatGPT — and it runs locally