A possible hardware solution for ultra speed (73x faster than H200) self hosted small models that is not dependent on RAM

humanspiral@lemmy.ca · 1 day ago

A possible hardware solution for ultra speed (73x faster than H200) self hosted small models that is not dependent on RAM

spencerwi@feddit.org · edit-2 2 hours ago

Oh cool, a whole new e-waste industry. Anyone want this old gpt4.1 chip? I know the latest is GPT-8 and the whole ecosystem has largely moved on in a way that renders most software incompatible, but hey, it’s right here on this PCI-E card so you can’t stick it in a Raspberry Pi either!

No? Guess I’ll chuck it in the landfill!

A possible hardware solution for ultra speed (73x faster than H200) self hosted small models that is not dependent on RAM

A possible hardware solution for ultra speed (73x faster than H200) self hosted small models that is not dependent on RAM

A 25-Person Startup Built a Chip That Only Runs One AI Model. It's 73 Times Faster Than Nvidia.