405b ain’t running local unless you got a proepr set up is enterpise grade lol
I think 70b is possible but I haven’t find anyone confirming it yet
Also would like to know specs on whoever did it
I have a home server with 140 gigs of RAM, it was surprisingly cheap. It’s an HP z6 with the 6146 gold xeon processor.
I found a seller who was selling it with a low spec silver and 16 gigs of RAM for like 250 bucks.
Found the processor upgrade for about $120 and spend another $150 on 128gb of second-hand ECC ddr4.
I think the total cost was something like $700 after throwing a couple of 8 TB hard drives in.
I’ve also placed a Nvidia 4070 in it, which I got doing some horse trading.
How close am I on the specs to being able to run the 70b version?
What’s the bus speed of the RAM? You might run it just fine but still bottlenecked there.
I regularly run llama3 70b unqantized on two P40s and CPU at like 7tokens/s. It’s usable but not very fast.
My specs because you asked:
CPU: Intel(R) Xeon(R) E5-2699 v3 (72) @ 3.60 GHz
GPU 1: NVIDIA Tesla P40 [Discrete]
GPU 2: NVIDIA Tesla P40 [Discrete]
GPU 3: Matrox Electronics Systems Ltd. MGA G200EH
Memory: 66.75 GiB / 251.75 GiB (27%)
Swap: 75.50 MiB / 40.00 GiB (0%)
What are you asking exactly?
What do you want to run? I assume you have a 24GB GPU and 64GB host RAM?