raldone01

CPU: Intel(R) Xeon(R) E5-2699 v3 (72) @ 3.60 GHz
GPU 1: NVIDIA Tesla P40 [Discrete]
GPU 2: NVIDIA Tesla P40 [Discrete]
GPU 3: Matrox Electronics Systems Ltd. MGA G200EH
Memory: 66.75 GiB / 251.75 GiB (27%)
Swap: 75.50 MiB / 40.00 GiB (0%)

permalink

report

parent

[ - ]

raldone01@lemmy.world

1 point

2 months ago

in technology@lemmy.world•The first GPT-4-class AI model anyone can download has arrived: Llama 405B

What are you asking exactly?

What do you want to run? I assume you have a 24GB GPU and 64GB host RAM?

permalink

report

parent

[ - ]

raldone01@lemmy.world

2 points

2 months ago

in technology@lemmy.world•The first GPT-4-class AI model anyone can download has arrived: Llama 405B

I regularly run llama3 70b unqantized on two P40s and CPU at like 7tokens/s. It’s usable but not very fast.

permalink

report

parent

[ - ]

raldone01@lemmy.world

1 point

2 months ago

in selfhosted@lemmy.world•Suggestion for buying drive

True multiple drives speed up reads significantly. As long as the videos are sequential read speeds can be very fast (600MB/s) even on one drive though. Results may vary.

permalink

report

parent

[ - ]

raldone01@lemmy.world

4 points

2 months ago

in selfhosted@lemmy.world•Suggestion for buying drive

I have a ~40TB HDD array and jellyfin is super fast. Just put the database and cache files on a SSD.

For bulk storage of 4k videos with high bitrates HDDs are way cheaper.

permalink

report