Update on SJW upgrade to v19.3

TheDude@sh.itjust.works · edit-2 2年前

Update on SJW upgrade to v19.3

rowrowrowyourboat@sh.itjust.works · 2年前

Question. Why would you go with 1tb ssds instead of larger hdds? Isn’t the space and price more important than the speed for this use?

You could get double the space (2tb hdd) for the same price as a 1 tb ssd.

Just wondering.

TheDude@sh.itjust.works · 2年前

The biggest consumer of storage on this instance is related to the image hosting which we use an external object storage provider for. The second is the database which is no were near the 2TB capacity. 1TB SSDs are cheaper than 2TB SSDs and I also didn’t want to spend more than I needed. As other mentioned if we need more space or IOPs in the future, I could accomplish this by adding more drives as a quick fix. This server does not support NVME unless I leverage its PCIe ports but I don’t plan on doing that. By the time this instance gets to the point where 10 SSD drives just isn’t cutting it anymore I’ll probably have come across another opportunities on getting a new server with better NVME support.

where_am_i@sh.itjust.works · 2年前

You let us know when you need some help with new NVMes. We’re more than willing to contribute ;)

4am@lemm.ee · edit-2 2年前

Speed is usually the reason. SSDs in general are faster, enterprise SSDs are not only faster but much more write-tolerant and last a very long time in comparison to consumer SSDs.

They can also (in many cases) do write caching at the speed of a DRAM buffer, making the bottleneck the SATA or SAS bus itself (SAS is like enterprise SATA, 12Gb/sec as opposed to 6). NVMe can be even faster. This means that programs (ie Lemmy and its database) that write data aren’t waiting around for the drive to acknowledge the write before that program can move on to other things. Shaving off a few milliseconds per write can make a massive difference when you realize there might be millions of IOPS (Input/Output operations Per Second) under load. The requirement for low latency is everything in servers.

When you are running a public service and requests are coming in constantly and at a high rate, you really really do not want storage latency to bottleneck you, as that is a problem that will compound extremely quickly. This is a big issue with HDDs as well, as even disk seek times add to the problem, let alone caching/buffering writes.

We could talk all day about if four SSDs in a RAID 10 are optimal, but sometimes you have to think about budget and complexity as well. For the load that a popular Lemmy instance might currently draw, I’d make an educated guess that this might be sufficient for now. Room to expand was also mentioned, which is the second most important part of a storage plan.

_cnt0@sh.itjust.works · 2年前

I’d wager raid 5 would be better, but it would require a special storage controller or hog the cpu with 4 ssds.

burrito@sh.itjust.works · 2年前

Software RAID is much faster than you think, even in RAID 5. Many of the algorithms used in software RAID leverage special CPU instructions that can process the parity operations at a very fast rate. Reading the data, which is by far the most common operation in a Lemmy instance, uses even less computational power than writes.

4am@lemm.ee · 2年前

Yeah, ZFS rocks these days. Fast and rock solid for me, even on older hardware. I run my whole array as mirrored vdevs (so, basically a bunch of raid 10) to keep resilver times down when i replace drives. No issues so far!

UNWILLING_PARTICIPANT@sh.itjust.works · 2年前

Probably for faster loading times when playing games on the hardware during off peak hours jk jk lol

SorteKanin@feddit.dk · edit-2 2年前

Speed is important. All else equal, the database will work faster with SSDs. Raid also makes the storage be under heavier load so SSDs make even more sense here as well. You want response times to be as low as possible for a good user experience.

But also SSDs are kinda standard now and you can get a decent amount of storage for not that much higher price. Especially for server hardware that is more or less constantly under load, SSDs just make a lot more sense.

Task	Date	Expected Downtime
Migration to new server	Tuesday February 27 2024 @ 8:00PM ET	90 Minutes
Upgrade to V19.3	Thursday February 29 2024 @ 8:00PM ET	Up to 120 Minutes