Yesterday, the last missing piece of my new fileserver arrived. But so much to do. I have to work. I’m learning something outside my normal stuff at the moment which has neither to do with my existing hobbies (it’s a new hobby in planning for a few years now and beginning of this year i really started it instead of just fooling around¹) or my job and i think i will have to explain my teacher why I didn’t go forward with training. But letting new hardware components lying around is against my nature. Fileserver was more important. But i doubt the teacher will accept it.

The last missing part were two 16 Terabytes WD Red Pro disks. It’s just mindboggling that i acquired lots of A5000 arrays back in 2000 and they still hadn’t the amount of storage this single disk had today. But we didn’t talk about helium-filled disks with energy assisted magnetic recording for home use at that time as well.

Almost RAIDless

My configuration is a little bit strange. I’m not using RAID for the large disks. As I wrote earlier: RAID is not a data protection feature, it’s an availability feature. It can prevent you in a hardware failure situation from using data protection features. But at foremost it’s about being able to work without interruption, because your filesystem is still accessible, even when one disk fail where as with my raidless approach I have to manually change things.

But what if you don’t need availability protection? For me it’s more important to have independent copies of the file system. So, when I delete something by accident, I’m able to recover it. If someone encrypts my data, I’m able to jump back quite fast. I can live with some manual reconfiguration.

My config

At the end I had the following configuration:

There is a single pool with RAID on my Fileserver. It’s called flashpool. It’s the reason why this blog entry isn’t titled “Raidless”. It’s the pool where I’m actively doing any work. Where server processes are doing their work. My Gitea runs on it with its workflow for the blog. There are other tools storing their data on it, too. So, this pool is configured with two 4 TB SSD in RAID 1. I thought a while about configuring them as RAID0, but I don’t think i will need 8 TB flash soon (when I’m honest to myself) and I think this would be the filesystem where it’s most probable that i don’t have the time to recover it, when i really need data from it.

Then there are two additional pools: diskpool_primary and diskpool_secondary. Both pools consist out of a single 16 TB WD Red Pro. Currently both pools have three main dataset:

flashpool_backup
timemachine
dataheap

There are a few replications configured there:

flashpool is replicating all changes to diskpool_primary/flashpool_backup every hour.
flashpool is replicating all changes to diskpool_secondary/flashpool_backup every 12 hours.

The reasoning behind this is, that against hardware errors this pool is protected by RAID1 and I have two copies trailing 1 hour respectively 12 hours. In addition to that I have snapshots of the flashpool.

However, I’m using diskpool_primary for directly storing data on it without using flashpoolbefore. So, I need a backup there.

It’s obvious what I’m storing in timemachine. All users of the system get their own dataset in this pool and then for each system they are backing up to the system their own dataset that is created in the dataset of the user. For example, diskpool_primary/timemachine/joergmoellenkamp/waddledee for my desktop Mac. This is shared with SMB to the outside.

diskpool_primary/dataheap is for data hoarding purposes. Large files, old ISOs of prehistoric Solaris Versions, one of the many copies of “My life in receipts as managed by Devonthink”. Or longtime preservation of older media or Virtualbox VMs of experiments of the past.

Several replications are working here as well.

diskpool_primary/dataheap is replicating all changes at 12:00, 18:00 and 22:00 each day to diskpool_secondary/dataheap. There isn’t a replication after 22:00 and before 12:00 because I don’t think I will change much data after 22:00 and before 6:00. And if this is important. I could always manually trigger a replication.
diskpool_primary/timemachine is replicating all changes once a day. It’s a backup of a backup. I don’t think I need to replicate it much more often than that.

Remote replication

There is an additional replication in place: In my cellar there is another fileserver running in a VM just for receiving a replication from flashpool. This replication is manually triggered one a day when I’m working with that system anyway. It just replicates the flashpool on an externally attached USB drive. This system doesn’t even have the keys for the pool. So, I don’t have problem with the system being in a somewhat public space in a rack (the Rack is locked, but you don’t have to be the Lockpicking Lawyer to open those locks)

Future investments

Perhaps at one point in time I will purchase a 16 TB disk for the remote replication target but I’m currently not that convinced that will need it. But it’s perfectly possible that I will wake up at some point in time in the middle of the night convinced that I need such a third disk at my remote replica

At the moment all this look like a good idea to me. But in the future it’s perfectly possible that i’m opting for a 4-bay NAS and mirror both pools. Would be something north of 1000€ again. But as money has has the nasty property of being limited and ideas to spend it have the annoying property of being unlimited, i decided to move this in the future.

Against best practice

I know, it’s against best practice to have a single disk ZFS pool for both diskpools , but I think because of the use case and the surrounding infrastructure I can accept that risk. Because I’m not actually working on this pool. It’s for archival purposes. If I see a checksum error preventing me from accessing a file, I have other means to recover it by using their replica.

Noise

My WD Red Pro have one annoying feature. When idling they are moving their heads every 5 seconds. As far I was able to find out, it’s meant to prevent problems with the bearing and lube distribution. In sleep this doesn’t happen. So, you want those drive to sleep quite often when the disks are right beside your disks.

I have a budget of 600000 off/on-ramps on a WD Red Pro. Which boils down to 13 off-on ramps per hour (600000/5/365/24) when I plan to retire the disk at the end of their 5 years warranty. Based on my replication schemes and the fact that there is no user traffic on diskpool_secondary there should be much less off/on ramps on diskpool_secondary in a few years time. I will reverse the roles of diskpool_primary and diskpool_secondary roughly when 3/4 of my off/on ramp budget has been used up on diskpool_primary. So far the theory.

In practice, I’ve currently set the sleep period to 5 minutes. I will observe the development of the counter in the next few days and perhaps increase it to 10 minutes. But i don’t think it’s necessary. On the other side I have the impression that they currently wake up much more often than i think it’s feasible considering the load. I think the operating system itself is accessing the pools for whatever reason. I must investigate this. Currently there is no difference between both disks according to smartctl. But with the current trend I would use up 1/3 of the budget at the end of the warranty. I can live with that.

After fooling around for almost two years, I concluded that I need experts help … and I did really a lot of things really the wrong way as I learned now. It’s like the reason why I had to leave the machine typing course at school. I’m trying to unlearn them now. I started to learn it to a large part as a stress relief (as this was in a extremely stressful time with a large project at job ongoing and and otherwise a situation that i was utterly ill-equiped for … in combination I was on a very unhealthy trajectory then). This new attempt of a hobby really helped to switch off my brain and is still doing so. But ‘needing’ to show progress to the teacher each week adds stress … but at least now see tiny, tiny breakthroughs every week. I hope the stress relief comes back. Or I should reevaluate if I keep this hobby. ↩

Almost RAIDless

Almost RAIDless

My config

Remote replication

Future investments

Against best practice

Noise

Joerg Moellenkamp

Double negative

Lost trust

Backup

c0t0d0s0.org

Recent posts

Solaris 11.4 SRU 78: Wildcards for iostat

Würzburg

Menu

Almost RAIDless

Almost RAIDless

My config

Remote replication

Future investments

Against best practice

Noise

Joerg Moellenkamp

You may also like...

Double negative

Lost trust

Backup

Solaris 11.4 SRU 78: Wildcards for iostat

Würzburg