Monday, December 17, 2018

ZFS on enterprise RAID pass-through, and ZFS on FreeBSD root



We've been running ZFS on top of a single hardware RAID on dell poweredge for years. I know most people are against this, but the snapshot/clone, compression and flexible partition of ZFS served us very well. Whenever a drive dies, a dell technician is dispatched (the server is in another State), he will confirm that drive has an amber light, and replace it.




Now we want to take advantage of L2ARC/ZIL caching of ZFS, we are seriously thinking about running ZFS on bare disks. Current dell RAID controllers (PERC H730/H330) do support pass-through. My remaining questions are:




  1. if a drive fails from ZFS, does it display amber light on the front panel? This is important because, the dispatched dell technician may need to confirm the drive is indeed faulty. Otherwise we may have problem with dell.


  2. do any people run ZFS on FreeBSD root? It is in production quality? Any known issues?



Answer



You can control the PERC H730 and H330 using the LSI MegaCLI utility as both of these cards are Dell PERC badged LSI cards.



There is an excellent article and tutorial on how to do this at https://calomel.org/megacli_lsi_commands.html




I know that zfsonlinux has a ZFS Event Daemon (ZED) which you can use to cause particular things to happen on certain events (e.g. use MegaCLI to turn on the amber light for a particular slot when a drive dies).



IIRC, FreeBSD has a ZFSd which can do similar things but I am not an expert on FreeBSD so can not point you to more information other than to say that the FreeBSD forums are full of useful advice and helpful people.



I suspect that the hardest part of doing this will be figuring out what the MegaCLI "slot" number is for a given drive, because ZFS only knows about the device node / name, and doesn't have specific LSI or PERC information. if the device node name is directly related to the card and slot number, it may be a trivial transformation....otherwise, it may be quite difficult.



Even if you have to manually use MegaCLI to turn on the amber light from the shell when a drive dies to satisfy the Dell tech's procedural expectations, you're still better off giving ZFS raw drives rather than overlaying ZFS on top of hardware raid - you're losing most of the important features of ZFS by doing that, and they're the most important features (e.g. error detection and correction for your data).


No comments:

Post a Comment

linux - How to SSH to ec2 instance in VPC private subnet via NAT server

I have created a VPC in aws with a public subnet and a private subnet. The private subnet does not have direct access to external network. S...