#illumos

03:37

jbk

a bit random, but does anyone know offhand if the kernel ever updates /etc/path_to_inst, or is that always done from userland?
03:37

jbk

(or vice versa)
03:39

jbk

(if userland, i wanted to experiment with addign it as a boot module from the local disk and then lofi mount in the running os)
03:39

jbk

so it can be persistent even with a ramdisk based setup
06:17

gitomat

[illumos-gate] 15613 SMB2 Large MTU support -- Gordon Ross <gordon.ross⊙tc>
14:32

jbk

*sigh* one thing that would be nice is if standards included a history for a feature (e.g. 'first appears in version X') so you don't have to chase down older versions to answer that
15:44

sommerfeld

indeed. The python docs at python.org seem to get this right (plus you can pull up the older versions in a couple clicks)
16:03

jbk

(among many other things), I'm hoping to work on some improvements with how ZFS talks to disks
16:03

jbk

e.g. there's an existing 'don't cache' flag for buf(9S) that zfs sets, but doesn't actually get used by anything
16:03

jbk

(B_DONTNEED IIRC)
16:04

jbk

err B_NOCACHE
16:36

jbk

could also set a flag in the READ/WRITE CDBs (at least for SCSI disks) to tell the disk to not cache
16:36

jbk

as well as when zfs says don't retry, have sd.c not retry the failed I/O
16:37

jbk

(our timeout/retry defaults here are horrible and were probably marginally ok for disks 25-30 years ago)
16:38

rmustacc

Unfortunately standards are often that way and just is the nature of the beast.
16:38

jbk

yeah.. the scsi ones are far from the only ones that do it
16:38

rmustacc

I'll be curious to see what you come up with here.
16:40

sommerfeld

looks like B_NOCACHE does get used in bio.c (leads to the buf getting freed sooner). I'd worry about dumb firmware misinterpreting it.
16:40

sommerfeld

("it" being the SCSI equivalent of B_NOCACHE)
16:42

jbk

i'd probably add in a hook to disable it via sd.conf like some of the other behaviors..
16:42

jbk

the bigger one though is if the disk has an uncorrectable media error, it returns EIO to zfs, which AFAICT never attempts self-healing in that instance
16:42

jbk

but sd.c could tell the drive to attempt to remap that block
16:43

jbk

at which point self healing could be used to repair it
16:43

jbk

(or on a write, just retry the write after remapping the block)
16:45

jbk

(also, might be nice to enable background scanning via fmd or such.. it's supposed to at least be non-impacting)
16:48

sommerfeld

hard part would be if as a result of the scan the disk says, unsolicited, "block 8675309 is unreadable". would be hard to find the block pointer that has the checksum that covers that block..
16:48

sommerfeld

scrub at least has that context.
16:50

jbk

yeah.. for bad blocks discovered asynchronously, it might just be use that to trigger a scrub
16:50

sommerfeld

presumably you could tell if it's allocated or free. (free -> just remap it to zeros; alloc -> add to suspect list and schedule a scrub?)
16:51

jbk

maybe.. i need to look at more to see how easily that can be discerned
16:55

sommerfeld

mapping back from lba -> slice -> metaslab might be messy but should presumably be doable but might involve a bunch of io to read the metaslab metadata.
16:57

jbk

at minimum, fma could at least know from a scrub 'yeah, i'm expecting errors on these blocks' based on the bad block list after a media scan
16:57

jbk

(at some point, I'd also like to make the disk & zfs modules a little smarter... right now a bad disk can cause them to step on top of each other)
16:58

jbk

i don't know if it's possible, but like if the disk module thinks the disk is bad, but is a whole disk zfs disk, have a way to tell and defer to the zfs module
16:59

jbk

right now when it retires a disk, it can kinda pull the rug out from zfs
16:59

jbk

and the zfs module might be trying to do things too
18:11

sommerfeld

yep, I've noticed it's quick to mark a disk "removed" requiring what looks like a full rebuild after it returns (and likely messing with the increased robustness that the DTL should give you)
18:12

jbk

yeah, at least some disk manuf have told us it's normal/ok to have a certain # of defects
18:14

sommerfeld

if it's offline for minutes or hours, a DTL-pruned resilver should complete quickly but i don't see that happening. I'd still run a scrub afterwards just to be sure but...

3 years ago

« a day earlier

a day later »

today »