-
Guest61
Hi, I ran into the problem here I believe:
illumos.org/issues/16065 - How can I apply the attached patch? My system still keeps crashing, also with the line in /etc/system. Thanks!
-
fenix
→
BUG 16065: Panic in zfs_retzcbuf with smb2_read_zcopy enabled (In Progress) |
code.illumos.org/c/illumos-gate/+/3164
-
jbk
are you sure it's crashing with the same issue?
-
Guest61
got this: panicstack = unix:die+c0 () | unix:trap+999 () | unix:cmntrap+e9 () | zfs:dmu_xuio_cnt+10 () | zfs:zfs_retzcbuf+16 () | genunix:fop_retzcbuf+71 () | smbsrv:smb_vop_retzcbuf+20 () | smbsrv:smb_fsop_retzcbuf+20 () | smbsrv:smb_xuio_free+70 () | smbsrv:smb2_read+55c () | smbsrv:smb2sr_work+38f () | smbsrv:smb2_tq_work+7a () |
-
Guest61
genunix:taskq_d_thread+1fc () | unix:thread_start+b () |
-
Guest61
and: panicstr = BAD TRAP: type=e (#pf Page fault) rp=fffffe00f72c06a0 addr=4 occurred in module "zfs" due to a NULL pointer dereference
-
jbk
what distro are you running?
-
Guest61
It was OI Hipster updated to newest version from time to time: "The illumos Project illumos-2a6a1d830d December 2023"
-
jbk
after updating /etc/system, have you tried running `bootadm update-archive` and rebooting? AFAIK making the change via /etc/system (should be persistent) or mdb (non-persistent), should completely disable that code path
-
jbk
running `mdb -ke 'smb2_read_zcopy/D'` should show you the current value
-
tobias61
not yet, did not know i had to do so. The mdb from thread gives me: mdb: failed to open smb2_read_zcopy/W0: No such file or directory
-
tobias61
mdb -ke 'smb2_read_zcopy/D'
-
tobias61
smb2_read_zcopy:
-
tobias61
smb2_read_zcopy:1
-
jbk
yeah, it's not applying the fix
-
danmcd
`mdb -kwe 'smb2_read_zcopy/W0'`
-
danmcd
(Unless '-e' doesn't work with '-w' ?)
-
jbk
it should
-
jbk
the mystery is that /etc/system should be doing that on boot, but appears to not be
-
tobias61
danmcd that does something
-
jbk
(why i was wondering if maybe the boot archive needs to be updated)
-
tobias61
mdb -kwe 'smb2_read_zcopy/W0'
-
tobias61
smb2_read_zcopy:0x1 = 0x0
-
jbk
yeah, that'll disable it, but is non persistent
-
jbk
whenever you're able to reboot, I would try the bootadm command (it can be run anytime -- it just updates the boot archive used during boot) and then rerun the mdb -ke 'smb2_read_zcopy/D' command to see what the value is
-
tobias61
Ok, I tried the bootadm command and rebooted, but it is 1 again after that
-
jbk
oh.. that's strange...
-
jbk
i don't have an OI hipster system handy to test unfortunately..
-
jbk
i don't know why setting it in /etc/system isn't working
-
jbk
(it should basically do the same thing as `mdb -kwe 'smb2_read_zcopy/W0'` for you, but obviously is not)
-
jbk
but that explains why you're still getting the panic
-
jbk
a fix should be coming pretty soon
-
tsoome
set smbsrv: smb2_read_zcopy = 0 in /etc/systrem ?
-
tobias61
tried to remove the spaces in /etc/system line for that, ran bootadm and tried again but it is 1 again after reboot, so this temporary fix will hopefully help me until that. I would also test the patch, if I know how to apply it :)
-
tsoome
err, without space after colon
-
jbk
tsoome: is the fix in your queue yet?
-
tobias61
ok I'll try that tsoome
-
tsoome
if this does not work, you can try: boot -B disable-smbsrv=true
-
tobias61
but will this disable the whole smb server?
-
tsoome
it is waiting for 16031 and 15985 to land first
-
jbk
i've been a bit out of pocket the past several days (covid :(), so haven't caught up yet
-
tsoome
tobias61 yes, but that allows you to have usable system, so I can mail you the patched binary;)
-
tobias61
set smbsrv:smb2_read_zcopy=0 works
-
jbk
ahh ok..
-
tsoome
yes, because module variables need modulename: prefix (see system(4)
-
jbk
i've got the ticket up, i can add that
-
tobias61
perhaps a hint in the issue could have helped me out with that :)
-
tsoome
I'm currently waiting for 16031 and 15985 to get approved, but obviously the zfs hole did eat much of the attention...
-
tobias61
big thanks for your help! will come back the next days and check if the fix works for me, if it is rolled out
-
jbk
yeah, i saw the emails from the tracker fly by inbetween (several) naps over the past several days
-
tsoome
sigh. inaccessible imported data pool is making system unusable:(