13:46:22 hrm.. is there a way to set the signal sent to a parent when a whild process dies? (e.g. equiv of freebsd procctl(P_PID, -, PROC_DEATHSIG_CTL, ..) or linux prctl(PR_SET_PDEATHSIG, xxx) ? 13:47:13 maybe something with contracts? 13:48:29 err i have that backwards.. want child to get signal when parent dies 14:14:27 as a variant: register via atexit() a function to send a signal to the children 15:04:17 jbk: contract should work though I believe you'd get the notification via a fd rather than a signal. A child could also potentially poll with getppid() 17:03:08 hi all, i have no perms for review. i'm interested in why my pers was removed 17:03:25 *permissions 17:06:06 igork: you asked that question about two days ago - did you see jbk's response to you? (have you tried signing in again?) 17:07:03 sommerfeld: yes, i saw, and i tried his proposed solution - it is not working 17:08:24 andyf: have you perhaps seen this kind of ena panic before? https://paste.ec/raw/Zy0f-Jek#znVpMTR+Fgus9ot5tHWFXodlhViZqvj-HY/rKeQnGPZ 17:08:25 i saw some removed of my +1 to richlowe changes with sparc updates by jclulow and after that i have no perms for review on tsoome changes too 17:10:25 jbk: thanks for your proposal but it is not working me for review, i have no permissions and i have no ideas - why ? 17:43:24 daleg - yes illumos 16373 (fenix). 17:43:26 BUG 16373: ena network interface occasionally hangs (In Progress) 17:43:26 ↳ https://www.illumos.org/issues/16373 17:43:52 That is the same - just you ran something like `dladm` or something else that tried to send an admin command after it hung. 17:44:11 I have implemented reset for the driver, which lets it get over that bump but don't know the root cause yet. 17:48:05 andyf: thanks. alright. I have some crash dumps, but have not hit upon a way to reliably reproduce this. It seems to happen in spates - a few panics and reboots due to this, followed by a long period of it not occurring. 17:48:36 Here's the change that adds reset, and fixes some other bugs - https://code.illumos.org/c/illumos-gate/+/3367?usp=search 17:48:37 → CODE REVIEW 3367: 16391 ena driver could support reset 16392 ena driver async event queue stalls (NEW) | https://www.illumos.org/issues/16391 17:49:10 I could spin a hotfix if you're on omnios. I'm working on finding the root cause but no breakthrough yet. 17:50:07 thanks. I'll try those in a local branch and try some more to get this to happen. It's on bsros, so I'll just do my own local application of the patch and try it. 17:52:03 I have never managed to reproduce it myself in AWS, but some CI runners are hitting it fairly regularly. That patch is getting them to run to completion at least while I work on it. 17:54:25 daleg - what's the instance type? In case it's relevant - I'm mostly testing on m7a.medium 17:54:40 yeah I've been trying to get our nitro story in shape and hit upon this using an m5.large. All seems quite fine otherwise. 18:13:25 Happy Patch Tuesday everyone! https://illumos.topicbox.com/groups/developer/T209911d7435d13a1-Mb16ac836483418b3ce160d2c 18:13:52 If anyone here has a modern (Skylake-era or later) Atom processor running any illumos, please contact me. 18:56:44 i'm thinking this is the software trying to use something it shouldn't, but warning on R_AMD64_COPY: file : relocation bound to a symbol with STV_PROTECTED visibility 18:57:01 if i'm wrong, is there something to fix? 18:57:50 (specifically, this is trying to link against pkgsrc libcrypto) 19:02:04 datapoint: I only see one such error in my bulk builds, and it's with ncmpcpp trying to link against libicuuc.so 19:03:34 annoyingly, i think it's the openssl macros itself that are responsible (still digging) 19:03:40 ex: ASN1_TIME_it 19:42:55 [illumos-gate] 16311 ps: Inconsistant formatting of options in usage output -- rigzba21 19:44:21 is that macro part of the published openssl API/ABI? (they've been tightening up on that lately..) 19:46:34 that i'm not sure 19:47:12 though doing some more digging, i guess it's maybe non-fatal (makefiles don't make that clear) 20:12:05 but at least now i have a hopefully working tpm2 binary for testing 22:39:04 so what on earth is nfsfind? 22:39:21 and why is it removed nfs.*? 22:39:25 removing* 22:40:11 https://docs.oracle.com/en/operating-systems/solaris/oracle-solaris/11.4/manage-nfs/removing-hidden-nfs-files.html#GUID-C398F1DA-E0EE-491B-96AC-39714ABCD60D 22:41:55 hmmm, is that necessary? Is there a way to kill it. This particular file system is pretty dense 22:41:58 lots of files 22:44:46 those .nfs files are leftovers, so it is good to have cleanup. it is run via root cron, if you do not have nfs service, you can disable it. 23:30:56 KungFuJesus: unix semantics require that you be able to continue doing I/O to an unlinked file that you have open. NFS doesn't allow for that so NFS clients don't unlink open files -- they rename them, instead. And unlink them later, but later may not happen if the client crashes.