02:56:24 Anything interesting change in 13.1 where null deference would be SIGILL (not 13.1... maybe even <13) vs SIGBUS (13.1) 02:58:33 Could even be amd64 only... Looking at updating www/node14 and running tests on the port there's a new failing test that causes what I think a null dereference. Based off nodejs comments and that this test isn't failing on 12 or 13i386 it appears it should be SIGILL. 02:58:59 But on 13.1 amd64 it's SIGBUS. 03:02:13 it should be SIGSEGV 03:02:40 It's possible I'm reading the nodejs test code incorrectly. 03:03:27 can someone checkout 3e5300e0ed3 from the /usr/src and tell me if you have a Makefile? 03:06:27 fatal: path 'Makefile' exists on disk, but not in '3e5300e0ed3' 03:06:46 (git show 3e5300e0ed3:Makefile) 03:17:58 It would be a SIGSEGV. SIGILL is for a corrupted instruction. Code (text) space versus data space. 03:25:47 crb: that's not an actual freebsd commit 03:26:07 whatever you did, you've managed to run off along openzfs (maybe you were bisecting) 03:26:15 --first-parent is probably your friend 03:35:06 kevans, I am indeed trying to bisect 03:36:17 crb: yeah, so you'll need to restart with --first-parent to avoid accidentally hopping down vendor branches 03:36:39 kevans, thanks a bunch! 03:36:44 yup yup 03:38:06 kevans: that makes huge difference, now I'm building. I've been stuck on that for a week! 03:42:46 crb: ah, you should've said something sooner. :-) 03:47:18 kevans: well I hadn't done a git bisect before and I just figured I was doing something wrong 04:44:39 newbie question - what's the port name of gtk for python? 04:46:24 I think that's part of the gtk port not its own port. 04:46:40 right 04:46:50 so it's possible that Gtk has come unglued 04:47:42 ... it has! O.O 04:48:04 fascinating unix problems... 04:48:37 we're on. merci beaucoup skered ! 05:36:12 I'm trying to get Python scripts working in the `pkg` version of HexChat on 13.1-RELEASE-p2. When I try loading `/usr/local/lib/hexchat/plugins/python.so` I get `Undefined symbol "PyCapsule_Type"`, which suggests to me it can't find the Python library for whatever reason. 05:38:51 I was wondering if someone might know what's going on or how I can probe this further. My hunch is that it's only looking in `/usr/lib` and not `/usr/local/lib`? 05:59:58 That should be part of libpython? python.so links to that? 06:00:49 ldd /usr/local/lib/hexchat/plugins/python.so 06:53:18 libc.so.7 => /lib/libc.so.7 (0x801083000) 06:53:26 And that's it. 08:03:16 fran, Could it be that you one version of python, say 3.10, installed & hexchat was expecting another version, say 3.9? 08:03:31 s/that you one version/that you have one version/ 08:59:43 just had an sdma fail with amdgpu. it hanged the whole local terminal, and Xorg went T 08:59:55 remote computation remained available 10:27:20 bhyve(4) says "bhyve is considered experimental in FreeBSD." Is that still true? 11:10:59 Probably best to ask virtualization@ 11:31:38 sounds like universal disclaimer 11:44:08 I'm somewhat skeptical at my ability to get relevant answers from FreeBSD technical mailing lists, at this point. Last time I got one at all, it answered the question I had specifically said I wasn't asking, then blamed me for pointing that out. I'll stick to IRC. 13:05:01 I only have my doc hat so I can't speak with any authority, but nothing about bhyve feels experimental anymore. 13:05:29 I've been using it in production for quite a number of years, as have many others, so I'd be fine with removing it. 13:05:53 I think it'd be smart to get the OK from someone involved with its development currently, though. 13:11:51 Fair enough. We'll see if one answers it in #bhyve. 14:28:21 aww just updated the cpu on my router to the most modern it can support 14:28:24 and still no vmm :( 14:28:25 CPU: Intel(R) Core(TM)2 Quad CPU Q9650 @ 3.00GHz (2992.63-MHz K8-class CPU) 14:28:26 vmx_modinit: processor does not support desired primary processor-based controls 14:28:30 I really just wanted to update it anyway so it was better at plex so no loss 14:28:31 but was holding out a little hope :) 14:30:47 weirdly though according to intels data sheet it does support virtulization on that chip 14:30:51 https://ark.intel.com/content/www/us/en/ark/products/35428/intel-core2-quad-processor-q9650-12m-cache-3-00-ghz-1333-mhz-fsb.html 14:31:00 wonder what specific processor function its looking for 14:33:02 daemon: man page says it wants VT-x/EPT; while it seems that VT-x is supported, EPT is not: https://community.intel.com/t5/Processors/Is-Intel-VT-x-EPT-not-supported-by-Q9550/m-p/276699 14:33:03 Title: Solved: Is Intel VT-x/EPT not supported by Q9550? - Intel Communities 14:33:17 ah I see 14:33:36 hi daemon 14:33:40 heya wez 14:33:49 How's the BSDing coming along? 14:34:10 pretty well, I mean its BSD once you have a system working with it you never really need to do much :D 14:34:20 but its nice to upgrade from a dual to a quad 14:34:28 Can I virtualise it yet? 14:34:45 you can run freebsd in a vm or as a hypervisor (using bhyve) 14:35:15 AWS? 14:35:21 no reason why not 14:35:32 but not docker? 14:35:52 there are attempts at docker, but I think the general consensus is VM a linux instance and use that 14:35:54 its the easiest way 14:36:19 I made an absolutely microscopic gentoo build specifically for that purpose 14:36:59 I keep meaning to actually publish that config somewhere -_- 14:37:00 ooohh 14:38:28 hmm 14:38:35 might actually be a fun little side project 14:38:46 make a port that initilizes a minature gentoo bhyve vm 14:38:58 and connects a docker shim executable to it 14:39:06 ~ no reason it should not work 14:39:39 I wonder how volume mounts would operate though 14:41:13 daemon: do it and I will use FreeBSD on docker (AWS Fargate) :) 14:41:23 well actally that would be a problem 14:41:37 because on AWS you would be in a vm already 14:41:48 so you could not run bhyve and emulate a linux 14:41:54 as you would not have access to EPT/VT-x etc 14:42:11 daemon: They have bare metal instances 14:42:17 that should work I imagine 14:49:58 at least vmware products can passthrough the required cpu features so you can run hypervisors in vms 14:50:32 (i'm running xenserver for testing on esxi) 14:55:12 interesting, I heard that passing down those cpu functions can cause some unusual behaviour 14:55:15 never tried it myself 18:17:52 phryk: so these disks use MBR instead of GPT. how can i rewrite a modern freebsd bootloader to my disk 18:27:30 gpart add -t freebsd-boot -s 512K -l gptboot0 ada1 18:27:56 gpart bootcode -b /boot/pmbr -p /boot/gptboot -i 1 ada1 18:53:04 should i use GPT or MBR opn a new install on an m.2 SSD? it's a uefi motherboard. the default selection was MBR 18:54:50 GPT 18:58:17 CCFL_Man: for newly formatted disks, use GPT. if you have to keep using an older thing with MBR on top of it (don't wanna lose existing data, OSes, …) I assume you'd use -p /boot/boot for ufs and -p /boot/zfsboot for zfs when calling gpart bootcode 18:59:03 tho if you're on a halfway modern machine and do a new install, gpt + efi is probably the way to go. 19:04:49 btw anyone know if uefi secureboot is possible with freebsd yet? it's like the last big missing thing concerning security-wise for me. 19:05:08 s/concerning// 19:53:39 phryk: thanks! 19:53:56 thanks for the help 19:55:41 phryk, uefi secureboot == no 20:01:35 so when i create a lagg interface during install, it won't link up with the switch 20:02:58 in truth creating interfaces in the install is kind of pointless, as long as you can get into the system after it reboots after install is the time you want to target for setup 20:04:47 that's true 20:05:46 ~ it takes a bit of getting used to but skipping the installer and doing it by hand can be 'handy' as well 20:06:11 the other truth is that when you configure the lagg0 interface, make sure to bring the individual interfaces up or else you will be pulling out what little hair you have left 20:06:37 oh yep 20:06:54 even though its a hateful thing to do what I like doing is triggering a customer script via /etc/rc.local 20:07:06 so rc.conf is almost bare except the essentials 20:07:12 ugly but pretty effective 20:07:37 custom* 20:08:14 I have a lot of daemons that require an ordered start up that is hard to predict due to network latencies and such; sync times 20:08:40 so I pop the 'daemon' application via rc.local to start a perl script that handles all of that stuff 20:09:49 oh, nice 20:09:57 this will just be a basic nas 20:10:56 it always starts that way :) 20:12:35 it certainly does! 20:30:56 is there a general consensus that with zfs and lots of ram, does the swap size need larger than the default 20:35:14 ~ honestly if you are speccing a machine for serious zfs work you may as well shove a SATA/ssd or nvme in there and just use that as swap 20:35:33 but zfs is meant to use 'unused' ram 20:35:46 so there would be no requirement to change the swap allocation 20:55:49 I'm running FreeBSD 12.3 with up-to-date quarterly packages, and my system "appears to" hang almost every time I log out of KDE. It sits on a console screen with a frozen pointer, but I'm still able to SSH in and do a clean reboot from another system. 20:56:06 It seems to be related to PulseAudio. I noticed one message on the console: "Failed to create secure directory (/var/run/user/1001/pulse): No such file or directory" However, that directory does exist, it's owned by me, and it has 700 permissions. 20:56:34 Should I be part of any particular group (like 1 of the 3 "pulse" groups)? Any other ideas? 21:21:27 chriswells0, I am just a FreeBSD newbie who was going to ask my own question but first let me ask you how you are starting KDE? Are you lauching it with xinit, startx, xdm or other X Display Manager? 21:22:18 rwp: I'm using sddm. 21:22:21 Also are there any clues that might be shown in the ~/.xsession-errors file? 21:22:51 And I would also look in the /var/log/Xorg.0.log file looking for the "EE" pattern at the bottom for errors. 21:23:28 I don't have ~/.xsession-errors. 21:23:57 I haven't ever used sddm so I don't know where it logs output. 21:24:58 It seems like every other moment there is a new X Display Manager that is new and different. xdm, gdm, gdm3, kdm, lightdm, slim, sddm, ... well, you get the idea. :-/ 21:25:57 In Xorg.0.log and Xorg.0.log.old, the only EE line I have is: "Failed to load module "ati" (module does not exist, 0)" 21:26:41 And since it starts up we can assume that is an early non-fatal error that it keeps going through. So not of note here. 21:29:10 There don't seem to be any errors in /var/log/sddm.log either. 21:29:34 (searching for EE and error) 21:30:45 The "EE" thing is a convention I have only seen in the /var/log/Xorg.0.log file. Noted tags information at the top of that log file. 21:31:23 sddm.log appears to follow the same convention. 21:32:50 It has mostly II and 2 WW lines (auth info and "could not setup default cursor"). 21:34:26 I don't know the solution to your problem but I might suggest some things that might help. 21:34:34 I'm also able to log in and out without issues using xrdp, which uses xrdp for sound instead of PulseAudio. 21:34:59 I'm open to any suggestions. :) 21:35:05 When X is running are you able to Control-Alt-F1 to return to the main console? Or Control-Alt-F3 to get to vt3? 21:35:20 That would at least get you to a console that you can log into without needing a second system to ssh in from. 21:35:43 And then if that works for you then does that also allow it when your X is stuck? 21:37:00 Since you suspect PA you might try killing just the PA processes. It is possible (likely?) that some process at X exit time is still holding a file descriptor open which is blocking sddm from closing and exiting everything. 21:37:36 I'm able to get to another console from X, but it doesn't work when it's stuck. 21:38:04 Drat. That's a bummer. Because it is nice to be able to self-rescue all on the same keyboard. 21:38:26 If this were another OS and it were me I would run "ps -efH | less" and browse the parent-child hierarchy below the X server and see if processes are still running that should not be running. Then I would try killing those. 21:38:44 I agree, but I'd still be stuck having flashbacks from using Windows. 21:38:50 But I actually do not know the BSD equiv to ps -efH to get a hier sorted ps listing. Do you? 21:39:12 * rwp laughs at that imagery! 21:42:10 I don't know how to list that tree, and I'm also not sure which processes to kill for PA. The only one I know for sure is pulseaudio. 21:43:12 Honestly, I don't recall if I've every killed it after getting stuck. 21:43:16 "htop" can show the processes as a tree. "pstree" port also claims to do the same 21:44:43 "-d" option of "ps" could do that too 21:47:35 Just to show what I see on a SysV style ps output system. https://bsd.to/Jd0K/raw 21:47:36 Title: Jd0K 21:49:10 parv: Thanks. I installed htop. 21:50:03 +1 for htop which I find awesome and much preferred to the traditional top 21:50:39 The best feature is the bar graphs for memory usage. Very useful to check overall health of the system. 21:51:02 OK, I'm going to log out to reproduce it. I'll be back in ~5 minutes since I expect to need to reboot. 21:51:27 Take your time going but hurry back. :-) 21:51:44 Do not be surprised if "htop" shows more memory than could physically be installed in a system: https://github.com/htop-dev/htop/issues/1039 -- eventually port would be updated with the fix 21:51:47 Title: Mem: 16.0Z on FreeBSD · Issue #1039 · htop-dev/htop · GitHub 21:51:47 1039 – two different spellings of "timezone" https://bugs.freebsd.org/bugzilla/show_bug.cgi?id=1039 21:52:33 Good to know. Thanks. 21:53:10 My own newbie question is how do I use Boot Environments to back out from a failed upgrade from 12.3 to 13.1 where I screwed up the upgrade? 21:53:46 * parv would be happy to find a laptop with where he could easily add/change merely 64 GB of RAM let alone 16 ZB 21:55:27 How did you go through the installation/update? 21:55:35 Here is my current state now. https://bsd.to/aATb/raw 21:55:36 Title: aATb 21:56:39 After much searching I don't find any blogs that show anyone actually discarding a boot environment. 21:57:05 They all talk about having forethought to create a new one for the new upgrade. But alas I am now getting the opportunity to learn all about them now. :-) 21:58:00 I think from reading beadm(8) & bectl(8) man pages that I need to create a "newdefault" (or whatever) and then rename it into place. Which requires destroying the current default. 21:58:01 With "bectl(8)": bectl activate . Then "shutdown -r now". 21:58:06 Does that sound about right? 21:58:39 parv, Right. That's how I was able to boot. I can set that to 12.3-RELEASE_2022-04-18_154217 which is the last snapshot which was fully functional. 21:58:57 And then I booted to that environment. And of course everything works there. But now I need to clean up. 21:59:22 I could bectl mount zroot/ROOT/default somewhere and make repairs. I might do that too. 21:59:37 But if I could just discard it and start again that would be a good option for me. 22:00:07 Yeah, forget repair; install anew 22:00:55 "install anew" as in flush and re-install a pristine system? Is that what you are suggesting? 22:02:30 Of course that is always an option but I don't think I have reached that point. Far from it! I expect to fully recover and then upgrade again. 22:02:55 And possibly have a different failure. Recover. And repeat. It's a pretty good system. 22:02:58 This is my first time using boot environments and let me say I think they rock! 22:03:01 Yes|No. I meant that create a new dataset of the current 12.3 root; then (try again to) update that to 13.1. OTOH, blasting everything out, and just installing 13.1 would not cause a broken update 22:04:00 That "anew" did not come out right 22:05:44 It was a little ambiguous. But honestly the documentation is a little fuzzy on these things. 22:06:29 I have given up on snapshot clones (as creates by "bectl") to create a base for updates. I use "zfs send | zfs recv" for that. 22:07:34 I must say that the possibility of creating a new zroot on one system, and then sending that zroot to another system that way, and then booting it, is very sexy. 22:07:43 That way I can destroy the old snapshots whenever I want & do not have to keep them due to being a "clone" 22:08:20 Right. Without needing to "promote" the clone. 22:08:28 Yes 22:09:41 And I guess in 2018 or so pool checkpoints were added. "I didn't know!" Or I would have checkpointed before upgrading. That would have been good forethought too. 22:11:07 To me, having "full" or "thick" root datasets provide better flexibility & ease of management at the obviouls cost of disk space 22:11:22 s/obviouls/obvious/ 22:11:26 Though a pool checkpoint looks to be a very large hammer. It seems to be all or nothing. 22:11:31 So good that I now have the opportunity to learn something about snapshots, clones, promote, and boot environments. :-) 22:12:55 I can see the advantage of your zfs send zfs recv to create a completely deep copy with no entanglements. And likely disk space is not really a concern at that point. 22:14:21 But I didn't have the forethought to avoid my current problem. So I am going to go back and hack on it for a bit and see what I can learn from it. 22:14:28 With an intermediate step of sending the data to a file & creating a dataset from that file provides me with a backup of the dataset on another disk 22:21:57 rwp, Before the update to 13.1, you should also make snapshots of datasets responsible for /usr/local/ & /var/db 22:29:32 This was a default install of 12.0 (I think .0) and so has the default zroot pool layout with /usr/local being part of zroot/ROOT/default I think. 22:30:20 You can confirm that with "df /usr/local" 22:30:20 There is a /usr dataset (that is not mounted) so that /usr/home and such inherit from it. But non-listed /usr/local is on / (I think). 22:30:46 Yes. It's on zroot/ROOT/default and so part of the core dataset. 22:31:32 Before I actually snapshot'd, cloned, promoted, I thought I would explore "bectl mount default /mnt" first. Really cool! 22:31:50 Is that "default" from broken 13.1 update or is the 12.3 dataset the new "default"? 22:32:06 It's default from the broken 13.1 upgrade. 22:33:20 That allowed me to go in there and fix my current snag. I had changed root's shell to bash. But then broke bash by screwing up the upgrade recipe. 22:33:34 I know, I know, I should have avoided it but... With that broken then I had no way to log into the 13.1 system after boot. 22:34:00 And I rather wanted to try it again with the proper upgrade procedure anyway. But let's try to repair this one. 22:34:25 I was then able to edit the passwd database in /mnt the 13.1 environment, now mounted under /mnt. 22:34:31 ( Ok. So "/usr/local" snapshot is covered by the snapshot of the root dataset due to the default ZFS set up during FreeBSD installation) 22:35:06 Yes on /usr/local being covered by the root dataset snapshot. That's the reason for the non-mounted /usr dataset. 22:35:36 I miss not having /etc/passwd and /etc/group and such directly. I don't understand why FreeBSD went to the binary database. A lot of trouble for very little gain. And loss of easy of admin IMNHO. 22:35:41 Install|Comiple static version of the desired shell 22:36:35 But anyway "vipw -d /mnt" allowed me to set the shell to /bin/sh (or csh if one prefers) and automatically runs pwd_mkdb on it. So that worked. 22:36:40 s/Comiple/Compile/ # What the hell was that! 22:37:10 Then "bectl unmount default" unmounted it. It's already marked as the default boot. Rebooted. I can now log into the 13.1 system and all is as it should be. 22:37:42 With a failed binary ports upgrade. But now that I am back logged in again I can now fix that easily enough using the pkg-static executable. 22:38:38 Wait, so the only thing broken in 13.1 update was "bash" package (because it was set as root's shell)? 22:39:01 Yes. Mostly. There was another thing too though. 22:39:27 The other thing that was odd during the freebsd-upgrade install phase was that there were a long list of complaints about /usr/src but I didn't run it under script and lost long list of whatever it was complaining about. 22:39:28 ... along with all the other packages of 12.3 which will need to be reinstalled on 13.1 22:39:37 rwp: FreeBSD got that from BSD at CSRG 22:40:20 I have a git clone of the source from the as documented way to do that in /usr/src and that seemed to confuse things. I wouldn't mind to walk that again and see exactly what was happening there during the freebsd-upgrade install phase. 22:41:05 debdrup, I just don't see the advantage. The /etc/nsswitch.conf allows other databases to be used if one wants an open ended database for accounts. 22:41:45 And for most systems the small /etc/passwd file fits in memory trivially. So it is very fast. It's a convenience tradeoff thing. 22:42:19 But I am making my peace with pwd_mkdb and moving forward. 22:44:28 This is just a guess since I'm too lazy/tired to source spelunk, but I'd be surprised if it doesn't have something to do with it being a lot easier to cache, since it's the sort of thing that gets consulted rather often on a busy system. 22:46:00 I note that it fixes a problem that I experience with, ahem, other systems, that run "nscd" network services caching daemon. On those the nscd hashes entries and the order is hash-shuffled breaking ordering. 22:49:28 Regarding my upgrade failure and recovery... I ran "pkg-static bootstrap -f" to force a reinstall of pkg. Then "pkg update" then "pkg upgrade -f" is now running to force replace all of the ports. That should solve my broken shared library problem I created that broke my ports. 22:51:17 Thanks for the upgrade recovery help! I am really pleased with the FreeBSD robustness and ability to recover. It rocks! :-) 22:51:23 While that is running I think it might be snack time. 22:59:41 rwp: It seems like plasmashell that's hanging. I did have some other processes belonging to that user (keepassxc, ssh-agent), but I was able to kill them. 23:00:28 The keepassxc process (which I suspected as the issue before) wouldn't die once, but I was able to reproduce the issue another time without keepassxc running. 23:04:38 Once in a while some process would not end after sending TERM signal. But after sending the series of STOP, TERM, CONT, the process ends. So how do the series of signal make the process end in that case when a lone TERM would not? 23:08:50 Is it that because STOP signal "cannot be caught or ignored", the process stops; TERM signal is waiting, so on CONT signal the process sees that & ends? 23:11:41 There was also a kwin_x11 process. 23:12:00 I'll reproduce and try STOP, TERM, CONT. 23:12:21 I also found "kquitapp5 plasmashell" online, so I'll try that too. 23:14:43 Was "kwin_x11" process a child of "plasmashell"? 23:44:38 parv, It seems to me that if TERM is handled incorrectly but STOP, KILL, CONT, works, then that is an indication of a programming error. 23:45:08 I can envision ways to produce the bug. But it can't be the way they desired it. That or they were buggy in their thinking of signal handling. 23:45:55 Sorry, 's/STOP, KILL, CONT/STOP, TERM, CONT/' is what I mean. 23:46:52 Upon return from suspend a program can catch SIGCONT to do things like resize and redraw and such. And that might accidentally allow SIGTERM. But SIGTERM should work by itself. 23:53:43 parv: No luck. 23:54:14 I also tried killing kwin_x11 first. It did die, but then plasmashell still wouldn't die.