-
skered
I see people doing it by hand but is there a reason something like iocage or ezjail don't have an arch. option?
-
skered
ex: I want a 32-bit jail on a 64-bit host.
-
meena
skered: both of those are longer maintained
-
skered
ezjail I knew but it works. iocage I thought as the replacement... What's the new jail manager?
-
meena
-
VimDiesel
Title: Containers - FreeBSD Wiki
-
meena
skered: bastille, cbsd, and pot are probably the best candidates on that list
-
meena
but, if you got like 5 jails, instead of 50-500, you might also just consider jail.conf
-
meena
-
VimDiesel
Title: Howto: Setting up Jails
-
skered
Like by hand? yuck.
-
meena
eh, I've done worse
-
meena
like written my infrastructure automaton in Perl and then rewritten it in gnu make
-
meena
but right now, I've literally got four jails, and until i get buildah and podman working, I'll be doing it like this
-
skered
We don't use those words in #freebsd. 🤫
-
Plasmoduck
hey, everythime I installed freebsd 1 on my thinkpad x1 carbon I would get the intel speed shift bug and the system would hang. So I added something to my loader.conf to set it off
-
Plasmoduck
but now the system gets really hot and freezes
-
Plasmoduck
so I disabled it again and now my computer wont even boot
-
Plasmoduck
isnt there a fix for this by now
-
Plasmoduck
13.1-release
-
Plasmoduck
now I have to format and start over again
-
Plasmoduck
sucks to be stuck on 12.*
-
johnjaye
any tips on speeding up the boot time? i need to cut it from 2 minutes in this vm to 1 minute if possible.
-
johnjaye
i put autoboot_delay=1 in /boot/loader.conf and rc_startmsgs=NO into /etc/rc.conf
-
johnjaye
i googled it but i'm not sure which suggestions make sense or don't make sense. one of them said to start commenting out random 'sleep 1' lines from startup scripts
-
mason
johnjaye: A minute seems a bit long. What's taking the time?
-
johnjaye
well it's a vm and it's slow
-
johnjaye
if i switch the arch and enable kvm it should improve. but i don't care about speed just about starting it
-
johnjaye
qemu has this annoyance where even if you suspend the guest inside its monitor it still eats up cpu time on the order of a web browser. i could just suspend the process i suppose
-
mason
What's the host?
-
johnjaye
the raspi4 i'm typing this on. i wanted to start a local freebsd instance on it for a project
-
mason
Oh. Opportunities for things to go slow abound there.
-
johnjaye
indeed. but i'm not familiar enough with how freebsd starts up to go any further
-
johnjaye
the thing is there is not a 'standard' way to start up on arm, so i'd have to google around for the right params for that to work...
-
johnjaye
i can try the freebsd wiki's suggestion and hope for the best though
-
Plasmoduck
this whole intel speed shift bug is rediculous
-
Plasmoduck
I've managed to boot old kernel and disable it again
-
johnjaye
starting devd sure takes awhile...
-
johnjaye
but not as long as the network!
-
doug713705[m]
Plasmoduck: same here with a Lenovo p14s gen1. I read somewhere (can't remember where) that it is fixed in 14.0. I'm waiting, just hoping the fix to be backported to 13.2
-
johnjaye
well it's down to 24 seconds using qemu with -cpu max. very confusing.
-
johnjaye
if i tell it -cpu cortex-a72 then qemu refuses to start at all. but with -cpu host or -cpu max it boots freebsd on aarch64 in 30 seconds. then if i ask freebsd its cpu it says cortex-a72 r0p3
-
johnjaye
well that doesn't make sense but at least it's fast now
-
johnjaye
to be clear the raspi4 has cortex a72's in it... so idk the conflict here
-
Plasmoduck
doug713705[m]: yeah its crazy
-
Plasmoduck
on top of that, something is causing my system to freeze/lock up
-
Plasmoduck
some software I updated before upgrading to 13
-
Plasmoduck
it's driving me crazy
-
zapata
doug713705[m]:
bugs.freebsd.org/bugzilla/show_bug.cgi?id=253288 ? It's already in stable/13 and releng/13.2.
-
VimDiesel
Title: 253288 – hwpstate_intel: modern ThinkPads wedge under any kind of load or during boot
-
doug713705[m]
zapata: Thank you for the good news :)
-
parv
(Took some time to read the PR) zapata, same here
-
pertho
anyone here use node_exporter on FreeBSD for prometheus stats? I am seeing this constantly: node_exporter[13044]: ts=2023-02-25T15:03:20.906Z caller=collector.go:169 level=error msg="collector failed" name=meminfo duration_seconds=8.8323e-05 err="couldn't get memory: cannot allocate memory"
-
Remilia
pertho: is that in syslog?
-
pertho
Remilia: yes
-
Remilia
I do not have anything like that
-
Remilia
using node_exporter, version 1.3.1 (branch: release-1.3, revision: 6)
-
pertho
node_exporter, version 1.5.0 (branch: release-1.5, revision: 1)
-
Remilia
hhmm
-
pertho
I guess I could just add --no-collector.meminfo to the cmdline
-
pertho
prometheus doesn't seem to draw the memory info properly for a freebsd client anyway
-
pertho
adding node_exporter_args="--no-collector.meminfo" to /etc/rc.conf fixes it but disables any memory info.. not that prometheus/grafana seem to show it which is odd
-
pertho
not the first time it broke I guess:
prometheus/node_exporter #1344
-
VimDiesel
Title: meminfo collector broken on FreeBSD · Issue #1344 · prometheus/node_exporter · GitHub
-
VimDiesel
1344 – pstat -i wasn't documented on the man page
bugs.freebsd.org/bugzilla/show_bug.cgi?id=1344
-
pertho
-
VimDiesel
Title: FreeBSD: couldn't get memory: cannot allocate memory · Issue #2593 · prometheus/node_exporter · GitHub
-
VimDiesel
2593 – [small] security hole and nfs compatibility bug in rexecd.c
bugs.freebsd.org/bugzilla/show_bug.cgi?id=2593
-
Remilia
pertho: hmm I updated mine just now to 1.5
-
Remilia
not seeing any errors currently
-
Remilia
but it has only been 5 minutes
-
pertho
1.5.0 is definitely broken.. the fix was committed 2 weeks ago.. but 1.5.0 came out end of November 2022 :/
-
pertho
wonder if that could be backpatched into the FreeBSD port.. hmmm
-
pertho
if portsnap is going away.. what's replacing it?
-
pertho
ah ha.. this may have been fixed in 1.5.0p3
-
pertho
I'm on 1.5.0p1
-
pertho
Remilia: node_exporter-1.5.0_3 fixes it! :)
-
pertho
am I correct in assuming this commit is for -CURRENT (14.0?) and not 13.1?
svnweb.freebsd.org/base?view=revision&revision=362631
-
VimDiesel
Title: [base] Revision 362631
-
meena
pertho: that's two years old. chances are it's in 13.0
-
V_PauAmma_V
Per
freebsd.org/security/unsupported , -current was still 13-current, ie before ... beat me to it.
-
VimDiesel
Title: Unsupported FreeBSD Releases | The FreeBSD Project
-
pertho
still seeing loads of these messages: kernel: linux: jid 0 pid 1456 (VizCompositorTh): possibly incorrect MADV_DONTNEED
-
dch
-
dch
you can look through there for it
-
dch
maybe git blame etc
-
pertho
hmm.. I'm on 13.1-RELEASE-p7
-
meena
-
VimDiesel
Title: FreeBSD / src / 05e842d8294bb4c98664ef8ef689b0caf24b5112 - FreshBSD
-
meena
-
VimDiesel
Title: FreeBSD / src / 362631 - FreshBSD
-
pertho
hmm.. that revision (362631) may not fix the messages
-
pertho
ah well.. wait for 13.2 at the end of next month then :D
-
meena
pertho: test beta / stable now, test current, demand fixes from current
-
Remilia
pertho: oh, makes sense, it was 'node_exporter: 1.3.1_6 -> 1.5.0_3' for me
-
dch
welp
-
dch
`BTX halted` on boot
-
dch
I'm in the middle of migrating cluster node 3/3 to new h/w
-
dch
and thats cluster node 2/3 crapping itself
-
dch
which leaves ... 1 of 3 nodes which is not quorum :-(
-
dch
I have backups, but damn that timing is awkward
-
drobban
:-|
-
drobban
dch: do you have any clue to why?
-
dch
not really. `BTX halted` is ~ probably ~ the first boot sector or two on the SSD mirror
-
dch
but these are UEFI boxes so something happened that has borked both NVMes to the point it cant boot
-
dch
so my guess is a h/w fault, maybe backplane or something
-
drobban
you said new hardware... is it "same" kind of hardware or is the any diff in controllers sata/raid and so on
-
drobban
there*
-
dch
oh so this node has died while I'm migrating / upgrade *another* node
-
dch
completely different h/w / vendors / locations
-
» dch spins up a replacement node
-
dch
gonna turn this bad one off for 10m maybe it just got too warm for its own good
-
drobban
:/ could always hope
-
dvl
I am repeatedly being stumped by a syncoid hosta:system/data localsys/data "cannot create snapshots : permission denied" errors. I keep checking permission, wondering, why, the others work fine. After about 10 minutes of trial and error I remember: the source filesystem is jailed. I must do a 'zfs jailed=off` first....
-
dch
dvl: that is so annoying, every time I run into it
-
dvl
Don't be like me. Think jailed first, then permissions.
-
vext01
what do you folks use to get notifications about zfs failures?
-
vext01
no zed on fbsd it seems
-
vext01
nfsd man page doesnt say anything about notifications
-
vext01
some vague mutterings on the internet about devd something something
-
vext01
silly that `zpool status -x` isn't silent when all is well, or you coudl stick it right into a cron job
-
parv
Well, there is "-n" option, at least in 13.1, to send mail when the command exits with 0 code
-
parv
1 2 * * * -n zfs status
-
parv
Let me try that again. "crontab(5)" has "-n" option, at least in 13.1, to not send mail when the command exits with 0 code
-
otis
there also is zfsd
-
vext01
above when i said nfsd, i meant zfsd, sorry
-
vext01
can it do email notifications? man page is a little sparse
-
vext01
also the man page doesnt document the return status of any of the userspace zfs tools
-
parv
That lack should be filed as (document) bugs
-
vext01
parv: agreed
-
vext01
where to? openzfs i guess
-
meena
-
VimDiesel
Title: zfs/man at master · openzfs/zfs · GitHub
-
meena
I liked thru a few, didn't see any exit codes
-
debdrup
vext01: zfsd will generate system messages which get handled by syslogd just like any other daemon with system logging capability, which has been a thing on Unix-likes for decades.
-
rtprio
vext01: i set a script to run `zfs status` and send me a push notificatin if it differs from the last `zfs status`
-
rtprio
it's pretty simple, i didn't want to wire it into syslog
-
debdrup
You _do NOT_ under any circumstances want to rely on email for notifications about storage (or anything else) failing, because email isn't a reliable delivery protocol.
-
epony
what is more reliable for you then? ;-) irc?
-
epony
(discover the power of syslog)
-
debdrup
There's plenty of tools that're made to solve this problem, including but not limited to zabbix, prometheus, librenms and many others.
-
debdrup
epony: if email isn't reliable, irc sure isn't.
-
rtprio
apis to push services, like pushover or pushbullet aren't bad at a small scale
-
pertho
or.. *shiver* ... Slack
-
debdrup
If you read the RFCs for SMTP, the closest you'll get to "reliability" is that the server shouldn't "lose mail for frivalous reasons" (or something to that effect).
-
debdrup
That's an actual RFC using language _that_ nebulous, so you know it's bad. :P
-
epony
some people use XMPP like sendmail too
-
epony
but store and forward is more reliable than sent and "lost"
-
debdrup
There's no real point in listing all the bad ideas people have had, because they'll keep having them at a rate that far exceeds any reasonable way of enumerating them :P
-
parv
:-)
-
vext01
debdrup: email is more reliable than relying on me to learn any of those tools :P
-
vext01
rtprio: thanks, i like that
-
vext01
might just write a script that sends a diff
-
debdrup
Yes, what do I know, I've only been in storage, network, and systems administration for a few decades.
-
vext01
:)
-
vext01
such noob, wow
-
ngortheone
I think there is a confusion between monitoring and alerting, and their componenets. Nobody argues that dedicated monitoring application like zabbix is a good tool for detecting problems in the system. But how does zabbix do alerting?
-
ngortheone
It can show you the error on the dashboard, but this is not alering. It can send an email, sms, push notification, call an http endpoint..
-
ngortheone
As with any network communication there is a chance of failed delivery
-
ngortheone
But even if the alert has failed to reach the destination, the even will not be lost, it will be stored in zabbix database somewhere and you will see on the dahboard
-
parv
Install a bell & apppoint person to send a courier to hit appropriate person with clue fish
-
ngortheone
Prometheus is not a complete monitoring or alerting solution, so but it can be a part of one
-
ngortheone
parv: yes :) but a courier can get hit by a bus
-
parv
.oO(Man! All these damn exceptions)
-
ngortheone
I'd say that no single channel is reliable. By combining different channels together and using them as fallbacks for each other is the closes you get for relaibility
-
ngortheone
including a bell and a buttler with an evelope on a tray
-
ngortheone
I think what debdrup was trying to say is that "don't just fire off an email in a script that detected a problem"
-
ngortheone
because if this email gets lost you will loose the trace of a problem
-
ngortheone
first - save the event in some sort of database
-
ngortheone
and then let a separate agent worry about delivering an event
-
ngortheone
zabbix does it for you, many other solutions do too
-
meena
I have people on IRC message me when
alpha.pkgbase.live falls over, or its certificate runs out…
-
VimDiesel
Title: Unofficial FreeBSD pkgbase repository
-
meena
that's monitoring and notification outsourced…
-
ngortheone
after certain point this stops serving you well. One of my favorite ways to get AWS fix their shit is to post about broken thing on twitter
-
ngortheone
works better then paid support
-
meena
you might notice that I've burned out on systems administration, but if you haven't, a nice monitoring system (not nagios) will bring you … uh.… something.
-
meena
ngortheone: my twitter accounts are emptied and set private
-
meena
ngortheone: it's just one service, and getting feedback gives me a feeling for how many people use it / rely on it.
-
meena
which i think is ca 5.
-
ngortheone
not the point, I was trying to say that service interrup notification relayed over public channels damage your reputation
-
ngortheone
you may not care about it atm, but when you are a size of Amazon you can't afford it
-
ngortheone
twitter in general is a bad,bad place
-
meena
ngortheone: it's a service used by 5 people, and it has Alpha in the name and title and description
-
ngortheone
sometimes I think that POSIX was written by predators for aliens
-
ngortheone
check this: (man 3 daemon)
-
ngortheone
Unless the argument noclose is non-zero, daemon() will redirect standard input, standard output, and standard error to /dev/null
-
ngortheone
double-negation, unless.. non-zero
-
ngortheone
who writes docs like that?
-
xtile
daemon isn't even in POSIX, that's a FreeBSD man page
-
ngortheone
they could've called the argument "keep_open" and say - "keep_open >=1 keeps file descriptors open, if == 0 then file descriptors are redirected to /dev/null"
-
ngortheone
well, it exists in Linux too
-
xtile
yeah?
-
ngortheone
with the same description and argument tames
-
ngortheone
-
VimDiesel
Title: daemon(3) - Linux manual page
-
xtile
it's a BSD function that GNU/Linux adopted, then. :P Not POSIX.
-
ngortheone
Linux actually has slightly better description
-
ngortheone
"If noclose is zero, daemon() redirects standard input, standard output, and standard error to /dev/null; otherwise, no changes are made to these file descriptors."
-
ngortheone
Â
-
ngortheone
which avoids mental gymnastics required to figure out the double-negated sentence
-
xtile
agreed, this is a better description
-
xtile
the one in the gnu/linux man page
-
ngortheone
well, ok, dis on POSIX was unfounded... this time :D
-
xtile
OpenBSD has removed the double negation, but NetBSD and DragonFlyBSD retain the double negation in their man pages
-
xtile
-
VimDiesel
Title: daemon(3) - OpenBSD manual pages
-
ngortheone
yeah, we can do even better -> rename noclose argument to keepopen
-
ngortheone
then description makes sense: keepopen == 0 -> /dev/null
-
ngortheone
otherwise..well keep those descriptors open
-
xtile
gonna break someone's code? :3
-
ngortheone
unfortunately..
-
» xtile nods.
-
xtile
ngortheone: submit a patch for the man page :D
-
xtile
i wouldn't mind seeing the wording changed to how OpenBSD's has it
-
ngortheone
already on it
-
xtile
it would read better
-
xtile
nice
-
xtile
thank you <3
-
ngortheone
trying to locate the man page
-
xtile
/usr/src/lib/libc/gen/daemon.3
-
xtile
the 'locate' command works well
-
xtile
it's installed to, of course:
-
xtile
/usr/share/man/man3/daemon.3.gz
-
ngortheone
Thanks! Found it
-
ngortheone
-
VimDiesel
Title: man 3 daemon: remove double negation by ngortheone · Pull Request #671 · freebsd/freebsd-src · GitHub
-
xtile
I approve of this, including the 'will redirect' -> 'redirects'
-
ngortheone
openbsd did so too
-
» xtile looks again, and notices now.
-
ngortheone
I took their text verbatim