-
Smithx10
I have a CN with no aggregations and ran lldpneighbors and then i ran it again and go t " lldpneighbors couldn't connect to the OpenLLDP transport socket. Is lldpd running?'
-
Smithx10
-
danmcd
COnfig file oddness?
-
danmcd
`[ERROR] in config file: invalid location_data_format '0'`
-
danmcd
And /var/run/lldpd.sock can't be bound?
-
Smithx10
i never configured lldp*
-
danmcd
Oh...
-
Smithx10
smartos just had it running
-
Smithx10
i figured it was supposed to just work.
-
danmcd
I wonder if lldpneighbors attempts to run the daemon? `pgrep lldp` say anything?
-
Smithx10
253625
-
danmcd
And `svcs -xv lldp/server` ?
-
Smithx10
Is there another command I'm suppoed to use to get ToR switch port number
-
danmcd
(I'll note it's running on my no-aggr systems as well.)
-
Smithx10
State: online since December 15, 2022 at 03:39:27 PM UTC
-
Smithx10
no impact
-
Smithx10
I know it doesn't work over aggregations
-
danmcd
SOrry.
-
Smithx10
but we are just debugging this 1 where we believe we had another cross connection
-
danmcd
I just got up and was going to start testing this week's Triton.
-
Smithx10
No problem not a big deal
-
Smithx10
its all jbk fault :P
-
danmcd
Anyway... `svcs -xv lldp/server` on both your local machine and any appropriate remote ones.
-
neuroserve
Smithx10 same here - lldpneighbors answers once - maybe there's some kind of rate limiting - or it's waiting for the next lldp packet
-
Smithx10
neuroserve: yea, looks like thats the behavior
-
neuroserve
Smithx10 but on some CNs/HNs it answers every time...
-
bahamat
I think that a peer is sending it data that makes it crash.
-
bahamat
I've seen it happen, but not consistently enough to track it down.
-
Smithx10
Probably Cisco
-
Smithx10
Ive got Cisco toR
-
Smithx10
ACI Nexus switches
-
neuroserve
restarting lldpd changes the behaviour...
-
neuroserve
Juniper here...
-
Smithx10
neuroserve: your using ixgbe?
-
neuroserve
nope - igb - oldschool
-
bahamat
I've been dying to replace our lldpd, but the alternatives don't compile on illumos. I started writing one in rust, but I got hung up with it and didn't get very far.
-
bahamat
Though, there is now rust_lldp, so maybe I should look into that again.
-
Smithx10
bahamat: can you send over the url please? I googled it and couldn't find it
-
Smithx10
-
Smithx10
ahh strange when I typed it exactly into google nothing popped up, but just lldp crate poped right up
-
Smithx10
-
bahamat
Yeah, but docs.rs says it doesn't build, and there's literally no documentation for it.
-
Smithx10
yea, very minimal
-
jbk
maybe i should finish up my version :)
-
jbk
though it still doesn't solve the aggr issue, but it'd at least integrate with smf and other bits as needed
-
Smithx10
do we have a /dev/kmsg ?
-
danmcd
no.
-
danmcd
we DO have /dev/msglog though.
-
danmcd
ooops, that's just for the console (but maybe that's what you need?)
-
danmcd
What problem are you trying to solve Smithx10?
-
Smithx10
We got hit by the flapping interface again
-
Smithx10
So I was looking at go libs for parsing the last 15 seconds from it
-
Smithx10
-
Smithx10
think I will just probably read /var/adm/messages
-
Smithx10
and look at the last timestamp *
-
rmustacc
Smithx10: sysevents?
-
danmcd
Yeah, that's the supported interface. NOTE that we collapse repeated messages there, so you may wonder "why am I only seeing one" and then seeing something like:
-
danmcd
Dec 13 23:41:05 neuromancer last message repeated 1 time
-
Smithx10
rmustacc: I'm using prometheus that come around to do a scrape every 15s
-
Smithx10
Is there a counter for a flapping interface, or would I have to subscribe to the events and keep that count myself
-
danmcd
If it's flapping out long enough to make the NIC go "down" from IP's POV, it's possible `route monitor` may suffice.
-
rmustacc
Right now you'd have to keep the counter yourself.
-
jbk
possibly crazy/bad idea... there's a change from robj that was never done for adding an interface to generate ireports in the kernel.. i wonder if link notifications might be an appropriate use case for such a thing
-
jbk
(IIRC, the example there was for something like noting SAS link speed change)
-
rmustacc
jbk: I mean, we already have that part of it there.
-
rmustacc
We are already generating a sysevent.
-
rmustacc
So changing that to an ireport doesn't buy much.
-
jbk
can ireports be used with the fmd modules? it might be nice to be able to mark the interface as degraded or such if it's flapping too frequently (vs. having to scrape logfiles to find out)
-
rmustacc
It can subscribe to the sysevents?
-
jbk
hmm.. probably.. looks like there are modules that already do for other stuff
-
jbk
and fmd doesn't fork() after the initial startup
-
jbk
so don't have to worry about it deadlocking
-
jbk
next time I hit it, I should really file a bug since apparently no one has
-
jbk
to get all the details, but IIRC, if a thread tries to fork() while another is in a door call, it can cause a deadlock and unkillable process
-
rmustacc
OK. If you can write up some details, give me a shout and we can look at it.
-
Smithx10
so /var/log/messages isn't syslog lol
-
Smithx10
at least not 3164 or 5424