-
jaap
Yeah the IP blocking is difficult with AI crawlers. They're run out of large networks so the crude ban hammer blast radius is gonna touch some innocent bystanders
-
jaap
Also not convinced the (free version of the) Anubis solution amounts to much more than security theater
-
jaap
These crawlers are a serious issue with no easy solution right now
-
jaap
No idea about the infra involved but you could give Anubis a try to see if it alleviates some of the load
-
jbk
hrm...
-
sjorge
It’s so bad even on my small stafix
-
sjorge
*static blog too
-
sjorge
Traffic has gone 10x in the past year and it’s all AI crowlers and most just ignore robots.txt or worse, use it to crawl hidden stuff
-
richlowe
it is very much "or worse"
-
Smithx10
I've got a nasty one....
-
Smithx10
connect() is getting EINPROGRESS and doesn't seem to be sending out to the switch. did a packet capture switch side and didn't see anything coming in fronm dig on 2 CNs
-
Smithx10
the process just loops
-
Smithx10
-
rzezeski
Smithx10: maybe check `connstat` and see what state the TCP connection is in, also run snoop on the host and see if SYN packets are going out or not.
-
Smithx10
very strange that snoop on the Server said we sent them, but switch doesn't see them
github.com/Smithx10/debug-images/blob/main/pcap.png. The dig command intermittently gets stuck
-
Smithx10
-
Smithx10
rzezeski: when it hangs it looks like the latest one that was hung is SYN_SENT from what I can tell
gist.github.com/Smithx10/e07c3de9dd5cb5915febac1de182da3d
-
rzezeski
Yep, SYN_SENT is the state I would expect if the host sent a SYN and is still waiting for its ACK. If you see it hitting snoop it should technically have hit the wire, but snoop intercepts before the driver, so it could be the driver or the device rejected to send it for some reason. What is the link it's going out on?
-
Smithx10
i40e in aggr.
-
Smithx10
I have at least 2 CN in this state, behavior is happening in bhyve guests and os zones
-
Smithx10
I am gonna takethe aggr out of the scenario in one of them
-
rzezeski
okay, one thing you might do is take two snapshots of kstat for any mac, aggr, and i40e stats. And look for any counters that are changing that look suspicious, like errors or drops.
-
rzezeski
are other connections on the same host working on that link? like other TCP connections going out over the same aggr?
-
Smithx10
yeah there a bunch of virtual machines
-
Smithx10
internal customers
-
rzezeski
so other VMs are getting new connections established over the link okay?
-
Smithx10
no, everyone is having intermittent issues
-
Smithx10
tcp, udp
-
Smithx10
what kstat dumps you think have the most value?
-
rzezeski
hmmm, okay, I mean it kind of sounds like the i40e Tx freeze issues I fixed years ago, but those would grind the i40e device to a halt (no traffic for anyone) until a reset.
-
rzezeski
I can't think of them off the top of my head, but anything with 'mac', 'i40e', or 'aggr' in the name
-
Smithx10
connect(4, 0x005EAC28, 16, SOV_DEFAULT) Err#150 EINPROGRESS
-
Smithx10
errr
-
Smithx10
whhos
-
Smithx10
whoops*
-
rzezeski
I have to step away for a bit
-
Smithx10
-
sommerfeld
Smithx10: i40e kstats might also be informative (there are a bunch of different reasons for a tx drop)
-
Smithx10
sommerfeld: I think they are in there on that gist should be 3 files i40e, mac, and aggr
-
Smithx10
each gist is a server, I haven't bounced if you'd like to see anything more