00:30:31 it sounds like it was maybe a research thing, so maybe that would come later? 00:31:31 one of these days when there's > 24h in one, I still have a copy of that branch using doors for FUSE on illumos that'd be nice to revive and play with.. i think as long as libfuse keeps the same API, it'd be largely transparent to any fses 11:27:22 So IPv6 still mostly broken on newer lx images it seems :( 11:27:37 I think we're missing some stuff, there seem to be multiple breakages 11:28:01 having a IPv6 NS will break all name resolving and just using IPv4 will still make stuff not connect in a lot of cases 16:33:10 rmustacc: if a driver returns mblks in its tx routine (i.e. couldn't transmit them), do you know if mac always retries later, or if it propagates any 'tx failed' errors upstack? 16:36:36 When someone performs a mac tx it depends on what they pass. 16:36:53 In general for TCP the backpressure will go all the way back up. 16:36:59 And will get to sockfs. 16:37:32 However, this is specifically the MAC_DROP_ON_NO_DESC and related flags that exist. 16:38:18 So for example, the IP fast path mode via dld will get the cookie and set that the queue is full. 16:42:46 I should add MAC will consider the tx queue blocked until it receives a call to mac_tx_update() or the ring equivalent. 20:42:20 the context is a system where we're seeing a spurt of tcpListenDrop events, but by the time we get on the system, they're gone... trying to dig into potential causes, i noticed that tx_err_nodescs for a number of rings (i40e) are non-zero and appear to have incremented (unfortunately, the granularity of checking the kstat isn't sufficient to know if it happened at the same time, only that it might) 20:42:47 so was wondering if that might be a potential cause 20:43:35 esp since during the time period some connections (inbound) were successful while others failed w/o any clear pattern 20:53:22 So the theory would be that the application is trying to accept connections off the queue, but can't because some response that needs to tx out isn't happening? 20:53:42 And therefore subsequent inbound connect attempts are being dropped? 21:16:48 it's a possible theory... but wasn't sure if tx failign like that could actually induce that up stack or not 21:17:30 (trying to think of things we can instrument in case it happens again as well) 21:19:04 I think the main initial question is what's the application doing and is it even trying to accept when this happens. 21:24:22 it's smb, and it appears to.. on that side we see ksocket_accept() failing while tcpListenDrops are incrementing... 21:25:38 You're getting ECONNABORTED? 21:25:55 yeah (130) 22:07:41 jbk: one thing I'd look at is the full descriptor recycling path for that driver. what thread of control handles that, and what does it do with mblk/dblk/etc., and what can it get entangled with in the process that might slow it down?