#illumos

07:14

szilard

Nice article about SMF: davepacheco.net/blog/2026/smf-properties
15:09

danmcd

FreeBSD just removed le(4D) support yesterday. I'm pretty sure we ripped it out during Solaris 10 bringup.
15:11

jbk

oh that's a blast from the past..
15:12

jbk

and the infamous ce
15:13

danmcd

le was the onboard chip for late Sun-3s and the SPARCstations 1 & 2.
15:14

danmcd

IIRC my first UltraSPARC I workstation at Sun was an early model that also had le on it.
15:14

danmcd

illumos.org/opensolaris/bugdb/bug.html#!4942766
15:14

fenix

→ OpenSolaris issue 4942766: Remove le driver from ON10 (Closed)
15:14

jbk

I'm trying to remember what was on the sparc 5 & 20
15:14

jbk

the two sun boxes i ever got access to :)
15:18

danmcd

en.wikipedia.org/wiki/SPARCstation_20 says `le`.
15:18

jbk

ok.. that sounds right..
15:18

jbk

they were running solaris 2.4 at the time and at one point I believe disksuite was installed (uugh)
15:20

jbk

(I greatly disliked the admin interface of disksuite.. while maybe not admin hostile, it was certainly admin indifferent :P)
15:23

jbk

naming is important, and just giving a user an arbitrary number (or these days, guid) is just a giant FU IMO
19:31

richlowe

almost all the "classic" Sun machines were the lance ethernet
19:32

richlowe

and optional weirdness like atm and fddi
19:33

richlowe

I have this vague memory that le v. hme was the difference between the "Ultra 1" and "Ultra 2" and the "Ultra 1 Enterprise" and "Ultra 2 Enterprise"
19:33

richlowe

along with a framebuffer
20:09

alanc

yeah, it was early in S10:
20:09

alanc

PSARC 2003/335 EOL of le Ethernet driver
20:09

alanc

4942766 Remove le driver from ON10
20:10

richlowe

those machines would never run 64bit, so on10 was toxic for them eventually anyway
20:10

richlowe

they had that one bug nobody ever explains
20:16

alanc

and this many years later, there may be no one left who remembers the details
20:27

jbk

was 64-bit the 'prize' for the happy meal? :)
20:34

jbk

heh... and the choice of reusing EBADE instead of just added a new error code for zfs continues to cause confusion :)
21:42

ENOMAD

any HBA-driver experts present? My newly imaged OmniOS host is setting the same problem report as illumos.topicbox.com/groups/develop…55bd13e0d-M68f6ceb0f3e2a1cf3bbeb89d and I'm curious if it is really ignoreable or if I can/need to do something to fix it.
21:43

ENOMAD

In my case the complaint is "Mar 3 12:17:25 fs2 scsi: [ID 107833 kern.warning] WARNING: /pci@95,0/pci8086,352c@5/pci1000,4060@0 (mpt_sas0):#012#011Number of phys reported by HBA SAS IO Unit Page 0 (11) is greater than that reported by the manufacturing information (8). Driver phy count limited to 8. Please contact the firmware vendor about this."
21:43

ENOMAD

value='SAS3808ALLHBA 9500-8i03-50134-01004SPF3001010'
21:43

ENOMAD

value='HBA 9500-8i'
21:43

ENOMAD

value='MPTSAS HBA Driver 00.00.00.24'
21:43

ENOMAD

value='9500-8i Tri-Mode HBA'
21:44

richlowe

it seems like it's saying there's two ways to get that value, and they give different answers, we picked the smaller, but you should ask LSI's desecendents to make it not do that
21:44

richlowe

which seems like it's trying to imply you're ok
21:44

richlowe

I'm not an expert
21:45

» ENOMAD nods
21:45

ENOMAD

My 'concern' (said gently) is the "count limited to 8" part. This host could eventually have up to 36 SAS devices connected. Right now we only have 11.
21:49

jbk

rmustacc: is there a reason we couldn't convert pci_boot.c to use the busra.c interfaces (ndi_ra_XXX)?
21:50

jbk

(it seems like it'd be nicer, and seems like it'd allow most of the code to not care about the PCI segment it's on)
22:45

sommerfeld

so I'm trying to understand where mblks can get queued between a NIC driver (specifically i40e) and a tcp socket. there's the receive ring, then there are soft rings in mac and then squeues entering ip. anywhere else? (I'm trying to figure out how a single active tcp connection that's coming from a 1gbit/s link and being actively read by the reciever can cause the i40e driver to run out of receive buffers after loaning out ~1024 of them to
22:45

sommerfeld

mac and points downstream..)
22:46

sommerfeld

suggests to me that something is causing large packet batches to accumulate somewhere along the pipeline.
22:52

richlowe

rzezeski: this sounds like something you know
22:53

jbk

one thing I've thought about but haven't dug in too deeply to see how difficult it'd be is for mblk_ts going upstack that are being loaned up, to copy and release the original mblk_ts if processing gets deferred for some reason
22:54

jbk

since the loaned up resources are often shared amongst multiple 'streams' (tcp connections/etc), so can potentially hog loaned out resources from down stack
22:55

jbk

i saw this in a bug I was never able to entirely chase down with inter-zone traffic on the same box
22:59

sommerfeld

the specific thing I'm chasing is that with a 4M tcp window, throughput sucks on most connections (~200Mbits/s); with a 500k window it goes at 9xx Mbit/s (gigabit-ish line rate).
23:01

sommerfeld

my working theory is that there are the standing waves building up *somewhere* and when the sender fills the tcp window it gives a chance for the receiver to drain and stay caught up.
23:03

sommerfeld

jbk: I think the hard part is that there are potentially so many places for mblks to get queued that knowing where to look is half the battle..
23:05

sommerfeld

ENOMAD: so how are things cabled up? expanders, or multiple 9500's?
23:07

ENOMAD

sommerfeld, single 9500
23:07

ENOMAD

I presume expanders. I uploaded the prtconf to the ticket I just opened.
23:11

sommerfeld

so it's probably at that point only counting the 8 ports on the 9500 and not the other ports on the expander(s) plugged into some of those ports
23:13

sommerfeld

note that it says "phy count" not something like 'target count'
23:13

ENOMAD

hmm. Interesting.
23:13

ENOMAD

Not sure why the number is odd but it is a potentially reasonable interpretation.
23:15

jclulow

sommerfeld: Which congestion control algorithm are you using?
23:16

jclulow

(I discovered last year that our "cubic" is possibly rubbish)
23:18

sommerfeld

i am in fact using cubic
23:18

sommerfeld

(which seemed to help for long-haul connections which this test was not..)
23:19

jclulow

sommerfeld: Does it improve if you switch back to sunreno
23:19

sommerfeld

trying that now
23:20

jclulow

The conditions where I was seeing this issue were also a speed imbalance: a 10G server into a generally 1G network etc
23:20

sommerfeld

yah, with both sunreno and newreno set as congestion control on the sender I don't see the speed collapse
23:21

jclulow

yeeeeah
23:21

jclulow

sigh
23:21

sommerfeld

this is the reverse situation (1G sender, 10G receiver)
23:24

jclulow

I don't think I got around to filing a bug for this (with my apologies) but it definitely seems like a real issue
23:40

sommerfeld

I still think there's something wrong going on in between driver and tcp on the receiver independent of tcp congestion control
23:43

sommerfeld

(because the trigger for the aforementioned rx_bind_norcb events in i40e is too many buffers on loan from driver to mac)
23:46

rmustacc

ENOMAD: I added that comment. There are basically two different log pages that report that and my memory is in this case we had things that were from other devices.
23:46

rmustacc

ENOMAD: In your case with an LSI 8i you only have 8 actual PHYs on that HBA that can be directly connected anyways.
23:47

rmustacc

jbk: Eventually we will rewrite pci_boot.c, but the main reason no one has yet is because it has a huge amount of testing implications.
23:47

rmustacc

But fundamentally one wants this to mostly be able to look like hotplug after a fashion.;
23:47

rmustacc

And not have multiple divergent paths.
23:48

rmustacc

If I were going to work on that project, I'd go first finish the project to allow me to do arbitrary PCIe briges in propolis so I can fake up all the different corner cases of devices and resources.
23:49

jbk

i ask because getting segments to work, you pretty much are having to touch a _lot_ (and I mean a _lot_) of pci_boot.c
23:49

rmustacc

Doesn't really change what I'd do first.
23:50

rmustacc

Which is have a good way to test arbitrary topologies with a VM configuration that I can automated.
23:50

rmustacc

*automate
23:53

jbk

that's not a very useful answer tbh
23:54

rmustacc

I mean, if I was going to rewrite it I'd want to do that first.
23:54

rmustacc

It may make sense and be the right way.
23:55

rmustacc

But again, how do we test it is the big question that i'd have.
23:55

rmustacc

It's really high risk.
23:55

rmustacc

I've not looked at the ndi_busra stuff in detail, sorry. Keith did that on Oxide.
23:56

rmustacc

So, no, I guess I don't know of a reason, but if it was me, I'd first figure out how to test it all without needing every different hardware config under the sun. There are other ways too to look at it like figuring out how to write it so you can drive it outside of a specific booting config.
23:56

rmustacc

Dunno, happy to talk live or something if that'd be more useful for you. Not sure if I can give you the answer you want.
23:56

rmustacc

Or ask the question again and I'll try to do better.
23:58

rmustacc

I'd probably also see how Rich redid enumeration, which I know has been described and I've forgotten.

19 days ago

« a day earlier

a day later »

today »