-
Smithx10
thanks sommerfeld
-
Reinhilde
next time you reach for an AI tool, remember Jia Tan.
-
Reinhilde
the AI tools *are* Jia Tan.
-
gitomat
[illumos-gate] 17978 want cp/mv/ln -T -- Robert Mustacchi <rm⊙fo>
-
gitomat
[illumos-gate] 17979 ln should support -L and -P -- Robert Mustacchi <rm⊙fo>
-
tsoome
nice!
-
Agnar
I need help with one of my servers. I had to move the disks (and all pcie cards) from my old X4270M2 to a X8-2L. the system boots, but refuses to see the Dual 10G Ethernet (ixgbe) and two of three NVMe devices (one shows up, "SK hynix") while both kingston cards are invisible. however, prtconf -dD does show all the devices - for the ixgbe even all instances and also the driver loads, but dladm show-phys is
-
Agnar
not showing any ixgbe
-
Agnar
(550) spin:/root# prtconf -dD | grep ixgbe pci108e,7b11 (pciex8086,10fb) [Intel Corporation 82599ES 10-Gigabit SFI/SFP+ Network Connection] (driver name: ixgbe) pci108e,7b11 (pciex8086,10fb) [Intel Corporation 82599ES 10-Gigabit SFI/SFP+ Network Connection] (driver name: ixgbe)
-
Agnar
(560) spin:/root# update_drv -a -v -i '"pci108e,7b11"' ixgbe
-
Agnar
exit status = 0
-
Agnar
devfsadm: driver failed to attach: ixgbe
-
Agnar
exit status = 1
-
Agnar
Warning: Driver (ixgbe) successfully added to system but failed to attach
-
Agnar
I do have rebuild /etc/path_to_inst multiple times, but nothing changed
-
tsoome
you get failure in ixgbe_attach(), most of those errors are logged with ixgbe_error() which is using "!%s" (man cmn_err: The message goes only to the system log.)
-
Agnar
messages and syslog have *no* log for ixgbe, and syslog sends kern.debug to messages
-
tsoome
fmadm faulty?
-
Agnar
nothing, I had three entries which I have acquit'ed
-
richlowe
if you evacuated a system like that, I would still be willing to bet it's confused about state
-
richlowe
what I can't do is find a good guess as to what
-
Agnar
richlowe: I agree on the confused part. I just wonder where it persists device data outside of path_to_inst and driver_aliases
-
sommerfeld
do the devices show up in the output of /usr/lib/pci/pcieadm show-devs ?
-
Agnar
oh, AND: if I boot from a OI install stick, all devices are configured correctly
-
tsoome
and are available?
-
Agnar
sommerfeld: yes: af/0/0 PCI -- 82599ES 10-Gigabit SFI/SFP+ Network Connection
-
Agnar
af/0/1 PCI -- 82599ES 10-Gigabit SFI/SFP+ Network Connection
-
Agnar
tsoome: yes
-
sommerfeld
Agnar: did the path_to_inst rebuild include rebuilding the boot archive?
-
sommerfeld
(and then rebooting?)
-
Agnar
sommerfeld: erm, I have deleted it and boot -arvw
-
richlowe
I would probably do an explicit rebuild and double check
-
richlowe
it's usually very good about doing an automatic reboot during boot to keep things from getting confusing
-
richlowe
but I'm not 100% sure if that's clock sensitive at all or anything
-
Agnar
richlowe: could you elaborate on how you would do an explicit rebuild? I'm fighting with this system for four days and did a lot of rain dance already so I don't want to do any mistake
-
richlowe
`bootadm update-archive` `-R` selects an alternate root, `-v` is verbose
-
Agnar
ah, the boot archive
-
Agnar
let me try that
-
Agnar
the old X4270M2 powered off on monday night because of an electrical defect. so I checked ebay for good alternatived...thanks to AI, it doesn't matter if you look for a 15,10 or 5y old server - you basically pay per GB of RAM, regardless of the age.
-
sommerfeld
note that there is other state in /etc/devices/* that could potentially be stale.
-
Agnar
sommerfeld: can I safely delete that?
-
Agnar
ok, so regenerating the boot-archive hasn't changed anything
-
Agnar
I have deleted all files in /etc/devices/, did update the boot archive and rebooting now
-
sommerfeld
Thinking two different paths here. 1) while running off the busted root, try cleanups with devfsadm -v -C. 2) boot off of OI stick, import busted root, install a new BE into that root and make sure it can see devices. Then migrate your new config into the new root BE from the old BE.
-
Agnar
killing /etc/devices was the trick!!
-
Agnar
w00t!!
-
Agnar
ok, I'm going to blog that tomorrow - thanks a lot to you all :))
-
richlowe
I wonder if it was the unit address persistence or what
-
Agnar
richlowe: I would assume that, but killing files called "cache" usually makes sense too :)
-
sommerfeld
cache invalidation is hard.
-
Agnar
indeed, see nscd :)
-
gitomat
[illumos-gate] 17956 decode nvme extended SMART logs -- Robert Mustacchi <rm⊙fo>