-
Guest32
looking for a NUC to install SmartOS, is the 10th gen still not compatible? haven't been able to find any new info regarding this since 2020
-
jbk
they should boot now
-
Guest32
thank you! is there any config needed on install or it works out of the box?
-
jbk
should just be out of the box.. though some bits like the onboard wireless may not work (wired ethernet should be fine though)
-
Guest32
good to know. thanks again
-
Guest9073
hi there, i am looking to upgrade a smartos server and researching an Intel® Xeon® Silver 4310 setup, but see there are some issues with KVM vms, specifically:
-
Guest9073
-
Guest9073
We have a few KVM instances and I was wondering if there is any official documentation on converting these to BHYV or if a rebuild is recommended? My searches didn't turn up much, appreciate any thoughts
-
danmcd
Yeah... I wish I had some Ice Lake HW to reproduce & fix this bug.
-
Guest9073
Could possibly get you access to it if you're interested
-
danmcd
I'm imagining I'd need kmdb at the host AND the guest. (Assuming I couldn't find a possible fix via source-diving... in THAT case such a remote-access scenario becomes more plausible.)
-
Guest9073
I figured you'd need host access. I'll hit you up directly. Thanks
-
tealirc
Hi all. I am deep trouble. We are updating Triton from version 20190912T054018Z to the latest version 20220825T001415Z. We have installed the latest platform image on the headnode and restarted the server. Then we updated sdcadm itself, GZ tools and agents tool. Everything went smoothly, but then we hit a roadblock. We are trying to update the manatee but it failed, and we cannot continue the update process.
-
tealirc
Manatee primary and sync are ok, but async is in failed state. On async zone we have run “svcadm disable manatee-sitter” and “manatee-adm rebuild” command but without any success.
-
bahamat
tealirc: What error are you getting?
-
bahamat
Is the async the only issue you have, or are there other things wrong?
-
tealirc
bahamat: We do not get any errors, but on the other hand nothing happens(?).
-
bahamat
The rebuild process is asynchronous, so maybe it's running already.
-
tealirc
I'm not sure. Async zone is still in failed state. We ran the command “manatee-adm rebuild” five hours ago.
-
bahamat
check your manatee-sitter log
-
bahamat
If the only thing wrong is that your async is failed, then you don't have anything to worry about. The system works fine without an async, you just need to make sure it's fixed before you attempt any upgrades of manatee or other manatee instances fail.
-
tealirc
Indeed. The system works fine, but we cannot update the Triton core components and platform images, because the manatee async zone is failed state.
-
bahamat
OK, well the first thing is that you'll need to figure out what state the rebuild process is in. To do that, you need to check the sitter log file.
-
neuroserve
tealirc : what was the result of "manatee-adm rebuild"? did it finish?
-
tealirc
neuroserve: No, it is still on "infinite" loop.
-
tealirc
I'm just collecting logs.
-
tealirc
-
bahamat
You should definitely not have that nay zfs recv processes running.
-
tealirc
I am not sure if this matters, but manatee primary version is 20220825T001415Z, and sync/ async are version 20190912T054018Z.
-
bahamat
No, that shouldn't particularly matter.
-
bahamat
You'll need to figure out what's going on with zfs there. The sitter is unable to get the status of the recv, so it never sees it as complete.
-
bahamat
One possible reason that it can't get the status is because you have so many zfs recv processes running. There should be exactly 1.
-
bahamat
If you ran `manatee-adm rebuild` 34 times, that wasn't the best choice.
-
tealirc
Could this log point to a problem?
pastebin.com/JXDgJq9Y
-
Smithx10
bahamat: did you find the same issue with piranha? Or perhaps I was on a wrong version?
-
bahamat
Smithx10: travisp was working on that.
-
bahamat
tealirc: That doesn't provide any additional information.
-
akole
did a zone stop/start and it seems to automatically begin the zfs receive operation without manatee-admin rebuild. And it seems that it starts new zfs recv's continuously leaving the previous ones still running
-
akole
there's also a lot of ECONNREFUSED errors all over the log
-
bahamat
That's because the zfs dataset hasn't been fully restored, so postgres is not running yet.
-
bahamat
It will automatically start a recv if there's no data.
-
akole
ah
-
akole
from where is the send supposed to connect?
-
bahamat
It will connect to the manatee-backupserver service on the next upstream peer. In this case, that will be the sync.
-
bahamat
There's another underlying problem though. In the sitter log it says the progress is null. That should never be the case.
-
bahamat
So you've probably got other underlying condition that's breaking zfs.
-
bahamat
that might also explain why there's so many zfs recvs running.
-
bahamat
Some possible places to look:
-
bahamat
* disk full?
-
bahamat
* delegated dataset exists?
-
bahamat
* dataset is properly delegated to the zone?
-
akole
should the /data/manatee exist before receive?
-
akole
the /data is there
-
bahamat
Yes, the dataset needs to exist.
-
akole
ah... there's our problem, I presume
-
bahamat
Use zfs list, don't just look for /data/manatee
-
bahamat
It's mounted at /manatee/pg, not at /data/manatee
-
akole
yeah, I see the uuid and uuid/data sets
-
akole
nothing more
-
bahamat
Use zpool history in the global zone to see if you can figure out who deleted it
-
akole
I see it destroyed -r
-
akole
I wonder if the upgrade script could've done it?
-
bahamat
No, that's not how it works.
-
bahamat
I would stop the zone and use sapiadm from the headnode to create a 4th manatee zone, and just Let It Be.
-
bahamat
It should come up on its own automatically.
-
bahamat
In any case, the sitter should be able to track progress.
-
bahamat
If it works, then use sapi to destroy the defunct zone.
-
bahamat
If it does not work, then you've got a bigger problem on your hands.
-
akole
oki... we shall try that
-
akole
thanks for the help :)