-
gonzosysadm[m]
any freebsd13 bhyve vms floating around?
-
gonzosysadm[m]
sorry, images
-
neuroserve
haven't seen one (only 12)
-
bahamat
gonzosysadm[m]: I don't understand the FreeBSD release cadence very well, so it's hard for me to pick which releases I should make images of.
-
tealirc
Hi there. We are still struggling with Manatee. manatee0 primary and manatee1 sync are ok, but manatee2 async is still in failed state. We created the maantee3 zone with sapiadm command, but this did not help.
-
tealirc
Do you have any tips? Can we destroy manatee1, 2 and 3 and run command “sdcadm post-setup ha-manatee -s SERVER1_UUID -s SERVER2_UUID” again?
-
tealirc
Is this possible?
-
bahamat
I need more information than just "did not work". What does the sitter on the new node report in the log?
-
tealirc
bahamat: The logs are the same in the new manatee3 zone. Lots of "zfs recv" commands but nothing happens. The dataset does not exist.
pastebin.com/g8KszyUa
-
bahamat
that log output doesn't show anything about progress, and that's what I need to see.
-
tealirc
Sorry for my ignorance, but where can I find the detailed information? Headnode/manatee0/sitter, etc.
-
bahamat
In the peer that's attempting to rebuild
-
bahamat
the manatee-sitter log
-
bahamat
Like yesterday the log you showed me had "body: {"restore": null}" in it. It's not supposed to say null there.
-
bahamat
It's supposed to say the total size and the size transferred so far
-
bahamat
But null means something is broken.
-
bahamat
I need to see what that's doing on the new peer.
-
bahamat
If it's null again, then you need to figure out what's getting in the way of your zfs recv.
-
bahamat
That's not normal, and nothing about Triton would cause that to happen, so I don't have a solution for you.
-
bahamat
Maybe it's networking? Maybe your filesystems are full?
-
bahamat
Anyway, you'll need to figure that out.
-
bahamat
Once you figure that out, I can guide you through the rest of restoring manatee.
-
akole
ZfsClient.postRestoreRequest is the function that initiates the zfs send process?
-
akole
ok, might've solved the issue... saw a whole lotta ECONNREFUSED to the manatee0 port 12345 on that function... and sure enough for some reason the backup and snapshotter services were disabled there
-
akole
now restore is happening happily
-
neuroserve
tealirc : have you checked the diskspace of the manatee instance? I had a similar issue two years ago ->
bit.ly/3MOaaAJ
-
tealirc
neuroserve: We did found issue (more precisely, akole found it). Key services on the manatee0 zone were down (backup and snapshotter).
-
neuroserve
cool
-
akole
that said, we have had the diskspace issue in the past as well :)
-
neuroserve
:)
-
tealirc
akole: Then it was a broken hard drive.