-
vetaljclulow: Here is with shared disks between two nodes - (case when only one connection to each node)
-
vetal
-
tsoomeis there sort of "obvious" cause for vmem_hash_delete() panic with "bad free" message?
-
tsoomelike there are 'freeing into free cache' and such....
-
richlowethe address you're trying to free isn't in a vmem segment
-
richlowethat is, we hashed `addr` and walked the bucket, and it wasn't there
-
tsoomeyep. that much was easy to see:)
-
richloweHey, I'm not the one who asked the question
-
tsoomenono, I meant no harm:) but yes, I guess it really has no obvious answers anyhow
-
tsoomeseeing it from reap, so the kmem_cache_free() and friends have not spotted anything wrong
-
sommerfeldby default, kmem_cache_free() doesn't do a lot of checking on the fast path.
-
sommerfeldif it's a reproduceable crash, try the workload with kmem debugging enabled.
-
tsoomeyep, thats the plan. there has been 2 cases, but not sure if it is reliably reproducible.
-
richloweyou could turn the lighter-weight kmem debugging on for longer periods of time, even if the heavier pieces would be a problem, too
-
richlowe(I've mostly found the checking wasn't a major concern, but the logging was problematic)
-
richloweunfortunately, the logging is also often invaluable to fix what the checking has found :)