do a cyclic scan for orphan objects in zstd memory pool #10969

BrainSlayer · 2020-09-23T07:58:50Z

in non regular usecases allocated memory might stay persistent
in memory pool. this small patch checks every minutes if there are
old objects which can be released from memory pool.

right now with regular use, the pool is checked for old objects on each allocation attempt from this pool. so basicly polling by its use. now consider what happens if someone writes alot of files and stops use of the volume or even unmounts it. so the code will no longer check if objects can be released from the pool. so what has been allocated still stays in pool cache. this is no big issue for common use. but someone discovered this issue while doing tests. personally i know this behaviour and i'm aware of it. its no big issue. just a enhancement. see also #10938

covers #10938

Signed-off-by: Sebastian Gottschall [email protected]

Motivation and Context

Description

How Has This Been Tested?

Types of changes

Bug fix (non-breaking change which fixes an issue)
New feature (non-breaking change which adds functionality)
Performance enhancement (non-breaking change which improves efficiency)
Code cleanup (non-breaking change which makes code smaller or more readable)
Breaking change (fix or feature that would cause existing functionality to change)
Documentation (a change to man pages or other documentation)

Checklist:

My code follows the OpenZFS code style requirements.
I have updated the documentation accordingly.
I have read the contributing document.
I have added tests to cover my changes.
I have run the ZFS Test Suite with this change applied.
All commit messages are properly formatted and contain Signed-off-by.

mschilli87 · 2020-09-23T08:15:20Z

@BrainSlayer:

in non regular usecases allocated memory might stay persistent in memory pool.

Why is that? Do we know? Don't we care?

this small patch checks every minutes if there are old objects which can be released from memory pool.

This seems a bit hacky IMHO. Wouldn't the proper fix be to investigate why the memory ends up orphaned in the first place and ensure it doesn't?
If you want this as a fallback safeguard for practical reasons, I think it should at least issue a warning when it has to release memory 'forgotten' previously. Otherwise this could end up masking more severe future issues with 'proper' memory release.

BrainSlayer · 2020-09-23T08:22:39Z

@mschilli87 let me explain. right now with regular use, the pool is checked for old objects on each allocation attempt from this pool. so basicly polling by its use. now consider what happens if someone writes alot of files and stops use of the volume or even unmounts it. so the code will no longer check if objects can be released from the pool. so what has been allocated still stays in pool cache. this is no big issue for common use. but someone discovered this issue while doing tests. personally i know this behaviour and i'm aware of it. its no big issue. just a enhancement. see also #10938

mschilli87 · 2020-09-23T08:31:40Z

@BrainSlayer: Thank you for the explanation. My two cents:

This explanation should be in the commit message to document the need for that change.
I'd still prefer a fix more closer to the root cause. Why not trigger this check when umounting a volume instead of polling regularly in case it got unmounted? Do we keep pooling unused volumes even after freeing the leftover memory? Or could we stop considering this volume after dealing with it once (within a minute after stopping to use it) and add it back to our list once it get's used again (e.g. in the polling by usage code you mentioned)?

BrainSlayer · 2020-09-23T08:50:40Z

copied the my explaination to the title. will add it later to the commit message after testrun is done

regarding the check procedure. zstd is just a algorithm. it doesnt know of any higher level operation and does not know if a volume is in use, mounted or unmounted. the pool is also not used per volume, but used for the algorithm which can be used by multiple volumes of course. and yes the polling remains also if all is freed. but it only happens once in a minute and does not cause any measureable performance impact since the code which is executed is very small for checking. forget about thinking in volumes. just think about a compression algorithm which is available globally for everything you do. the case i mentioned was just a example. if a any volume is in use and you just dont write or isnt in use, the preallocated ram stays allocated without that patch. so its basicly just a optimization to reduce memory usage if everything related to zstd is in idle state. of course i can stop the thread after nothing is left and restart it on first new allocation. but this is a more complicated handling and may cause a latency spike on first allocation attempt. keep it simple as possible is the best way from my oppinion. stopping and restarting threads needs also special handling to avoid race conditions in this threaded context. the current simple approach is thread safe

mschilli87 · 2020-09-23T08:58:34Z

That's a good enough explanation for me, as long as it makes it into the commit message and/or a comment in the source code itself. Thanks for taking the time to walk me through.

PrivatePuffin

Looks good with a slight nitpick on the note not refering to the 60 seconds.

Ping for @c0d3z3r0 and @allanjude to check this out if they want.

module/zstd/zfs_zstd.c

richardelling · 2020-09-23T16:23:38Z

ISTM that this is the wrong approach. We know when an objectset is removed from the system (umount, destroy, close) and we already have async clean to clear its data from ARC. Why not just extend it to also clear out the zstd memory pool?

allanjude · 2020-09-23T17:39:43Z

ISTM that this is the wrong approach. We know when an objectset is removed from the system (umount, destroy, close) and we already have async clean to clear its data from ARC. Why not just extend it to also clear out the zstd memory pool?

This is a pool of memory allocations used for compression. It is not tied to any specific objset. It is just that after a period of high activity, the amount of memory used by this pool may be high, and it doesn't shrink when there is no activity.

mschilli87 · 2020-09-23T18:47:05Z

@allanjude: That is what confused me at well until @BrainSlayer explained it.
@richardelling: Good to know I am not alone. 😉

I think this whole discussion just stresses that the change suggested here should come with good documentation.

BrainSlayer · 2020-09-23T21:40:24Z

ISTM that this is the wrong approach. We know when an objectset is removed from the system (umount, destroy, close) and we already have async clean to clear its data from ARC. Why not just extend it to also clear out the zstd memory pool?

circular dependency. no call to zstd code can be made from functions within zfs. zzstd must be loaded before zfs is loaded. such solutions are only possible with installable hooks

BrainSlayer · 2020-09-24T06:02:23Z

@allanjude: That is what confused me at well until @BrainSlayer explained it.
@richardelling: Good to know I am not alone. 😉

I think this whole discussion just stresses that the change suggested here should come with good documentation.

according to what he wrote, he did understand very well how it works.

BrainSlayer · 2020-09-24T12:31:11Z

@behlendorf please check the new solution for handling the memory cleanup.

scripts/commitcheck.sh

module/zstd/zfs_zstd.c

module/zfs/arc.c

BrainSlayer · 2020-09-29T10:53:58Z

@behlendorf regarding the way when to reclaim space and when not. the pool allocator is self managing. waiting until memory is low, if avoidable is not a good way from my point of view. in our testcase we had about 1.5 gb unused memory left. thats no small piece. releasing it if not in use for a while avoids such situations which will slow down zfs. so if avoidable without performance impact is a good approach

in non regular usecases allocated memory might stay persistent in memory pool. this small patch checks every minute if there are old object which can be released from memory pool. right now with regular use, the pool is checked for old objects on each allocation attempt from this pool. so basicly polling by its use. now consider what happens if someone writes alot of files and stops use of the volume or even unmounts it. so the code will no longer check if objects can be released from the pool. already allocated objects will still stay in pool cache. this is no big issue for common use. but someone discovered this issue while doing tests. personally i know this behaviour and i'm aware of it. its no big issue. just a enhancement Signed-off-by: Sebastian Gottschall <[email protected]>

In non regular use cases allocated memory might stay persistent in memory pool. This small patch checks every minute if there are old objects which can be released from memory pool. Right now with regular use, the pool is checked for old objects on each allocation attempt from this pool. so basically polling by its use. Now consider what happens if someone writes a lot of files and stops use of the volume or even unmounts it. So the code will no longer check if objects can be released from the pool. Already allocated objects will still stay in pool cache. this is no big issue for common use. But someone discovered this issue while doing tests. personally i know this behavior and I'm aware of it. Its no big issue. just a enhancement Reviewed-by: Brian Behlendorf <[email protected]> Reviewed-by: Kjeld Schouten-Lebbing <[email protected]> Signed-off-by: Sebastian Gottschall <[email protected]> Closes #10938 Closes #10969

In non regular use cases allocated memory might stay persistent in memory pool. This small patch checks every minute if there are old objects which can be released from memory pool. Right now with regular use, the pool is checked for old objects on each allocation attempt from this pool. so basically polling by its use. Now consider what happens if someone writes a lot of files and stops use of the volume or even unmounts it. So the code will no longer check if objects can be released from the pool. Already allocated objects will still stay in pool cache. this is no big issue for common use. But someone discovered this issue while doing tests. personally i know this behavior and I'm aware of it. Its no big issue. just a enhancement Reviewed-by: Brian Behlendorf <[email protected]> Reviewed-by: Kjeld Schouten-Lebbing <[email protected]> Signed-off-by: Sebastian Gottschall <[email protected]> Closes openzfs#10938 Closes openzfs#10969

BrainSlayer mentioned this pull request Sep 23, 2020

ZFS and ZZSTD: Memory is not cleared (held) when the pool is exported. #10938

Closed

PrivatePuffin approved these changes Sep 23, 2020

View reviewed changes

module/zstd/zfs_zstd.c Outdated Show resolved Hide resolved

BrainSlayer force-pushed the liberator branch from d2c8cf8 to 81ae20e Compare September 23, 2020 13:06

behlendorf reviewed Sep 23, 2020

View reviewed changes

module/zstd/zfs_zstd.c Show resolved Hide resolved

behlendorf added the Status: Code Review Needed Ready for review and testing label Sep 23, 2020

This comment has been minimized.

Sign in to view

BrainSlayer force-pushed the liberator branch 3 times, most recently from 2b289f3 to be616dc Compare September 24, 2020 12:29

BrainSlayer force-pushed the liberator branch 3 times, most recently from 7f2dd02 to 592ba61 Compare September 24, 2020 12:42

allanjude mentioned this pull request Sep 24, 2020

Split zstd into OS dependant files (use FreeBSD UMA) #10975

Closed

12 tasks

behlendorf requested changes Sep 28, 2020

View reviewed changes

scripts/commitcheck.sh Outdated Show resolved Hide resolved

module/zstd/zfs_zstd.c Outdated Show resolved Hide resolved

module/zstd/zfs_zstd.c Show resolved Hide resolved

module/zfs/arc.c Show resolved Hide resolved

BrainSlayer force-pushed the liberator branch 3 times, most recently from 211545a to aa33676 Compare September 29, 2020 10:51

BrainSlayer force-pushed the liberator branch from aa33676 to 624c1aa Compare September 29, 2020 10:54

BrainSlayer force-pushed the liberator branch from 624c1aa to 7ca1fdd Compare September 29, 2020 10:55

behlendorf approved these changes Sep 29, 2020

View reviewed changes

behlendorf added Status: Accepted Ready to integrate (reviewed, tested) and removed Status: Code Review Needed Ready for review and testing labels Sep 30, 2020

behlendorf merged commit 8a171cc into openzfs:master Sep 30, 2020

do a cyclic scan for orphan objects in zstd memory pool #10969

do a cyclic scan for orphan objects in zstd memory pool #10969

Uh oh!

Conversation

BrainSlayer commented Sep 23, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Motivation and Context

Description

How Has This Been Tested?

Types of changes

Checklist:

Uh oh!

mschilli87 commented Sep 23, 2020

Uh oh!

BrainSlayer commented Sep 23, 2020

Uh oh!

mschilli87 commented Sep 23, 2020

Uh oh!

BrainSlayer commented Sep 23, 2020 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

mschilli87 commented Sep 23, 2020

Uh oh!

PrivatePuffin left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

richardelling commented Sep 23, 2020

Uh oh!

allanjude commented Sep 23, 2020

Uh oh!

mschilli87 commented Sep 23, 2020

Uh oh!

BrainSlayer commented Sep 23, 2020

Uh oh!

BrainSlayer commented Sep 24, 2020

Uh oh!

This comment has been minimized.

BrainSlayer commented Sep 24, 2020

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

BrainSlayer commented Sep 29, 2020

Uh oh!

Uh oh!

BrainSlayer commented Sep 23, 2020 •

edited

Loading

BrainSlayer commented Sep 23, 2020 •

edited

Loading