-
Notifications
You must be signed in to change notification settings - Fork 64
Slow mount of PVC for che workspaces - can prevent workspaces from starting. #4079
Comments
Here are the job results with two angular projects installed. Events from openshift console: Link to Zabbix graph with data colelcted: |
This have one more implication... Normally, when workspace is deleted, the But when the PVC has lots of files, the mount fails (or takes longer, then is the timeout for startup of the WDYT @davidfestal ? |
@rhopp could you please test how it works on prod-preview where deployments are used instead of bare pods - eclipse-che/che#10021 (comment) ? |
@ibuziuk On prod-preview, when the failure happens, it happens with weird message (timeout 0 miliseconds, even though it fails after few minutes):
|
@rhopp @slemeur but the message in openshift events is still about the Fail Mount problem, right ? |
Could you please tell me if there's been progress on resolving this? |
@sbose78 Hello. |
@ScrewTSW Fantastic! Thank you.
yeah, this should be OK. |
All starter clusters have been updated this week to openshift 3.11. However, yet another requirement for fixing this slow volume mount issue is having gluster-subvol driver fix [1] deployed to the prod clusters. I will provide an update here when the fix would be available on prod clusters and qa verification could be done. |
I can confirm that e2e tests started passing on this cluster. 12 of 12 runs successful so far. https://ci.centos.org/job/devtools-test-e2e-openshift.io-smoketest-us-east-1a-released/ |
I'm not sure but you commented in this issue on Dec 12, 18:16 UTC |
@rhopp could someone from QA team with the account provisioned against |
PVC mount issue seems to be fixed on I've encountered another (not so critical issue) while testing that... I'll report that as separate issue. |
The update has now been applied to the rest of OSIO. |
@rhopp closing this issue |
Uh oh!
There was an error while loading. Please reload this page.
gitlab issue - https://gitlab.cee.redhat.com/dtsd/housekeeping/issues/2169
Mounting PVC which has lots of files is very slow and it can even prevent che workspaces from starting.
To put it into perspective - It's enough to have 3 workspaces with Angular [1] example to get into the state, where workspaces fail to start.
Right now, this may not be such a big issue (but we had people in the past who were affected by this), but this won't scale in the future... Even if we raise the timeout for workspaces to start, it would be unbearalbe to wait for startup for ~10 minutes or more.
Steps to reproduce
Basically fill up
claim-che-workspace
PVC with lots of files and try to start workspace - observe that it takes either too long, or fails after ~10 minutes.OR use "more real world" approach:
This link will spawn up workspace with Angular quickstart and immediately starts build - this generates (downloads) few thousands files.
[1] - Created from factory - https://www.eclipse.org/che/getting-started/cloud-osio/
The text was updated successfully, but these errors were encountered: