Closed
Description
While testing e2e integration tests on my local, I encountered a few scenarios when these tests fail.
- Sometimes tests fail because of the following condition.
conditions = client.get_job_conditions(name, namespace, job_kind)
if len(conditions) != 1:
raise Exception(f"{job_kind} conditions are invalid: {conditions}")
I think the reason is if a container created by tests starts running instantaneously, then that test fails because that job will have two conditions.
- Sometimes tests fail because of the following condition
conditions = client.get_job_conditions(name, namespace, job_kind)
if len(conditions) != 3:
raise Exception(f"{job_kind} conditions are invalid: {conditions}")
With these scenarios, I found that the running condition is missing from training job conditions.