[SYCL][E2E] Fix `XFAIL` statements that use negated device features #18641

ayylol · 2025-05-22T18:37:30Z

Because of device specific features we have to perform our own checks to see whether or not a test should XFAIL. However, a similar check is also done in internal lit code, but due to the fact that this bit of code only knows about the non device specific features there can be situations where it incorrectly reports a test as expected to fail (Like negating a device specific feature, like XFAIL: !cpu).

To avoid this we set the XFAIL conditions to an empty list before returning our result, this way expected_to_fail is always false in the code below, and our result is not altered.

llvm/llvm/utils/lit/lit/Test.py

Lines 275 to 292 in d60cd27

    
           def setResult(self, result): 
        
               assert self.result is None, "result already set" 
        
               assert isinstance(result, Result), "unexpected result type" 
        
               try: 
        
                   expected_to_fail = self.isExpectedToFail() 
        
               except ValueError as err: 
        
                   # Syntax error in an XFAIL line. 
        
                   result.code = UNRESOLVED 
        
                   result.output = str(err) 
        
               else: 
        
                   if expected_to_fail: 
        
                       # pass -> unexpected pass 
        
                       if result.code is PASS: 
        
                           result.code = XPASS 
        
                       # fail -> expected fail 
        
                       elif result.code is FAIL: 
        
                           result.code = XFAIL 
        
               self.result = result

sarnex · 2025-05-22T19:11:16Z

sycl/test-e2e/format.py

+        # Set this to empty so internal lit code won't change our result if it incorrectly
+        # thinks the test should XFAIL. This can happen when our XFAIL condition relies on
+        # device features, since the internal lit code doesn't have knowledge of these.
+        test.xfails = []


sorry so which features are known by the general lit infra and which are known only by our custom code?

The features in test.config.available_features are known by the general lit infra, the device specific features which are in test.config.sycl_dev_features are not.

could we add the dev features to available_features before we build/run for each target and remove it after?

I think the issue would be in situations with multiple devices where a single device might be marked as XFAIL, while the rest pass. If we add all the device features to available_features and let the lit infra do their own XFAIL logic, then this test would be expected to XFAIL, while in reality we expect it to pass, while the XFAIL devices are skipped.

The reason I did it this way was mostly to avoid doing this redundant second XFAIL check.

Also in the case that we change the logic for XFAILs in our code to properly account for running on multiple devices, it is likely that we would want this internal XFAIL check turned off in favor of our own logic.

does the xfail check only happen once in the lit infra with multiple devices?

Yes the internal lit infra XFAIL check only happens once after we return the result of our test.

and the result contains all device runs? sorry just making sure we can't use the lit infra

Yeah its a single result for the amalgamation of all the device runs.

i.e.,
PASS -> if all device pass (Some but not all devices may be xfail/unsupported, those are skipped)
FAIL -> if a single non xfail/unsupported device fails
XFAIL/XPASS -> can't happen afaik when running a test on multiple devices
UNSUPPORTED -> if all devices are unsupported, or xfailed

got it, thanks

ayylol · 2025-05-22T19:58:20Z

sycl/test-e2e/Regression/build_log.cpp

-// XFAIL: !arch-intel_gpu_bmg_g21
+// XFAIL: *


Actually fails on BMG, but this statement was an example of one that was being incorrectly processed and marked as XFAIL on BMG.

https://github.com/intel/llvm/actions/runs/15194270458/job/42735337999

sarnex · 2025-05-22T20:57:10Z

sycl/test-e2e/format.py

+        # Set this to empty so internal lit code won't change our result if it incorrectly
+        # thinks the test should XFAIL. This can happen when our XFAIL condition relies on
+        # device features, since the internal lit code doesn't have knowledge of these.
+        test.xfails = []


does our outer lit code handle XFAILs for both device specific features and normal features?

Yep the part just above this

llvm/sycl/test-e2e/format.py

Lines 391 to 395 in e5d5382

if len(triples) == 1 and test.config.test_mode == "build-only":

result.code = map_result(test.config.available_features, result.code)

if len(devices_for_test) == 1:

device = devices_for_test[0]

result.code = map_result(test.config.sycl_dev_features[device], result.code)

For build only if the test is only ran for a single triple we only use the normal features, and on full/run-only if the test is only ran for one device we use test.config.sycl_dev_features[device] which contains both the normal features as well as the device features.

[SYCL][E2E] Fix XFAIL statements that use negated device features

a019bc2

ayylol requested a review from sarnex May 22, 2025 18:37

ayylol requested a review from a team as a code owner May 22, 2025 18:37

ayylol temporarily deployed to WindowsCILock May 22, 2025 18:37 — with GitHub Actions Inactive

ayylol temporarily deployed to WindowsCILock May 22, 2025 18:58 — with GitHub Actions Inactive

ayylol had a problem deploying to WindowsCILock May 22, 2025 18:58 — with GitHub Actions Failure

sarnex reviewed May 22, 2025

View reviewed changes

Change XFAIL in test to XFAIL: *

3ade491

ayylol temporarily deployed to WindowsCILock May 22, 2025 19:55 — with GitHub Actions Inactive

ayylol commented May 22, 2025

View reviewed changes

ayylol temporarily deployed to WindowsCILock May 22, 2025 20:17 — with GitHub Actions Inactive

sarnex reviewed May 22, 2025

View reviewed changes

sarnex approved these changes May 22, 2025

View reviewed changes

sarnex merged commit 1e62398 into intel:sycl May 22, 2025
23 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[SYCL][E2E] Fix `XFAIL` statements that use negated device features #18641

[SYCL][E2E] Fix `XFAIL` statements that use negated device features #18641

Uh oh!

ayylol commented May 22, 2025

Uh oh!

sarnex May 22, 2025

Uh oh!

ayylol May 22, 2025

Uh oh!

sarnex May 22, 2025

Uh oh!

ayylol May 22, 2025

Uh oh!

sarnex May 22, 2025

Uh oh!

ayylol May 22, 2025

Uh oh!

sarnex May 22, 2025

Uh oh!

ayylol May 22, 2025

Uh oh!

sarnex May 22, 2025

Uh oh!

ayylol May 22, 2025

Uh oh!

sarnex May 22, 2025

Uh oh!

ayylol May 22, 2025

Uh oh!

Uh oh!

Uh oh!

	def setResult(self, result):
	assert self.result is None, "result already set"
	assert isinstance(result, Result), "unexpected result type"
	try:
	expected_to_fail = self.isExpectedToFail()
	except ValueError as err:
	# Syntax error in an XFAIL line.
	result.code = UNRESOLVED
	result.output = str(err)
	else:
	if expected_to_fail:
	# pass -> unexpected pass
	if result.code is PASS:
	result.code = XPASS
	# fail -> expected fail
	elif result.code is FAIL:
	result.code = XFAIL
	self.result = result

	if len(triples) == 1 and test.config.test_mode == "build-only":
	result.code = map_result(test.config.available_features, result.code)
	if len(devices_for_test) == 1:
	device = devices_for_test[0]
	result.code = map_result(test.config.sycl_dev_features[device], result.code)

[SYCL][E2E] Fix XFAIL statements that use negated device features #18641

[SYCL][E2E] Fix XFAIL statements that use negated device features #18641

Uh oh!

Conversation

ayylol commented May 22, 2025

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

[SYCL][E2E] Fix `XFAIL` statements that use negated device features #18641

[SYCL][E2E] Fix `XFAIL` statements that use negated device features #18641