Move PJRT Python APIs out of `torch_xla.experimental.*` #5011

will-cromar · 2023-05-15T21:03:13Z

Reorganize the experimental PJRT Python APIs.

Add new _internal module for APIs that are well-tested, but likely to change. I moved device-specific logic here, since I expect to rework it in the near future. All of these functions are mainly used for framework development. In general, users shouldn't have to call them directly.
Most functionality that interacts directly with the runtime is moved into the new torch.runtime module.
Added deprecation module to register deprecated aliases for all public functions that are moving out into other parts of of torch_xla.
Print a warning once per deprecated function. For example:

>>> from torch_xla.experimental import tpu
>>> tpu.num_available_chips()
WARNING:root:torch_xla.experimental.tpu.num_available_chips is deprecated. Use torch_xla._internal.tpu.num_available_chips instead.
4
>>> tpu.num_available_chips()
4
>>> tpu.version()
WARNING:root:torch_xla.experimental.tpu.version is deprecated. Use torch_xla._internal.tpu.version instead.
4

Update references across the repository.

Summary of new modules:

torch_xla.runtime
torch_xla._internal.tpu
torch_xla._internal.gpu
torch_xla._internal.pjrt

JackCaoG · 2023-05-19T00:53:58Z

test/pjrt/test_experimental_pjrt_gpu.py

@@ -11,14 +11,15 @@
 import torch_xla.core.xla_env_vars as xenv


should we renamed these experimental files?

Yes, good catch.

JackCaoG · 2023-05-19T01:06:01Z

torch_xla/runtime.py

+  # TODO(wcromar): Detect GPU device too
+
+
+def device_type() -> Optional[str]:


I think we have similar functions in torch_xla.core.xla_model, do we want to do some clean up?

I want to take this chance to do some clean up. I am always confuse what function to call for local ordinal, gloabal ordinal, worla_size etc and what do they really mean in a pod context. If we can restructure those api a bit and maybe have those apis in this runtime instead that would be nice...(random idea, might need more thinking)

xla_model becomes a kitch sink and we put random things in it, if we can move all runtime related bits in this file it is actually nicer..

Discussed this offline. We'll start to move APIs that interact directly with the runtime to a new module, and leave any modeling-related APIs in xla_model. I moved the PJRT version of rendezvous back to xla_model, and the old rendezvous is will be an alias of that implementation when PJRT is enabled.

JackCaoG · 2023-06-02T22:59:56Z

torch_xla/runtime.py

+    return
+
+  logging.warning(
+      'XRT configuration not detected. Defaulting to preview PJRT '


do we want to change preview to stable here?

Good catch. Done.

JackCaoG

LGTM, but prefer to merge on Monday

will-cromar changed the title ~~[WIP] Move PJRT Python APIs out of torch_xla.experimental.*~~ Move PJRT Python APIs out of torch_xla.experimental.* May 18, 2023

will-cromar requested a review from JackCaoG May 18, 2023 17:43

will-cromar marked this pull request as ready for review May 18, 2023 17:45

JackCaoG reviewed May 19, 2023

View reviewed changes

will-cromar force-pushed the wcromar/stable-pjrt-api branch from 8fa857f to 660a4cf Compare May 24, 2023 21:30

will-cromar force-pushed the wcromar/stable-pjrt-api branch 2 times, most recently from 93def62 to de2f117 Compare June 1, 2023 21:22

will-cromar requested a review from JackCaoG June 1, 2023 21:23

JackCaoG added the runtime label Jun 2, 2023

JackCaoG reviewed Jun 2, 2023

View reviewed changes

JackCaoG approved these changes Jun 2, 2023

View reviewed changes

will-cromar added 18 commits June 6, 2023 17:23

Move torch_xla.experimental.pjrt

4f5008b

Move deprecation to common module

e9fa31c

Move tpu.py

462f02a

Move gpu.py

e5637d9

pjrt -> this_module

6d7b4c2

formatting

5361fcb

formatting better

2bb3765

Move run_multiprocess

abcd397

Start migrating tests

b27a50b

Add an __init__.py file

bd21051

formatting

248003b

Fix import

21705a1

Replace more instances of torch_xla.experimental

414c4e3

Fix import in pjrt_backend

d0aee7f

Fix import in test_ddp

d25744a

Move pjrt rendezvous

e4e104a

Move CI tests

bfb9d26

formatting

fb69f7d

will-cromar added 6 commits June 6, 2023 17:24

Fix TPU integration test name

79bcf7b

Adding missing multi_cpu test

f5c8024

fix test name typo

c82883a

Merge global_device_attributes

8db5376

Update PJRT warning

5e8fea7

xr.requires_pjrt

f11be83

will-cromar force-pushed the wcromar/stable-pjrt-api branch from dc1130e to f11be83 Compare June 6, 2023 17:25

xr.global_device_count

38d068b

will-cromar merged commit 7f4d190 into master Jun 6, 2023

SunMarc mentioned this pull request Apr 24, 2025

change XLA deprecated api huggingface/transformers#37741

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Move PJRT Python APIs out of `torch_xla.experimental.*` #5011

Move PJRT Python APIs out of `torch_xla.experimental.*` #5011

Uh oh!

will-cromar commented May 15, 2023 •

edited

Loading

Uh oh!

JackCaoG May 19, 2023

Uh oh!

will-cromar May 24, 2023

Uh oh!

JackCaoG May 19, 2023

Uh oh!

JackCaoG May 19, 2023

Uh oh!

JackCaoG May 19, 2023

Uh oh!

will-cromar May 24, 2023

Uh oh!

JackCaoG Jun 2, 2023

Uh oh!

will-cromar Jun 5, 2023

Uh oh!

JackCaoG left a comment

Uh oh!

Uh oh!

		@@ -11,14 +11,15 @@
		import torch_xla.core.xla_env_vars as xenv

		# TODO(wcromar): Detect GPU device too


		def device_type() -> Optional[str]:

Move PJRT Python APIs out of torch_xla.experimental.* #5011

Move PJRT Python APIs out of torch_xla.experimental.* #5011

Uh oh!

Conversation

will-cromar commented May 15, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

JackCaoG left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Move PJRT Python APIs out of `torch_xla.experimental.*` #5011

Move PJRT Python APIs out of `torch_xla.experimental.*` #5011

will-cromar commented May 15, 2023 •

edited

Loading