-
-
Notifications
You must be signed in to change notification settings - Fork 32.2k
gh-134160: Improve multi-phase init note on isolation & subinterpreters #134775
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Changes from 1 commit
File filter
Filter by extension
Conversations
Jump to
Diff view
Diff view
There are no files selected for viewing
Original file line number | Diff line number | Diff line change |
---|---|---|
|
@@ -302,8 +302,13 @@ using :c:func:`PyModule_GetState`), or its contents (such as the module's | |
:attr:`~object.__dict__` or individual classes created with :c:func:`PyType_FromSpec`). | ||
|
||
All modules created using multi-phase initialization are expected to support | ||
:ref:`sub-interpreters <sub-interpreter-support>`. Making sure multiple modules | ||
are independent is typically enough to achieve this. | ||
:ref:`sub-interpreters <sub-interpreter-support>`. | ||
Typically, extensions ensure this in one of these ways: | ||
|
||
- :ref:`isolating module instances <isolating-extensions-howto>`, | ||
- :ref:`raising an error on repeated initialization <isolating-extensions-optout>`, or | ||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. FWIW, I just noticed that the referenced section does not mention opting out by setting the There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. (Please read this in a friendly voice; I can't find a way to reword this so it's clear and doesn't have a terrible sarcastic interpretation.) Yeah. I'm not a good person to write that, as I never fully understood why a slot is better than explicitly raising an exception when you're about to clobber shared state. |
||
- limiting a module to the main interpreter using | ||
:c:data:`Py_mod_multiple_interpreters`. | ||
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. It's a bit weird to present this as a way of supporting sub-interpreters, no? There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. Yeah. The original wording from PEP 489 would work a bit better: All modules created using multi-phase initialization are expected to support
:ref:`sub-interpreters <sub-interpreter-support>` and multiple
:c:func:`Py_Initialize`/:c:func:`Py_Finalize` cycles correctly.
Typically, extensions ensure this in one of these ways: or should we expand the “correctly“ to something like “work as expected or raise exceptions, rather than crash”? There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. I would definitely not call this "support". So perhaps something like: Modules created using multi-phase initialization are expected to handle
sub-interpreters in either of the following two ways:
* either they fully support multiple interpreters by
:ref:`isolating module instances <isolating-extensions-howto>`;
* or they signal that multiple interpreters are not supported by
:ref:`raising an error on repeated initialization <isolating-extensions-optout>`
and/or limiting a module to the main interpreter using
:c:data:`Py_mod_multiple_interpreters`.
This way, users will not encounter crashes or undefined behavior when
trying to use these modules from multiple interpreters. There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. I guess right thing would be to take step back and not make this so much about subinterpreters: Multi-phase initialization allows creating multiple independent instances of the
module object.
Extensions are expected to handle this in one of the following ways:
* either by :ref:`isolating module instances <isolating-extensions-howto>`;
* and/or by :ref:`raising an error on repeated initialization <isolating-extensions-optout>`;
* and/or limiting a module to the main interpreter using
:c:data:`Py_mod_multiple_interpreters`.
This way, users will not encounter crashes or undefined behavior when
trying to use these modules from :ref:`sub-interpreters <sub-interpreter-support>`.
The first two ways also prevent crashes when reusing the module after
Python runtime reinitialization (:c:func:`Py_Finalize` and :c:func:`Py_Initialize`)
and reimporting a module after it is removed from :py:attr:`sys.modules`. Thanks for suggesting the effects on the users! It helps to make this less of a thou shalt commandment and more like something where you can consider the trade-offs. There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. This makes me wonder: why is there a third way if the second is better (and not more difficult to implement)? There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. Not that it is critical for this PR, but I've been thinking about something related that might at least a little insight: gh-132861. There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. @pitrou: Let's say there's subtle nuance that doesn't really belong on this page of the docs :) AFAIK, the main interpreter is special; sometimes, being loadable once but in any interpreter isn't enough. There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. The extant prior paragraph covers much of this, so here's an alternative: When using per-module state, module instances should be :ref:`isolated
<isolating-extensions-howto>` to ensure that they are free of unwanted
side-effects. If the module still uses global state, or it otherwise has not
yet been isolated, either or both of of the following two options should be used:
* :ref:`raising an error on repeated initialization <isolating-extensions-optout>`
* limiting a module to the main interpreter using
:c:data:`Py_mod_multiple_interpreters`.
This way, users will not encounter crashes or undefined behavior when
trying to use these modules from :ref:`sub-interpreters <sub-interpreter-support>`.
Ensuring a module is properly isolated or raising an explicit error also prevent
crashes when reusing the module after Python runtime reinitialization
(:c:func:`Py_Finalize` and :c:func:`Py_Initialize`) or reimporting a module after
it has been removed from :py:attr:`sys.modules`.
encukou marked this conversation as resolved.
Show resolved
Hide resolved
|
||
|
||
To request multi-phase initialization, the initialization function | ||
(PyInit_modulename) returns a :c:type:`PyModuleDef` instance with non-empty | ||
|
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.