Cleanup trace initialization and type hints #5420

michaelosthege · 2022-01-29T18:36:35Z

This PR refactors the code and type hints in sampling.py that are related to trace backend initialization and related type hints.

I found several type hints that were too loose or even wrong.

In multiple places tune was advertised as int | None but the code inside assumed it to be int.
Similarly, multiple trace kwargs were advertised as BaseTrace | MultiTrace but the code inside called methods that are only available with BaseTrace.
After tightening the input types, I fixed the return type hints in a similar fashion.

Finally, this tightening enabled a consolidation of the trace initialization (_choose_backend() + strace.setup()) such that now all of _iter_sample, _mp_sample, _prepare_iter_population make the same call to _init_trace.

To reduce type confusion.

codecov · 2022-01-29T18:45:38Z

Codecov Report

Merging #5420 (2d87b9d) into main (0dca647) will increase coverage by 0.01%.
The diff coverage is 85.89%.

@@            Coverage Diff             @@
##             main    #5420      +/-   ##
==========================================
+ Coverage   81.39%   81.40%   +0.01%     
==========================================
  Files          82       82              
  Lines       14213    14213              
==========================================
+ Hits        11568    11570       +2     
+ Misses       2645     2643       -2

Impacted Files	Coverage Δ
pymc/distributions/shape_utils.py	`96.73% <75.00%> (-2.14%)`	⬇️
pymc/sampling.py	`86.06% <88.33%> (+0.22%)`	⬆️
pymc/distributions/distribution.py	`91.43% <100.00%> (+0.03%)`	⬆️
pymc/parallel_sampling.py	`87.70% <0.00%> (+0.99%)`	⬆️

The code largely assumed it to be `int`, even though it was often advertised as `int | None`.

With this change, the `_iter_sample` and `_prepare_iter_population` take the `tune` into account for the expected length of traces. This was not the case beforehand and I don't understand why it worked.

Also fixes an incorrectly named kwarg in some logger calls.

michaelosthege · 2022-01-29T23:37:33Z

The changes reduce the number of errors found by mypy -p pymc from 362 to 266.

https://www.diffchecker.com/aHofAXhG

canyon289 · 2022-01-31T00:47:24Z

pymc/distributions/distribution.py

@@ -61,7 +62,7 @@
    "NoDistribution",
 ]

-DIST_PARAMETER_TYPES = Union[np.ndarray, int, float, TensorVariable]
+DIST_PARAMETER_TYPES: TypeAlias = Union[np.ndarray, int, float, TensorVariable]


Im just curiouys, what does TypeAlias do?

This is a type hint itself. The mypy errors and documentation told me that aliases should be annotated like this.

canyon289 · 2022-01-31T00:49:01Z

pymc/sampling.py

@@ -1393,6 +1397,27 @@ def _choose_backend(trace: Optional[Union[BaseTrace, List[str]]], **kwds) -> Bac
    return NDArray(vars=trace, **kwds)


+def _init_trace(


Does this need a test?

It's covered extensively by the existing tests, though I agree that after extraction it is now easier to test it separately.
But my plan is to refactor this into using McBackend. This _init_trace() will then become run.init_chain() and then I'd also invest in the tests.

twiecki · 2022-01-31T20:03:16Z

pymc/sampling.py

@@ -1217,7 +1220,7 @@ def _prepare_iter_population(
    step,
    start: Sequence[PointType],
    parallelize: bool,
-    tune=None,
+    tune: int,


Why remove the optional?

The (only) call to _prepare_iter_population is in a context where tune is already int.

It's passed down into _iter_population which assumed tune: int.

...because its an internal function called in exactly one place, it's safe to make it non-optional. The alternative would be to pick an arbitrary default, which we do already in pm.sample([tune=1000]).

Rename local trace variable to mtrace

6be54ea

To reduce type confusion.

michaelosthege added the maintenance label Jan 29, 2022

michaelosthege self-assigned this Jan 29, 2022

michaelosthege added 3 commits January 29, 2022 20:22

Tighten type hints related to BaseTrace/MultiTrace

8e3ace6

Tighten tune to always be int

40f82c3

The code largely assumed it to be `int`, even though it was often advertised as `int | None`.

Consolidate trace backend initialization

0dd2760

With this change, the `_iter_sample` and `_prepare_iter_population` take the `tune` into account for the expected length of traces. This was not the case beforehand and I don't understand why it worked.

michaelosthege force-pushed the cleanup-trace-init branch from 00c229d to 40f82c3 Compare January 29, 2022 19:29

Add mypy config and more type hints

2d87b9d

Also fixes an incorrectly named kwarg in some logger calls.

michaelosthege force-pushed the cleanup-trace-init branch from 603636d to 2d87b9d Compare January 29, 2022 23:37

michaelosthege marked this pull request as ready for review January 30, 2022 00:21

michaelosthege requested review from ricardoV94 and lucianopaz January 30, 2022 00:25

canyon289 reviewed Jan 31, 2022

View reviewed changes

michaelosthege added this to the v4.0.0b3 milestone Jan 31, 2022

twiecki reviewed Jan 31, 2022

View reviewed changes

twiecki merged commit 2bb0c7c into pymc-devs:main Feb 3, 2022

michaelosthege deleted the cleanup-trace-init branch February 3, 2022 20:17

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Cleanup trace initialization and type hints #5420

Cleanup trace initialization and type hints #5420

Uh oh!

michaelosthege commented Jan 29, 2022 •

edited

Loading

Uh oh!

codecov bot commented Jan 29, 2022 •

edited

Loading

Uh oh!

michaelosthege commented Jan 29, 2022

Uh oh!

canyon289 Jan 31, 2022

Uh oh!

michaelosthege Jan 31, 2022

Uh oh!

canyon289 Jan 31, 2022

Uh oh!

michaelosthege Jan 31, 2022

Uh oh!

twiecki Jan 31, 2022

Uh oh!

michaelosthege Jan 31, 2022 •

edited

Loading

Uh oh!

Uh oh!

		@@ -1393,6 +1397,27 @@ def _choose_backend(trace: Optional[Union[BaseTrace, List[str]]], **kwds) -> Bac
		return NDArray(vars=trace, **kwds)


		def _init_trace(

Uh oh!

Cleanup trace initialization and type hints #5420

Cleanup trace initialization and type hints #5420

Uh oh!

Conversation

michaelosthege commented Jan 29, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

codecov bot commented Jan 29, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

michaelosthege commented Jan 29, 2022

Uh oh!

canyon289 Jan 31, 2022

Choose a reason for hiding this comment

Uh oh!

michaelosthege Jan 31, 2022

Choose a reason for hiding this comment

Uh oh!

canyon289 Jan 31, 2022

Choose a reason for hiding this comment

Uh oh!

michaelosthege Jan 31, 2022

Choose a reason for hiding this comment

Uh oh!

twiecki Jan 31, 2022

Choose a reason for hiding this comment

Uh oh!

michaelosthege Jan 31, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

michaelosthege commented Jan 29, 2022 •

edited

Loading

codecov bot commented Jan 29, 2022 •

edited

Loading

michaelosthege Jan 31, 2022 •

edited

Loading