Track model logp (in HMC and NUTS) #3134

eigenfoo · 2018-08-06T00:12:56Z

WIP. Following up from #3121. I'm opening up this PR so I can get Travis builds/tests/feedback from maintainers 😄

@ColCarroll I've done a first pass at tracking the logp. The sampler stats stack actually goes deeper than you outlined before, since the model logp is actually kept in integration.py for HMC. What I've done is modify the State so it keeps track of the model logp, and percolate that up as a sampler stat to HMC and NUTS.

@junpenglao currently this PR falls a bit short of what you were expecting; I've only made the model logp a sampling stat for HMC and NUTS. If I'm not mistaken, no other sampler computes the logp (e.g. Metropolis doesn't), so there would be overhead to require these samplers to track the logp. However, if this is a feature the dev team wants, I'd be happy to do that too - probably in a separate PR, though.

I'm also unsure about the compoundstep changes... states is a list, and cannot be indexed by a string. I must have misunderstood @junpenglao's comments here: I'll take another look sometime in the next few days. For now, tests will fail for that.

junpenglao · 2018-08-06T04:25:57Z

If I'm not mistaken, no other sampler computes the logp (e.g. Metropolis doesn't), so there would be overhead to require these samplers to track the logp. However, if this is a feature the dev team wants, I'd be happy to do that too - probably in a separate PR, though.

Oh you are right - in Metropolis we only computed the delta_logp. Hmmm, it does complicate things...

eigenfoo · 2018-08-06T19:44:42Z

This seems to pass tests now. The test it fails is due to SMC and appears unrelated. Ready for review!

What I've done for CompoundStep is to remove logp items from all but the last state in states. In other words, only the last state can be the model logp. Sometimes, the last state may not have a logp at all (eg if the last sampler is Metropolis), but if the last state has a logp, then it is the model logp. Does that sound reasonable?

junpenglao · 2018-08-07T04:23:55Z

Need a new test to check if the logp is saved correctly at least.

Also, add a line to release-note.

I think this is a good first step. Maybe in another PR we can also add it to other samplers. I have an idea to do this also for Metropolis where we are computing the delta_logp instead of the logp. We can compute the model logp at sample 0, then from sample 1 onward we do state.logp = logp_tm1 - delta_logp which should recover the logp at time t while also cheap to compute.
Careful tests are of course needed to make sure it is correct especially for CompoundSteps.

twiecki · 2018-08-07T10:08:46Z

RELEASE-NOTES.md

+- Track the model log-likelihood as a sampler stat for NUTS and HMC samplers
+  (accessible as `trace.logp`).
+
+### Fixes


This is more maintenance.

eigenfoo · 2018-08-07T20:42:16Z

@junpenglao I'm actually not sure where the tests for sampler stats are. I would've thought test_stats.py, but that's actually to test the pm.stats module... Any pointers?

junpenglao · 2018-08-07T20:51:06Z

pymc3/tests/test_step.py

eigenfoo · 2018-08-26T08:08:05Z

Sorry for the lull, I was taking a break from my laptop!

I've added a test to assert the existence of various sampler stats for NUTS, and also assert that their shapes are correct. Let's see if this passes tests.

eigenfoo · 2018-08-26T09:44:19Z

Passes tests! @junpenglao ready for review

pymc3/step_methods/compound.py

junpenglao · 2018-08-27T04:39:32Z

Maybe we should rename it to lp__, see arviz-devs/arviz#176 (comment)

pymc3/tests/test_step.py

eigenfoo · 2018-08-27T12:44:23Z

Hm, my concern is how intuitive a name lp__ is... I understand Stan has it as lp__ but with Python there is the convention of having self-descriptive names, even if this means verbosity. I'd be fairly confused at both what lp stands for, and why there are two underscores after it (AFAIK nothing else in PyMC3 has two underscores after it...)

I don't really know much about Arviz though. Is "compatibility" with Stan/Arviz a high priority for the PyMC devs?

twiecki · 2018-08-27T13:31:14Z

I agree with @eigenfoo, lp__ is not a great name. Compatibility to Arviz is important, but not to Stan. But since we also have control over arviz I would suggest standardizing on a reasonable name choice there (like logp) and here.

junpenglao · 2018-08-27T14:17:23Z

In that case, either logp or model_logp would be a good name choice.
FYI @eigenfoo, in PyMC3 the transformed parameters all have __ which is generated internally. These RVs are meant to be "hidden" and we use the double underscore to identify these parameters and hide them when generating summary and figure.

ColCarroll · 2018-08-27T15:00:25Z

Yeah, agree on using a descriptive name - I think arviz is going to keep trying to decide on a "standard" name for sampler statistics, which may change over time (I'm still in favor of a descriptive name there as well, but it should be a friendly spot for stan users too). So no worries on what's going on there :)

eigenfoo · 2018-08-28T14:58:36Z

@junpenglao I think I like model_logp better - it's more explicit, and harder to confuse it with all the other log-probabilities that you could talk about 🙂

I'll need some help with the test though - see my comment above.

springcoil · 2018-09-02T14:18:47Z

Awesome work @eigenfoo

aseyboldt · 2018-09-04T15:32:51Z

Maybe better sample_logp? It isn't really a property of the model, and it is different for each sample. Or maybe posterior_logp? Otherwise I think this looks good.

fonnesbeck · 2018-09-04T20:49:54Z

It gets a little semantic, I suppose. Its the log-probability of the model at a particular point, so I don't think model_logp is necessarily bad. The sample doesn't have a logp without a model!

aseyboldt · 2018-09-05T08:13:15Z

How about value_that_is_proportional_to_the_posterior_logp_given_model_and_dataset_at_the_point_of_that_sample. We have tab completion after all. :-)
Edit: I forgot the parametrization. It depends on that as well.

aseyboldt · 2018-09-05T08:14:57Z

But I guess you have a point. They all probably work. (except that last one)

ColCarroll · 2018-09-05T12:41:29Z

This looks good to me (and something I will use right away) - thanks, @eigenfoo! Is there something specific you would still like review on?

eigenfoo · 2018-09-06T00:48:39Z

For the record, the problem that stalled me up was this:

The way CompoundStep works is essentially a glorified for-loop over a list of step methods. This has various implications for sampler stats, since each step method in this list may or may not support stats, and may support different stats, etc. etc. According to these docs, we should be returning every single stat generated by every single sampler. If there happen to be name collisions, we stick the numpy arrays together, along a new axis.

The "bad thing" about this PR is best explained by example:

with pm.Model():
    x = pm.Normal('x')
    y = pm.Normal('y')
    step1 = pm.NUTS([mu1])
    step2 = pm.NUTS([mu2])
    trace = pm.sample(step=[step1, step2])

Here, trace.get_sampler_stats('model_logp').shape is (1000, 2). This may be counter-intuitive, since there should only be one model logp, so it may make sense to insist that model_logp is always a 1-D numpy array.

Eventually, @ColCarroll and I decided that it's better to keep this quirk and document it, rather than having an inconsistent API. This means that if you wanted the overall logp for with a CompoundStep, you'd need trace.get_sampler_stats('model_logp')[:, -1], and if you (for whatever reason) wanted one of the intermediate logp's, you'd need to index one of the columns of trace.get_sampler_stats('model_logp').

I think we're going to go ahead and merge this once tests pass.

twiecki · 2018-09-06T06:21:04Z

Hm yeah that's unfortunate but I agree with your assessment.

aseyboldt · 2018-09-06T07:35:48Z

Thank @eigenfoo!

ColCarroll · 2018-09-07T13:45:04Z

Thanks for this!

eigenfoo added 4 commits August 5, 2018 11:56

Add model logp to state

bba8ad7

Add logp to HMC stats

37a6c12

Add logp to NUTS stats

21996e8

Attempt to add logp to CompoundStep; will need second pass

1454b0e

eigenfoo added 2 commits August 6, 2018 20:25

Fix compoundstep

1258b17

Remove logp for all but last state in CompoundStep

9078283

eigenfoo added 2 commits August 7, 2018 14:30

Add logp to release notes

5ab824d

Add logp to example notebook

67941cb

twiecki reviewed Aug 7, 2018

View reviewed changes

Fix release notes

6fa5ee0

eigenfoo added 4 commits August 26, 2018 15:29

Merge branch 'master' of github.com:pymc-devs/pymc3 into track-logp

23c0436

Update release notes w/ idiomatic usage and PR number

b167cc0

Add test for NUTS sampler stats

7177940

assert existence of specific sampler stats

6fe74fd

ColCarroll mentioned this pull request Aug 26, 2018

add sample_stats for PyStan arviz-devs/arviz#176

Merged

ColCarroll reviewed Aug 27, 2018

View reviewed changes

pymc3/step_methods/compound.py Outdated Show resolved Hide resolved

junpenglao reviewed Aug 27, 2018

View reviewed changes

pymc3/tests/test_step.py Outdated Show resolved Hide resolved

pop logp if dict, replace with nan if namedtuple

fa46fcd

eigenfoo added 3 commits August 28, 2018 22:21

change name logp to model_logp

c2fa8e1

finish renaming logp to model_logp

f7dddf6

change logp to model_logp in tests as well

1d8ac10

finish test for sampler stats

aba6645

eigenfoo added 2 commits September 5, 2018 20:18

Document model_logp in last column for compoundstep

847ef76

Fix bug in docs

3d817f0

twiecki merged commit 7e280b1 into pymc-devs:master Sep 6, 2018

eigenfoo deleted the track-logp branch September 6, 2018 09:22

eigenfoo mentioned this pull request Sep 6, 2018

Track model_logp in other samplers #3188

Closed

kirangauthier mentioned this pull request May 28, 2020

Tracking logp using step=pm.SMC() in 3.7 and pm.sample_smc in 3.8 #3937

Closed

Uh oh!

Track model logp (in HMC and NUTS) #3134

Track model logp (in HMC and NUTS) #3134

Uh oh!

Conversation

eigenfoo commented Aug 6, 2018 • edited by junpenglao Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

junpenglao commented Aug 6, 2018

Uh oh!

eigenfoo commented Aug 6, 2018 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

junpenglao commented Aug 7, 2018

Uh oh!

twiecki Aug 7, 2018

Choose a reason for hiding this comment

Uh oh!

eigenfoo commented Aug 7, 2018

Uh oh!

junpenglao commented Aug 7, 2018

Uh oh!

eigenfoo commented Aug 26, 2018

Uh oh!

eigenfoo commented Aug 26, 2018

Uh oh!

Uh oh!

junpenglao commented Aug 27, 2018

Uh oh!

Uh oh!

eigenfoo commented Aug 27, 2018

Uh oh!

twiecki commented Aug 27, 2018 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

junpenglao commented Aug 27, 2018

Uh oh!

ColCarroll commented Aug 27, 2018

Uh oh!

eigenfoo commented Aug 28, 2018

Uh oh!

springcoil commented Sep 2, 2018

Uh oh!

aseyboldt commented Sep 4, 2018

Uh oh!

fonnesbeck commented Sep 4, 2018 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

aseyboldt commented Sep 5, 2018 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

aseyboldt commented Sep 5, 2018

Uh oh!

ColCarroll commented Sep 5, 2018

Uh oh!

eigenfoo commented Sep 6, 2018

Uh oh!

twiecki commented Sep 6, 2018

Uh oh!

aseyboldt commented Sep 6, 2018

Uh oh!

ColCarroll commented Sep 7, 2018

Uh oh!

Uh oh!

eigenfoo commented Aug 6, 2018 •

edited by junpenglao

Loading

eigenfoo commented Aug 6, 2018 •

edited

Loading

twiecki commented Aug 27, 2018 •

edited

Loading

fonnesbeck commented Sep 4, 2018 •

edited

Loading

aseyboldt commented Sep 5, 2018 •

edited

Loading