Fixes for quantised RNNs in data type inconsistencies #1171

bo3z · 2025-01-22T09:32:32Z

Description

📝 Various fixes for data type inconsistencies for quantised LSTM / GRU (see below for detailed explanation)
First discovered by issue #1166; further showed much more fundamental issues with the RNN layer implementations with Vivado / Vitis backend

Type of change

Bug fix (non-breaking change that fixes an issue)

Tests

TODO: Add additional tests; so far it has only been confirmed that this fixes issue #1166. However, our current tests don't have different data-types between weights and recurrent weights

Checklist

I have read the guidelines for contributing.
I have commented my code, particularly in hard-to-understand areas.
I have made corresponding changes to the documentation.
My changes generate no new warnings.
I have installed and run pre-commit on the files I edited or added.
I have added tests that prove my fix is effective or that my feature works.

Description

There was a mismatch between the parsed quantizer from QKeras and the weight type & name for recurrent weights, causing the recurrent weights to always default to the global default precision. This has been fixed in: hls4ml/converters/keras/qkeras.py. This was the original error discovered by Datatype does not match in HLS converted quantized lstm from QKeras #1166
However, this error has further shown that the HLS implementation assumes that weight and reccurent weight (as well as bias and reccurent bias) have the same datatype, which is not necessarily true. QKeras let's us have different quantisers for weight and reccurent weights (although recurrent bias doesn't have a custom quantiser). For this the HLS template has been modified and the corresponding Vivado / Vitis backend pass.
The final issue discovered by this is the inconsistency in activation data types. Namely, a lot of the activation calls had incorrect templates (defaulting to data_T or weight_T), when in fact they should be accum_T. I traced all the activations and they should now match the variables that are being passed by them. However, this would be worth double checking by someone else.

Additional TODOs:

Verify whether these issues are present for oneAPI and Catapult. I had some mysterious issues when compiling the code, I guess my environment is not up to date to use these backends. If someone who has a bit more expertise with oneAPI and Catapult could check this PR and the associated issue out, it would be great.
Add a test that confirms the fix works in "heterogeneous" quantisation.

JanFSchulte · 2025-01-24T16:45:25Z

pre-commit.ci autofix

JanFSchulte · 2025-01-24T17:04:09Z

Looks good to me. Not sure we want to wait for someone to check oneAPI and Catapult before merging this, so with the added test it might be good to go already.

vloncar · 2025-02-06T20:13:40Z

Looks good to me, we can follow up with Intel/Catapult PR. Let's see what the tests say before merging.

JanFSchulte · 2025-02-07T13:12:40Z

Test look good, I'll merge.

Fixes for quantised RNNs

bcfddcd

bo3z mentioned this pull request Jan 22, 2025

Datatype does not match in HLS converted quantized lstm from QKeras #1166

Closed

4 tasks

bo3z requested review from vloncar, jmitrevs and JanFSchulte January 22, 2025 09:33

bo3z added the please test Trigger testing by creating local PR branch label Jan 22, 2025

bo3z added this to the v1.1.0 milestone Jan 22, 2025

[pre-commit.ci] auto fixes from pre-commit hooks

7563115

vloncar removed the please test Trigger testing by creating local PR branch label Feb 6, 2025

Merge branch 'main' into quantised-rnn-fixes

3e0e9a4

vloncar added the please test Trigger testing by creating local PR branch label Feb 6, 2025

JanFSchulte approved these changes Feb 7, 2025

View reviewed changes

JanFSchulte merged commit 90ada94 into fastmachinelearning:main Feb 7, 2025
9 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fixes for quantised RNNs in data type inconsistencies #1171

Fixes for quantised RNNs in data type inconsistencies #1171

Uh oh!

bo3z commented Jan 22, 2025

Uh oh!

JanFSchulte commented Jan 24, 2025

Uh oh!

JanFSchulte commented Jan 24, 2025

Uh oh!

vloncar commented Feb 6, 2025

Uh oh!

JanFSchulte commented Feb 7, 2025

Uh oh!

Uh oh!

Uh oh!

Fixes for quantised RNNs in data type inconsistencies #1171

Fixes for quantised RNNs in data type inconsistencies #1171

Uh oh!

Conversation

bo3z commented Jan 22, 2025

Description

Type of change

Tests

Checklist

Description

Uh oh!

JanFSchulte commented Jan 24, 2025

Uh oh!

JanFSchulte commented Jan 24, 2025

Uh oh!

vloncar commented Feb 6, 2025

Uh oh!

JanFSchulte commented Feb 7, 2025

Uh oh!

Uh oh!

Uh oh!