Handle time_major in lstm and remove reshape from embedding #457

sonu1-p · 2020-04-23T23:16:36Z

This commit makes the following changes:

Embedding was unnecessarily reshaping input to a rank2 tensor. This
is not required and causes model to diverge from what its actually
supposed to do.
Lstm was not considering time_major param. No transpose of input and
output is required if the input is already time major, i.e of shape
[ seq_len, batch_size, input_size]
Simplify the handling of case when return sequences is set to true.
Use squeeze and unsqueeze instead of reshape for dim representing
direction of lstm for simplicity.

This commit makes the following changes: 1. Embedding was unnecessarily reshaping input to a rank2 tensor. This is not required and causes model to diverge from what its actually supposed to do. 2. Lstm was not considering time_major param. No transpose of input and output is required if the input is already time major, i.e of shape [ seq_len, batch_size, input_size] 3. Simplify the handling of case when return sequences is set to true. 4. Use squeeze and unsqueeze instead of reshape for dim representing direction of lstm for simplicity.

sonu1-p and others added 2 commits April 23, 2020 16:10

Merge branch 'master' into lstm-embedding-fix

5cadfd2

jiafatom approved these changes Apr 24, 2020

View reviewed changes

jiafatom merged commit 79df32d into onnx:master Apr 24, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Handle time_major in lstm and remove reshape from embedding #457

Handle time_major in lstm and remove reshape from embedding #457

Uh oh!

sonu1-p commented Apr 23, 2020

Uh oh!

Uh oh!

Handle time_major in lstm and remove reshape from embedding #457

Handle time_major in lstm and remove reshape from embedding #457

Uh oh!

Conversation

sonu1-p commented Apr 23, 2020

Uh oh!

Uh oh!