pre train LSTM policy [question] 

I want to pre train a LSTM policy, with some Example data. My current approach, is to train it like a normal feed forward network (_plugging in the observants in one end and compare the other wit my ground truth)_, and hope that your LSTM Implementation is doing the rest _(hidden state managment)_ for me. But before I find out that it is not so easy and I spend the next two weeks of my life code digging and hidden state managing, I thought I could just simply ask you guys. Is there anything i need to keep in mind when I train the LSTM policy directly?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

pre train LSTM policy [question] #253

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

pre train LSTM policy [question] #253

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions