Instable learning with SemiMarkov CRF

HI, 

First, thank you for fixing #110 (@da03), the SemiCRF works better now, I was able to get good results on span extraction tasks. However, I still encounter a learning instability where the loss (neg logprob) gets negative after several steps (and the accuracy starts to drop). The same problem occurs with batch_size = 1. Below I put the learning curve (f1_score and log loss).

(Maybe the bug comes from the masking of spans where **(length, length  + span_with)**  and **length  + span_with > length**, but I am not sure.)

**Edit**: I created a test and it seems that the masking is good. Maybe the log_prob computation or the to_parts function ?


![train_loss](https://user-images.githubusercontent.com/38214774/139147977-6f7b098f-01cc-4cff-96c0-ae0b151009a7.JPG)
![score](https://user-images.githubusercontent.com/38214774/139148014-9fd86f9a-9f41-4b38-b3e6-e7e39f800974.JPG)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Instable learning with SemiMarkov CRF #117

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Instable learning with SemiMarkov CRF #117

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions