Use custom `serde` deserializer for JinaBERT models #559

alvarobartt · 2025-04-04T13:18:01Z

What does this PR do?

Note

This PR is still being tested and is missing the tests to be included within this PR.

This PR "improves" the handling for the JinaBERT configurations as those share the same model_type as the default BERT models i.e. model_type=bert meaning that those need a specific handling for telling apart JinaBERT and BERT. Apparently, when fine-tuning or re-uploading JinaBERT models with Sentence Transformers, the _name_or_path value is overwritten with the actual path to the origin Hugging Face Hub repository, which means that the serde-based tag strategy to tell those apart will no longer work, as the value won't match the former check. This PR then, adds a custom serde deserializer to make sure that the config.json is deserialized for the correct model not only based on the _name_or_path value but also on the auto_map.AutoConfig value, which in any case will still point to the remote repository with the JinaBERT implementation.

Fixes #556

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read the contributor guideline, Pull Request section?
Was this discussed/approved via a GitHub issue or the forum? Please add a link to it if that's the case.
Did you make sure to update the documentation with your changes? Here are the documentation guidelines, and here are tips on formatting docstrings.
Did you write any new necessary tests?

Who can review?

@Narsil or @McPatate

alvarobartt · 2025-04-04T13:20:32Z

P.S. To test this PR I've just used the same model as the one reported in the linked issue borgcollectivegmbh/testing-jina-stuff

Narsil

Very nice solution to the problem.

Use custom serde deserializer for JinaBERT models

0cdea33

alvarobartt requested a review from Narsil April 4, 2025 13:18

Narsil approved these changes Apr 4, 2025

View reviewed changes

Narsil merged commit 7f7832e into main Apr 4, 2025
14 checks passed

Narsil deleted the jina-bert-config-fix branch April 4, 2025 14:10

This was referenced Apr 7, 2025

Failed to build docker image due to missing cutlass/cutlass.h #566

Closed

Failing deployment on AWS Sagemaker endpoints #569

Closed

BrewTestBot mentioned this pull request Apr 8, 2025

text-embeddings-inference 1.7.0 Homebrew/homebrew-core#218821

Merged

CoolFish88 mentioned this pull request Apr 10, 2025

Error: could not create backend -> jinaai/jina-reranker-v1-turbo-en #579

Closed

4 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Use custom `serde` deserializer for JinaBERT models #559

Use custom `serde` deserializer for JinaBERT models #559

Uh oh!

alvarobartt commented Apr 4, 2025

Uh oh!

alvarobartt commented Apr 4, 2025

Uh oh!

Narsil left a comment

Uh oh!

Uh oh!

Uh oh!

Use custom serde deserializer for JinaBERT models #559

Use custom serde deserializer for JinaBERT models #559

Uh oh!

Conversation

alvarobartt commented Apr 4, 2025

What does this PR do?

Before submitting

Who can review?

Uh oh!

alvarobartt commented Apr 4, 2025

Uh oh!

Narsil left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Use custom `serde` deserializer for JinaBERT models #559

Use custom `serde` deserializer for JinaBERT models #559