You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
SmolLM2 are models with the Llama architecture that are trained on better data than older models of the same size, especially the 135M variant. It has the minor weakness of having a broken 11th layer, but it is still becoming popular.
Pitch
Add the name/alias for SmolLM2 to the model list.
Checklist
I have checked that there is no similar issue in the repo (required)
The text was updated successfully, but these errors were encountered:
Proposal
Support the SmolLM2 series of models - https://huggingface.co/HuggingFaceTB/SmolLM2-135M etc
Motivation
SmolLM2 are models with the Llama architecture that are trained on better data than older models of the same size, especially the 135M variant. It has the minor weakness of having a broken 11th layer, but it is still becoming popular.
Pitch
Add the name/alias for SmolLM2 to the model list.
Checklist
The text was updated successfully, but these errors were encountered: