That’s not what it means. "-it" just indicates the model is instruction-tuned, i...

DeepYogurt · 2026-04-02T18:45:15 1775155515

What does that mean for a user of the model? Is the "-it" version more direct with solutions or something?

petu · 2026-04-02T21:15:00 1775164500

It means that model was tuned to to act as chat bot. So write a reply on behalf of assistant and stop generating (by inserting special "end of turn" token to signal inference engine to stop generation).

Base model (without instruction/chat tuning) just generates text non stop ("autocomplete on steroids") and text is not necessarily even formatted as chat -- most text in training data isn't dialogue, after all.

BoredomIsFun · 2026-04-03T04:42:21 1775191341

good old illustrtation: https://www.ml6.eu/en/blog/large-language-models-to-fine-tun...

The it- one is the yellow smiling dot, the pt- is the rightmost monster head.

nolist_policy · 2026-04-02T19:17:27 1775157447

Use the it versions. The other versions are base models without post-training. E.g. base models are trained to regurgitate raw wikipedia, books, etc. Then these base models are post-trained into instruction-tuned models where they learn to act as a chat assistant.