2024-06-06

JeMeDemande quelles sont les différences entre les modèles qui terminent par “rien”, par -instruct et par -chat.

JaiLu Noob question: What’s the difference between chat and instruct (or other?) models?/
- JaiLu Language Models: Completion and Chat-Completion - langroid/

This brings us to the heart of the innovation behind the wildly popular ChatGPT: it uses an enhancement of GPT3 that (besides having a lot more parameters), was explicitly fine-tuned on instructions (and dialogs more generally) — this is referred to as instruction-fine-tuning or IFT for short. In addition to fine-tuning instructions/dialogs, the models behind ChatGPT (i.e., GPT-3.5-Turbo and GPT-4) are further tuned to produce responses that align with human preferences (i.e. produce responses that are more helpful and safe), using a procedure called Reinforcement Learning with Human Feedback (RLHF). (from)

JaiLu Difference between “chat”, “instruct”, and “chat-instruct”? : LocalLLaMA

notes.sklein.xyz

Notes éphémères

2024-09-17_1707

2024-09-16_1754

2024-09-15_1518

2024-09-15_1038

2024-09-14_2253

Projets

Projet 13 - "POC Elasticsearch sur un PKM"

Je cherche à convertir en SQL des query de filtre basé sur un système de "tags"

Projet 12 - "Implémentation nodemailer-scaleway-transport"

Projet 11 - "Première version d'un moteur web PKM"

Projet 10 - "Mettre en oeuvre DotTXT AI"

2024-06-06_2257

Vue Graphique

Liens retour