Data Design For Fine-Tuning To Improve Small Language Model Behaviour
cobusgreyling.substack.com
Teaching Small Language Models to Self-Correct & Reason by using creative data formats for fine-tuning data. Via Prompt Erasure & Partial Answer Masking.
Data Design For Fine-Tuning To Improve Small Language Model Behaviour
Data Design For Fine-Tuning To Improve Small…
Data Design For Fine-Tuning To Improve Small Language Model Behaviour
Teaching Small Language Models to Self-Correct & Reason by using creative data formats for fine-tuning data. Via Prompt Erasure & Partial Answer Masking.