Teaching Small Language Models to Self-Correct & Reason by using creative data formats for fine-tuning data. Via Prompt Erasure & Partial Answer Masking.
Share this post
Data Design For Fine-Tuning To Improve Small…
Share this post
Teaching Small Language Models to Self-Correct & Reason by using creative data formats for fine-tuning data. Via Prompt Erasure & Partial Answer Masking.