From Imitation to Preference: A Better Way to Distill Reasoning into Small Models - Quantiphi - Quantiphi