- Published on
'I trained three generations of ColQwen3.5, each with more sophisticated optimization than the last. The most optimized version didn't beat the previous one on the primary benchmark. It just reshuffled which tasks improved and which got worse, with a net difference smaller than seed variance.'
