Is this a reasonable SFT methodology for Qwen 3.5 35B A3B using Opus-distilled datasets?

r/LocalLLaMA
Open Source AI AI Research

Recently, I have seen that there are some publicly available datasets distilled from Opus. I am planning to perform SFT using those datasets on Qwen 3.5 35B A3B. My idea is the following: First, perform SFT once using the original English dataset distilled from Opus. Then translate that dataset into another language (matching the target country's language) using either: a larger model, or a model that has already been trained on Opus datasets. After that, train again using both the translated dataset and the original English dataset together.