Update on Gemma 4 having MTP: Reverse engineering effort

r/LocalLLaMA
Open Source AI AI Tools

Hey Everyone In a previous post I had mentioned I found out Gemma 4 has MTP. Turns out I was able to extract the model weights, but now I need help from the community, especially people who know C++ to help reverse engineer the MTP from the compiled TFLite graph files, back into a usable Pytorch nn. Module. I have made a repo on HuggingFace with the extracted files, alongsite replication steps and clues I could find, which I linked here in the post.