Looking for Workflow that can do extraction from image

I am on the hunt for a workflow that can do extraction from image like this shown below. I have reference character art, want it in t-pose, and then extract the image parts based on prompts. I have my code that creates the JSON file for parts, but I'm having trouble getting the correct extraction that matches the reference image, which can be modeled. I was trying with Sam3 but was not able to get it to run. I have tried Qwen Image Edit and Flux 2 Klien. Nanobanana can do it, but its costly at 15 cents per image, and it charged me about $5 just in testing.