Is there a framework for translating + recreate images?

r/StableDiffusion
Generative AI

I've seen that with tools such as grok or gemini the results are acceptable. How could I do it locally? I own a RTX 3060 What could be the framework? It doesn't matter if it takes 2 minutes while grok/gemini could generate and output like that in seconds. I want to save money generating translated images submitted by /u/Many_Ball_227 [link] [comments]