Wan 2.2 s2v workload getting terrible outputs.

r/StableDiffusion
Generative AI

Trying to generate 19s of lip synced video in wan 2.2. I am using whatever workflow is located in the templates section of comfyui if you search wan s2. I do have a reference image along with the music. I need 19s, so I have 4 batches going at 77 "chunks". I was using the speed loras at 4 steps at first and it was blurry and had all kinds of weird issues Chatgpt made me change my sampler to dpm 2m and scheduler to Karras, set cfg to 4, denoise to.30 and shift scale to 8. the output even with 8 steps was bad.