AI RESEARCH
A Creative Agent is Worth a 64-Token Template
arXiv CS.CV
•
ArXi:2603.17895v1 Announce Type: new Text-to-image (T2I) models have substantially improved image fidelity and prompt adherence, yet their creativity remains constrained by reliance on discrete natural language prompts. When presented with fuzzy prompts such as ``a creative vinyl record-inspired skyscraper'', these models often fail to infer the underlying creative intent, leaving creative ideation and prompt design largely to human users.