POET Logo

Supporting Prompting Creativity with Automated Expansion of Text-to-Image Generation

Abstract: Given that creative end-users often operate in diverse, context-specific ways that are often unpredictable, more variation and personalization are necessary. We introduce POET, a real-time interactive tool that (1) automatically discovers dimensions of homogeneity in text-to-image generative models, (2) expands these dimensions to diversify the output space of generated images, and (3) learns from user feedback to personalize expansions. Focusing on visual creativity, POET offers a first glimpse of how interaction techniques of future text-to-image generation tools may support and align with more pluralistic values and the needs of end-users during the ideation stages of their work.
๐Ÿ“‹ How to Use Personalized Redesign
1. Rate Your Satisfaction: Provide a satisfaction score for the current generated images
2. Select Preferences: Choose your most liked and disliked images
3. Save & Iterate: Click "Save Personalized Data" before redesgining your prompt and clicking "Generate"
4. Restart Anytime: Use the "Restart" button to begin a fresh session

๐Ÿ“Š Rate Current Generation

How satisfied are you with the current generated images?
๐Ÿ“ Prompt History

๐ŸŒŸ Examples

Examples
๐ŸŽจ Prompt Image 1 Image 2 Image 3 Image 4