Get in touch

diana@dianamas.com

Looking forward to finding another project I would feel passionate about. Let's get in touch ✌️.
AI studio logo

AI Studio

Apr 2023
AI studio cover image
Role
UI/UX
Platforms
Web
Category
AI, Art

This project is a solo experiment on the topic. As an illustrator, I was mesmerized by the power of generational AI, but somehow, the process never stuck with me. This case uses Vana Art Studio as a starting point but discusses the broader problem of text-to-image generation. I wasn't a company employee at the time.

Investigation

As a designer, I'm always curious and ready to ask 'Why?' When I noticed something felt off while creating self-portraits in Vana Art Studio, I didn't just let it go. I used usability heuristics and personal experience to identify possible issues.From my perspective, one of the designer's duties is always asking why.

Lack of user guidance
The design overly relied on the assumption that users were proficient in crafting effective prompts without providing necessary guidance. This approach overlooked the varying levels of user expertise and missed opportunities to enhance user engagement and success through instructional support.
Inadequate visual support
A couple of examples on the page were text only and quite hard to grasp. Even if you read text prompts, there was still no way to predict results.
No error prevention
While the system identified certain errors, such as omissions in user inputs, it did not actively prevent them.
Suboptimal prompt management
Automatic deletion of prompts after submission disrupted the workflow, particularly when refining ideas.
Investigation

Strategic risk forecast

01
Quality of outcomes. Poorly crafted prompts directly correlate with subpar results, undermining the platform's core value proposition.
02
User perception. The prevalent self-serving bias among users means that failures are often attributed to Vana's platform instead of user error, negatively affecting brand perception.
03
Conversion rates. This misattribution could lower conversion rates from free to paid users as trust and perceived value diminish.
04
User Retention. Disappointed users may seek alternatives, increasing churn rates as they turn to competitors or seek out self-education to improve their results.

Goals

How might we make writing good prompts easy and intuitive?
Goal 1
Goal 2
Goal 3

Best practices in writing prompts

I did a quick research on the topic, and here is what I've found can help.

01
Correct prompt structure (commas, length...)
02
Rich details ( such as type of art, location, color, style, camera settings...)
03
Adding weight (by mentioning the most important details first or adding numbers)
04
Adding "magic" words (such as "highly detailed, "trending on Artstation," "rendered in Unreal Engine," 4k, 8k...)
Best practices in writing prompts

Exploration

During my analysis of market competitors, I focused on identifying and understanding the features that improved user interactions. Notably, best practices among competitors included the integration of text-to-image examples and a detailed, granular approach to crafting prompts.

Exploration 1

In addition to examining the AI art sector, I extended my research to encompass examples from various other industries. I focused particularly on understanding the integration of user-friendly features such as tips, auto-fill functions, and templates.

Exploration 2

Solution

Easy start

My design strategy centers on the use of tag inputs enhanced by contextual tag suggestions. To minimize errors, the initial tag "me" is automatically populated. The contextual suggestions enrich the prompt details, encouraging a deeper engagement through intuitive recognition rather than recall. Additionally, the suggestions dropdown allows for refined search results and provides a hover feature for visual previews, further aiding in intuitive visual exploration. This approach maintains best practices automatically, minimizing cognitive load and eliminating the necessity for additional examples or guides.

Easy start
Recognition rather than recall

Each new word or phrase, entered directly or selected from a dropdown, is transformed into a tag. At the same time, tag suggestions are refined based on new context. Recognizing that only some users are well-versed in the intricacies of visual arts or photography settings, the system is set to enhance recognition, not recall. The hover-enabled visual exploration feature further empowers users to discover and experiment with new styles and phrases, promoting creative exploration. This has a chance to enhance user satisfaction and drive increased studio usage and revenue through a higher volume of transactions.

Recognition rather than recall
Intuitive edit

I used progressive disclosure to avoid overwhelming users. Extra guidance, such as "Drag to reorder," appears only after three tags are added, helping users to redistribute weight inside the prompt. Editing is just a click away on any tag, simplifying adjustments.

Intuitive edit

Final thoughts

I am happy with the trajectory and outcome of this project. The developed concept has proven its potential to make prompt creation easy and intuitive. By enforcing proper prompt structure through automatic comma insertion and enhancing detailed input via tag suggestions, this design leverages recognition over recall. It further supports optimal information hierarchy with a drag-to-reorder feature and employs progressive disclosure to minimize cognitive load effectively. Moreover, it supports visual exploration rather than constant reliance on text.