AgentsGPT-5.5

Model Comparison Test Prompt

A standardized benchmark prompt to compare model outputs side by side.

#benchmark#evaluation

Prompt

Complete this standardized task for model comparison benchmarking.

Task: Write a product changelog entry for a fictional app "Flowboard" that added dark mode and keyboard shortcuts.

Constraints:
- Exactly 150 words
- Include: user benefit, technical note, known limitation
- Tone: friendly but professional
- End with a thank-you to beta testers

Do not mention you are an AI.

Prompt

Related notes

Prompt Improver