1mo ago

LinkedIn post about new product

This experiment compares how different LLMs handle a specific marketing task: writing a LinkedIn launch post for a macOS tool called Oshn Prompt Generator. Each model was evaluated based on three key pillars: 1) Publish-Ready Quality: Would I actually post this as-is, or does it require heavy editing? 2) AI Fingerprint: Does the text feel "robotic" and full of AI cliches, or does it sound like a natural human creator? 3) Platform Optimization: How well does the model leverage LinkedIn-specific mechanics, such as effective hook lines, spacing, relevant hashtags, and viral formatting?

28 experiments14 models tested

GPT-5 MiniGPT-5 NanoClaude Haiku 4.5Claude Opus 4.5GPT-5GPT-5.2Claude Sonnet 4.5Gemini 2.5 ProGemini 3 FlashGemini 2.5 FlashGemini 2.5 Flash-LiteGPT-4.1GPT-5.2 ProGemini 3 Pro

Author's Rankings

Prompt #10

Gemini 2.5 Pro

903 tokens24.5s

Click to view

Prompt #26

Gemini 3 Pro

990 tokens22.3s

Click to view

Prompt #27

Gemini 2.5 Pro

1039 tokens27.5s

Click to view

Prompt #1

GPT-5 Mini

762 tokens10.3s

Click to view

Prompt #6

GPT-5

1292 tokens25.9s

Click to view

Prompt #7

GPT-5.2

339 tokens10.7s

Click to view

Prompt #9

GPT-5

1363 tokens33.4s

Click to view

Prompt #13

Gemini 2.5 Flash

799 tokens11.7s

Click to view

Prompt #17

Gemini 2.5 Flash-Lite

850 tokens6.8s

Click to view

Prompt #20

Gemini 3 Flash

431 tokens5.9s

Click to view

Prompt #21

GPT-5.2 Pro

351 tokens23.7s

Click to view

Prompt #22

GPT-5.2 Pro

313 tokens26.2s

Click to view

Prompt #24

Gemini 2.5 Flash

425 tokens8.8s

Click to view

Prompt #25

GPT-5 Nano

1866 tokens18.5s

Click to view

Prompt #2

GPT-5 Nano

2631 tokens25.2s

Click to view

Prompt #5

Claude Opus 4.5

440 tokens10.3s

Click to view

Prompt #11

Gemini 3 Flash

795 tokens8.7s

Click to view

Prompt #14

Gemini 2.5 Flash-Lite

759 tokens4.8s

Click to view

Prompt #15

GPT-5 Mini

739 tokens10.1s

Click to view

Prompt #19

GPT-5.2

330 tokens5.9s

Click to view

Prompt #3

Claude Haiku 4.5

366 tokens4.2s

Click to view

Prompt #4

Claude Opus 4.5

493 tokens10.1s

Click to view

Prompt #8

Claude Sonnet 4.5

441 tokens8.8s

Click to view

Prompt #12

Claude Sonnet 4.5

418 tokens10.1s

Click to view

Prompt #16

GPT-4.1

259 tokens5.2s

Click to view

Prompt #18

Claude Haiku 4.5

457 tokens4.1s

Click to view

Prompt #23

GPT-4.1

333 tokens5.0s

Click to view

Prompt #28

Gemini 3 Pro

392 tokens14.8s

Click to view

Drag experiments here

Conclusion

Key Findings & Insights: 1) Consistent Performance Across Scales: I was surprised by how similar the results were across various architectures. No model completely failed the task, showing that basic marketing copywriting has become a baseline capability even for smaller models. 2) Brand "DNA" & Styling: Models from the same provider (e.g., Google vs. OpenAI) showed strong internal consistency, likely due to shared training datasets. This highlights a crucial insight: every provider has a distinct "native style." As a user, you must either adapt your prompting to this style or choose the provider whose default "vibe" best fits your specific task. 3) Gemini’s Versatility (S-Tier): Gemini 2.5/3 Pro stood out by providing multiple options tailored to different audiences, along with unsolicited (but helpful) publication advice. While providing extra options wasn't explicitly requested, the quality of the "publish-ready" variants made it the winner for me. It transformed from a simple generator into a strategic assistant. 4) OpenAI’s Non-Linear Progress: GPT models performed well, but there was no clear correlation between model "power" (e.g., GPT-5 vs. GPT-5.2) and result quality. Newer or larger didn't always mean better for this specific LinkedIn use case. 5) The Claude "Emoji Trap": Despite being my go-to for coding, Claude struggled here by overusing emojis. This created an immediate "AI-generated" red flag. While fixable with more instructions, this experiment exposed Claude’s default tendency toward a "robotic" marketing tone compared to the others. Final Verdict: Gemini currently leads for professional social media content that feels natural and offers strategic variety, while other models require more rigorous "human-like" fine-tuning in the prompt.

0 comments

LinkedIn post about new product

Author's Rankings

Conclusion

Community Rankings

Comments