I
Ilia

1mo ago

LinkedIn post about new product

This experiment compares how different LLMs handle a specific marketing task: writing a LinkedIn launch post for a macOS tool called Oshn Prompt Generator. Each model was evaluated based on three key pillars: 1) Publish-Ready Quality: Would I actually post this as-is, or does it require heavy editing? 2) AI Fingerprint: Does the text feel "robotic" and full of AI cliches, or does it sound like a natural human creator? 3) Platform Optimization: How well does the model leverage LinkedIn-specific mechanics, such as effective hook lines, spacing, relevant hashtags, and viral formatting?

28 experiments14 models tested
GPT-5 MiniGPT-5 NanoClaude Haiku 4.5Claude Opus 4.5GPT-5GPT-5.2Claude Sonnet 4.5Gemini 2.5 ProGemini 3 FlashGemini 2.5 FlashGemini 2.5 Flash-LiteGPT-4.1GPT-5.2 ProGemini 3 Pro

Author's Rankings

S

Prompt #10

Gemini 2.5 Pro
903 tokens24.5s
Click to view

Prompt #26

Gemini 3 Pro
990 tokens22.3s
Click to view

Prompt #27

Gemini 2.5 Pro
1039 tokens27.5s
Click to view
A

Prompt #1

GPT-5 Mini
762 tokens10.3s
Click to view

Prompt #6

GPT-5
1292 tokens25.9s
Click to view

Prompt #7

GPT-5.2
339 tokens10.7s
Click to view

Prompt #9

GPT-5
1363 tokens33.4s
Click to view

Prompt #13

Gemini 2.5 Flash
799 tokens11.7s
Click to view

Prompt #17

Gemini 2.5 Flash-Lite
850 tokens6.8s
Click to view

Prompt #20

Gemini 3 Flash
431 tokens5.9s
Click to view

Prompt #21

GPT-5.2 Pro
351 tokens23.7s
Click to view

Prompt #22

GPT-5.2 Pro
313 tokens26.2s
Click to view

Prompt #24

Gemini 2.5 Flash
425 tokens8.8s
Click to view

Prompt #25

GPT-5 Nano
1866 tokens18.5s
Click to view
B

Prompt #2

GPT-5 Nano
2631 tokens25.2s
Click to view

Prompt #5

Claude Opus 4.5
440 tokens10.3s
Click to view

Prompt #11

Gemini 3 Flash
795 tokens8.7s
Click to view

Prompt #14

Gemini 2.5 Flash-Lite
759 tokens4.8s
Click to view

Prompt #15

GPT-5 Mini
739 tokens10.1s
Click to view

Prompt #19

GPT-5.2
330 tokens5.9s
Click to view
C

Prompt #3

Claude Haiku 4.5
366 tokens4.2s
Click to view

Prompt #4

Claude Opus 4.5
493 tokens10.1s
Click to view

Prompt #8

Claude Sonnet 4.5
441 tokens8.8s
Click to view

Prompt #12

Claude Sonnet 4.5
418 tokens10.1s
Click to view

Prompt #16

GPT-4.1
259 tokens5.2s
Click to view

Prompt #18

Claude Haiku 4.5
457 tokens4.1s
Click to view

Prompt #23

GPT-4.1
333 tokens5.0s
Click to view

Prompt #28

Gemini 3 Pro
392 tokens14.8s
Click to view
D
Drag experiments here
F
Drag experiments here

Conclusion

Key Findings & Insights: 1) Consistent Performance Across Scales: I was surprised by how similar the results were across various architectures. No model completely failed the task, showing that basic marketing copywriting has become a baseline capability even for smaller models. 2) Brand "DNA" & Styling: Models from the same provider (e.g., Google vs. OpenAI) showed strong internal consistency, likely due to shared training datasets. This highlights a crucial insight: every provider has a distinct "native style." As a user, you must either adapt your prompting to this style or choose the provider whose default "vibe" best fits your specific task. 3) Gemini’s Versatility (S-Tier): Gemini 2.5/3 Pro stood out by providing multiple options tailored to different audiences, along with unsolicited (but helpful) publication advice. While providing extra options wasn't explicitly requested, the quality of the "publish-ready" variants made it the winner for me. It transformed from a simple generator into a strategic assistant. 4) OpenAI’s Non-Linear Progress: GPT models performed well, but there was no clear correlation between model "power" (e.g., GPT-5 vs. GPT-5.2) and result quality. Newer or larger didn't always mean better for this specific LinkedIn use case. 5) The Claude "Emoji Trap": Despite being my go-to for coding, Claude struggled here by overusing emojis. This created an immediate "AI-generated" red flag. While fixable with more instructions, this experiment exposed Claude’s default tendency toward a "robotic" marketing tone compared to the others. Final Verdict: Gemini currently leads for professional social media content that feels natural and offers strategic variety, while other models require more rigorous "human-like" fine-tuning in the prompt.

0 comments

Community Rankings

Comments

0/2000