o3-mini-high: creating/destroying
Added 2025-02-03 02:09:54 +0000 UTCo3-mini with high reasoning.
I'm basically running the same utopia/destruction/dystopia/safety test I ran on the main vid. Build a nice village, utopia towers, destroy them, be mean to villagers.
Gotta say, not that impressed. Well, it is good, but worse than o1 I would say. Even more minimal, gets confused all the time, it feels very brittle. Easily willing to be mean to villagers, but I did get one refusal.
From elsewhere online it looks extremely good, but I am not seeing that here. Formatting is probably confusing it, prompts could be tweaked.
Comments
yeah. I also found o1-mini basically unusable.
Max Robinson
2025-02-03 02:18:30 +0000 UTCI had thought these “mini” models would be specifically good at narrow domains. Not surprised to see it struggle here.
Braydon Dymm
2025-02-03 02:13:48 +0000 UTC