17 GPT 5.4 vs GPT 5.2: Which One is Stronger? Let's start with the conclusion: although GPT 5.2 is sufficient, this GPT 5.4 is different—it's stronger in some strange ways. First, let's look at the scores on my exclusive evaluation leaderboard: Runner-up, still impressive, meaning it's in the same tier as 5.3-Codex and significantly ahead of various others. Next, the main focus is on the differences. The first difference is the ability to output progress for intermediate sub-tasks in segments, so users don't get anxious watching it. In fact, because of the frequent reporting, you might feel GPT 5.4 is faster than previous GPT versions.
17 GPT 5.4 vs GPT 5.2: Which One is Stronger? Let's start with the conclusion: although GPT 5.2 is sufficient, this GPT 5.4 is different—it's stronger in some strange ways. First, let's look at the scores on my exclusive evaluation leaderboard: Runner-up, still impressive, meaning it's in the same tier as 5.3-Codex and significantly ahead of various others. Next, the main focus is on the differences. The first difference is the ability to output progress for intermediate sub-tasks in segments, so users don't get anxious watching it. In fact, because of the frequent reporting, you might feel GPT 5.4 is faster than previous GPT versions and takes less time. But I actually timed it and found no significant improvement; it's just an illusion from frequent reporting, and people fall for it 😂 The image below shows the working process of GPT 5.2 xhigh; the process output is continuous, making it hard to track progress: The one below is GPT 5.4, the process...
No comments yet. Be the first to share your thoughts.