

The second screenshot goes to a chart where the Y axis is labelled
Task duration (for humans) where logistic regression of our data predicts the AI has a 50% chance of succeeding
So they’re just extrapolating an exponential, not actually measuring it.
Surprise level: zero