GPT-5.4 Scores 0.62 F1 on Understanding Handwritten Edits in Dickens dorrit.pairsys.ai 2 points by svcrunch a month ago