dr_dshiv 2 days ago

This is really beautiful content. I’m assuming it comes from the fact that there are Google teams tasked with digitizing old manuscripts?

I work with a library (Biblioteca Philosophica Hermetica) in Amsterdam that has thousands of manuscripts from the renaissance to the early modern period… all very esoteric. We really want to get the renaissance into model training! Over 75% of books (1450-1700) are unscanned — and the manuscripts are in even worse shape.

Curious if anyone knows if there any new handwriting recognition benchmarks? I’ve noticed the main model providers have plateaued in the past year on their ability to read manuscripts / modern handwriting… I think the lack of well-designed competitive benchmarks is the issue…

I love positive examples of the intersection of AI and the humanities.

chrismorgan 2 days ago

Seeing the mention of Round Hand reminded me of Gilbert & Sullivan’s description in H.M.S. Pinafore of the importance of handwriting if, like Sir Joseph Porter K.C.B., you want to rise to the top of the tree:

  As office boy I made such a mark
  That they gave me the post of a junior clerk.
  I served the writs with a smile so bland,
  And I copied all the letters in a big round hand—
      I copied all the letters in a hand so free,
      That now I am the Ruler of the Queen’s Navee!
  • casey2 2 days ago

    The satire here is clear, that naval skill isn't necessary to run the navy but rather skill in some trivial and unrelated clerical task (along with political connections)

    The problem comes then when people see this, don't recognize it as corruption, and waste their time learning useless skills. In a large org productivity losses from this are non trivial.

    • IAmBroom 2 days ago

      I read a very serious article about how the path to promotion in today's US Armed Forces is skill in Powerpoint, ubiquitously used for daily reports up the chain.

      • psunavy03 2 days ago

        As a retired officer, this is at best tangentially true for warfighting roles. The more senior you get, the more you end up in staff and headquarters roles where plans and reporting become more of your role than direct operations.

        But to get there you have to compete and succeed in a stack-ranked promotion system where your tactical and operational skills are what set you apart.

      • bluGill 2 days ago

        The us has been enough 'i can't believe it isn't a wars' lately that competence has some bearing, though powerpoint likely is valued too much.

navigate8310 2 days ago

What a horrible over engineered UI. Texts were illegible when behind a white background and constantly shifting image perspective. No proper way to zoom in on the image and anchoring everything on the scrollbar was the cherry on cake.

  • fkyoureadthedoc 2 days ago

    > Please don't complain about tangential annoyances—e.g. article or website formats, name collisions, or back-button breakage. They're too common to be interesting.

    bike shedding web design comment is always on my HN bingo card

    • triyambakam 2 days ago

      In this case it very directly relates to the content's focus and is hardly tangential.

      • fkyoureadthedoc 2 days ago

        It almost never is, and isn't in this case. It's just someone complaining about scrolljacking. Convo has been had thousands of times on HN alone. Just click the arrow at the bottom, problem solved.

  • nchmy 2 days ago

    yeah, its an absolute monstrosity of a site. Many of the images didnt even load for me.

    • whamlastxmas 2 days ago

      Brave in iOS and yeah. Pinch to zoom better on images sucked, the auto zoom effect cut off the part it was trying to highlight on my smaller phone, and various other jumpy problems

einpoklum 2 days ago

> The handwriting in the early 1100s is known as Carolingian Miniscule. It was easy to write, very legible and ideal for writing Latin.

It was not very legible. Its legibility is weak due to aspects such as:

* Non decomposability into rectangles bounding letters

* Variation in letter forms depending on preceding and succeeding letters

* "extended serifs" or edges of letters which may vary at the author's pleasure.

* Some pen strokes being extremely thin to the point of near-invisibility (like the middle line of 'e' characters)

* non-uniform vertical height of letters (even ignoring ascenders)

* non-uniform horizontal baseline

  • IAmBroom 2 days ago

    I interpret that sentence to mean "when compared to predecessors".

qingcharles 2 days ago

I always like throwing these old handwritten documents into LLMs to see how well they do. GPT5 did nicely on the quitclaim deed in Anglicana, which is very hard to read.

gx 2 days ago

It should've included another 100 years to catalogue and showcase the devolvement towards kindergarten handwriting.

valleyer 3 days ago

Slightly off-topic:

1. When I loaded the page, it bombarded me with a banner asking me, "Interested in sports?" (Yes, I am, but I came here to read about English handwriting. Go away.)

2. At the end, it presented me with a "badge" for finishing a whole "book"! Yeah, maybe people's attention spans would be better if they weren't bombarded with little banners at the beginning.

  • 0xEF 2 days ago

    I did not get either banner looking at the page using Firefox focus on mobile, but this parallax scrolling experience used on this site (and many others)that today's webdevs seem hot to trot over is abysmal. I'm on an 8" tablet and still have to find the sweet spot where the text does not obfuscate the image, so my eyes are unable to quickly dart between both as I read. It's absurd that people think this is a good way to present information. Another commenter replying to you mentioned why not just ditch the JS, and for a site like this I say, right, why not? I'd much rather have the static text and image positions afforded with simple and elegant HTML and CSS tags as opposed to this finicky BS that was clearly tested on maybe one viewport and called good.

  • FearNotDaniel 2 days ago

    I'm torn on point 2... on the one hand, I'm insulted to be "rewarded" for my tenacity in managing to "read" what is basically a very short article with a lot of pictures. On the other hand, I'm working daily with people who refuse to read messages on Teams chat if they are longer than five words and respond either with an answer that proves they didn't make it past the first sentence or with the even worse "quick call?" So perhaps anything that rewards primary-school-level literacy among adults is a positive step after all.

    • drivers99 2 days ago

      Yeah I didn’t think I needed to be called names (“bookworm”) for reading a whole article on a topic I’m fascinated by. I get enough of that in real life already.

  • eadmund 3 days ago

    And when I visited the page with JavaScript disabled, it displayed all the text but no images.

    HTML has the img tag. There’s no need for JavaScript to add images to the DOM!

    • cyberax 3 days ago

      Sorry. That was an oversight. Now the text is also generated by JavaScript manipulating the DOM tree.

      /S

  • coldpie 2 days ago

    We really need to start considering these things in the same category as pop-up windows. Browsers need to start killing these things, just like they did pop-up windows in the 90s and early 2000s. Remember when pop-up blocking was a huge feature? Imagine if Firefox implemented that today to block all this crap.

    • kelvinjps 2 days ago

      Apply Adguard - annoyances filter list on ublock

      • Timwi 21 hours ago

        I already have that on and it didn't catch this. It doesn't really seem to catch any pop-ups much.

joshcsimmons 2 days ago

This is phenomenal. I’ve recently gotten into fountain pens and have been interested in learning some of these older styles.

teekert 2 days ago

For me it's in Dutch, it must detect my language and adapt (since HN block non-English content)?

  • zarq 2 days ago

    There's a blue button at the bottom center to revert to the original language.

  • FearNotDaniel 2 days ago

    Presumably they are doing that based on browser or OS language settings rather than IP-based location? As an English-speaker living in a German-speaking land nothing infuriates me more than websites that assume I would rather have German language content simply based on my IP address rather than checking my system language.

    But yes, to confirm your assumption - I followed the link above and got the English version.

fatihkocnet 2 days ago

Frist time seeing this website. Excellent!