Text rendering hates you (2019)

(faultlore.com)

111 points | by andsoitis 6 days ago

14 comments

  • Karliss 5 hours ago
    Few more additional ones, more about editing than just rendering:

    The style change mid ligature has a related problem. While it might be reasonable not to support style change in the middle of ligature, you still want to select individual letters within ligatures like "ff", "ffi" and "fl". The problem just like with color change is that neither the text shaper nor program rendering text knows where each individual letter within ligature glyph is positioned. Font simply lacks this information.

    From what I have seen most programs which support it use similar approximation as what Firefox uses for coloring - split the ligature into equal parts. Works good enough for something like "fi", "fl" not so much for some of ligatures within programming fonts that combine >= into ≥.

    There are even worse edge cases in scripts for other languages. There are ligatures which look roughly like the 2 characters which formed it side by side but in reverse order. There are also some ligatures in CJK fonts which combine 4 characters in a square.

    Backspace erases characters at finer granularity than it's possible to select them.

    With regards to LTR/RTL selection weirdness I recently discovered that some editors display small flag on the cursor displaying current position direction when it's in mixed direction text.

    • gudzpoz 2 minutes ago
      > some editors display small flag on the cursor displaying current position direction

      I was amazed to see IDEA/RustRover doing exactly this [1] when I added BIDI texts to my code to test things out.

      [1] https://i.imgur.com/Qqlyqpc.png (image taken from IDEA issue tracker)

    • tomcam 2 hours ago
      I cannot imagine a use case where I would want to do a style change mid ligature. Can someone smarter than I am give a reasonable example of doing so?
      • jfengel 1 hour ago
        The user may not think of the letters as connected. Suppose the user wanted to write "stuffing" and bold the letters "ing". The user may well not realize that the font thinks of "ffi" as anything other than three separate letters.
        • tomcam 1 hour ago
          Excellent example! Thanks.
  • tomcam 2 hours ago
    > Text is complicated

    So true!

    > and english is bad at expressing these nuances.

    I think English is a terrible shitpile of grammar and syntax. I'm very impressed that anyone who speaks another language natively can get good at it.

    But I'm interested in the notion that it lacks nuance to describe the intricacies of text rendering. Can someone tell me where that would apply?

    • tenacious_tuna 1 hour ago
      > the notion that it lacks nuance to describe the intricacies of text rendering

      I took this to mean that any non-domain-specific language may be bad at describing that domain, e.g. why physicists, mathematicians, chemists, etc. have a common symbology for the discipline, or why programming languages exist. i.e., not so much that English is uniquely bad among written human language for conveying these topics, but just that any non-specialized language may be.

      Though, I think the author did a fair job, but I lack the domain experience to guess at where the misconceptions might lie.

      • tomcam 1 hour ago
        I had much the same conclusions. The author did a perfectly good job of explaining the issues.
    • globalnode 1 hour ago
      As a native english speaker, i did try to learn german but eventually gave up. A language sprinkled with "learn by wrote" gender prefixes for every item is just not worth learning. I did have an issue with the numbers being back to front once you get to the unit value but then someone pointed out english does that too for the values 13-19... so there ya go.
  • gnabgib 6 days ago
    (2019) Popular in:

    2023 (290 points, 119 comments) https://news.ycombinator.com/item?id=36478892

    2022 (399 points, 154 comments) https://news.ycombinator.com/item?id=30330144

    2019 (542 points, 170 comments) https://news.ycombinator.com/item?id=21105625

  • jesse__ 6 hours ago
    The ligatures part of this article gets me every time I re-read it. I think reading this article may have been the first time I realized that even large, well-funded projects are still done by people who are just regular humans, and sometimes settle for something that's good enough.
  • thot_experiment 6 hours ago
    I've tried to ask this before in various contexts and I've never been able to find an answer but maybe commenters on a post like this would know.

    I like the way that the CJK fonts render without anti-aliasing on windows. I want to know why and how to cause windows to render a non-cjk font of my choosing in this aliased style. I am not opposed to hex-editing or otherwise modifying the font if that's necessary. I've never been able to find information bout the mechanism or how it's triggered.

  • charcircuit 4 hours ago
    >But if the transform is an animation this will actually look even worse

    I wish they provided an example video of this since I can't visualize it. My natural thinking is subpixel antialiasing should look fine.

    >the characters will jiggle as each glyph bounces around between different subpixel snappings and hints on each frame.

    This shouldn't be a big issue unless your animation is slow and your subpixels are big.

    • akdor1154 3 hours ago
      The issue (i think) is that the animation is done post-rasterizing. So a translate of integer pixels is fine, but scale? Skew? Suddenly you have really visible colour fringing appearing out of nowhere.
  • Dwedit 2 hours ago
    "Subpixel offsets break glyph caches"

    I once resolved that by keeping a vertically shrunken but really wide glyph around in a cache. Just resample it for a different horizontal offset.

    • mananaysiempre 27 minutes ago
      The AGG (“Anti-Grain Geometry”) library does something similar[1], from what I understand.

      Also, I had (though never tested) the impression that in the Windows world ClearType uses 3x the horizontal resolution internally (I vaguely remember that being mentioned in the horror novel^W^W Raster Tragedy[2] somewhere?..). Given many font designers’ testing process for their hinting bytecode seems to be to run it through ClearType and check if it looks OK (not unlike firmware programmers...), we all, including Microsoft, are essentially stuck with that choice forever (or at least until people with painfully low-res displays become rare enough that the complaining about blurry text can be disregarded). So I’d expect 1/3 of a pixel to be the natural resolution for a glyph cache, not 1/4? Or have things changed in the transition from GDI to GDI+ to DirectWrite?

      [1] https://agg.sourceforge.net/antigrain.com/research/font_rast...

      [2] http://rastertragedy.com/

  • xg15 5 hours ago
    > Don’t ask about the code which line-breaks partial ligatures though.

    Wondered about this. All the circular dependencies sound like you could feasibly get some style/layout combinations that lead to self-contradictory situations.

    E.g. consider a ligature that's wider than the characters' individual glyphs. If the ligature is at the end of the box, it could trigger a line break. But that line break would also break up the ligature and cause the characters to be rendered as individual glyphs, reducing their width - which would undo the line break. But without the line break, the ligature would reconnect, increase the width and restore the line break, etc etc...

    • bfgeek 3 hours ago
      Blink's (Chromium) text layout engine works the following way.

      1. Layout the entire paragraph of text as a single line.

      2. If this doesn't fit into the available width, bisect to the nearest line-break opportunity which might fit.

      3. Reshape the text up until this line-break opportunity.

      4. If it fits great! If not goto 2.

      This converges as it always steps backwards, and avoids the contradictory situations.

      Harfbuzz also provides points along the section of text which is safe to reuse, so reshaping typically involes only a small portion of text at the end of the line, if any. https://github.com/harfbuzz/harfbuzz/issues/224

      This approach is different to how many text layout engines approach this problem e.g. by adding "one word at a time" to the line, and checking at each stage if it fits.

      • nicoburns 3 hours ago
        > This approach is different to how many text layout engines approach this problem e.g. by adding "one word at a time" to the line, and checking at each stage if it fits.

        Do you know why Chrome does it this way?

        • bfgeek 1 hour ago
          We found it was roughly on par performance wise for simple text (latin), and faster for more complex scripts (thai, hindi, etc). It also is more correct when there is kerning across spaces, hyphenation, etc.

          For the word-by-word approach to be performant you need a cache for each word you encounter. The shape-by-paragraph approach we found was faster for cold-start (e.g. the first time you visit a webpage). But this is also more difficult to show in standard benchmarks as benchmarks typically reuse the same renderer process.

  • socalgal2 5 hours ago
    And the companion article: https://lord.io/text-editing-hates-you-too/

    (posted in other other threads too)

  • tankenmate 4 hours ago
    Hmm I use Firefox and the rendering I see in Firefox looks nothing like the render the author gets in Firefox; in fact the text rendering I get looks very similar to the "Chrome" rendering. Obviously this must depend on the libraries linked during the build process.
    • Denvercoder9 4 hours ago
      The article is from 2019, things might also simply have changed since then.
    • kg 4 hours ago
      Depending on your OS Firefox will select from multiple rendering backends based on your GPU, driver etc.

      On Windows it may or may not be using DirectWrite for text rasterization as a general thing, and in some cases text might be rasterized using a different fallback path if DirectWrite can't handle the font, I think.

      IIRC this was/is true for Chrome as well, where in some cases it software rasterizes text using Skia instead of calling through to the OS's font implementation.

      • nicoburns 3 hours ago
        IIRC, Chrome now uses CoreText/DirectWrite for system fonts on macOS/Windows, and Skrifa (FreeType rewritten in Rust) outlines rasterized with Skia for everything else (system fonts on Linux, web fonts on all platforms).

        I believe Firefox leans on the system raserizers a little more heavily (using them for everything they support), and also still uses FreeType on Linux.

  • lovich 4 hours ago
    How did they get the exact effect to show what they want in the text here instead of say, me seeing the exact same visuals for each browser as I am reading it from a single browser?
    • zerocrates 4 hours ago
      You mean in the parts that say "Here's what they look like in Safari" and so on? Those are just .pngs.
      • lovich 2 hours ago
        I missed some UI improvement in browsers then as I can copy and paste them as text, and even the italic emoji example carried over the italic information when I tried copying it into various editors.
        • namibj 1 hour ago
          It's just transparent text over png background.
  • shmerl 3 hours ago
    > So subpixel-AA is a really neat hack that can significantly improve text legibility, great! But, sadly, it’s also a huge pain in the neck!

    Especially when you have a monitor with unusual subpixel layout, which is very common for OLEDs that don't have any standard for it. In practice, developers of common font libraries like FreeType simply didn't bother with trying to support all that. And that trickles down to toolkits like Qt. Surprising the article doesn't mention this major problem with modern displays.

    > Retina displays really don’t need it

    Assuming this means high resolution displays - unfortunately that's not always what you end up using. So subpixel antialiasing can still be useful, if it can work. But as above, it's often just broken on OLEDs.

    • namibj 1 hour ago
      Arguably monitors that are not mere TVs ought to allow control of each distinct pixel they drive internally and communicate their layout and if needed distinct brightness/color coordinates to the host.

      Exceptions can apply if the consumers of the screen can't resolve details finer than "emulated sRGB pixels" anyways.

      • shmerl 1 hour ago
        Something like that should be done in EDIDs may be, but you still would need to support a ton of different layouts in the end. LCD monitors are a lot more limited in that sense.
  • djaouen 3 hours ago
    Good. I hated it first!
  • casey2 5 hours ago
    The real takeaway from the article is that you can rathole forever on ill-defined problems. Decide upfront whether you care about actual humans and their usecases or hypothetical humans and their hypothetical usecases.
    • PKop 4 hours ago
      Or even, which subset of humans' uses cases you wish to concern yourself with as you can't always please everyone or tackle everyone's problems. If one only cared about a single language everything becomes much easier.
      • nicoburns 3 hours ago
        > If one only cared about a single language everything becomes much easier.

        Yes. Let's be thankful that isn't the case for browsers and major GUI toolkits though.