Codepoints.net
banner
codepoints.typo.social.ap.brid.gy
Codepoints.net
@codepoints.typo.social.ap.brid.gy
� We love Unicode, characters, lettering and language. Questions about #Unicode? Just ask!

Previously @CodepointsNet on Twitter.

[bridged from https://typo.social/@codepoints on the fediverse by https://fed.brid.gy/ ]
Don't leave the screen reader hungry

https://htmhell.dev/adventcalendar/2025/17/

The #htmhelladventcalendar entry 17 by @gerireid explores fascinating differences between visible and read out text on websites. And #Unicode makes a guest appearance in form of emoji names.
Don't leave the screen reader hungry - HTMHell
A collection of bad practices in HTML, copied from real websites.
htmhell.dev
December 19, 2025 at 10:07 AM
Reposted by Codepoints.net
using emojis in subtitles to indicate saying a word twice in different ways is so wildly modern that I am left in shock
December 2, 2025 at 10:22 AM
Reposted by Codepoints.net
There are a lot of cursed computers things but this is the most cursed computers thing

https://benjojo.co.uk/u/benjojo/h/h4N78m1PjXYsYfzkGV
November 27, 2025 at 6:06 PM
Reposted by Codepoints.net
For anyone who's struggled with variable spelling practices before modern printing, there's a summary of what's "interchangeable" in Thomas Wright's Dictionary of Obsolete and Provincial English. 2 vols. London, 1857.
#orthography #English #language #dialect #manuscripts #medieval
November 16, 2025 at 2:58 PM
Reposted by Codepoints.net
RE: https://hachyderm.io/@SnoopJ/109921008429679600

Happy #unicode Mouse day to those who celebrate
hachyderm.io
November 9, 2025 at 11:56 PM
RTL Styling 101 – An extensive guide on how to style for right-to-left text in #css

https://rtlstyling.com/posts/rtl-styling/

by @shadeed9
Right to Left Styling 101
An extensive guide on how to style for RTL in CSS
rtlstyling.com
October 29, 2025 at 8:16 AM
Do I get this right? The obfuscation technique that is used in the GlassWorm virus is basically using variation selectors instead of a-z0-9 for base64-encoding the payload?

https://www.koi.ai/blog/glassworm-first-self-propagating-worm-using-invisible-code-hits-openvsx-marketplace
GlassWorm: First Self-Propagating Worm Using Invisible Code Hits OpenVSX Marketplace | Koi Blog
www.koi.ai
October 22, 2025 at 9:45 AM
This is a written word:

https://www.youtube.com/shorts/uKyiBwrFgjg

Turns out there is actually text written on the Mexican flag.
October 21, 2025 at 8:13 PM
Reposted by Codepoints.net
Glad to announce the October 2025 FontForge release.

Some interesting stuff:
* Show CNC fonts in Font View
* Harfbuzz integration
* GTK3 support kick-off

#fontforge #typography #foss #gtk #gtk3
October 10, 2025 at 9:16 AM
Reposted by Codepoints.net
The layout of this paper is expert level trolling!

https://journals.uc.edu/index.php/vl/article/view/5765/4629
View of Optimal Line Length in Reading — A Literature Review
journals.uc.edu
October 10, 2025 at 9:28 AM
What does a lowercase emoji look like?

https://www.youtube.com/shorts/MQp2HRZHFBA
September 15, 2025 at 11:19 AM
Reposted by Codepoints.net
Android will *not* be getting most of the Unicode 17 updates.

Some of its fonts are over a decade out of date - and Google refuses to re-use its own Noto font stack.

I've raised the issue at:
https://issuetracker.google.com/issues/366415133

If you're a Googler please ask someone to prioritise […]
Original post on mastodon.social
mastodon.social
September 10, 2025 at 10:58 AM
Reposted by Codepoints.net
@Edent

SET NAMES utf8mb4;

-- helper: safe code point to utf8mb4 string using JSON unescape
DELIMITER //

DROP FUNCTION IF EXISTS unichar//
CREATE FUNCTION unichar(cp INT)
RETURNS VARCHAR(4) CHARSET utf8mb4
DETERMINISTIC
BEGIN
DECLARE v INT;
DECLARE […]
Original post on infosec.exchange
infosec.exchange
September 5, 2025 at 7:54 AM
If you ever wondered about the difference between Sheriff and Sans-Sherif, @alvaromontoro has got you covered:

https://comicss.art/comics/190/

Part of his comiCSS series.
comiCSS #190: Sheriff
cartoon with two panels. The first one shows a sheriff with a curly mustache, curly cowboy hat, curly eyebrows, and additional shadows that make it 'fancy'; it is titled Sheriff. The second one is the same sheriff, but the mustache is straight also the eyebrows and the cowboy hat; it is titled Sans-Sheriff.
comicss.art
July 22, 2025 at 7:04 AM
Oh no!

https://corp.unicode.org/pipermail/unicode/2025-July/011467.html

@babelstone died four days ago. Rest in Peace and thank you for all your work!

Thank you @Evertype for being the messenger of such sad news!
Andrew C. West 魏安 1960–2025
corp.unicode.org
July 14, 2025 at 6:11 PM
[swearwords]

This is the reason why the site is so slow currently. Fucking useless spiders.
June 27, 2025 at 2:22 PM
Currently the site is a bit slow. This is due to high server load. I’m at that problem and will hopefully figure it out soon.

Thanks for the patience and sorry for the inconvenience!
June 25, 2025 at 2:54 PM
The Rarest Sounds in Language…
https://www.youtube.com/shorts/aO9wVq5aUos

a short by human1011 on YouTube.
May 23, 2025 at 8:26 AM
The #Unicode v17 beta review period has opened:
https://blog.unicode.org/2025/05/unicode-170-beta-review-open.html

New scripts that will be encoded in this version:

> Beria Erfe is a modern-use script used in central Africa.
>
> Chisoi is a modern-use script used in northeast India.
>
> Tolong […]
Original post on typo.social
typo.social
May 21, 2025 at 10:38 AM
Reposted by Codepoints.net
Detecting malicious Unicode
In a recent educational trick, curl contributor James Fuller submitted a pull-request to the project in which he suggested a larger cleanup of a set of scripts. In a later presentation, he could show us how not a single human reviewer in the team nor any CI job had spotted or remarked on one of the changes he included: he replaced an ASCII letter with a Unicode alternative in a URL. This was an eye-opener to several of us and we decided we needed to up our game. We are the curl project. We can do better. ## GitHub The replacement symbol looked identical to the ASCII version so it was not possible to visually spot this, but the diff viewer knows there is a difference. In this GitHub website screenshot below I reproduced a similar case. The right-side version has the Latin letter ‘g’ replaced with the Armenian letter co. They appear to be the same. GitHub shows a diff. But what is actually the difference? The diff viewer says there is a difference but as a human it isn’t possible to detect what it is. Is it a flaw? Does it matter? If done “correctly”, it would be done together with a _real_ and expected fix. The impact of changing one or more letters in a URL can of course be devastating depending on conditions. When I flagged about this rather big omission to GitHub people, I got barely no responses at all and I get the feeling the impact of this flaw is not understood and acknowledged. Or perhaps they are all just too busy implementing the next AI feature we don’t want. ## Warnings When we discussed this problem on Mastodon earlier this week, Viktor Szakats provided me with an example screenshot of doing a similar stunt with Gitea which quite helpfully highlights that there is something special about the replacement: Gitea warns that the replacement is using “ambiguous Unicode characters” I have been told that some of the other source code hosting services also show similar warnings. As a user, I would actually like to know even more than this, but at least this warns about the proposed change clearly enough so that if this happens I would get the code manually and investigate before accepting such a change. ## Detect While we wait for GitHub to wake up and react (which I have no expectation will actually happen anytime soon), we have implemented checks to help us poor humans spot things like this. _To detect malicious Unicode._ We have added a CI job that scans all files and validates every UTF-8 sequence in the git repository. In the curl git repository most files and most content are plain old ASCII so we can “easily” whitelist a small set of UTF-8 sequences and some specific files, the rest of the files are simply not allowed to use UTF-8 at all as they will then fail the CI job and turn up red. In order to drive this change home, we went through all the test files in the curl repository and made sure that all the UTF-8 occurrences were instead replaced by other kind of escape sequences and similar. Some of them were also used more or less by mistake and could easily be replaced by their ASCII counterparts. The next time someone tries this stunt on us it could be someone with less good intentions, but now ideally our CI will tell us. ## Confusables There are plenty of tools to find similar-looking characters in different Unicode sets. One of them is provided by the Unicode consortium themselves: https://util.unicode.org/UnicodeJsps/confusables.jsp ## Reactive This was yet another security-related fix _reacting_ on a demonstrated problem. I am sure there are plenty more problems which we have not yet thought about nor been shown and therefore we do not have adequate means to detect and act on automatically. We want and strive to be proactive and tighten everything _before_ malicious people exploit some weakness somewhere but security remains this never-ending race where we can only do the best we can and while _the other side_ is working in silence and might at some future point attack us in new creative ways we had not anticipated. That future unknown attack is a tricky thing.
daniel.haxx.se
May 16, 2025 at 7:10 AM
Reposted by Codepoints.net
If anyone wants a horrible python program that will write integers in cuneiform sexegesimal (base sixty) using unicode so you can paste it all over the place in emails, documents and text messages I have just the thing. Use with https://en.wikipedia.org/wiki/Plimpton_322

For maximum amusement. […]
Original post on sauropods.win
sauropods.win
April 10, 2025 at 3:39 PM
Quick hieroglyphs YouTube short:

This Ancient Hieroglyph is ADORABLE

https://www.youtube.com/shorts/-nnW8xrxp4w
April 9, 2025 at 10:53 AM