if you enjoy some good old fashioned "you are definitely using the wrong programing language" code, here's UTF-8 encoding in pure VCL. which is essentially supposed to be a http config language. i am especially pleased with this
community.fastly.com/t/utf-8-enco...
community.fastly.com/t/utf-8-enco...
UTF-8 encoding in pure VCL
Just for fun, here’s UTF-8 encoding in pure VCL. I wrote this as part of our language tests when we introduced bitwise operators a while ago, but I think it’s also a good example for passing argument...
community.fastly.com
November 7, 2025 at 10:23 AM
if you enjoy some good old fashioned "you are definitely using the wrong programing language" code, here's UTF-8 encoding in pure VCL. which is essentially supposed to be a http config language. i am especially pleased with this
community.fastly.com/t/utf-8-enco...
community.fastly.com/t/utf-8-enco...
Fish 4.2 introduces history-based multi-line autosuggestions, sets UTF-8 as default encoding, adds tab title control, and improves build stability with bug fixes buff.ly/UbVUM9t
November 11, 2025 at 9:00 PM
Fish 4.2 introduces history-based multi-line autosuggestions, sets UTF-8 as default encoding, adds tab title control, and improves build stability with bug fixes buff.ly/UbVUM9t
Trottelbot kann kein UTF-8...
November 1, 2025 at 7:43 AM
Trottelbot kann kein UTF-8...
Also, debating if I should support UTF-16 input, or just say fuck it and only work with UTF-8 and anything else needs to be re-encoded.
November 2, 2025 at 2:18 PM
Also, debating if I should support UTF-16 input, or just say fuck it and only work with UTF-8 and anything else needs to be re-encoded.
🧩 Day 9 of my #PDFwithPython journey
Today I worked with line spacing — a key formatting option when rendering longer text blocks in PDFs. Increasing line spacing improves readability, especially for paragraphs and multi-line content.
#Python #PDF #LearnPython #100DaysOfCode
Today I worked with line spacing — a key formatting option when rendering longer text blocks in PDFs. Increasing line spacing improves readability, especially for paragraphs and multi-line content.
#Python #PDF #LearnPython #100DaysOfCode
October 30, 2025 at 8:19 PM
🧩 Day 9 of my #PDFwithPython journey
Today I worked with line spacing — a key formatting option when rendering longer text blocks in PDFs. Increasing line spacing improves readability, especially for paragraphs and multi-line content.
#Python #PDF #LearnPython #100DaysOfCode
Today I worked with line spacing — a key formatting option when rendering longer text blocks in PDFs. Increasing line spacing improves readability, especially for paragraphs and multi-line content.
#Python #PDF #LearnPython #100DaysOfCode
echo mb_convert_case(ucwords($foo), MB_CASE_TITLE, "UTF-8");
October 29, 2025 at 9:32 AM
echo mb_convert_case(ucwords($foo), MB_CASE_TITLE, "UTF-8");
<?xml version="1.0" encoding="utf-8"?><jerry><no /></jerry>
October 27, 2025 at 10:02 PM
<?xml version="1.0" encoding="utf-8"?><jerry><no /></jerry>
🚀 Released path-security v0.2.0 🦀🛡️
Rust library for a path validation - protects against 62+ traversal techniques used in real-world exploits.
✅ URL & UTF-8 overlong encoding
✅ Unicode homoglyphs / zero-width tricks
✅ NTFS streams, UNC paths
✅ 95.81% test coverage
📦 crates.io/crates/path-security
Rust library for a path validation - protects against 62+ traversal techniques used in real-world exploits.
✅ URL & UTF-8 overlong encoding
✅ Unicode homoglyphs / zero-width tricks
✅ NTFS streams, UNC paths
✅ 95.81% test coverage
📦 crates.io/crates/path-security
crates.io: Rust Package Registry
crates.io
October 25, 2025 at 9:54 PM
🚀 Released path-security v0.2.0 🦀🛡️
Rust library for a path validation - protects against 62+ traversal techniques used in real-world exploits.
✅ URL & UTF-8 overlong encoding
✅ Unicode homoglyphs / zero-width tricks
✅ NTFS streams, UNC paths
✅ 95.81% test coverage
📦 crates.io/crates/path-security
Rust library for a path validation - protects against 62+ traversal techniques used in real-world exploits.
✅ URL & UTF-8 overlong encoding
✅ Unicode homoglyphs / zero-width tricks
✅ NTFS streams, UNC paths
✅ 95.81% test coverage
📦 crates.io/crates/path-security
🤖 IA só lida bem com ASCII !!! 🤣
Notei esse problema em dois serviços diferentes.
O Claude apontava erros ortográficos num texto UTF-8 porque tava lendo os byres das letras acentuadas como caracteres separados!
Agora no Substack na Web, a IA lendo artigo em inglês pronunciou FAÇADE como FA-ADE!!
Notei esse problema em dois serviços diferentes.
O Claude apontava erros ortográficos num texto UTF-8 porque tava lendo os byres das letras acentuadas como caracteres separados!
Agora no Substack na Web, a IA lendo artigo em inglês pronunciou FAÇADE como FA-ADE!!
October 22, 2025 at 11:35 PM
🤖 IA só lida bem com ASCII !!! 🤣
Notei esse problema em dois serviços diferentes.
O Claude apontava erros ortográficos num texto UTF-8 porque tava lendo os byres das letras acentuadas como caracteres separados!
Agora no Substack na Web, a IA lendo artigo em inglês pronunciou FAÇADE como FA-ADE!!
Notei esse problema em dois serviços diferentes.
O Claude apontava erros ortográficos num texto UTF-8 porque tava lendo os byres das letras acentuadas como caracteres separados!
Agora no Substack na Web, a IA lendo artigo em inglês pronunciou FAÇADE como FA-ADE!!
CUI時代からjgawkを愛用しています。多くのOSに移植されており、日本語処理が可能(UTF-8対応)なので。
ちょっとしたデータの抽出や加工なら、jgawkだけでできますしね😀
ちょっとしたデータの抽出や加工なら、jgawkだけでできますしね😀
今日のQiitaトレンド
AWKの簡単な使い方
AWKは、テキストデータ処理に特化したプログラミング言語であり、AWKスクリプトを処理するインタプリタです。
主にターミナル上でワンライナーとして使用され、空白やカンマなどで区切られたデータを行ごとに処理します。
基本的な実行形式は`awk 'パターン {アクション}'`であり、データの検索、抽出、置換などが行えます。
AWKは、他のスクリプト言語が利用できない環境でも手軽に使える利点があります。
複雑な処理よりも、コマンドのように簡潔に使うことが推奨されます。
AWKの簡単な使い方
AWKは、テキストデータ処理に特化したプログラミング言語であり、AWKスクリプトを処理するインタプリタです。
主にターミナル上でワンライナーとして使用され、空白やカンマなどで区切られたデータを行ごとに処理します。
基本的な実行形式は`awk 'パターン {アクション}'`であり、データの検索、抽出、置換などが行えます。
AWKは、他のスクリプト言語が利用できない環境でも手軽に使える利点があります。
複雑な処理よりも、コマンドのように簡潔に使うことが推奨されます。
AWKの簡単な使い方 #Bash - Qiita
はじめに 前回書いた記事でAWKをほんの少しだけ使用したが、AWKとは何ですか?とごく少数の人に聞かれたので自分の知っている範囲で記載します GNU awkであることを前提としています AWKとは AWKとはAWKスクリプトを処理するインタプリタで、プログラミン...
qiita.com
October 23, 2025 at 1:04 AM
CUI時代からjgawkを愛用しています。多くのOSに移植されており、日本語処理が可能(UTF-8対応)なので。
ちょっとしたデータの抽出や加工なら、jgawkだけでできますしね😀
ちょっとしたデータの抽出や加工なら、jgawkだけでできますしね😀
O_O
Um…I'm not sure I'm gonna waste time blogging about Atlas.
This is literally the first thing I did with it.
I even hinted it had prompt injection.
Don't use it.
Don't use any "AI" powered browser.
Ever.
Um…I'm not sure I'm gonna waste time blogging about Atlas.
This is literally the first thing I did with it.
I even hinted it had prompt injection.
Don't use it.
Don't use any "AI" powered browser.
Ever.
October 23, 2025 at 8:25 AM
O_O
Um…I'm not sure I'm gonna waste time blogging about Atlas.
This is literally the first thing I did with it.
I even hinted it had prompt injection.
Don't use it.
Don't use any "AI" powered browser.
Ever.
Um…I'm not sure I'm gonna waste time blogging about Atlas.
This is literally the first thing I did with it.
I even hinted it had prompt injection.
Don't use it.
Don't use any "AI" powered browser.
Ever.
O Claude é tão tosco que não sabe lidar com UTF-8.
Sobe um texto acentuado em UTF-8 para revisar. Diga que é UTF-8. Mesmo assim, ele repetidamente alucina erros que só existem porque está lendo o texto byte a byte, e não entende que em UTF-8 um caractere acentuado é feito de dois bytes!
Ridículo.
Sobe um texto acentuado em UTF-8 para revisar. Diga que é UTF-8. Mesmo assim, ele repetidamente alucina erros que só existem porque está lendo o texto byte a byte, e não entende que em UTF-8 um caractere acentuado é feito de dois bytes!
Ridículo.
October 20, 2025 at 6:50 PM
O Claude é tão tosco que não sabe lidar com UTF-8.
Sobe um texto acentuado em UTF-8 para revisar. Diga que é UTF-8. Mesmo assim, ele repetidamente alucina erros que só existem porque está lendo o texto byte a byte, e não entende que em UTF-8 um caractere acentuado é feito de dois bytes!
Ridículo.
Sobe um texto acentuado em UTF-8 para revisar. Diga que é UTF-8. Mesmo assim, ele repetidamente alucina erros que só existem porque está lendo o texto byte a byte, e não entende que em UTF-8 um caractere acentuado é feito de dois bytes!
Ridículo.
Попри невимовну самобутність, VBA все ж привчає до оголошення змінних для обʼєктів та їх призначення цим змінним.
Але нормальну підтримку XML могли би і зробити, не кажучи вже про UTF-8 🤷🏻♀️
Але нормальну підтримку XML могли би і зробити, не кажучи вже про UTF-8 🤷🏻♀️
Хостинг на Git то ще нормально, от я зараз схрещую VBA та XML
October 20, 2025 at 4:19 PM
Попри невимовну самобутність, VBA все ж привчає до оголошення змінних для обʼєктів та їх призначення цим змінним.
Але нормальну підтримку XML могли би і зробити, не кажучи вже про UTF-8 🤷🏻♀️
Але нормальну підтримку XML могли би і зробити, не кажучи вже про UTF-8 🤷🏻♀️
Normale Mails an einem Freitag von der IT.
Es gibt nichts zu sehen.
Bitte weitergehen.
Nein, über UTF-8 reden wir jetzt nicht(!).
Es gibt nichts zu sehen.
Bitte weitergehen.
Nein, über UTF-8 reden wir jetzt nicht(!).
October 17, 2025 at 10:47 AM
Normale Mails an einem Freitag von der IT.
Es gibt nichts zu sehen.
Bitte weitergehen.
Nein, über UTF-8 reden wir jetzt nicht(!).
Es gibt nichts zu sehen.
Bitte weitergehen.
Nein, über UTF-8 reden wir jetzt nicht(!).
UTF-8 範囲外の文字がLinux kernel のコミットタイトルに出てくることがあるので気をつけよう hikalium.hatenablog.jp/entry/2025/0... 👀
gitのコミットメッセージってエンコーディング情報を持ってるのか。知らなかった。
ISO-8859-*やCP125xをサポートして、CJKのマルチバイトエンコーディングをサポートしていないのは、公平でないように感じてはしまうが。
gitのコミットメッセージってエンコーディング情報を持ってるのか。知らなかった。
ISO-8859-*やCP125xをサポートして、CJKのマルチバイトエンコーディングをサポートしていないのは、公平でないように感じてはしまうが。
UTF-8 範囲外の文字がLinux kernel のコミットタイトルに出てくることがあるので気をつけよう - /var/log/hikalium
みんなが知ってるそのスマホにも密かに流れる、そうLinux kernel. 最近はそのコミットを眺める機会が多いのですが、色々調べていたら面白い点を見つけたので紹介します。 UTF-8として有効ではないコミットタイトル いまから137万コミットほど前のこと、2006年のコミットタイトルに、UTF-8の範囲外の文字が含まれていました。(don<B4>tと書いてある部分がそれです。) commit d...
hikalium.hatenablog.jp
October 18, 2025 at 12:45 AM
UTF-8 範囲外の文字がLinux kernel のコミットタイトルに出てくることがあるので気をつけよう hikalium.hatenablog.jp/entry/2025/0... 👀
gitのコミットメッセージってエンコーディング情報を持ってるのか。知らなかった。
ISO-8859-*やCP125xをサポートして、CJKのマルチバイトエンコーディングをサポートしていないのは、公平でないように感じてはしまうが。
gitのコミットメッセージってエンコーディング情報を持ってるのか。知らなかった。
ISO-8859-*やCP125xをサポートして、CJKのマルチバイトエンコーディングをサポートしていないのは、公平でないように感じてはしまうが。
En media hora voy a participar en una Ruby meet-up online en Argentina, donde hablaré de strings y encoding en español 😱🇪🇸 Todavía no he decidido si UTF-8 lo diré como iu-ti-ef-eit o u-te-efe-ocho 🤔
ruby.com.ar#highlighted
ruby.com.ar#highlighted
RubySur Argentina
Conectate con la comunidad de Ruby en Argentina y mirá las charlas pasadas.
ruby.com.ar
October 15, 2025 at 7:29 PM
En media hora voy a participar en una Ruby meet-up online en Argentina, donde hablaré de strings y encoding en español 😱🇪🇸 Todavía no he decidido si UTF-8 lo diré como iu-ti-ef-eit o u-te-efe-ocho 🤔
ruby.com.ar#highlighted
ruby.com.ar#highlighted
Forget* about Python 3.14, the cool kids are on 3.15.0a1! 🚀
🔬 Dedicated profiling package for Python profiling tools
💬 UTF-8 as default encoding
🌊 New C API to create bytes object
⚠️ Better error messages
discuss.python.org/t/python-3-1...
* Don't forget about 3.14
#Python #Python315 #release
🔬 Dedicated profiling package for Python profiling tools
💬 UTF-8 as default encoding
🌊 New C API to create bytes object
⚠️ Better error messages
discuss.python.org/t/python-3-1...
* Don't forget about 3.14
#Python #Python315 #release
Python 3.15 alpha 1
This is an early developer preview of Python 3.15 Major new features of the 3.15 series, compared to 3.14 Python 3.15 is still in development. This release, 3.15.0a1, is the first of seven planned ...
discuss.python.org
October 14, 2025 at 4:07 PM
Forget* about Python 3.14, the cool kids are on 3.15.0a1! 🚀
🔬 Dedicated profiling package for Python profiling tools
💬 UTF-8 as default encoding
🌊 New C API to create bytes object
⚠️ Better error messages
discuss.python.org/t/python-3-1...
* Don't forget about 3.14
#Python #Python315 #release
🔬 Dedicated profiling package for Python profiling tools
💬 UTF-8 as default encoding
🌊 New C API to create bytes object
⚠️ Better error messages
discuss.python.org/t/python-3-1...
* Don't forget about 3.14
#Python #Python315 #release
Ok who thought it'd be a great idea to have std::filesystem to use different char type on Windows?
I don't care that some nerd bet wrong on UTF-16 thirty years ago, we shouldn't have to deal with this nonsense in a 21st century standard abstraction.
Any 2010+ API should be UTF-8.
I don't care that some nerd bet wrong on UTF-16 thirty years ago, we shouldn't have to deal with this nonsense in a 21st century standard abstraction.
Any 2010+ API should be UTF-8.
October 3, 2025 at 1:11 PM
Ok who thought it'd be a great idea to have std::filesystem to use different char type on Windows?
I don't care that some nerd bet wrong on UTF-16 thirty years ago, we shouldn't have to deal with this nonsense in a 21st century standard abstraction.
Any 2010+ API should be UTF-8.
I don't care that some nerd bet wrong on UTF-16 thirty years ago, we shouldn't have to deal with this nonsense in a 21st century standard abstraction.
Any 2010+ API should be UTF-8.
Three weeks into building OpenGraph.tools and the hardest part isn't the code—it's handling edge cases. Broken UTF-8, weird redirects, sites that block bots. The web is beautifully messy.
#BuildInPublic #IndieHacker #Dev #SaaS #DevTools #OpenGraph
#BuildInPublic #IndieHacker #Dev #SaaS #DevTools #OpenGraph
October 1, 2025 at 8:27 PM
Three weeks into building OpenGraph.tools and the hardest part isn't the code—it's handling edge cases. Broken UTF-8, weird redirects, sites that block bots. The web is beautifully messy.
#BuildInPublic #IndieHacker #Dev #SaaS #DevTools #OpenGraph
#BuildInPublic #IndieHacker #Dev #SaaS #DevTools #OpenGraph
I grabed the diamond PNG here! This tool is so useful for grabbing utf-8 symbols.
symbol.so
symbol.so
Text Symbols Copy and Paste - Cool Symbols ✿ ➹ © ✔ ✦ ❥
Copy and paste cool text symbols such as arrows →, copyright ©, hearts ❤, stars ★, letters Ⓐ, and more with their Unicode, HTML, CSS, Alt code, etc., easily.
symbol.so
September 28, 2025 at 12:19 AM
I grabed the diamond PNG here! This tool is so useful for grabbing utf-8 symbols.
symbol.so
symbol.so
Understanding the Odin Programming Language 1.8 is out! The biggest change is an overhauled strings chapter. It explains UTF-8 and runes in a much clearer way.
Release notes are here: odinbook.com#release_notes
Also, see this thread for some free goodies related to the update.
Release notes are here: odinbook.com#release_notes
Also, see this thread for some free goodies related to the update.
Understanding the Odin Programming Language
Learn the Odin Programming Language and demystify low-level programming
odinbook.com
September 25, 2025 at 6:17 PM
Understanding the Odin Programming Language 1.8 is out! The biggest change is an overhauled strings chapter. It explains UTF-8 and runes in a much clearer way.
Release notes are here: odinbook.com#release_notes
Also, see this thread for some free goodies related to the update.
Release notes are here: odinbook.com#release_notes
Also, see this thread for some free goodies related to the update.
Danke, hab schon vermutet, dass das der gleiche UTF-8-Sonderzeichentrick ist wie auf anderen Plattformen.
Hilft den nicht grafisch lesenden halt so überhaupt nicht mit Screenreader und/oder Braillezeile.
Hilft den nicht grafisch lesenden halt so überhaupt nicht mit Screenreader und/oder Braillezeile.
September 25, 2025 at 1:18 PM
Danke, hab schon vermutet, dass das der gleiche UTF-8-Sonderzeichentrick ist wie auf anderen Plattformen.
Hilft den nicht grafisch lesenden halt so überhaupt nicht mit Screenreader und/oder Braillezeile.
Hilft den nicht grafisch lesenden halt so überhaupt nicht mit Screenreader und/oder Braillezeile.