TIP: Click on subject to list as thread! ANSI
echo: golded
to: Nicholas Boel
from: Vitaliy Aksyonov
date: 2024-03-03 16:28:00
subject: Re: Need volonteers to te

Hello Nicholas.

03 Mar 24 10:33, you wrote to Michiel van der Vlist:

 MvdV>> Yes, they are translated to multi (usually two for most
 MvdV>> characters used in Fidonet) byte characters. Only the ASCII
 MvdV>> characters (0-127) are not translated and so remain one byte.

 NB> Thanks for the explanation. While reading this, I did check the ASCII
 NB> table and the characters I'm referring to are all above 127. This also
 NB> is kind of reflected while using Golded, if I widen my screen more
 NB> than 160 characters, less of those lines are wrapped to the next line.

 NB> However, it doesn't seem like I can widen my window enough to keep
 NB> them all on one line, so I'm guessing when they are translated to
 NB> utf-8 they are more than 2 bytes, since I'm well over double the width
 NB> of an 80 character screen - which the original stat message was made
 NB> for.

Pseudo-graphics symbols encoded to three bytes in UTF-8. That's why it's not enough.
So you either ignore it or make windows at least 3 times wider than that message. It might sound stupid and overkill, but it shall do the trick. :)

 NB> So, at this point it's basically working and displaying properly, but
 NB> then comes in the 'characters' vs 'bytes' thing that Golded isn't
 NB> supporting, so it is wrapping what it thinks is double, triple, or
 NB> even quadruple the amount of 'characters' that are there.

 MvdV>> To put it simple: if you want to encode CP437 and CP866, you
 MvdV>> could put CP437 OR CP866 in the first byte, but you need at
 MvdV>> least one bit more information which one it is; CP437 or CP866.
 MvdV>> That is not exactly how UTF-8 works but it should give you an
 MvdV>> idea of why just one byte can not be enough.

 NB> Thank you for the explanation. This definitely helps me to understand
 NB> what is happening.

Unicode is very complex. It's event worse than you may think. For example, some displayed symbols take one char space on screen, but others two.
Also one letter is not necessarily one Unicode "symbol". Also it may have non-printable symbols and many more.

Vitaliy

... C????o ?a ??epa??ee, ?o ?e ?o??? ?epe? ?e?...
--- GoldED+/LNX 1.1.5-b20240223
                                                                                         
* Origin: Aurora, Colorado (1:104/117)

SOURCE: echomail via QWK@pharcyde.org

Email questions or comments to sysop@ipingthereforeiam.com
All parts of this website painstakingly hand-crafted in the U.S.A.!
IPTIA BBS/MUD/Terminal/Game Server List, © 2025 IPTIA Consulting™.