Relative Content

Tag Archive for character-encoding

How to detect client character encoding?

I programmed a telnet server using C as programming language but I have a problem to send characters with emphases (é, è, à …). The character encoding is different between the telnet clients (windows, linux, putty, …).

How do you face decoding issues?

For what I understand, given a sequence of bytes without any further information, it’s not generally possible to understand which encoding we are talking about. Of course we can guess (e.g. perl’s Encode::Guess and similar tools), but sometimes this is just not feasible.