ports/textproc/uchardet/pkg/DESCR

13 lines
565 B
Text
Raw Normal View History

2023-08-16 22:26:55 +00:00
uchardet is a C language binding of the original C++ implementation of
the universal charset detection library by Mozilla.
uchardet is an encoding detector library, which takes a sequence of bytes
in an unknown character encoding without any additional information, and
attempts to determine the encoding of the text.
The original code of universalchardet is available at
http://lxr.mozilla.org/seamonkey/source/extensions/universalchardet/
Techniques used by universalchardet are described at
http://www.mozilla.org/projects/intl/UniversalCharsetDetection.html