-
-
-
-
[
C/C++]
zhongwen.rar
C++ General input is not as wide-character set handling, use C++ Participle, if we can complete the narrow character input, but by the wide-character substring check basically segmentation problem solved
-
[
Visual C++ (VC++)]
utf8_gbk.rar
The realization of a document character set conversion, you can complete the character set from utf8 to gbk GB conversion, convenient interface shows some garbled characters amendment
-
-
[
C/C++]
cipinbijiao.rar
Names of the Beijing University corpus to carry out before and after the meeting of the extraction, by setting a threshold, to control the choice.
-
[
Visual C++ (VC++)]
SimpleSplit.rar
Himself wrote a simple segmentation procedure can identify in both Chinese and English, punctuation, numbers, etc., but the speed is not very ideal, in which ideas can be for your reference!
-
[
C/C++]
member.rar
Address book under the small DOS program for learning science can be set reference number, telephone, names can be used to document a few inches so that at any time to view does not support Chinese
-
[
Visual C++ (VC++)]
IR_Lib.rar
XPDF: the pdf file into a TEXT document library, for Chinese language support, please visit the official website to download Chinese language pack HTM2TXT: the HTML file into a TEXT file library ICTCLAS: Chinese string Segmentation of library PS2TXT: the ...
-