Encoding values

This page lists the encoding types that you can use for your search box and results page. Encoding tells computers how to store and transmit text over the network.

UTF-8, which is the default encoding type, works best in the vast majority of cases. In fact, many text issues in the results page can be resolved by keeping the UTF-8 value. The only time you need to change the encoding value for your results page and search box is when the hosting webpage is not in UTF-8. The encoding for Programmable Search Engine must match the encoding of your webpage.

You can define the encoding either in the control panel or the context file. In the Basics tab of the control panel, you select the language from the Search engine encoding drop-down. In the context file, you define the value of the encoding attribute of the CustomSearchEngine element, as in the following example:

<CustomSearchEngine volunteers="false"

The following table lists the values you can use with the encoding attribute.

Note: If you do not specify an encoding type, Programmable Search Engine will use UTF-8 as the default value.

Encoding Type Value
Unicode (UTF-8) UTF-8
Arabic (Windows-1256) windows-1256
Central European Latin-2 (ISO-8859-2) ISO-8859-2
Central European (Windows-1250) windows-1250
Central European (CP852) cp852
Chinese Simplified (GB2312) GB2312
Chinese Simplified (GB18030) GB18030
Chinese Traditional (Big5) big5
Cyrillic (ISO-8859-5) ISO-8859-5
Cyrillic (KOI8-R) KOI8-R
Cyrillic (Windows-1251) windows-1251
Cyrillic/Russian (CP-866) cp-866
Greek (ISO-8859-7) ISO-8859-7
Hebrew (ISO-8859-8-I) ISO-8859-8-I
Hebrew (Windows-1255) windows-1255
Japanese (Shift_JIS) Shift_JIS
Japanese (EUC-JP) EUC-JP
Japanese (ISO-2022-JP) ISO-2022-JP
Korean (EUC-KR) EUC-KR
Nordic Latin-6 (ISO-8859-10) ISO-8859-10
South European Latin-3 (ISO-8859-3) ISO-8859-3
Turkish Latin-5 (ISO-8859-9) ISO-8859-9
Turkish (Windows-1254) windows-1254
Vietnamese (Windows-1258) windows-1258
West European Latin-1 (ISO-8859-1) ISO-8859-1
West European Latin-9 (ISO-8859-15) ISO-8859-15

