This page lists the encoding types that you can use for your search box and results page. Encoding tells computers how to store and transmit text over the network.
UTF-8, which is the default encoding type, works best in the vast majority of cases. In fact, many text issues in the results page can be resolved by keeping the UTF-8
value. The only time you need to change the encoding value for your results page and search box is when the hosting webpage is not in UTF-8. The encoding for Programmable Search Engine must match the encoding of your webpage.
You can define the encoding either in the control panel or the context file. In the Basics tab of the control panel, you select the language from the Search engine encoding drop-down. In the context file, you define the value of the encoding
attribute of the CustomSearchEngine
element, as in the following example:
<CustomSearchEngine volunteers="false" language="en" visible="false" encoding="UTF-8"> </CustomSearchEngine>
The following table lists the values you can use with the encoding
attribute.
Note: If you do not specify an encoding type, Programmable Search Engine will use UTF-8
as the default value.
Encoding Type | Value |
---|---|
Unicode (UTF-8) | UTF-8 |
Arabic (Windows-1256) | windows-1256 |
Central European Latin-2 (ISO-8859-2) | ISO-8859-2 |
Central European (Windows-1250) | windows-1250 |
Central European (CP852) | cp852 |
Chinese Simplified (GB2312) | GB2312 |
Chinese Simplified (GB18030) | GB18030 |
Chinese Traditional (Big5) | big5 |
Cyrillic (ISO-8859-5) | ISO-8859-5 |
Cyrillic (KOI8-R) | KOI8-R |
Cyrillic (Windows-1251) | windows-1251 |
Cyrillic/Russian (CP-866) | cp-866 |
Greek (ISO-8859-7) | ISO-8859-7 |
Hebrew (ISO-8859-8-I) | ISO-8859-8-I |
Hebrew (Windows-1255) | windows-1255 |
Japanese (Shift_JIS) | Shift_JIS |
Japanese (EUC-JP) | EUC-JP |
Japanese (ISO-2022-JP) | ISO-2022-JP |
Korean (EUC-KR) | EUC-KR |
Nordic Latin-6 (ISO-8859-10) | ISO-8859-10 |
South European Latin-3 (ISO-8859-3) | ISO-8859-3 |
Turkish Latin-5 (ISO-8859-9) | ISO-8859-9 |
Turkish (Windows-1254) | windows-1254 |
Vietnamese (Windows-1258) | windows-1258 |
West European Latin-1 (ISO-8859-1) | ISO-8859-1 |
West European Latin-9 (ISO-8859-15) | ISO-8859-15 |