ShapefileDataSet.CodePage Property
Gets or sets the code page to use for attribute names and string attribute values.
Syntax
public System.Int32 CodePage { get; set; }
Property Value
Default: 0
The code page in which attribute names and string attribute values are encoded.
This property can only be modified before the dataset has been initialized. Attempting to modify it after initialization will cause an exception to be thrown.
Remarks
Shapefiles are not always encoded with UTF-8. The code page that has been used should detectable via metadata in the files, but the information is sometimes missing.
The default value of the CodePage property tells Carmenta Engine to detect the code page automatically from metadata. In this case, Carmenta Engine will first try to read the code page from the .cpg file. If the .cpg file is missing or could not be parsed, Carmenta Engine will try to deduce the code page from the Language Driver ID in the .dbf file. Failing that, the code page 65001 for UTF-8 will be used.
So you need to set an explicit CodePage value only if the metadata information is missing and the encoding is not UTF-8, or if the metadata information is present but wrong.
If the CodePage has been set to the special value −1, the active code page of the system will be used. The −1 usually occurs only in configuration files that Carmenta Studio has migrated automatically from some Carmenta Engine version earlier than 5.8, where using the active code page was the default behavior. So in this case, the automatic migration has been careful to preserve the pre-5.8 behavior of the configuration, but it will usually be better to manually replace −1 by the default value.
A few common code pages are mentioned in the table below:
Code page | Description |
---|---|
437 | DOSLatinUS |
850 | DOSLatin1 |
852 | DOSLatin2 |
855 | DOSCyrillic |
866 | DOSCyrillicRussian |
874 | DOSThai |
1250 | WinLatin2 |
1251 | WinLatinCyrillic |
1252 | WinLatin1 |
1257 | WinBaltic |
65001 | UTF-8 |
Platforms
Windows, Linux, Android