More actions
| Line 186: | Line 186: | ||
===PUA characters=== | ===PUA characters=== | ||
[[Wikipedia:Private Use Area|Private Use Area]] (PUA) characters are in three ranges of code points (<code>U+E000</code>–<code>U+F8FF</code> in the [[Wikipedia:Plane (Unicode)#Basic Multilingual Plane|BMP]], and in [[Wikipedia:Plane (Unicode)#Private Use Area planes|planes 15 and 16]]). PUA characters should normally be avoided, but they are sometimes used when they are found in common fonts. | |||
[[Private Use Area]] (PUA) characters are in three ranges of code points (<code>U+E000</code>–<code>U+F8FF</code> in the [[Plane (Unicode)#Basic Multilingual Plane|BMP]], and in [[Plane (Unicode)#Private Use Area planes|planes 15 and 16]]). PUA characters should normally be avoided, but they are sometimes used when they are found in common fonts | |||
Where PUA characters cannot be replaced with non-PUA Unicode characters, they should be converted to their (hexa)decimal code values (that is, &#...; or &#x...;). However, whenever a PUA character has a Unicode equivalent, it should instead be replaced with that equivalent (Unicodified). The Unicode may be obvious when text is copied and pasted from a document that uses the PUA for bullets or similar characters in Latin text, but similar things happen with punctuation and emoticons in documents using Japanese and other scripts, so an editor familiar with those scripts may be needed | Where PUA characters cannot be replaced with non-PUA Unicode characters, they should be converted to their (hexa)decimal code values (that is, &#...; or &#x...;). However, whenever a PUA character has a Unicode equivalent, it should instead be replaced with that equivalent (Unicodified). The Unicode may be obvious when text is copied and pasted from a document that uses the PUA for bullets or similar characters in Latin text, but similar things happen with punctuation and emoticons in documents using Japanese and other scripts, so an editor familiar with those scripts may be needed. Such PUA characters, which are sometimes found on the UniWiki in references and footnotes, should not be substituted with their (hexa)decimal values, as that will lock in the illegible character. If you're moderately familiar with the script, an internet search of the surrounding text will often locate a fully Unicode version of the text which can be used to correct the UniWiki article. | ||
Because browsers do not know which fonts to use for PUA characters, it is necessary for | Because browsers do not know which fonts to use for PUA characters, it is necessary for the UniWiki to specify them. The fonts should be specified through html markup, as in the example below. Note that if a font is not specified, or if none of the fonts are installed, readers will only see a numbered box in place of the PUA character. | ||
== See also == | == See also == | ||