
Note however that some characters are not displayed in Internet Explorer: Use a link to a special character listed under the edit box to insert that character.There are four ways to enter a non-ASCII character into the wikitext: This page contains recommendations for which characters are safe to use and how to use them. Many characters not in the repertoire of standard ASCII will be useful-even necessary-for projects in a non-latin alphabet language. To find out which character set applies in a project, use the browser's "View Source" feature and look for something like this: Note that Special:Export exports using UTF-8 even if the database is encoded in ISO 8859-1, at least that was the case for the English Wikipedia, already when it used version 1.4. the most common special characters, such as é, are in the character set, so code like é, although allowed, is not needed.Unavailable characters which are copied into the edit box are first displayed as the character, and automatically converted to their decimal codes on Preview or Save.
The edit box shows the entered code, the webpage the resulting character. for the character € the named character reference € and the decimal character reference € and the hexadecimal character reference €.
special characters that are not available in the limited character set are stored in the form of a multi-character code there are usually two or three equivalent representations, e.g. special characters, including CJK characters, can be treated like normal ones not only the webpage, but also the edit box shows the character in addition it is possible to use the multi-character codes they are not automatically converted in the edit box. a variable number of bytes per character. It is also invisible to reusers as Wikimedia now uses XML dumps rather than database dumps. This conversion on load process is invisible to users. Edits made since the upgrade will be stored as UTF-8 in the database. Pre-upgrade wikitext in their databases remains stored in Windows-1252 and is converted on load (some of it may also have been converted by gradual changes in the way history is stored). Until the end of June 2005, when this new version came into use on Wikimedia projects, the English, Dutch, Danish, and Swedish Wikipedias used windows-1252 (they declared themselves to be ISO-8859-1 but in reality browsers treat the two as synonymous and the MediaWiki software made no attempt to prevent use of stuff from windows-1252). Overviewįrom MediaWiki 1.5, all projects use Unicode ( UTF-8) character encoding.