Sophie

Sophie

distrib > Mageia > 4 > x86_64 > by-pkgid > f800694edefe91adea2624f711a41a2d > files > 9090

php-manual-en-5.5.7-1.mga4.noarch.rpm

<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.01 Transitional//EN" "http://www.w3.org/TR/html4/loose.dtd">
<html>
 <head>
  <meta http-equiv="content-type" content="text/html; charset=UTF-8">
  <title>Basics of Japanese multi-byte encodings</title>

 </head>
 <body><div class="manualnavbar" style="text-align: center;">
 <div class="prev" style="text-align: left; float: left;"><a href="mbstring.encodings.html">Summaries of supported encodings</a></div>
 <div class="next" style="text-align: right; float: right;"><a href="mbstring.http.html">HTTP Input and Output</a></div>
 <div class="up"><a href="book.mbstring.html">Multibyte String</a></div>
 <div class="home"><a href="index.html">PHP Manual</a></div>
</div><hr /><div id="mbstring.ja-basic" class="chapter">
 <h1>Basics of Japanese multi-byte encodings</h1>

 <p class="para">
  Japanese characters can only be represented by multibyte encodings, 
  and multiple encoding standards are used depending on platform and 
  text purpose. To make matters worse, these encoding standards 
  differ slightly from one another. In order to create a web 
  application which would be usable in a Japanese environment, a 
  developer has to keep these complexities in mind to ensure that the
  proper character encodings are used.
 </p>
 <p class="para">
  <ul class="itemizedlist">
   <li class="listitem">
    <span class="simpara">Storage for a character can be up to six bytes</span>
   </li>
   <li class="listitem">
    <span class="simpara">
     Most Japanese multibyte characters appear twice as wide as
     single-byte characters. These characters are called 
     &quot;zen-kaku&quot; in Japanese, which means 
     &quot;full width&quot;. Other, narrower, characters are called 
     &quot;han-kaku&quot;, which means &quot;half width&quot;. The 
     graphical properties of the characters, however, depends upon 
     the type faces used to display them.
    </span>
   </li>
   <li class="listitem">
    <span class="simpara">
     Some character encodings use shift(escape) sequences defined
     in ISO-2022 to switch the code map of the specific code area
     (<em>00h</em> to <em>7fh</em>).
    </span>
   </li>
   <li class="listitem">
    <span class="simpara">
     ISO-2022-JP should be used in SMTP/NNTP, and headers and entities
     should be reencoded as per RFC requirements. Although those are not
     requisites, it&#039;s still a good idea because several popular user
     agents cannot recognize any other encoding methods.
    </span>
   </li>
   <li class="listitem">
    <span class="simpara">
     Web pages created for mobile phone services such as
     <a href="http://www.nttdocomo.com/services/imode/" class="link external">&raquo;&nbsp;i-mode</a>
     or <a href="http://www.au.kddi.com/english/service/ezweb/index.html" class="link external">&raquo;&nbsp;EZweb</a>
     are supposed to use Shift_JIS.
    </span>
   </li>
   <li class="listitem">
    <span class="simpara">
      As of PHP 5.4.0, the pictogram characters 
      used for mobile phone service such as 
      <a href="http://www.nttdocomo.com/services/imode/" class="link external">&raquo;&nbsp;i-mode</a>
      or <a href="http://www.au.kddi.com/english/service/ezweb/index.html" class="link external">&raquo;&nbsp;EZweb</a>
      are supported.
    </span>
   </li>
  </ul>
 </p>
</div>
<hr /><div class="manualnavbar" style="text-align: center;">
 <div class="prev" style="text-align: left; float: left;"><a href="mbstring.encodings.html">Summaries of supported encodings</a></div>
 <div class="next" style="text-align: right; float: right;"><a href="mbstring.http.html">HTTP Input and Output</a></div>
 <div class="up"><a href="book.mbstring.html">Multibyte String</a></div>
 <div class="home"><a href="index.html">PHP Manual</a></div>
</div></body></html>