ʸ»ú¥³¡¼¥É´ØÏ¢ †
¥ê¥ó¥¯ †
META¥¿¥°¤Ç¤ÎÀßÄê †
¤è¤¯¤¢¤ë´Ö°ã¤¤ †
- ¥À¥Ö¥ë¥¯¥ª¡¼¥È¤Ç°Ï¤àÈϰϤδְ㤤¡£
°Ê²¼¤Ï´Ö°ã¤¤¤Ç¤¹
- °Ï¤àÈϰϤδְ㤤
<META HTTP-EQUIV="Content-Type" CONTENT="text/html;" charset="euc-jp">
charset¤ÏMETA¥¿¥°¤Î¥Ñ¥é¥á¡¼¥¿¤Ç¤Ï¤Ê¤¯¡¢CONTENT¥Ñ¥é¥á¡¼¥¿¤ÎÆâÍÆ¤Î°ìÉô¤Ç¤¹¡£
<META HTTP-EQUIV="Content-Type" CONTENT="text/html; charset=euc-jp">
¾å¤ÎÀÄʸ»ú¤ÎÉôʬ¤Î¤è¤¦¤Ë¤¹¤Ù¤ÆCONTENT¥Ñ¥é¥á¡¼¥¿¤ÎÆâÍÆ¤È¤·¤Æ°ì³ç¤·¤Æ°Ï¤àɬÍפ¬¤¢¤ê¤Þ¤¹¡£
- °Ï¤ó¤Ç¤¤¤Ê¤¤
<META HTTP-EQUIV="Content-Type" CONTENT=text/html; charset=euc-jp>
¥À¥Ö¥ë¥¯¥ª¡¼¥È¤Ç°Ï¤Þ¤ì¤Æ¤¤¤Ê¤¤¤¿¤á CONTENT= CHARSET= ¤È¤¤¤¦Ê̤Υѥé¥á¡¼¥¿¤È¤·¤Æ²ò¼á¤µ¤ì¤Æ¤·¤Þ¤¤¤Þ¤¹¡£
- ÊĤ¸Ëº¤ì
<META HTTP-EQUIV="Content-Type" CONTENT="text/html; charset=euc-jp>
CONTENT¤ÎÆâÍÆ¤¬ÉÔÌÀ¤Ë¤Ê¤ê¤Þ¤¹¡£¤ª¤½¤é¤¯¤Þ¤È¤â¤Ë¼±Ê̤Ǥ¤Ê¤¤¤¿¤áʸ»ú²½¤±¤·¤Þ¤¹¡£
- ʸ»ú¥³¡¼¥É̾¤Î´Ö°ã¤¤
ASCII¥³¡¼¥É †
- 1¥Ð¥¤¥Èʸ»ú¥³¡¼¥É¡£0x00¡Á0x7f ¤Î 7bit ¤Çɽ¤ï¤¹¤¿¤á7bit¥³¡¼¥É¤È¤â¸À¤¦¡£
¤Û¤È¤ó¤É¤Îʸ»ú¥³¡¼¥É¤¬¤³¤ì¤ò»²¹Í¤Ë¤·¤Æ¤¤¤ë¡£
0x00¡Á0x1f ¤ËÀ©¸æÊ¸»ú¡¢0x20¡Á0x7e ¤Ë±Ñ¿ô»ú¤Èµ¹æ¡¢
0x7f ¤ËDEL¤òɽ¤ï¤¹À©¸æÊ¸»ú¤Î¹ç·×128ʸ»ú¤ÇÀ®¤êΩ¤Ã¤Æ¤¤¤ë¡£
0x20 ¤ÎȾ³Ñ¥¹¥Ú¡¼¥¹¡Ê¶õÇòʸ»ú¡Ë¤Ï²ò¼á¼¡Âè¤Ç¤ÏÀ©¸æÊ¸»ú¤Ë¤Ê¤ë¡£
JIS¥³¡¼¥É †
JIS X 0201 (ANK¥³¡¼¥É) †
- Alphabetic Numeric and Kana¤ò°ÕÌ£¤¹¤ëANK¤«¤é¤ï¤«¤ë¤è¤¦¤Ë
Ⱦ³Ñ±Ñ¿ô»ú¤ÈȾ³Ñµ¹æ¡¢È¾³Ñ¥«¥Ê¤Ç¹½À®¤µ¤ì¤ë£±¥Ð¥¤¥È(7bit,8bit)ʸ»ú½¸¹ç¡£
¤«¤Ê¤êÂ绨ÇĤ˸À¤¦¤ÈANK¥³¡¼¥É¤ÏASCII¥³¡¼¥É¡ÜȾ³Ñ¥«¥Ê¡£
- 7¥Ó¥Ã¥È¡§0x00(0000 0000)¡Á0x7f(0111 1111)¤Î7¥Ó¥Ã¥È¤Çɽ¤ï¤¹Èϰϡ£
ASCII¥³¡¼¥É¤ÎÈϰϤǤ¢¤ë±Ñ¿ô»ú¤Ê¤É¤òɽ¤ï¤¹¡¢¥í¡¼¥Þ»ú¥â¡¼¥É¤È¤·¤Æ°·¤¦
¤¿¤À¤·¡¢¥Ð¥Ã¥¯¥¹¥é¥Ã¥·¥å¤¬±ßµ¹æ¡¢¥Á¥ë¥À¤¬¥ª¡¼¥Ð¡¼¥é¥¤¥ó¤È¤·¤ÆÄêµÁ¤µ¤ì¤Æ¤¤¤ë¡£
- 8¥Ó¥Ã¥È¡§0x80(1000 0000)¡Á0xff(1111 1111)¤Î8bit¤Çɽ¤ï¤¹Èϰϡ£
¤³¤Î¤¦¤Á 0xa1¡Á0xdf ¤ÎÈϰϤËȾ³Ñ¥«¥Êʸ»ú¤¬³ä¤êÅö¤Æ¤é¤ì¤¿¥«¥Ê¥â¡¼¥É¤È¤·¤Æ°·¤¦¡£
JIS X 0202 (ISO/IEC 2022, ISO-2022-JP, JIS´Á»ú) †
- ¼ç¤ËÆüËܤÇJIS¥³¡¼¥É¤È¸Æ¤Ð¤ì¤ëʸ»ú¥³¡¼¥É½¸¹ç¡£
- ¥á¡¼¥ë¤ÎÁ÷¿®»þ¤Îʸ»ú¥³¡¼¥É¤È¤·¤Æ»È¤ï¤ì¤Æ¤¤¤ë¡£
- ¥¨¥¹¥±¡¼¥×¥·¡¼¥±¥ó¥¹¤ÇÌÀ¼¨¤¹¤ë¤³¤È¤Ç¤½¤ì°Ê¹ß¤Îʸ»ú¤ò¥í¡¼¥Þ»ú¡¿ÆüËܸì¤Î¤è¤¦¤ËÀÚ¤êÂØ¤¨¤ë¤³¤È¤¬¤Ç¤¤ë¡£
¤³¤ì¤Ë¤è¤Ã¤ÆÆ±¤¸ÈϰϤǽÅÊ£¤·¤¿Ê̤Îʸ»ú¤Ç¤âÀµ¤·¤¯Ç§¼±¤µ¤»¤ë¤³¤È¤¬¤Ç¤¤ë¡£
- ¤³¤³¤Ç¸À¤¦ ESC ¤Ïʸ»ú¥³¡¼¥É0x1b ¤Î¥¨¥¹¥±¡¼¥×¥³¡¼¥É¡£
Î㤨¤Ð ESC$B ¤Ç JIS X 0208 ¤ËÀÚ¤êÂØ¤¨¡¢
ESC(J ¤Ç JIS X 0201 ¤Î 8¥Ó¥Ã¥È¥³¡¼¥É(Ⱦ³Ñ¥«¥Ê)¤ËÀÚ¤êÂØ¤¨¤ë¡£
ʸ»ú¥³¡¼¥É | °ÕÌ£ | ¥·¡¼¥±¥ó¥¹ | ASCII ISO/IEC 646 IRV | | Ⱦ³Ñ±Ñ¿ô»ú | ESC(B | JIS X 0201 ¥í¡¼¥Þ»ú | | (µì)JIS¥í¡¼¥Þ»ú | ESC(H | JIS X 0201 ¥í¡¼¥Þ»ú | 1976 | JIS¥í¡¼¥Þ»ú | ESC(J | JIS X 0201 Ⱦ³Ñ¥«¥Ê | 1976 | JISȾ³Ñ¥«¥Ê | ESC(I | JIS X 0208 | 1978 | (µì)´ðËÜ´Á»ú | ESC$@ | JIS X 0208 | 1983 | ´ðËÜ´Á»ú | ESC$B | JIS X 0212 | 1990 | JISÊä½õ´Á»ú | ESC$(D | JIS X 0213 Âè1ÌÌ | 2000 | (µì)JIS³ÈÄ¥´Á»ú | ESC$(O | JIS X 0213 Âè1ÌÌ | 2004 | JIS³ÈÄ¥´Á»ú | ESC$(Q | JIS X 0213 Âè2ÌÌ | 2004 | JIS³ÈÄ¥´Á»ú | ESC$(P | ¥·¥Õ¥È¥¤¥ó | | Ⱦ³Ñ¥«¥Ê¤ÎÀ©¸æ | 0x0f | ¥·¥Õ¥È¥¢¥¦¥È | | Ⱦ³Ñ¥«¥Ê¤ÎÀ©¸æ | 0x0e |
- JIS X 0201¡¢JIS X 0208¤òÆâÊñ¤·¤¿ ISO 2022-JP ¤¬³ÈÄ¥¤µ¤ì¡¢
¸½ºß¤Ï JIS X 0212¡¢JIS X 0213 ¤â´Þ¤á¤¿ISO-2022-JP-2004 ¤È¤Ê¤Ã¤Æ¤¤¤ë¡£
JIS X 0208 (JIS´ðËÜ´Á»ú, Âè1¡¦Âè2¿å½à´Á»ú) †
- °ìÈÌŪ¤ÊÁ´³Ñʸ»ú¤Î½¸¹ç¡£
¡Ý¤Ò¤é¤¬¤Ê¡¢¥«¥¿¥«¥Ê¡¢µ¹æ¡¢¤è¤¯»È¤ï¤ì¤ëÂ裱¿å½à´Á»ú¡¢
¤¢¤Ã¤¿Êý¤¬¤è¤¤Â裲¿å½à´Á»ú¤Î°ìÉô¤Ê¤É¤¬´Þ¤Þ¤ì¤Æ¤¤¤ë¡£
JIS X 0212 (JISÊä½õ´Á»ú) †
- JIS X 0208 ¤Ç¤¢¤Þ¤ê»È¤ï¤ì¤Ê¤¤Ê¸»ú¤Î½¸¹ç¡£
- Â裳¿å½à´Á»ú¤È°ìÈÌŪ¤Ç¤Ï¤Ê¤¤µ¹æ¤Ê¤É¤¬´Þ¤Þ¤ì¤ë¡£
JIS X 0213 (JIS³ÈÄ¥´Á»ú, Âè3¡¦Âè4¿å½à´Á»ú) †
- JIS X 0208 ¤ÇÉÔ¤·¤Æ¤¤¤ë´Á»ú¤òÊ䤦·Á¤Îʸ»ú½¸¹ç¡£
- Â裳¿å½à´Á»ú¡¢Â裴¿å½à´Á»ú¤¬´Þ¤Þ¤ì¤ë¡£
°ìÉô JIS X 0212 ¤È½ÅÊ£¤¹¤ë´Á»ú¤â¤¢¤ë¡£
JIS X 0221 (UCS, ISO/IEC 10646) †
- À¤³¦Ãæ¤Îʸ»ú¤ò16¥Ó¥Ã¥È¤Çɽ¤ï¤¹Ê¸»ú¤Ë¤Þ¤È¤á¤è¤¦¤È¤¤¤¦µ¬³Ê¡£
Unicode ¤ÈƱ¤¸Êý¸þÀ¤Î¤¿¤á UCS-2 ¤Ë¼è¤ê¹þ¤Þ¤ì¤ë¡£
- #Unicode
Shift JIS (MS´Á»ú¥³¡¼¥É) †
- ¥Þ¥¤¥¯¥í¥½¥Õ¥È·Ï¤ÎOS¤Ç»È¤ï¤ì¤Æ¤¤¤ëʸ»ú¥³¡¼¥É¡£
- Ⱦ³Ñ±Ñ¿ô»ú¡¢È¾³Ñµ¹æ¡¢È¾³Ñ¥«¥Ê¤Ë¤è¤ëANK¥³¡¼¥É¤ò¸µ¤Ë
£²¥Ð¥¤¥È´Á»ú¤Ë¤âÂбþ¤µ¤»¤¿Ê¸»ú¥³¡¼¥É½¸¹ç¡£
¶õ¤¤¤Æ¤¤¤ë 0x80¡Á9F¤Þ¤Ç¤È¡¢0xE0¡ÁFE ¤ÎÈϰϤò£²¥Ð¥¤¥È´Á»ú¤Î£±¥Ð¥¤¥ÈÌܤ˻ȤäƤ¤¤ë¡£
ʸ»ú²½¤±¤ÎÏà †
- £²¥Ð¥¤¥Èʸ»ú¤Î£²¥Ð¥¤¥ÈÌܤϣ±¥Ð¥¤¥Èʸ»ú¤Î 0x00¡Á7F ¤ÎÈϰϤȰìÉô½ÅÊ£¤·¤Æ¤¤¤ë¡£
¥¨¥¹¥±¡¼¥×¥·¡¼¥±¥ó¥¹¤Ë»È¤ï¤ì¤ë \ (0x5C) ¤È¤Î¸íǧ¤òµ¯¤³¤·¤ä¤¹¤¤¡£
- \t \n \2 \3 ¤Î¤è¤¦¤Ë¥×¥í¥°¥é¥à¤Ç¤Ï \ ¤Ë³¤¯1ʸ»ú¤Ï
ÆÃ¼ì¤Ê°ÕÌ£¤ò»ý¤Ä¤â¤Î¡Ê¥¨¥¹¥±¡¼¥×¥·¡¼¥±¥ó¥¹¡Ë¤È¤·¤Æ°·¤ï¤ì¤Þ¤¹¡£
\n ¤¬²þ¹Ôʸ»ú¡¢\t ¤¬¥¿¥Öʸ»ú¤Ê¤É¤ËÃÖ¤´¹¤¨¤é¤ì¡¢
ÆÃ¼ì¤Ê°ÕÌ£¤ò»ý¤Ã¤¿ \ ¤Ë³¤¯Ê¸»ú¤¬¤Ê¤¤¾ì¹ç¤Ï \! ¤¬ ! ¤Î¤è¤¦¤ËÃÖ¤´¹¤¨¤é¤ì¤ë¤Î¤¬°ìÈÌŪ¤Ç¤¹¡£
- ¥·¥Õ¥ÈJISʸ»úÎó "¥¦¥½800" ¤Î¤è¤¦¤Êʸ»úÎó¤Ï
ʸ»ú¥³¡¼¥É | 83 | 45 | 83 | 5C | 38 | 30 | 30 |
---|
1¥Ð¥¤¥Èʸ»ú | ? | E | ? | \ | 8 | 0 | 0 |
---|
2¥Ð¥¤¥Èʸ»ú | ¥¦ | ¥½ | 8 | 0 | 0 |
---|
¢¨É½¼¨¤Ç¤¤Ê¤¤Ê¸»ú¤Ï?¤ÇÂåÍѤ·¤Æ¤¤¤Þ¤¹¡Ë¡£
5C 38 ¤ÎÉôʬ¤¬¥¨¥¹¥±¡¼¥×¥·¡¼¥±¥ó¥¹ \8 ¤È²ò¼á¤µ¤ì¤Æ¤·¤Þ¤¤¡¢8¤ËÊÑ´¹¤µ¤ì¤Þ¤¹¡£
·ë²ÌŪ¤Ëʸ»úÎó¤Ï "¥¦¡¦00" ¤Î¤è¤¦¤Ë¤Ê¤ê¤Þ¤¹¡£
ÃÖ¤´¹¤¨¸å | ʸ»ú¥³¡¼¥É | 83 | 45 | 83 | 38 | 30 | 30 |
---|
1¥Ð¥¤¥Èʸ»ú | ? | E | ? | 8 | 0 | 0 |
---|
2¥Ð¥¤¥Èʸ»ú | ¥¦ | ? | 0 | 0 |
---|
- ʸ»ú²½¤±¤·¤Ê¤¤¤¿¤á¤Ë¤Ï¥Ç¡¼¥¿¤òEUC¤äUTF-8¤Ê¤É¤Çµ½Ò¤¹¤ë¤Ù¤¤Ç¤¹¡£
¤½¤ì¤Ç¤â¤¢¤¨¤Æ»È¤¦¤Ë¤Ï"¥¦¥½\800" ¤Î¤è¤¦¤Ë£²¥Ð¥¤¥ÈÌܤËÌäÂê¤Î¤¢¤ëʸ»ú¤Î¸å¤Ë \ ¤ò¶´¤ó¤ÇÆâÉôŪ¤Ê \\ ¤Ë¤·¤Þ¤¹¡£
¥¨¥¹¥±¡¼¥×¥·¡¼¥±¥ó¥¹ \\ ¤Ï \ ¤ËÃÖ¤´¹¤¨¤é¤ì¤ë¤¿¤áʸ»ú²½¤±¤»¤ºÀµ¤·¤¯É½¼¨¤µ¤ì¤Þ¤¹¡£
ʸ»ú¥³¡¼¥É | 83 | 45 | 83 | 5C | 5C | 38 | 30 | 30 |
---|
1¥Ð¥¤¥Èʸ»ú | ? | E | ? | \ | \ | 8 | 0 | 0 |
---|
2¥Ð¥¤¥Èʸ»ú | ¥¦ | ¥½ | \ | 8 | 0 | 0 |
---|
ÃÖ¤´¹¤¨¸å | ʸ»ú¥³¡¼¥É | 83 | 45 | 83 | 5C | 38 | 30 | 30 |
---|
1¥Ð¥¤¥Èʸ»ú | ? | E | ? | \ | 8 | 0 | 0 |
---|
2¥Ð¥¤¥Èʸ»ú | ¥¦ | ¥½ | 8 | 0 | 0 |
---|
- \¤ò´Þ¤à£²¥Ð¥¤¥Èʸ»ú
ʸ»ú | ¥³¡¼¥É | ʸ»ú | ¥³¡¼¥É | ʸ»ú | ¥³¡¼¥É | | | ¿½ | 0x90 0x5C | ß½ | 0xE0 0x5C | ¡½ | 0x81 0x5C | Á½ | 0x91 0x5C | á½ | 0xE1 0x5C | | | ý | 0x92 0x5C | ã½ | 0xE2 0x5C | ¥½ | 0x83 0x5C | Ž | 0x93 0x5C | å½ | 0xE3 0x5C | §½ | 0x84 0x5C | ǽ | 0x94 0x5C | ç½ | 0xE4 0x5C | | | ɽ | 0x95 0x5C | é½ | 0xE5 0x5C | | | ˽ | 0x96 0x5C | ë½ | 0xE6 0x5C | ½ | 0x87 0x5C | ͽ | 0x97 0x5C | í½ | 0xE7 0x5C | | | Ͻ | 0x98 0x5C | ï½ | 0xE8 0x5C | ±½ | 0x89 0x5C | ѽ | 0x99 0x5C | ñ½ | 0xE9 0x5C | ³½ | 0x8A 0x5C | Ó½ | 0x9A 0x5C | ó½ | 0xEA 0x5C | µ½ | 0x8B 0x5C | Õ½ | 0x9B 0x5C | | | ·½ | 0x8C 0x5C | ×½ | 0x9C 0x5C | | | ¹½ | 0x8D 0x5C | Ù½ | 0x9D 0x5C | ù½ | 0xED 0x5C | »½ | 0x8E 0x5C | Û½ | 0x9E 0x5C | û½ | 0xEE 0x5C | ½½ | 0x8F 0x5C | ݽ | 0x9F 0x5C | | |
EUC †
- UNIX·Ï¤ÎOS¤Ê¤É¤Ç»È¤ï¤ì¤Æ¤¤¤ëʸ»ú¥³¡¼¥É¡£
Ʊ¤¸EUC¥³¡¼¥É¤Ç¤â³ä¤êÅö¤Æ¤ëʸ»ú¤¬¹ñ¤´¤È¤Ë°Û¤Ê¤ë¡£
EUC-JP †
- ÆüËܸìEUC¤È¸Æ¤Ð¤ì¤ë¤â¤Î¡£
£±¥Ð¥¤¥Èʸ»ú¤ÎÈϰϤËASCII¥³¡¼¥É¤ò´Þ¤à¤¬¡¢È¾³Ñ¥«¥Ê¤Ï£²¥Ð¥¤¥Èʸ»ú¤Çɽ¤ï¤¹¡£
ASCII¥³¡¼¥É°Ê³°¤ò¤¹¤Ù¤Æ¥Þ¥ë¥Á¥Ð¥¤¥Èʸ»ú¡Ê£²¡Á£³¥Ð¥¤¥È¡Ë¤Çɽ¤ï¤·¤Æ¤¤¤ë¡£
Shift JIS¤È°ã¤Ã¤Æ£±¥Ð¥¤¥Èʸ»ú¤ÎÈϰϤȽŤʤé¤Ê¤¤¤¿¤á¡¢Ê¸»ú²½¤±¤Ïµ¯¤¤Ë¤¯¤¤¡£
Unicode †
- Á´À¤³¦¤Î°Û¤Ê¤ë¸À¸ì¤ò°ì°Õ¤Çɽ¤ï¤¹¤³¤È¤òÁ°Äó¤È¤·¤¿Ê¸»ú¥³¡¼¥É½¸¹ç¡£
- U+0061 ¤Î¤è¤¦¤Ëɽ¤ï¤¹¡£
- £±Ê¸»ú¤ò2¥Ð¥¤¥È¤Çɽ¤ï¤¹UCS-2¡¢4¥Ð¥¤¥È¤Çɽ¤ï¤¹UCS-4¤Ê¤É¤¬¤¢¤ë¡£
¤³¤ì¤é¤Ï´û¸¤Îʸ»ú¥³¡¼¥ÉÂηϤȤθߴ¹À¤Ï¤Ê¤¤¤¬¡¢
UTF-8¤Ê¤É¤Î¥¨¥ó¥³¡¼¥ÉÊý¼°¤Ë¤è¤Ã¤Æ¤ÏASCII¥³¡¼¥É¤È¤Î¸ß´¹À¤¬Êݤ¿¤ì¤Æ¤¤¤ë¡£
·²¡¦ÌÌ¡¦¶è¡¦ÅÀ †
- ¥ª¥¯¥Æ¥Ã¥È¡Ê¥Ð¥¤¥È¡Ë¤´¤È¤Ë¶èÀÚ¤é¤ì¡¢·²¥ª¥¯¥Æ¥Ã¥È¡¦ÌÌ¥ª¥¯¥Æ¥Ã¥È¡¦¶è¥ª¥¯¥Æ¥Ã¥È¡¦ÅÀ¥ª¥¯¥Æ¥Ã¥È¤ËʬÎव¤ì¤ë¡£
- UCS-4¤Ç¤Ï4¥Ó¥Ã¥È¤Î·²¡¦¤½¤ì¤¾¤ì8¥Ó¥Ã¥È¤´¤È¤ÎÌÌ¡¦¶è¡¦ÅÀ¤ËʬÎव¤ì¡¢
UCS-2¤Ç¤ÏÂè0·²¡¦Âè0Ì̤˸ÇÄꤵ¤ì¤¿8¥Ó¥Ã¥È¤´¤È¤Î¶è¡¦ÅÀ¤ËʬÎव¤ì¡¢
Unicode¤Ç¤ÏÂè0·²¡¦Âè0¡Á16Ì̤ȡ¢8¥Ó¥Ã¥È¤´¤È¤Î¶è¡¦ÅÀ¤ËʬÎव¤ì¤ë¡£
ʸ»ú¥³¡¼¥É | ·² | ÌÌ | ¶è | ÅÀ | ɽ¸½ÈÏ°Ï | ¼ÂÁõÈÏ°Ï | Unicode | 0 | 0¡Á16 | 0¡Á255 | 0¡Á255 | 1,114,112 | Ʊ°ì |
---|
UCS-2 | 0 | 0 | 0¡Á255 | 0¡Á255 | 65,536 | 63,488 ͽÌóÎΰè2,048¤ò½ü¤¯ |
---|
UCS-4 | 0¡Á127 | 0¡Á255 | 0¡Á255 | 0¡Á255 | 2,147,483,648 | 1,114,112 Unicode¤ÎÈϰϤ˸ÂÄê |
---|
- UCS-2¤Ç¤ÏUnicode¤òɽ¤ï¤·¤¤ë¤Ë¤ÏÉÔ¤·¤Æ¤¤¤ë¡£
- UCS-4¤Ç¤ÏUnicode¤ÎÈϰϳ°¤È¤Ê¤ëÂè0·²¡¦Âè17Ḭ̀ʹߤˤϱʵפËʸ»ú¤òÄɲ䷤ʤ¤¤³¤È¤Ë·è¤Þ¤Ã¤¿¡£
¤Ä¤Þ¤ê¤«¤Ê¤ê;Çò¤¬¤Ç¤¤¿¤¬¡¢UCS-4¤Îɽ¸½ÈϰϤÏUnicode¤ÎÈÏ°Ï¤ÈÆ±¤¸¤Ç¤¢¤ë¡£
- ¤½¤ì¤¾¤ì¤ÎºÇ½é¤ÎÂè0Ì̤ò´ðËÜ¿¸À¸ìÌÌ(BMP)¤È¸Æ¤Ö¡£
BMP (´ðËÜ¿¸À¸ìÌÌ, Basic Multilingual Plane) †
- Âè0·²¡¦Âè0Ì̤˳ä¤êÅö¤Æ¤é¤ì¤¿16¥Ó¥Ã¥È¤Ç¼çÍפÊʸ»ú¤òɽ¤ï¤¹Ê¸»ú½¸¹ç¡£
¤è¤¯»È¤ï¤ì¤ëʸ»ú¤ÎÂçȾ¤¬³ä¤êÅö¤Æ¤é¤ì¤Æ¤¤¤ë¡£
- UCS-2¤ÏBMP¤ÎÈÏ°Ï¤ÈÆ±°ì¡£
BOM (Byte Order Mark) †
- UTF-16¤Ç»È¤ï¤ì¤Æ¤¤¤ëÀèÆ¬¤Î¼±ÊÌ¥³¡¼¥É 0xfeff¡£
¥Ó¥Ã¥°¥¨¥ó¥Ç¥£¥¢¥ó¤Ê¤é 0xfe 0xff ¤Î¥Ð¥¤¥ÈÎó¤ÇÊݸ¤µ¤ì¡¢
µÕ½ç¤Î¥ê¥È¥ë¥¨¥ó¥Ç¥£¥¢¥ó¤Ê¤é 0xff 0xfe ¤Î¥Ð¥¤¥ÈÎó¤ÇÊݸ¤µ¤ì¤ë¤Î¤Ç
BOM¤¬ºÇ½é¤ËÁÞÆþ¤µ¤ì¤Æ¤¤¤ë¤³¤È¤Ç¥¨¥ó¥Ç¥£¥¢¥ó¤òȽÊ̤Ǥ¤ë¡£
- UTF-8¤Ë¤Ï¥¨¥ó¥Ç¥£¥¢¥óÌäÂ꤬¤Ê¤¤¤Î¤À¤¬¡¢0xefbbbf ¤Î£³¥Ð¥¤¥È¤ÎBOM¤ò»È¤¦¤³¤È¤â¤Ç¤¤ë¡£
BOM̵¤·¤Î UTF-8 ¤òUTF-8N ¤È¸Æ¤Ö¤³¤È¤â¤¢¤ë¤é¤·¤¤¡£
Windows¤Î¥á¥âÄ¢¤ÇUTF-8¤ÇÊݸ¤¹¤ë¤ÈBOM¤¬ÁÞÆþ¤µ¤ì¤ë¤è¤¦¤À¡£
- ñ½ã¤ËUnicode¤Ç¤¢¤ë¤³¤È¤ò¼¨¤¹¤¿¤á¤Î¥Ø¥Ã¥À¤È¤·¤ÆÍѤ¤¤é¤ì¤ë¤³¤È¤â¤¢¤ë¡£
UCS-2 (Universal Multiple-Octet Coded Character Set in 2 octets) †
- £±Ê¸»ú¤ò2¥Ð¥¤¥È(16¥Ó¥Ã¥È)¤ÇUnicode¤òɽ¤ï¤¹Ê¸»ú¥³¡¼¥É½¸¹ç¡£
BMP ¤Îʸ»ú½¸¹ç¤½¤Î¤â¤Î¤¬³ºÅö¤¹¤ë¡£
- ¥Ó¥Ã¥°¥¨¥ó¥Ç¥£¥¢¥ó¡¢¥ê¥È¥ë¥¨¥ó¥Ç¥£¥¢¥ó¤ÎÆó¼ïÎब¤¢¤ë¡£
- ASCII¥³¡¼¥É¤È¤Î¸ß´¹À¤Ï¤Ê¤¯¡¢0¡Á1Fh¤ÎÀ©¸æÊ¸»ú¤ÎÈϰϤâÍøÍѤ·¤Æ¤¤¤ë¡£
UCS-4 (Universal Multiple-Octet Coded Character Set in 4 octets) †
- £±Ê¸»ú¤ò4¥Ð¥¤¥È(32¥Ó¥Ã¥È)¤ÇUnicode¤òɽ¤ï¤¹Ê¸»ú¥³¡¼¥É½¸¹ç¡£
UCS-2(BMP)¤ò´Þ¤ß¡¢¤½¤ì¤ËÉÔ¤·¤Æ¤¤¤ëʸ»ú¤¬Äɲäµ¤ì¤Æ¤¤¤ë¡£
- ASCII¥³¡¼¥É¤È¤Î¸ß´¹À¤Ï¤Ê¤¯¡¢0¡Á1Fh¤ÎÀ©¸æÊ¸»ú¤ÎÈϰϤâÍøÍѤ·¤Æ¤¤¤ë¡£
UTF-8 (Unicode Transformation Format-8, UCS Transformation Format 8) †
- Unicode¤òɽ¤ï¤¹¤¿¤á¤Ë»È¤ï¤ì¤ë¥¨¥ó¥³¡¼¥ÉÊý¼°¡£
ʸ»ú¤òƳ¤½Ð¤¹¤¿¤á¤Îº÷°ú¤Î¤è¤¦¤Ê8¥Ó¥Ã¥ÈÎó¤¬ÀèÆ¬¤Ë¤Ä¤¤¤Æ¤¤¤ë¡£
- UCS-2¤ÎÈϰϤÎʸ»ú¤ò¥«¥Ð¡¼¤·¤Æ¤ª¤ê¡¢ASCII¥³¡¼¥É¤ò8¥Ó¥Ã¥È¡¢¤½¤ì°Ê³°¤ò16¡Á32¥Ó¥Ã¥È¤Çɽ¤ï¤¹¡£
- Unicode ¤ÇÄêµÁ¤µ¤ì¤¿µ¬³Ê Unicode Transformation Format-8
UCS ¤ÇÄêµÁ¤µ¤ì¤¿µ¬³Ê UCS Transformation Format 8 ¤¬¤¢¤ê¤É¤Á¤é¤âUTF-8¤È¸Æ¤Ð¤ì¤ë
Á°¼Ô¤Ï£´¥Ð¥¤¥È¡¢¸å¼Ô¤Ï£¶¥Ð¥¤¥È¤ÎÈϰϤò°·¤¦¤è¤¦¤ËÁÛÄꤵ¤ì¤Æ¤¤¤ë¤¬¡¢
2006ǯ¤Ë4¥Ð¥¤¥È¤ÎÈϰϤ·¤«»È¤ï¤Ê¤¤¤è¤¦¤Ëµ¬Äꤵ¤ì¤¿¤¿¤áƱ¤¸Êª¤È¹Í¤¨¤Æ¤è¤¤¡£
- 8¥Ó¥Ã¥È¤ò»È¤Ã¤¿º÷°ú¤«¤éʸ»ú¥³¡¼¥É¤òƳ¤½Ð¤¹¡£
0¡Á7Fh¤Î7¥Ó¥Ã¥È¤Þ¤Ç¤ÏASCII¥³¡¼¥É¤ËÂбþ¤·¡¢
80¡ÁFFh¤Ï¤½¤ì¤¾¤ì2¡Á6¥Ð¥¤¥È¤ÎÆüËܸì¤ä¾¹ñ¸ì¤Îʸ»ú¤äµ¹æ¤ËÂбþ¤·¤Æ¤¤¤ë¡£
- ¥³¡¼¥É¥Ý¥¤¥ó¥È¤È¸Æ¤Ð¤ì¤ë¤³¤ÎÀèÆ¬¥Ó¥Ã¥È
0xxx xxxx ¤¬ASCII¥³¡¼¥É
110x xxxx ¤¬£²¥Ð¥¤¥È¥³¡¼¥É
1110 xxxx ¤¬£³¥Ð¥¤¥È¥³¡¼¥É
1111 0xxx ¤¬£´¥Ð¥¤¥È¥³¡¼¥É
1111 10xx ¤¬£µ¥Ð¥¤¥È¥³¡¼¥É
1111 110x ¤¬£¶¥Ð¥¤¥È¥³¡¼¥É
¥È¥³¡¼¥É
¤³¤Î¤è¤¦¤ËÀèÆ¬¥Ó¥Ã¥È¤¬0¤Ê¤é£±¥Ð¥¤¥ÈASCII¥³¡¼¥É¡¢
¤½¤ì°Ê³°¤Ï 110x xxx ¤¬2¥Ð¥¤¥È¤Î¤è¤¦¤ËÀèÆ¬¥Ó¥Ã¥È¤¬¥Ð¥¤¥È¿ô¤ò¼¨¤·¤Æ¤¤¤ë
- ÀèÆ¬ 10xx xxxx ¤Ç»Ï¤Þ¤ë¥³¡¼¥É¤ÏÀèÆ¬¥Ð¥¤¥È¤Ë³¤¯2¡Á5¥Ð¥¤¥Èʬ¤Î¥Ç¡¼¥¿¤È¤Ê¤ë¡£
10xx xxxx ¤¬£²¥Ð¥¤¥ÈÌܰʹߤΥ³¡¼¥É
ºÇĹ£¶¥Ð¥¤¥È¤Îʸ»ú¤òɽ¤ï¤¹¤Ë¤Ï 1111 110x ¤Î¸å¤Ë 10xx xxxx ¤¬5¥Ð¥¤¥È³¤¯
1111 110x 10xx xxxx 10xx xxxx 10xx xxxx 10xx xxxx 10xx xxxx 10xx xxxx
- BOM¤òÀèÆ¬3¥Ð¥¤¥È¤ËÍѤ¤¤ë¤³¤È¤ÇUnicode¤ÎUTF-8¤Ç¤¢¤ë¤³¤È¤ò¼¨¤¹¡£
BOM¤òÍѤ¤¤Ê¤¤¾ì¹ç¤Ï UTF-8N ¤È¸Æ¤Ð¤ì¤ë¤³¤È¤¬¤¢¤ë¡£
UTF-16 †
- Unicode¤òɽ¤ï¤¹¤¿¤á¤Ë»È¤ï¤ì¤ë¥¨¥ó¥³¡¼¥ÉÊý¼°¡£
ʸ»ú¤òƳ¤½Ð¤¹¤¿¤á¤Îº÷°ú¤Î¤è¤¦¤Ê16¥Ó¥Ã¥ÈÎó¤¬ÀèÆ¬¤Ë¤Ä¤¤¤Æ¤¤¤ë¡£
- UCS-2¤ÎÈϰϤò16¥Ó¥Ã¥È¤Çɽ¤ï¤·¡¢UCS-2¤ËÉÔ¤·¤Æ¤¤¤ëUCS-4¤ÎÈϰϤÎʸ»ú¤ò16+16¥Ó¥Ã¥È¤ÎÂФÇɽ¤ï¤¹¥µ¥í¥²¡¼¥È¥Ú¥¢Êý¼°¤ò»È¤Ã¤Æ¤¤¤ë¡£
- ¥Ó¥Ã¥°¥¨¥ó¥Ç¥£¥¢¥ó¡¢¥ê¥È¥ë¥¨¥ó¥Ç¥£¥¢¥ó¤Çɽ¤ï¤¹¤³¤È¤¬¤Ç¤¡¢
BOM¤òÀèÆ¬2¥Ð¥¤¥È¤ËÍѤ¤¤Æ¥¨¥ó¥Ç¥£¥¢¥ó¤ò¼±Ê̤¹¤ë¡£
UTF-32 †
- Unicode¤òɽ¤ï¤¹¤¿¤á¤Ë»È¤ï¤ì¤ë¥¨¥ó¥³¡¼¥ÉÊý¼°¡£
- UCS-4(31¥Ó¥Ã¥È)¤Îʸ»ú¤ò32¥Ó¥Ã¥È¤Î¸ÇÄêŤÇɽ¤ï¤¹¡£
UCS-4¤ÈƱ¤¸¤È¹Í¤¨¤Æ¤è¤¤¡£
²þ¹Ô¥³¡¼¥É¤ÎÏà †
- OS¤´¤È¤Ë²þ¹Ôʸ»ú¤Ï°Û¤Ê¤ë¡£
Ê£¿ô¤ÎOS´Ö¤Ç¥Ç¡¼¥¿¤ò¤ä¤ê¼è¤ê¤¹¤ë¤¿¤á¤Ë¤Ïʸ»ú¥³¡¼¥É¤Î°ã¤¤¤Î¾¤Ë
²þ¹Ôʸ»ú¤Î°ã¤¤¤âÁÛÄꤹ¤ëɬÍפ¬¤¢¤ë¡£
OS | ʸ»ú | ¥³¡¼¥É | ¥Þ¥¤¥¯¥í¥½¥Õ¥È·ÏOS | CR+LF | 0x0D 0x0A | ¥Þ¥Ã¥¥ó¥È¥Ã¥·¥åOS | CR | 0x0D | UNIX·ÏOS | LF | 0x0A |
MS¤Î¥Ð¥¤¥Ê¥ê¥â¡¼¥É¤È¥Æ¥¥¹¥È¥â¡¼¥É †
- ¥Þ¥¤¥¯¥í¥½¥Õ¥È·ÏOS¤Î¥Õ¥¡¥¤¥ëÁàºî¤Ë¤Ï¡Ö¥Ð¥¤¥Ê¥ê¡×¤È¡Ö¥Æ¥¥¹¥È¡×¤È¤¤¤¦£²¤Ä¤Î¥â¡¼¥É¤¬¤¢¤ë¡£
- ¥Ð¥¤¥Ê¥ê¥â¡¼¥É¤Ï¥Õ¥¡¥¤¥ë¤ò¤½¤Î¤Þ¤Þ¥Ç¡¼¥¿¤È¤·¤ÆÆÉ¤ß½ñ¤¤¹¤ë¡£
¾¤ÎOS¤Ï´ðËÜŪ¤Ë¤³¤ì¡£
- ¥Æ¥¥¹¥È¥â¡¼¥É¤Ï¥Õ¥¡¥¤¥ë¤òÆÉ¤ß¹þ¤àºÝ¤Ë¥Õ¥¡¥¤¥ëÆâ¤ÎCR+LF¤òLF¤Ë¼«Æ°Åª¤ËÊÑ´¹¤·¤Æ¥Ç¡¼¥¿¤ò¼õ¤±¼è¤ë¡£
¤½¤·¤Æ¥Õ¥¡¥¤¥ë¤Ø¤Î½ñ¤¹þ¤ß»þ¤Ë¤Ï¥Ç¡¼¥¿Æâ¤ÎLF¤òCR+LF¤ËÊÑ´¹¤·¤Æ¥Õ¥¡¥¤¥ë¤Ë½ñ¤¹þ¤à¡£
Perl¤òWindows¤Çư¤«¤¹¤È¥Ç¥Õ¥©¥ë¥È¤Ç¤Ï¥Æ¥¥¹¥È¥â¡¼¥É¤Î¤¿¤á¡¢
²èÁü¤ä²»³Ú¥Õ¥¡¥¤¥ëÆâ¤Î²þ¹Ôʸ»ú¤Þ¤Ç¤â¾¡¼ê¤ËÃÖ¤´¹¤¨¤Æ¥Ç¡¼¥¿¤òÂæÌµ¤·¤Ë¤¹¤ë¤³¤È¤â¤¢¤ë¡£
|