{"id":302,"date":"2011-03-19T20:30:24","date_gmt":"2011-03-20T03:30:24","guid":{"rendered":"http:\/\/glyphs.webfoot.com\/blog\/?p=302"},"modified":"2011-04-02T22:22:38","modified_gmt":"2011-04-03T05:22:38","slug":"gaiji","status":"publish","type":"post","link":"http:\/\/glyphs.webfoot.com\/blog\/2011\/03\/19\/gaiji\/","title":{"rendered":"Gaiji"},"content":{"rendered":"<div id=\"attachment_344\" style=\"width: 110px\" class=\"wp-caption alignleft\"><a href=\"http:\/\/glyphs.webfoot.com\/blog\/wp-content\/uploads\/2011\/03\/hryvnia.png\"><img loading=\"lazy\" decoding=\"async\" aria-describedby=\"caption-attachment-344\" class=\"size-full wp-image-344\" title=\"hryvnia\" src=\"http:\/\/glyphs.webfoot.com\/blog\/wp-content\/uploads\/2011\/03\/hryvnia.png\" alt=\"\" width=\"100\" height=\"115\" \/><\/a><p id=\"caption-attachment-344\" class=\"wp-caption-text\">Hryvnia<\/p><\/div>\n<p>Many writing systems have a finite set of glyphs; you can write down a complete list and there are no others, except for the rare invention of new characters. But some writing systems have an open-ended set of glyphs; no matter how many you write down, it is routine to discover or invent a new one. <a href=\"http:\/\/glyphs.webfoot.com\/blog\/2011\/01\/08\/chinese-traditional-chinese\/\">Chinese script<\/a> and its variants are prime examples of such open-ended scripts.<\/p>\n<p>This leads to a big issue of writing technology. If you are writing by hand, you don&#8217;t care: you just draw any glyph you need. But movable type, and later digital fonts, have essentially finite glyph complements. It is not easy, especially for an end user, to add a new glyph to a font.<\/p>\n<p>To make matters worse, a font may not contain all the glyhs in a writing system. The reference dictionaries of Chinese and <a href=\"http:\/\/glyphs.webfoot.com\/blog\/2011\/03\/06\/kanji-600-ad-japan\/\">Japanese<\/a> contain some 50-70,000 characters; common Chinese and Japanese digital fonts contain some 20,000 characters today, and 15 years ago the standard was more like 8,000. (By comparison, a Japanese high-school graduate is required to know less than 2,000 characters, enough for adult communication.)<\/p>\n<p>There were several causes for the limitations on font size. Many computer architectures were developed in the USA and other Latin-script territories, and initially addressed only Latin script needs. The capability for Chinese, Japanese, Cyrillic, Arabic, and other scripts were added later (and are still being added). Also, computer storage and processing were hideously expensive, and computer hardware and software developers could only afford to meet the most common needs of their their largest customers, not the complete needs of every potential user.<\/p>\n<p>Hence publishers, and recently computer users, find sometimes that they want to use a character which is valid in their writing system but is not in their font. In Japanese these characters are called <em>gaiji<\/em> (meaning &#8220;outside character&#8221;)<em><\/em>.<\/p>\n<p>Gaiji are not just a problem for Japanese and Chinese (and <a href=\"http:\/\/glyphs.webfoot.com\/blog\/2011\/03\/10\/gugyeol-950-ad-korea\/\">Korean<\/a>, to a lesser extent): this is a significant problem for any scholars writing about extinct logographic writing systems like <a href=\"http:\/\/glyphs.webfoot.com\/blog\/2011\/03\/13\/chu-nom-1200-ad-vietnam\/\">Chu nom<\/a> or <a href=\"http:\/\/glyphs.webfoot.com\/blog\/2011\/01\/02\/sumerian-cuneiform\/\">Sumerian<\/a>.<\/p>\n<p>If you are a computer geek, you might say, &#8220;ah, but Unicode solves that problem&#8221;.\u00a0 Unfortunately, Unicode doesn&#8217;t. Unicode encodes many characters; that is, it defines numbers to represent many characters in a string. It doesn&#8217;t make the font stretch to represent those characters.\u00a0 It also does not encode all characters, past and future.<\/p>\n<p>This problem is less common in European scripts, because glyphs are rarely added to these scripts. But &#8220;rarely&#8221; does not mean &#8220;never&#8221;.\u00a0 The glyph adorning this post is the <a href=\"http:\/\/en.wikipedia.org\/wiki\/Ukrainian_hryvnia\">Hryvnia<\/a>, the Ukranian currency symbol, introduced in 2004.\u00a0 You might scoff at the Hryvnia, and figure that it isn&#8217;t that important to support the Hryvnia symbol, but I doubt you would say that about the <a href=\"http:\/\/en.wikipedia.org\/wiki\/Euro_sign\">Euro<\/a> symbol, introduced in 1996.<\/p>\n<p>NB: This is an issue of great interest to my family; my beloved husband&#8217;s team at Adobe System came up with a publishers&#8217; tool for the gaiji problem, known as the <a title=\"Link to WIkipedia article on OpenType and SING\" href=\"http:\/\/en.wikipedia.org\/wiki\/OpenType#SING_gaiji_solution\">SING Gaiji Architecture<\/a>, available in Adobe Systems software between 2005 and 2010. (Disclaimer: I asked my husband to edit this post, and he did.)\u00a0 Adobe&#8217;s developer information page on &#8220;<a title=\"Link to Adobe developer info on gaiji\" href=\"http:\/\/www.adobe.com\/devnet-archive\/opentype\/archives\/gaiji.html\">Gaiji \u2014 Supplemental Characters\/Glyphs<\/a>&#8221; has links to good overview papers.\u00a0 Many technologies for supporting gaiji have been tried, including\u00a0 <a href=\"http:\/\/www.macintouch.com\/gaiji.html\">stroke-based fonts<\/a>, and <a title=\"Link to Wenlin Institute's page on Character Description Language\" href=\"http:\/\/www.wenlin.com\/cdl\/\">ideograph decomposition<\/a>.<\/p>\n<p>Links: <a href=\"http:\/\/en.wikipedia.org\/wiki\/Kanji#Gaiji\">Wikipedia<\/a>, &#8220;<a title=\"Link to DeLaHunt 2002 paper on Gaiji\" href=\"http:\/\/partners.adobe.com\/public\/developer\/en\/opentype\/Gaiji_DeLaHunt_IUC22.pdf\">Gaiji: Characters, Glyphs, Both, or Neither?<\/a>&#8221; paper (2002), &#8220;<a title=\"Link to DeLaHunt 2004 paper on SING Gaiji Architecture\" href=\"partners.adobe.com\/public\/developer\/en\/opentype\/SING_Intro_IUC26.pdf\">SING: Adobe&#8217;s New Gaiji Architecture<\/a>&#8221; paper (2004), <a href=\"http:\/\/inamidst.com\/whits\/2008\/gaiji\">Sean Palmer essay<\/a>, another <a href=\"http:\/\/lifecs.likai.org\/2010\/01\/solving-gaiji-problem-portably.html\">ideograph decomposition<\/a> proposal.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Many writing systems have a finite set of glyphs; you can write down a complete list and there are no others, except for the rare invention of new characters. But some writing systems have an open-ended set of glyphs; no &hellip; <a href=\"http:\/\/glyphs.webfoot.com\/blog\/2011\/03\/19\/gaiji\/\">Continue reading <span class=\"meta-nav\">&rarr;<\/span><\/a><\/p>\n","protected":false},"author":2,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[26,27],"tags":[],"class_list":["post-302","post","type-post","status-publish","format-standard","hentry","category-commentary","category-whoa"],"_links":{"self":[{"href":"http:\/\/glyphs.webfoot.com\/blog\/wp-json\/wp\/v2\/posts\/302","targetHints":{"allow":["GET"]}}],"collection":[{"href":"http:\/\/glyphs.webfoot.com\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"http:\/\/glyphs.webfoot.com\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"http:\/\/glyphs.webfoot.com\/blog\/wp-json\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"http:\/\/glyphs.webfoot.com\/blog\/wp-json\/wp\/v2\/comments?post=302"}],"version-history":[{"count":0,"href":"http:\/\/glyphs.webfoot.com\/blog\/wp-json\/wp\/v2\/posts\/302\/revisions"}],"wp:attachment":[{"href":"http:\/\/glyphs.webfoot.com\/blog\/wp-json\/wp\/v2\/media?parent=302"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"http:\/\/glyphs.webfoot.com\/blog\/wp-json\/wp\/v2\/categories?post=302"},{"taxonomy":"post_tag","embeddable":true,"href":"http:\/\/glyphs.webfoot.com\/blog\/wp-json\/wp\/v2\/tags?post=302"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}