Unicode Explained - Jukka K. Korpela

Unicode Explained

Buch | Softcover
800 Seiten
2006
O'Reilly Media (Verlag)
978-0-596-10121-3 (ISBN)
53,85 inkl. MwSt
Possessing what readers need to grasp Unicode, this book takes on a guide through the character world. It is suitable for both beginning and seasoned programmers.
Fundamentally, computers just deal with numbers. They store letters and other characters by assigning a number for each one. At one time, there were hundreds of different encoding systems for assigning these numbers - but that was before Unicode. Unicode enables a single software product or website to be targeted across multiple platforms, languages and countries without re-engineering. It's no wonder that industry giants like Apple, Hewlett-Packard, IBM and Microsoft have all adopted Unicode. Containing everything you need to understand Unicode, this comprehensive reference from O'Reilly takes you on a detailed guide through the complex character world. For starters, it explains how to identify and classify characters - whether they're common, uncommon, or exotic. It then shows you how to type them, utilize their properties, and process character data in a robust manner. The book is broken up into three distinct parts. The first few chapters provide you with a tutorial presentation of Unicode and character data. It gives you a firm grasp of the terminology you need to reference various components, including character sets, fonts and encodings, glyphs and character repertoires.
The middle section offers more detailed information about using Unicode and other character codes. It explains the principles and methods of defining character codes, describes some of the widely used codes, and presents code conversion techniques. It also discusses properties of characters, collation and sorting, line breaking rules and Unicode encodings. The final four chapters cover more advanced material, such as characters in HTML and XHTML. You simply can't afford to be without the nuggets of valuable information detailed in "Unicode Explained".

Jukka Korpela is a consultant who specializes in character codes, localization, orthography, usability, and accessibility. After graduating from Helsinki University of Technology, he taught these subjects in the university's Computer Science department and worked on localization and accessibility issues at TIEKE before becoming a full-time author and consultant. His previous books on CSS and XHTML were published in Finland by Docendo press.

Preface Part I. 1.Characters as Data Introduction to Characters and Unicode What 's in a Character? Variation of Writing Systems Glyphs and Fonts Definitions of Character Repertoires Numbering Characters Encoding Characters as Octet Sequences Working with Encodings Working with Fonts Summaries 2.Writing Characters Method Varieties Keyboard Variation and Settings Virtual Keyboards Program Commands Character Maps Replacements on the Fly Special Techniques Escape Sequences Specialized Editors Exercise 3.Character Sets and Encodings Good Old ASCII ISO 8859 Codes Windows Latin 1 and Other Windows Codes Other 8-bit Codes Unicode and UTF-8 Encodings for East Asian Languages Converters and Transcoding Using Character Codes Part II. 4.The Structure of Unicode Design Principles Versions of Unicode Coding Space Unicode Terms Guide to the Unicode Standard Unicode and Fonts Criticism of Unicode Questions and Answers 5.Properties of Characters Character Classification An Overview of Properties Compositions and Decompositions Normalization Case Properties Collation and Sorting Text Boundaries Directionality Line Breaking Properties Unicode Conformance Requirements Effects on Choosing Characters 6.Unicode Encodings Unicode Encodings in General UTF-32 and UCS-4 UTF-16 and UCS-2 UTF-8 Byte Order Conversions Between Unicode Encodings Other Encodings Auto-Detecting the Encoding Choosing an Encoding Part III. 7.Characters and Languages Writing Systems and IT Character Requirements of Languages Transliteration and Transcription Language Metadata Languages and Fonts 8.Character Usage Basics of Character Usage ASCII (Basic Latin) Latin-1 Supplement (ISO 8859-1) Other Latin Letters Other European Alphabetic Scripts Diacritic Marks Letterlike Symbols General Punctuation Line Structure Control Mathematical and Technical Symbols Other Blocks 9.The Character Level and Above Levels of Text Representation and Processing Characters and Markup Media Types for Text 10.Characters in Internet Protocols Information about Encoding Characters in MIME Content Negotiation and Multilingual Sites Characters in Protocol Headers Characters in Domain Names and URLs 11.Characters in Programming Characters in Computer Languages Character and String Data The Preparedness Principle Character Input and Output Processing Form Data Identifiers,Patterns,and Regular Expressions International Components for Unicode (ICU) Using Locales A.Tables for Writing Characters Index

Erscheint lt. Verlag 25.7.2006
Reihe/Serie O'Reilly Ser.
Verlagsort Sebastopol
Sprache englisch
Themenwelt Mathematik / Informatik Informatik Software Entwicklung
Informatik Theorie / Studium Algorithmen
ISBN-10 0-596-10121-X / 059610121X
ISBN-13 978-0-596-10121-3 / 9780596101213
Zustand Neuware
Haben Sie eine Frage zum Produkt?
Mehr entdecken
aus dem Bereich
IT zum Anfassen für alle von 9 bis 99 – vom Navi bis Social Media

von Jens Gallenbacher

Buch | Softcover (2021)
Springer (Verlag)
29,99
Interlingua zur Gewährleistung semantischer Interoperabilität in der …

von Josef Ingenerf; Cora Drenkhahn

Buch | Softcover (2023)
Springer Fachmedien (Verlag)
32,99