From: Chris Hanson Date: Mon, 7 Jan 2019 00:32:57 +0000 (-0800) Subject: Clean up wording around bitless characters. X-Git-Tag: mit-scheme-pucked-10.1.9~3^2~14 X-Git-Url: https://birchwood-abbey.net/git?a=commitdiff_plain;h=f066d1521e59c25520193f1267af4a1b59d2b78b;p=mit-scheme.git Clean up wording around bitless characters. --- diff --git a/doc/ref-manual/characters.texi b/doc/ref-manual/characters.texi index 3deff38ef..5e0084759 100644 --- a/doc/ref-manual/characters.texi +++ b/doc/ref-manual/characters.texi @@ -443,9 +443,9 @@ it returns @code{#f}. @end deffn @deffn procedure unicode-char? object -Returns @code{#t} if @var{object} is any ``bitless'' character -corresponding to a Unicode code point, except for those with general -category @code{other:surrogate} or @code{other:not-assigned}. +Returns @code{#t} if @var{object} is any character corresponding to a +Unicode code point, except for those with general category +@code{other:surrogate} or @code{other:not-assigned}. @end deffn @deffn procedure char-general-category char @@ -495,8 +495,12 @@ Returns the Unicode general category of @var{char} (or MIT/GNU Scheme's character-set abstraction is used to represent groups of characters, such as the letters or digits. A character set may -contain any ``bitless'' character. Alternatively, a character set can -be treated as a set of code points. +contain any character. Alternatively, a character set can be treated +as a set of code points. + +Implementation note: MIT/GNU Scheme allows any ``bitless'' character +to be stored in a character set; operations that accept characters +automatically strip their bucky bits. @deffn procedure char-set? object @cindex type predicate, for character set @@ -549,7 +553,7 @@ identical. An @var{element} can take several forms, each of which specifies one or more characters to include in the resulting character set: a -(bitless) character includes itself; a string includes all of the characters it +character includes itself; a string includes all of the characters it contains; a character set includes its members; or a code-point range includes the corresponding characters. diff --git a/doc/ref-manual/strings.texi b/doc/ref-manual/strings.texi index c6af04fbe..a5e979b6a 100644 --- a/doc/ref-manual/strings.texi +++ b/doc/ref-manual/strings.texi @@ -95,7 +95,8 @@ allow characters U+0001 to U+00FF (the Latin-1 repertoire without Implementation note: MIT/GNU Scheme allows any ``bitless'' character to be stored in a string. In effect this means any character with a -Unicode code point, including surrogates. +Unicode code point, including surrogates. String operations that +accept characters automatically strip their bucky bits. It is an error to pass such a forbidden character to @code{make-string}, @code{string}, @code{string-set!}, or