[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: [StrongED] Re: S & R query
In message <29a4bab856.Alan.Adams@xxxxxxxxxxxxxxxxxxxxxx>
Alan Adams <alan@xxxxxxxxxxxxxxxx> wrote:
> In message <56b8548384Lists@xxxxxxxxxxx>
> "Richard Torrens (lists)" <Lists@xxxxxxxxxxx> wrote:
>
> > The current definition of white Space certainly is not unusual - it is
> > the html default. So StrongED is not out of step.
>
> The confusing part is the definition - '\t\x0A' expanding to be space,
> tab, newline, carriage return and others. \x0A ia specifically one
> character, so the rest must be the expansion of \t. In which case,
> what is the key for tab?
'\t\x0A ' is not the definition of White. It's just the equivalent set,
albeit an incomplete one as recently discovered. The correct set that it
corresponds with is: '\x09-\x0D\x20\xA0'
Note that other equivalance sets in the manual are not complete either.
However that's done on purpose as some sets are large. Alpha for example
matches not only 'A-Za-z' but also all accented characters.
StrongED uses the TerritoryManager's character property tables for its
predefined sets. To see which characters are in a particular set you can
use this BASIC program:
REM Property code | StrongED
REM ------------------------------
REM 0 = Control code | Ctrl
REM 1 = Uppercase | Upper
REM 2 = Lowercase | Lower
REM 3 = Alphabetic | Alpha
REM 4 = Punctuation | Punct
REM 5 = White space | White
REM 6 = Digit | Digit
REM 7 = Hex digit | Hex
property% = 0 :REM set to 0-7 according to above table
SYS "Territory_CharacterPropertyTable",-1,property% TO table%
FOR c% = 0 TO 255
byte% = c% DIV 8 : bit% = 1<<(c% MOD 8)
IF (table%?byte%) AND bit% > 0 THEN PRINT c%
NEXT c%
Cheers,
Fred.
--
StrongED Developer
http://www.stronged.iconbar.com/
--
To unsubscribe send a mail to StrongED+unsubscribe@xxxxxxxxxxx
List archives and instructions at
http://diy.Torrens.org/RO/StrongED/index.html