W3C-XPOINTERS(2)                                 W3C-XPOINTERS(2)

     NAME
          w3c-xpointers - parser for XPointers framework including
          XPath

     SYNOPSIS
          include "xpointers.m";

          xpointers := load Xpointers Xpointers->PATH;
          Xpath, Xstep: import xpointers;

          # special operators ('+', '-', etc represent themselves)
          One, Ole, Oge, Omul, Odiv, Omod, Oand, Oor, Oneg,
          Onodetype, Onametest, Ofilter, Opath: con ...;

          # axis types
          Aancestor,
          Aancestor_or_self,
          Aattribute,
          Achild,
          Adescendant,
          Adescendant_or_self,
          Afollowing,
          Afollowing_sibling,
          Anamespace,
          Aparent,
          Apreceding,
          Apreceding_sibling,
          Aself: con iota;

          Xstep: adt {
             axis:  int;  # Aancestor, ... (above)
             op:    int;  # Onametest or Onodetype
             ns:    string;
             name:  string;
             arg:   string;  # optional parameter to processing-instruction
             preds: cyclic list of ref Xpath;

             text:  fn(nil: self ref Xstep): string;
             axisname:  fn(i: int): string;
          };

          Xpath: adt {
             pick{
             E =>
                op:    int;
                l, r:  cyclic ref Xpath;
             Fn =>
                ns:    string;
                name:  string;
                args:  cyclic list of ref Xpath;

     Page 1                       Plan 9            (printed 12/22/24)

     W3C-XPOINTERS(2)                                 W3C-XPOINTERS(2)

             Var =>
                ns:    string;
                name:  string;
             Path =>
                abs:   int;
                steps: list of ref Xstep;
             Int =>
                val:   big;
             Real =>
                val:   real;
             Str =>
                s:     string;
             }
             text: fn(nil: self ref Xpath): string;
          };

          framework:  fn(s: string):
                         (string, list of (string, string, string), string);

          # predefined schemes
          element:    fn(s: string): (string, list of int, string);
          xmlns:      fn(s: string): (string, string, string);
          xpointer:   fn(s: string): (ref Xpath, string);

     DESCRIPTION
          Xpointers implements a parser for the World-Wide Web
          Consortium's XPointers framework, including a parser for
          XPath expressions.

          Init must be called before any other operation in the mod-
          ule.

          Framework parses a string s according to the grammar for the
          XPointers framework, and returns a tuple
          (short, pointers, err).  On an error, the string err gives a
          diagnostic and the other two values are nil.  Otherwise, if
          short is non-nil, the XPointer was a `shorthand pointer',
          with the given value; pointers will be nil.  If a scheme-
          based pointer is used, short is nil and pointers has a list
          of tuples (ns, scheme, data), each representing one pointer
          value.  Ns is the XML name space for the given scheme; the
          default name space is represented by nil.  Scheme is the
          XPointer scheme name within that name space; and data is the
          actual pointer value following the rules of that scheme.
          (They all have completely different syntax.)

          Three common schemes are directly supported by the module,
          by functions named after the scheme.  All of them follow the
          convention of returning a tuple in which the last element is
          a diagnostic string.  On an error, all but that last element
          of the tuple will be nil, and the last element will be a
          non-nil string with a diagnostic.

     Page 2                       Plan 9            (printed 12/22/24)

     W3C-XPOINTERS(2)                                 W3C-XPOINTERS(2)

          Xmlns parses an XML name space definition of the form
          ns=uri, and returns its components.

          Element parses a value of the XPointer element scheme, given
          by the grammar:

               selector ::= name child*  |  child+
               child ::= '/' [1-9][0-9]*

          The optional name is an XPointer `shorthand pointer'.  Each
          child number selects the child with that index (origin 1) at
          the corresponding level of the XML tree beneath the node
          selected by the name, or starting at the root of the XML
          tree.  Element returns a tuple ((name, path), err) where
          name is the top element name or nil if none was specified,
          and path is a list of int giving the path of child indices.

          The most complex scheme is xpointer, because its syntax is
          that of XML's elaborate XPath expression.  Xpointer parses
          such an expression and returns a tuple (e, err) where e
          refers to an Xpath value that represents the abstract syntax
          of the XPath Expr.  Xpointer checks only the syntax of s,
          and does not check that functions are limited to those spec-
          ified by the xpointer scheme (that is consistent with it
          being a parse of s, rather than an XPointer or XPath evalua-
          tor).

          Xpath and Xstep together represent an abstract syntax of the
          XPath grammar.

          Xstep represents the XPath Step grammar rule, with all
          abbreviations expanded to their full form:

               Step ::= AxisName '::' NodeTest Predicate*
               NodeTest ::= NameTest | NodeType '(' ')'
               NameTest ::= '*' | NCName ':' '*' | (NCName ':')? NCName
               Predicate ::= '[' Expr ']'

          The correspondence is as follows:

          s.axis    Represents the AxisName by one of the constants
                    Aancestor to Aself.

          s.op      Onametest or Onodetype to say which rule is repre-
                    sented

          s.ns      For a NameTest, gives the XML name space; can be
                    `*' for `any name space' or nil for the default
                    name space.  For a NodeType, gives the type:
                    comment, node, processing-instruction, or text.

     Page 3                       Plan 9            (printed 12/22/24)

     W3C-XPOINTERS(2)                                 W3C-XPOINTERS(2)

          s.name    Gives the name for a NameTest; can be `*' for `any
                    name'.

          s.arg     The optional literal parameter to a NodeType that
                    is a processing-instruction.

          s.preds   A list of Xpath values representing the optional
                    sequence of Predicate expressions

          s.text()  Returns a string representing the Xstep in textual
                    form.

          s.axisname(a)
                    Returns the printable text for axis code a (ie,
                    one of Aancestor to Aself)

          Xpath values represent an abstract syntax for an XPath
          expression.  Briefly, an expression follows the grammar
          below (see the XPath specification for the full concrete
          syntax).

               e ::=  e 'or' e
                  |   e 'and' e
                  |   e ('=' | '!=') e
                  |   e ('<' | '<=' | '>=' | '>') e
                  |   e ('+' | '-') e
                  |   e ('*' | 'div' | 'mod') e
                  |   '-' e
                  |   e '|' e
                  |   filter
                  |   path
               filter ::= primary predicate* (('/' | '//') relpath)?
               primary ::= '$' QName | '(' e ')' | Literal | Number | FunctionName '(' (e (',' e)*) ')'
               path ::= '/' relpath | relpath
               relpath ::= relpath '/' relpath | relpath '//' relpath | Step

          Most of e is represented by a binary tree using the pick
          Xpath.E(op, l, r) where op is an operator symbol (either the
          character itself or one of the constants One, Odiv, etc. for
          compound symbols), and l and r represent the operands.  The
          only unary operator Oneg has its operand in l. A filter uses
          the binary operator Xpath.E(Ofilter, e, pred) to apply each
          predicate to the preceding primary or predicate. A filter
          also uses Xpath.E(Opath, e, relpath) to apply the optional
          relpath (represented by a value of Xpath.Path) to the pre-
          ceding part of the filter expression.

          The other cases in the pick adt correspond to the various
          choices of path and primary. Integer and real numbers are
          distinguished.  Literal is represented by Xpath.Str; vari-
          able references (ie, $QName) are represented by XPath.Var,
          where ns gives the optional XML name space of the name. Path

     Page 4                       Plan 9            (printed 12/22/24)

     W3C-XPOINTERS(2)                                 W3C-XPOINTERS(2)

          is represented by Xpath.Path(abs, steps) where abs is non-
          zero if and only and steps lists the Xstep values corre-
          sponding to the slash-separated Steps in the grammar.
          Abbreviated forms such as `//' are converted by xpointer to
          their full internal form in terms of `/', as defined by the
          specification, so there is no need to distinguish the delim-
          iters in this representation.

     SOURCE
          /appl/lib/w3c/xpointers.b

     SEE ALSO
          ``XML Path Language (XPath) Version 1.0'',
          http://www.w3.org/TR/xpath
          ``XPointer framework'', http://www.w3.org/TR/xptr-framework/
          ``XPointer element() scheme'',
          http://www.w3.org/TR/xptr-element/
          ``XPointer xmlns() scheme'',
          http://www.w3.org/TR/xptr-xmlns/
          ``XPointer xpointer() scheme'',
          http://www.w3.org/TR/xptr-xpointer/

     Page 5                       Plan 9            (printed 12/22/24)