NAME

lib/lpeg.pir - Parsing Expression Grammar for Lua, version 0.9

lpeg.match (pattern, subject [, init])
lpeg.print (pattern)
lpeg.span (string)
lpeg.type (value)
lpeg.version ()

Basic Constructions

The following operations build patterns. All operations that expect a pattern as an argument may receive also strings, tables, numbers, booleans, or functions, which are translated to patterns according to the rules of function lpeg.P.

lpeg.P (value)
lpeg.R ({range})
lpeg.S (string)
lpeg.V (v)
locale ([table])
#patt
-patt
patt1 + patt2
patt1 - patt2
patt1 *patt2
patt^n

Grammars

With the use of Lua variables, it is possible to define patterns incrementally, with each new pattern using previously defined ones. However, this technique does not allow the definition of recursive patterns. For recursive patterns, we need real grammars.

LPeg represents grammars with tables, where each entry is a rule.

The call lpeg.V(v) creates a pattern that represents the nonterminal (or variable) with index v in a grammar. Because the grammar still does not exist when this function is evaluated, the result is an open reference to the respective rule.

A table is fixed when it is converted to a pattern (either by calling lpeg.P or by using it wherein a pattern is expected). Then every open reference created by lpeg.V(v) is corrected to refer to the rule indexed by v in the table.

When a table is fixed, the result is a pattern that matches its initial rule. The entry with index 1 in the table defines its initial rule. If that entry is a string, it is assumed to be the name of the initial rule. Otherwise, LPeg assumes that the entry 1 itself is the initial rule.

As an example, the following grammar matches strings of a's and b's that have the same number of a's and b's:

 equalcount = lpeg.P{
  "S";   -- initial rule name
  S = "a" * lpeg.V"B" + "b" * lpeg.V"A" + "",
  A = "a" * lpeg.V"S" + "b" * lpeg.V"A" * lpeg.V"A",
  B = "b" * lpeg.V"S" + "a" * lpeg.V"B" * lpeg.V"B",
 } * -1

Captures

Captures specify what a match operation should return (the so called semantic information). LPeg offers several kinds of captures, which produces values based on matches and combine them to produce new values.

A capture pattern produces its values every time it succeeds. For instance, a capture inside a loop produces as many values as matched by the loop. A capture produces a value only when it succeeds. For instance, the pattern lpeg.C(lpeg.P"a"^-1) produces the empty string when there is no "a" (because the pattern "a"? succeeds), while the pattern lpeg.C("a")^-1 does not produce any value when there is no "a" (because the pattern "a" fails).

Usually, LPEG evaluates all captures only after (and if) the entire match succeeds. At match time it only gathers enough information to produce the capture values later. As a particularly important consequence, most captures cannot affect the way a pattern matches a subject. The only exception to this rule is the so-called match-time capture. When a match-time capture matches, it forces the immediate evaluation of all its nested captures and then calls its corresponding function, which tells whether the match succeeds and also what values are produced.

lpeg.C (patt)
lpeg.Carg (n)
lpeg.Cb (name)
lpeg.Cc ({value})
lpeg.Cf (patt, func)
lpeg.Cg (patt [, name])
lpeg.Cp ()
lpeg.Cs (patt)
lpeg.Ct (patt)
patt / string
patt / table
patt / function
lpeg.Cmt (patt, function)

Some Examples

http://www.inf.puc-rio.br/~roberto/lpeg.html#ex

NAME

DESCRIPTION

Introduction

Functions

Basic Constructions

Grammars

Captures

Some Examples

LINKS

parrotcode: Parsing Expression Grammar for Lua, version 0.9
Contents \| Language Implementations \| Lua