Basic Syntax
PIR has a relatively simple syntax. Every line is a comment, a label, a statement, or a directive. Each statement or directive stands on its own line. There is no end-of-line symbol (such as a semicolon in C).
Comments
A comment begins with the #
symbol,
and continues until the end of the line.
Comments can stand alone on a line or follow a statement or directive.
# This is a regular comment. The PIR # interpreter ignores this.
PIR also treats inline documentation in Pod format as a comment.
An equals sign as the first character of a line marks the start of a Pod block.
A =cut
marker signals the end of a Pod block.
=head2 This is Pod documentation, and is treated like a comment. The PIR interpreter ignores this. =cut
Labels
A label attaches a name to a line of code so other statements can refer to it. Labels can contain letters, numbers, and underscores. By convention, labels use all capital letters to stand out from the rest of the source code. It's fine to put a label on the same line as a statement or directive:
GREET: say "'Allo, 'allo, 'allo."
Labels on separate lines improve readability, especially when outdented:
GREET: say "'Allo, 'allo, 'allo."
Statements
A statement is either an opcode or syntactic sugar for one or more opcodes. An opcode is a native instruction for the virtual machine; it consists of the name of the instruction followed by zero or more arguments.
say "Norwegian Blue"
PIR also provides higher-level constructs, including symbolic operators:
$I1 = 2 + 5
These special statement forms are just syntactic sugar for regular opcodes. The +
symbol corresponds to the add
opcode, the -
symbol to the sub
opcode, and so on. The previous example is equivalent to:
add $I1, 2, 5
Directives
Directives resemble opcodes, but they begin with a period (.
). Some directives specify actions that occur at compile time. Other directives represent complex operations that require the generation of multiple instructions. The .local
directive, for example, declares a named variable.
.local string hello
Literals
Integers and floating point numbers are numeric literals. They can be positive or negative.
$I0 = 42 # positive $I1 = -1 # negative
Integer literals can also be binary, octal, or hexadecimal:
$I1 = 0b01010 # binary $I2 = 0o72 # octal $I3 = 0xA5 # hexadecimal
Floating point number literals have a decimal point and can use scientific notation:
$N0 = 3.14 $N2 = -1.2e+4
String literals are enclosed in single or double-quotes.See the section on Strings in Chapter 4 for an explanation of the differences between the quoting types.
$S0 = "This is a valid literal string" $S1 = 'This is also a valid literal string'
Variables
PIR variables can store four different kinds of values—integers, numbers (floating point), strings, and objects. Parrot's objects are called PMCs, for "PolyMorphic Container".
The simplest kind of variable is a register variable. The name of a register variable always starts with a dollar sign ($
), followed by a single character which specifies the type of the variable -- integer (I
), number (N
), string (S
), or PMC (P
) -- and ends with a unique number. You need not predeclare register variables:
$S0 = "Who's a pretty boy, then?" say $S0
PIR also has named variables; the .local
directive declares them. As with register variables, there are four valid types: int
, num
, string
, and pmc
. You must declare named variables; otherwise they behave exactly the same as register variables.
.local string hello hello = "'Allo, 'allo, 'allo." say hello
Constants
The .const
directive declares a named constant. Named constants are similar to named variables, but the values set in the declaration may never change. Like .local
, .const
takes a type and a name. It also requires a literal argument to set the value of the constant.
.const int frog = 4 # integer .const string name = "Superintendent Parrot" # string .const num pi = 3.14159 # floating point
You may use a named constant anywhere you may use a literal, but you must declare the named constant beforehand. This example declares a named string constant hello
and prints the value:
.const string hello = "Hello, Polly." say hello
Keys
A key is a special kind of constant used for accessing elements in complex variables (such as an array). A key is either an integer or a string; and it's always enclosed in square brackets ([
and ]
). You do not have to declare literal keys. This code example stores the string "foo" in $P0 as element 5, and then retrieves it.
$P0[5] = "foo" $S1 = $P0[5]
PIR supports multi-part keys. Use a semicolon to separate each part.
$P0['my';'key'] = 472 $I1 = $P0['my';'key']
Control Structures
Rather than providing a pre-packaged set of control structures like if
and while
, PIR gives you the building blocks to construct your own.PIR has many advanced features, but at heart it is an assembly language. The most basic of these building blocks is goto
, which jumps to a named label.This is not your father's goto
. It can only jump inside a subroutine, and only to a named label. In this code example, the say
statement will run immediately after the goto
statement:
goto GREET # ... some skipped code ... GREET: say "'Allo, 'allo, 'allo."
Variations on the basic goto
check whether a particular condition is true or false before jumping:
if $I0 > 5 goto GREET
You can construct any traditional control structure from PIR's built-in control structures.
Subroutines
A PIR subroutine starts with the .sub
directive and ends with the .end
directive. Parameter declarations use the .param
directive; they resemble named variable declarations. This example declares a subroutine named greeting
, that takes a single string parameter named hello
:
.sub 'greeting' .param string hello say hello .end
That's All Folks
You now know everything you need to know about PIR. Everything else you read or learn about PIR will use one of these fundamental language structures. The rest is vocabulary.