| parrotcode: Parrot Exceptions | |
| Contents | Documentation | 

docs/pdds/pdd23_exceptions.pod - Parrot Exceptions

This document defines the requirements and implementation strategy for Parrot's exception system.

$Revision$

Exceptions are indications by running code that something unusual -- an "exception" to the normal processing -- has occurred. When code detects an exceptional condition, it throws an exception object. Before this occurs, code can register exception handlers, which are functions (or closures) which may (but are not obligated to) handle the exception. Some exceptions permit continued execution immediately after the throw; some don't.
Exceptions transfer control to a piece of code outside the normal flow of control. They are mainly used for error reporting or cleanup tasks.
(A digression on terminology: In a system analysis sense, the word "exception" usually refers to the exceptional event that requires out-of-band handling. However, in Parrot, "exception" also refers to the object that holds all the information describing the exceptional condition: the nature of the exception, the error message describing it, and other ancillary information. The specific type (class) of an exception object indicates its category.)

These are the opcodes relevant to exceptions and exception handlers:
newclosure operation on the given subroutine,
and pushes the resulting closure.throw is a one-way trip (unless you have made other arrangements) because Parrot does not take a continuation after this opcode.
(But see throwcc below.)parrot;exception class,
described below.throwcc is just like throw.
This opcode is useful for exceptions that are more like warnings or notices than errors.throwcc opcode by executing the handled opcode,
and then invoking the given continuation which they receive as a parameter.
That continuation must be invoked with no parameters; in other words,
throwcc never returns a value.die opcode throws an exception of type exception;death with a payload of MESSAGE.
If MESSAGE is a string register,
the exception payload is a String PMC containing MESSAGE; if MESSAGE is a PMC,
it is used directly as the exception payload.exit(1).exception;exit with a payload of EXITCODE,
which defaults to zero,
as an Integer PMC.exit(EXITCODE).
When throw or throwcc is called, for all active exception handlers, in LIFO order:
If the handler returns without calling handled:
throwcc),
invoke the continuation (resume execution).
Whether to resume or die when an exception isn't handled is determined by the severity of the exception.die.When running an embedded Parrot interpreter, the interpreter does not immediately terminate on an unhandled exception, it merely returns control to the embedding program and stores the unhandled exception so that it may be queried by the embedding program. The embedding program may choose to handle the exception and continue execution by invoking the exception's continuation.
When the handled opcode is called:


All of Parrot's standard exceptions provide at least the following interface. It is STRONGLY RECOMMENDED that all classes intended for throwing also provide at least this interface as well.
String,
but you should still be able to stringify and print it.get_inner_exception() method returns that previous exception,
else it returns null.
Parrot's standard exceptions provide some additional methods beyond the three universal exception methods shown above. The additional methods are:
exception;errno is an integer.
In addition,
some exceptions don't require payloads,
thus:set_inner_exception() method to store that previous exception as part of the exception object.
Parrot comes with a small hierarchy of classes designed for use as exceptions. Parrot throws them when internal Parrot errors occur, but any user code can throw them too.
isa testing.errno.
Payload is an integer.
Message is the return value of the standard C function strerror().find_lex or store_lex operation failed because a given lexical variable was not found.
Payload is an array: [0] the name of the lexical variable that was not found,
[1] the LexPad in which it was not found.
Exceptions have been incorporated into built-in opcodes in a limited way.
For the most part,
they're used when the return value is either impractical to check (perhaps because we don't want to add that many error checks in line),
or where the output type is unable to represent an error state (e.g.
the output I register of the ord opcode).
The div,
fdiv,
and cmod opcodes throw exception;math;division_by_zero.
The ord opcode throws exception;domain when it's passed an empty argument or a string index that's outside the length of the string.
Payload is an array,
first element being the string 'ord'.
The classoffset opcode throws exception;domain when it's asked to retrieve the attribute offset for a class that isn't in the object's inheritance hierarchy.
Payload is an array: [0] string 'classoffset',
[1] object in question,
[2] ID of class not found.
The find_charset opcode throws exception;domain if the charset name it's looking up doesn't exist.
Payload is an array: [0] string 'find_charset',
[1] charset name that was not found.
The trans_charset opcode throws exception;domain on "information loss" (presumably,
this means when one charset doesn't have a one-to-one correspondence in the other charset).
Payload is an array: [0] string 'trans_charset',
[1] source charset name,
[2] destination charset name,
[3] untranslatable code point.
The find_encoding opcode throws exception;domain if the encoding name it's looking up doesn't exist.
Payload is an array: [0] string 'find_encoding',
[1] encoding name that was not found.
The trans_encoding opcode throws exception;domain on "information loss" (presumably,
this means when one encoding doesn't have a one-to-one correspondence in the other encoding).
Payload is an array: [0] string 'trans_encoding',
[1] source encoding name,
[2] destination encoding name,
[3] untranslatable code point.
Parrot's default version of the LexPad PMC throws exception;lexical for some error conditions,
though other implementations can choose to return error values instead.
By default,
the find_lex and store_lex opcodes throw an exception (exception;lexical) when the given name can't be found in any visible lexical pads.
However,
this behavior is only a default,
as provided by the default Parrot lexical pad PMC LexPad.
If a given HLL has its own lexical pad PMC,
its behavior may be very different.
(For example,
in Tcl,
store_lex is likely to succeed every time,
as creating new lexicals at runtime is OK in Tcl.)
{{ TODO: List any other opcodes that currently throw exceptions and general categories of opcodes that should throw exceptions. }}
Other opcodes respond to an errorson setting to decide whether to throw an exception or return an error value.
find_global throws an exception (or returns a Null PMC) if the global name requested doesn't exist.
find_name throws an exception (or returns a Null PMC) if the name requested doesn't exist in a lexical,
current,
global,
or built-in namespace.
{{ TODO: "errorson" as specified is dynamically rather than lexically scoped; is this good? Probably not good. Let's revisit it when we get the basic exceptions functionality implemented. }}
It's a little odd that so few opcodes throw exceptions (these are the ones that are documented,
but a few others throw exceptions internally even though they aren't documented as doing so).
It's worth considering either expanding the use of exceptions consistently throughout the opcode set,
or eliminating exceptions from the opcode set entirely.
The strategy for error handling should be consistent,
whatever it is.
[I like the way LexPads and the errorson settings provide the option for exception-based or non-exception-based implementations,
rather than forcing one or the other.]
{{ NOTE: There are a couple of different factors here. One is the ability to globally define the severity of certain exceptions or categories of exceptions without needing to define a handler for each one. (e.g. Perl 6 may have pragmas to set how severe type-checking errors are. A simple "incompatible type" error may be fatal under one pragma, a resumable warning under another pragma, and completely silent under a third pragma.) Another is the ability to "defang" opcodes so they return error codes instead of throwing exceptions. We might provide a very simple interface to catch an exception and capture its payload without the full complexity of manually defining exception handlers (though it would still be implemented as an exception handler internally). Something like:
  .local pmc error_code
  .capture_start error_code
  $P1 = find_lex 'foo'
  .capture_end
  # error_code contains what would have been the "error" return value
This could eliminate the need for "defanging" because it would be almost as easy to use as error codes. It could be implemented once for all exceptional opcodes, instead of needing to be defined for each one. And, it still keeps the error information out-of-band, instead of mixing the error in with normal return values. }}

Exceptions thrown by standard Parrot opcodes (like the one thrown by find_global above or by the throw opcode) are always resumable, so when the exception handler function returns normally it continues execution at the opcode immediately after the one that threw the exception. Other exceptions at the run-loop level are also generally resumable.
  $P0 = new String
  $P0 = "something bad happened"
  $P1 = new ['parrot';'exception'], $P0  # create new exception object
  throw $P1                              # throw it

None.

None.

  src/ops/core.ops
  src/exceptions.c
        
|  |   |