Defining modules

A module is defined by four kinds of subclauses which, if present, always appear in the same order.

import - a reference to a shared module definition; repeatable
module - a nested module definition; repeatable
macros - an exported list of macro definitions
symbols - an exported list of text values

The lexical name given to a module definition must be an identifier. However, it must not begin with a $--this is reserved for system-defined bindings like $ion.

Internal environment

The body of a module tracks an internal environment by which macro references are resolved. This environment is constructed incrementally by each clause in the definition and consists of:

the module bindings, a map from identifier to module definition
the exported symbols, an array containing symbol texts
the exported macros, an array containing name/macro pairs

Before any clauses of the module definition are examined, each of these is empty.

Each clause affects the environment as follows:

An import declaration retrieves a shared module from the implementation’s catalog and binds a name to it, making its macros available for use. An error must be signaled if the name already appears in the module bindings.
A module declaration defines a new module and binds a name to it. An error must be signaled if the name already appears in the module bindings.
A macros declaration defines the exported macros.
A symbols declaration defines the exported symbols.

Resolving Macro References

Within a module definition, macros can be referenced in several contexts using the following macro-ref syntax:

qualified-ref      ::= module-name '::' macro-ref

macro-ref          ::= macro-name | macro-addr

macro-name         ::= unannotated-identifier-symbol

macro-addr         ::= unannotated-uint

Macro references are resolved to a specific macro as follows:

An unqualified macro-name is looked up in the following locations:
1. in the macros already exported in this module's macros
2. in the default_module
3. in the system module
If it maps to a macro, that’s the resolution of the reference. Otherwise, an error is signaled due to an unbound reference.
An anonymous local reference (macro-addr) is resolved by index in the exported macro array. If the address exceeds the array boundary, an error is signaled due to an invalid reference.
A qualified reference (qualified-ref) resolves solely against the referenced module. First, the module name must be resolved to a module definition.
- If the module name is in the module bindings, it resolves to the corresponding module definition.
- If the module name is not in the module bindings, resolution is attempted recursively upwards through the parent scopes.
- If the search reaches the top level without resolving to a module, an error is signaled due to an unbound reference.
Next, the name or address is resolved within that module definition’s exported macro table.

`import`

import             ::= '(import ' module-name catalog-key ')'

module-name        ::= unannotated-identifier-symbol

catalog-key        ::= catalog-name catalog-version?

catalog-name       ::= string

catalog-version    ::= int // positive, unannotated

An import binds a lexically scoped module name to a shared module that is identified by a catalog key—a (name, version) pair. The version of the catalog key is optional—when omitted, the version is implicitly 1.

In Ion 1.0, imports may be substituted with a different version if an exact match is not found. In Ion 1.1, however, all imports require an exact match to be found in the reader's catalog; if an exact match is not found, the implementation must signal an error.

`module`

The module clause defines a new local module that is contained in the current module.

inner-module ::= '(module' module-name import* macro-table? symbol-table? ')'

Inner modules automatically have access to modules previously declared in the containing module using module or import. The new module (and its exported symbols and macros) is available to any following module, symbols, and macros clauses in the enclosing container.

See local modules for full explanation.

`macros`

The macros clause assembles a list of macro definitions for the module to export. It takes any number of arguments. All modules have a macro table, so when a module has no macros clause, the module has an empty macro table.

Most commonly, a macro table entry is a definition of a new macro expansion function, following this general shape:

//  ┌─── `macro` keyword
//  │   ┌─── macro name
//  │   │     ┌─── signature (s-expression of parameters)
//  │   │     │         ┌─── template (TDL expression)
(macro foo (x y z) (.values (%x) (%y) (%z))

(See the Defining macros for details.)

When the value null is given for the macro name, this defines an anonymous macro that can be referenced by its numeric address (that is, its index in the enclosing macro table). Inside the defining module, that uses a local reference like 12.

The signature defines the syntactic shape of expressions invoking the macro; see Macro Signatures for details. The template defines the expansion of the macro, in terms of the signature’s parameters; see Template Expressions for details.

Imported macros must be explicitly exported if so desired. Module names and export clauses can be intermingled with macro definitions inside the macros clause; together, they determine the bindings that make up the module’s exported macro array.

The module-name export form is shorthand for referencing all exported macros from that module, in their original order with their original names.

An export clause contains a single macro reference followed by an optional alias for the exported macro. The referenced macro is appended to the macro table.

tip

No name can be repeated among the exported macros, including macro definitions. Name conflicts must be resolved by exports with aliases.

Processing

When the macros clause is encountered, the reader constructs an empty list. The arguments to the clause are then processed from left to right.

For each arg:

If the arg is a macro clause, the clause is processed and the resulting macro definition is appended to the end of the macro table being constructed.
If the arg is an export clause, the clause is processed and the referenced macro definition is appended to the end of the macro table being constructed.
If the arg is the name of a module, the macro definitions in that module's macro table are appended to the end of the macro table being constructed.
If the arg is anything else, the reader must raise an error.

A macro name is a symbol that can be used to reference a macro, both inside and outside the module. Macro names are optional, and improve legibility when using, writing, and debugging macros. When a name is used, it must be an identifier per Ion’s syntax for symbols. Macro definitions being added to the macro table must have a unique name. If a macro is added whose name conflicts with one already present in the table, the implementation must raise an error.

`macro`

A macro clause defines a new macro. When the macro declaration uses a name, an error must be signaled if it already appears in the exported macro array.

`export`

An export clause declares a name for an existing macro and appends the macro to the macro table.

If the reference to the existing macro is followed by a name, the existing macro is appended to the exported macro array with the latter name instead of the original name, if any. In this way, an anonymous macro can be given a name. An error must be signaled if that name already appears in the exported macro array.
If the reference to the existing macro is followed by null, the macro is appended to the exported macro array without a name, regardless of whether the macro has a name.
If the reference to the existing macro is anonymous, the macro is appended to the exported macro array without a name.
When the reference to the existing macro uses a name, the name and macro are appended to the exported macro
array. An error must be signaled if that name already appears in the exported macro array.

Module names in `macros`

A module name appends all exported macros from the module to the exported macro array. If any exported macro uses a name that already appears in the exported macro array, an error must be signaled.

`symbols`

A module can define a list of exported symbols by copying symbols from other modules and/or declaring new symbols.

symbol-table       ::= '(symbols' symbol-table-entry* ')'

symbol-table-entry ::= module-name | symbol-list

symbol-list        ::= '[' ( symbol-text ',' )* ']'

symbol-text        ::= symbol | string

The symbols clause assembles a list of text values for the module to export. It takes any number of arguments, each of which may be the name of visible module or a list of symbol-texts. The symbol table is a list of symbol-texts by concatenating the symbol tables of named modules and lists of symbol/string values.

Where a module name occurs, its symbol table is appended. (The module name must refer to another module that is visible to the current module.) Unlike Ion 1.0, no symbol-maxid is needed because Ion 1.1 always requires exact matches for imported modules.

tip

When redefining a top-level module binding, the binding being redefined can be added to the symbol table in order to retain its symbols. For example:

// Define module `foo`
$ion::
(module foo
    (symbols ["b", "c"]))

// Redefine `foo` in terms of its former definition
$ion::
(module foo
    (symbols
        ["a"]
        foo // The old definition of `foo` with symbols ["b", "c"]
        ["d"]))

// Now `foo`'s symbol table is ["a", "b", "c", "d"]

Where a list occurs, it must contain only non-null, unannotated strings and symbols. The text of these strings and/or symbols are appended to the symbol table. Upon encountering any non-text value, null value, or annotated value in the list, the implementation shall signal an error.
To add a symbol with unknown text to the symbol table, one may use $0.

All modules have a symbol table, so when a module has no symbols clause, the module has an empty symbol table.

Symbol zero `$0`

Symbol zero (i.e. $0) is a special symbol that is not assigned text by any symbol table, even the system symbol table. Symbol zero always has unknown text, and can be useful in synthesizing symbol identifiers where the text of the symbol is not known in a particular operating context.

All symbol tables (even an empty symbol table) can be thought of as implicitly containing $0. However, $0 precedes all symbol tables rather than belonging to any symbol table. When adding the exported symbols from one module to the symbol table of another, the preceding $0 is not copied into the destination symbol table (because it is not part of the source symbol table).

It is important to note that $0 is only semantically equivalent to itself and to locally-declared SIDs with unknown text. It is not semantically equivalent to SIDs with unknown text from shared symbol tables, so replacing such SIDs with $0 is a destructive operation to the semantics of the data.

Processing

When the symbols clause is encountered, the reader constructs an empty list. The arguments to the clause are then processed from left to right.

For each arg:

If the arg is a list of text values, the nested text values are appended to the end of the symbol table being constructed.
- When $0 appears in the list of text values, this creates a symbol with unknown text.
- The presence of any other Ion value in the list raises an error.
If the arg is the name of a module, the symbols in that module's symbol table are appended to the end of the symbol table being constructed.
If the arg is anything else, the reader must raise an error.

Example

(symbols         // Constructs an empty symbol table (list)
  ["a", b, 'c']       // The text values in this list are appended to the table
  foo                 // Module `foo`'s symbol table values are appended to the table
  ['''g''', "h", i])  // The text values in this list are appended to the table

If module foo's symbol table were [d, e, f], then the symbol table defined by the above clause would be:

["a", "b", "c", "d", "e", "f", "g", "h", "i"]

This is an Ion 1.0 symbol table that imports two shared symbol tables and then declares some symbols of its own.

$ion_1_0
$ion_symbol_table::{
  imports: [{ name: "com.example.shared1", version: 1, max_id: 10 },
            { name: "com.example.shared2", version: 2, max_id: 20 }],
  symbols: ["s1", "s2"]
}

Here’s the Ion 1.1 equivalent in terms of symbol allocation order:

$ion_1_1
$ion::(import m1 "com.example.shared1" 1)
$ion::(import m2 "com.example.shared2" 2)
$ion::
(module _
  (symbols m1 m2 ["s1", "s2"])
)

Keyboard shortcuts

The Ion 1.1 Draft Specification