torque-asm - Lightweight meta-assembler

	Commit message (Collapse)	Author	Age
*	Defer evaluation of block arguments	Ben Bridle	35 hours
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	When a block value is evaluated, the address tracker is immediately incremented for each word in the evaluated block. This was causing an issue when a block value was passed as an argument to a macro invocation, because argument values would all be evaluated before the macro body was evaluated. This meant that the value of every label in the macro body would be pushed forward by the size of the block argument, even if the block argument is inserted later in the macro. Furthermore, the address tracker would be incremented exactly once for each word in the block, even if the block is never inserted into the macro or is inserted multiple times. To fix this, we now defer evaluation of a block argument until exactly when that block argument is inserted into a macro. Because a block argument could be an invocation with a set of arguments passed to it, we also store a copy of the current environment alongside the block value so that it can be evaluated correctly at any time in the future.
*	Move parse_macro_definition_body function closer to calling site	Ben Bridle	35 hours
\| \| \| \|	This simplifies the next commit.
*	Improve 'expected block token in macro definition' error message	Ben Bridle	35 hours
\| \| \| \| \| \| \| \| \| \| \| \| \|	When a macro definition body contains more than one token, every token must be a block token. When a non-block token is included in one of these macro definitions, an error points to the offending token and says that a block token was expected in this macro definition. The change that this commit makes is to also point to the top of the macro definition to make it clearer that this macro definition is part of the problem, not just the errant integer or list by itself. Without this, the error message could be read to say that the integer or list itself is a macro definition, which could be confusing.
*	Allow macros with different arities to share argument names	Ben Bridle	2025-12-04
\| \| \| \| \| \| \| \| \| \| \|	This commit includes the arity of a parent macro in the namespace of a macro argument, so that the assembler library can recognise that two macro implementations with different arities are not the same. This fixes an error where two implementations of a macro with the same name but different arity could not share argument names, because the arity of the parent macro wasn't being taken into account when checking for redefinitions.
*	Convert single-element lists to integers when an integer is required	Ben Bridle	2025-12-04
\| \| \| \| \| \|	This allows an integer invocation with arguments to be wrapped in `[]` square brackets so that it can be passed to another invocation as an argument.
*	Use invocation source for integer type error, not macro source	Ben Bridle	2025-11-26
\|
*	Remove semantic::ListLiteral::Invocation	Ben Bridle	2025-11-26
\| \| \| \| \| \|	This variant was never needed or constructed, I would have added it to match the invocation variants of the block and integer tokens (which do have a use).
*	Allow quote characters to be used in identifiers	Ben Bridle	2025-10-27
\| \| \| \| \| \|	The single-quote character is used in register names on some processor architectures. The double-quote character is also allowed to be used in identifiers for consistency, it could be confusing otherwise.
*	Raise an error if recursion depth exceeds a maximum value	Ben Bridle	2025-10-15
\| \| \| \| \| \|	This is preferable to having the assembler crash with a stack overflow error, because the user can now see which invocation caused the overflow.
*	Fix incorrect first segment address	Ben Bridle	2025-10-15
\| \| \| \| \| \|	Previously, if the first intermediate token in a program was a pinned address, the address was being discarded and address zero was being pinned instead.
*	Remove string-style argument syntax from error message	Ben Bridle	2025-10-15
\| \| \| \| \| \|	The string-style argument syntax is no longer supported, it has been replaced with a list-style syntax (using square brackets instead of double-quotes).
*	Implement new bytecode stage	Ben Bridle	2025-10-14
\| \| \| \| \|	This completes the Torque version 3 rewrite, other than some extensive testing that is yet to be done.
*	Implement new intermediate stage	Ben Bridle	2025-10-14
\| \| \| \| \| \| \| \|	Massive improvement. Label references can be used anywhere in the program, with the program being assembled repeatedly until all labels have stabilised. The bytecode stage will just be a tiny stage tacked onto the end, rather than the old bytecode stage that would resolve labels and expressions.
*	Track more information with SymbolParser	Ben Bridle	2025-10-14
\| \| \| \| \| \| \| \| \|	To be able to re-use the symbol parser code in the intermediate stage in order to quickly find all symbol definitions and invocations in the program (in particular now that macro definitions are indistinguishable from label definitions, since they can both follow references), the symbol parser now tracks additional information about each symbol (argument count, torque-specific role, and parent macro definition).
*	Implement lists as a first-class type	Ben Bridle	2025-10-14
\| \| \| \| \| \| \| \| \| \|	Previously, strings were implemented as lists of integers, but there was no way to create an arbitrary list of integers to pass to a macro invocation. This commit extends the []-delimited expression syntax, treating expressions that contain only integers and invocations as lists of integers. Strings are implemented as a sub-type of list.
*	Change string argument syntax to list argument syntax	Ben Bridle	2025-10-14
\| \| \| \| \|	Changes string arguments from "name" to [name], to prepare for generic lists in the language. Doesn't compile.
*	Allow a macro invocation to precede the macro definition	Ben Bridle	2025-10-14
\| \| \| \| \|	There is no longer any reason to enforce this restriction. This will allow the use of more complex macro recursion by programs.
*	Include argument count in the signature of a symbol	Ben Bridle	2025-10-14
\| \| \| \| \|	This is done by suffixing the argument count to the symbol name using the ':' separator character, which can't normally be used in a symbol.
*	Update wording of syntactic and semantic errors	Ben Bridle	2025-10-14
\|
*	Rename labels/sublabels to global/local labels	Ben Bridle	2025-10-13
\| \| \| \|	This terminology has been used in the manual for a long time now.
*	Fix initial indentation when printing semantic tokens	Ben Bridle	2025-10-13
\| \| \| \| \| \|	Al non-macro-definition semantic tokens were being printed starting from indentation level zero, instead of from the provided indentation value.
*	Add a specific error for nested macro definitions	Ben Bridle	2025-10-13
\| \| \| \| \| \|	This case was previously being reported with the message "The macro definition was not terminated, add a ';' character to terminate", which was not very helpful.
*	Allow label references in pinned address calculations	Ben Bridle	2025-04-26
\| \| \| \| \| \| \| \| \| \| \| \| \| \| \| \|	This is a relaxation of the rule where a label reference could not be used in any context that could change the length of an assembled program. We implement this in the bytecode stage by naively calculating an initial address for each label as before. If a pinned address is calculated from a label reference, some of the calculated addresses could be incorrect. We then attempt to run the bytecode stage, which will calculate a more accurate address for each label based on how pinned addresses are calculated. If the address of any label was changed by running this stage, we re-run the stage up to three more times until all labels stabilise. If the labels fail to stabilise, we return an error.
*	Implement first-class string literals	Ben Bridle	2025-04-26
\| \| \| \| \| \| \| \| \| \| \|	This feature promotes strings to a first-class type in the language. If a string is passed to an invocation via the new string-type argument, the string will be passed as a whole value. String arguments can still be passed to an invocation via an integer-type argument, in which case they'll be broken apart into individual characters with the macro being invoked once per character. String-type macro arguments are declared like "name".
*	Fix misleading error message	Ben Bridle	2025-04-26
\| \| \| \| \| \|	The code here is really expecting a block value, but has received an integer value. This catches the case where an integer value is incorrectly included in a macro definition body alongside block tokens.
*	Allow a macro to invoke itself safely	Ben Bridle	2025-04-18
\| \| \| \| \| \| \|	A macro can now invoke itself if the invocation is inside a conditional block that will eventually return false. The assembler stack can still overflow if the macro recurses too deeply, or if a macro calls itself without a conditional block.
*	Allow a macro to invoke itself	Ben Bridle	2025-04-18
\| \| \| \| \|	This will currently cause the assembler to hang in all situations where it is used.
*	Implement <len> operator for expressions	Ben Bridle	2025-04-18
\| \| \| \| \|	The <len> operator returns the width of an integer in bits, using the same calculation as for packing an integer into a bit field.
*	Implement octal literals	Ben Bridle	2025-04-17
\| \| \| \|	Octal literals use the prefix '0o'.
*	Fix width checks for negative integers	Ben Bridle	2025-04-12
\| \| \| \| \| \| \| \| \| \| \| \|	The width of a negative integer was previously being counted in the same way as for a positive integer, by inverting the bits to make it a positive integer and then finding the placement of the highest-order 1 bit. The actual width of a negative integer will always be one greater than this value however, because the highest-order 1 bit of an inverted negative integer will always have directly above it a significant 0 bit used as the sign bit.
*	Implement negative integer literals	Ben Bridle	2025-04-12
\| \| \| \| \|	Negative literals take the forms -29, -0x1D, and -0b11101 for decimal, hexadecimal, and binary.
*	Report token without prefix in invalid literal errors	Ben Bridle	2025-03-23
\| \| \| \| \| \| \|	Previously, the token quoted in the error message for an invalid literal error included the radix prefix. This is already visible in the highlighted source report, and implied by the named radix in the error message.
*	Update assembler dependency	Ben Bridle	2025-03-18
\| \| \| \|	torque-asm now uses the Compiler type provided by the assembler library.
*	Tidy cody	Ben Bridle	2025-03-18
\| \| \| \| \| \| \|	- Rename tokens field on SyntacticMacroDefinition to body - Rename push_err! macro to err! - Create macros for character-matching logic in syntactic parsing - Replace .as_bytes().to_vec() pattern with .into_bytes()
*	Rewrite entire assembler	Ben Bridle	2025-03-11
	The language is now more general, the code is better structured, error reporting is more detailed, and many new language features have been implemented: - conditional blocks - first-class strings - more expression operators - binary literals - negative values - invocations in constant expressions