Advent of Prism: Part 13 - Calls (part 1)

Dec 13, 2023

This blog series is about how the prism Ruby parser works. If you’re new to the series, I recommend starting from the beginning. This post is about call nodes.

We are now halfway through this blog series, and it’s high time we talked about the heart of Ruby programming: calling methods. Method calls take many different forms. The next four posts will show all of their various incantations. For today, we’ll be giving you the lay of the land so that you know what you’re getting into.

Method calls in Ruby consistent of four things:

A receiver (implicit or explicit)
A name
The number of arguments (known as argc)
A set of flags

This is the hardest node to get right in the whole AST, because it is so foundational to Ruby’s interpretation. We want to provide as much information as possible in as concise a format as possible. We also want to ensure all basic¹ method calls can be succinctly handled by this one singular node.

We’ll go through each form of method call in turn.

Identifiers

When a plain identifier is found in Ruby, it first looks to see if that identifier maps to name of a visible local variable. If it does not, then it is considered a method call. It’s important to note that this determination happens at parse time, which means if someone were to define a local variable later the earlier identifier would still map to a method call. For example:

foo # calls method `foo`
foo = 1
foo # returns the local variable

Filling in our four fields for method calls looks like:

Receiver - the receiver is implicitly the current value of self
Name - the name is the same as the value of the identifier
argc - 0, there are no arguments
Flags - a special flag called variable_call which changes the error from a NoMethodError to a NameError

The AST for foo looks like:

Method names

If an identifier is found that cannot be a local variable, then it is always a method call. This happens, for example, when the identifier has a ! or ? suffix. For example:

foo?

In this case it does not check if a local is defined by that name. Filling in our fields:

Receiver - the receiver is implicitly the current value of self
Name - the name is the same as the value of the identifier
argc - 0, there are no arguments
Flags - none

The AST for foo? looks like:

Identifiers with parentheses

If an identifier is followed by parentheses, it becomes a method call and it is not checked against the local table. Note that the number of spaces following the identifier matters. foo() is a method call where the parentheses wrap an empty set of arguments. foo () is a method call where the first argument is nil (the equivalent of ()). For example:

foo = 1
foo() # method call to foo, even though foo is a local

Filling in our fields:

Receiver - the receiver is implicitly the current value of self
Name - the name is the same as the value of the identifier
argc - 0
Flags - none

Here is the AST for foo():

Identifiers with arguments

If an identifier is immediately followed by arguments then it becomes a method call and it is not checked against the local table. For example:

foo = 1
foo 1 # method call to foo, even though it is also a local

Arguments are represented with an ArgumentsNode. These nodes and all of the other possible arguments will be covered in a later post. Filling in our fields for the snippet above:

Receiver - the receiver is implicitly the current value of self
Name - the name is the same as the value of the identifier
argc - 1, the integer 1
Flags - none

The AST for foo 1 looks like:

It’s important to note that there is a large difference between foo(1) and foo 1 in terms of the parser, but not in turns of the compiler. foo 1 is a statement, and can only appear in places that support statements. foo(1) is an expression, and can appear in many more places. This is, of course, a simplification, but in general you can think of statements as exclusively being the children of StatementsNode nodes.

Identifiers with blocks

If an identifier is immediately followed by a block then it becomes a method call and it is not checked against the local table. For example:

foo = 1
foo {} # method call to foo, even though it is a local

Filling in our fields:

Receiver - the receiver is implicitly the current value of self
Name - the name is the same as the value of the identifier
argc - 0 (blocks do not count to argc)
Flags - none

The AST for foo {} looks like:

Don’t worry, we’ll cover blocks at a later date.

Constants

All of the above except for the plain identifiers also work with constants. Here are a couple of examples:

Foo?
Foo()
Foo 1
Foo {}

All four of these lines are method calls, even though they are using constants as their name. You don’t see this often (it violates every style guide I could find to define these methods) but there are a couple of Kernel methods that fit this pattern that you might see in the wild, namely: Kernel#Integer, Kernel#Float, Kernel#Rational, and Kernel#Complex.

Filling in our fields:

Receiver - the receiver is implicitly the current value of self
Name - the name is the same as the value of the constant
argc - 0
Flags - none

Here is the AST for Foo():

`call` shorthand

Identifiers and constants alike have a special shorthand for calling the call method. For example:

foo.()
Foo.()

Both of these are method calls to #call on their receivers. Filling in our fields:

Receiver - the left-hand side of the . operator
Name - call
argc - the number of arguments within the parentheses
Flags - none

Here is what the AST looks like for foo.():

Explicit receivers

All of the above examples have implicit receivers. If you explicitly specify a receiver, then it is known to be a method call. For example:

1.to_s

You can also use the :: operator to indicate a method call, though there are some nuances when you use method calls that look like constants. Filling in our fields for the above example:

Receiver - the expression on the left-hand side of the . operator
Name - the name immediately following the . operator
argc - 0 in this case
Flags - none

Here is the AST for 1.to_s:

You can also use the &. operator to indicate a method call. This operator is slightly different if its receiver resolves to nil. If it does, then nothing is evaluated on the right-hand side of the operator (including arguments!). For example:

foo&.bar

Receiver - the expression on the left-hand side of the &. operator
Name - the name immediately following the &. operator
argc - 0 in this case
Flags - safe_navigation

Here is what the AST looks like for foo&.bar:

Unary operators

Unary operators in Ruby trigger method calls. For example:

!foo
~foo
+foo
-foo

These are all method calls to the !, ~, +@, and -@ methods respectively. Filling in our fields:

Receiver - the expression on the right-hand side of the unary operator
Name - the name of the operator if it cannot be binary, otherwise the name of the operator and @
argc - 0
Flags - none

Here is what the AST looks like for !foo:

`not`

There is a special not keyword that breaks down to a method call as well. For example:

not foo

This is a method call to the ! method. The only difference is at the parser level: not foo is a statement, and not(foo) is an expression. The fields are the same as for the ! operator. The AST for not foo looks like:

Note that the only difference between this AST and the one for !foo is the message_loc field. This also illustrates the difference between the message and name methods on CallNode. name is derived, whereas message is the actual source of the method call.

Binary operators

Many binary operators in Ruby trigger method calls. For example:

1 + 2

This is a method call to the Integer#+ method. This form has implications for parsing (namely operator precedence) but once compiled do not have any impact on execution. For example, these are almost entirely equivalent to 1.+(2). Filling in our fields:

Receiver - the receiver is the expression on the left-hand side of the operator
Name - the name is the same as the operator
argc - this is always 1, the expression on the right-hand side of the operator
Flags - none

Here is the AST for 1 + 2:

Indexing

Indexing is a special form of method call. It is a method call to the [] method. For example:

foo[1]

Filling in our fields:

Receiver - the receiver is the expression on the left-hand side of the [] operator
Name - the name is always []
argc - this is the number of arguments inside the [] operator (which can be 0!)
Flags - none

Here is what the AST looks like for foo[1]:

Assignment

When a method call is immediately followed by a =, it changes the name of the method call by appending a = and appends the right-hand side of the operator as another argument. For example:

foo.bar = 1 # a method call to bar= with one argument
foo[:bar] = 1 # a method call to []= with two arguments

Filling in our fields:

Receiver - the receiver is the expression on the left-hand side of either the ., ::, &., or [] operators
Name - the name is the same as the call before the = was found, with = appended
argc - one more than the number of arguments before the = was found
Flags - none

Here is what the AST looks like for foo.bar = 1:

Note that because the [] form can have other arguments, this includes a block. For example:

foo.bar[&baz] = 1

This means to call bar= with a single argument (1) and a block that is the result of calling #to_proc on the result of the baz method call. We’ll cover this more when we get to blocks. For now, here is the AST for the above snippet:

Wrapping up

Wow, that was a lot of calls. This is just the first of many posts about method calls and believe it or not, these were the simplest. Suffice to say, there are many ways to express method calls in Ruby. Here are a couple of things to remember from today:

Method calls are everywhere in Ruby, even in some syntax you might not expect.
Constants aren’t necessarily constants — be sure to check for arguments!
All arithmetic operators are method calls, and subject to redefinition.

Tomorrow we’ll continue our exploration of method calls by looking at some of the most complicated ones as well as the super keyword. See you then!

“Basic” has a loose definition here. There are method calls that can be expressed in Ruby that are quite complicated. We split up the method calls into other nodes in those cases. You’ll see why when we get to them in the coming days. ↩

← Back to home

Advent of Prism: Part 13 - Calls (part 1)

Identifiers

Method names

Identifiers with parentheses

Identifiers with arguments

Identifiers with blocks

Constants

call shorthand

Explicit receivers

Safe navigation

Unary operators

not

Binary operators

Indexing

Assignment

Wrapping up

`call` shorthand

`not`