Volodymyr Gubarkov

Stand With Ukraine

Adding parameterized goals to makesure

March 2023

What is makesure?

Makesure is a task/command runner that I am developing. It is somewhat similar to the well-known make tool, but without most of its idiosyncrasies (and with a couple of unique features!).

The need for parameterized goals

When you write large build files, you inevitably start getting some copy-paste in there. Typical use case: you want to test your code against multiple versions of some software / programming language / compiler.

To illustrate the point, let’s imagine that you’ve written a Python script and want to make sure it runs correctly with both Python 2 and Python 3 (this example is a bit artificial, but good for explaining).

@goal pythons_tested
@depends_on python2_tested python3_tested

@goal python2_installed
@depends_on soft_folder_created
@reached_if [[ -x ./soft/python2 ]]
  wget "https://software-repository/python2.tar.gz"
  tar xzvf python2.tar.gz --directory ./soft
  rm python2.tar.gz 
 
@goal python2_tested 
@depends_on python2_installed
  if ./soft/python2 script.py
  then
    echo "SUCCESS"
  else
    echo "FAIL"
    exit 1
  fi

@goal python3_installed
@depends_on soft_folder_created
@reached_if [[ -x ./soft/python3 ]]
  wget "https://software-repository/python3.tar.gz"
  tar xzvf python3.tar.gz --directory ./soft
  rm python3.tar.gz 
 
@goal python3_tested 
@depends_on python3_installed
  if ./soft/python3 script.py
  then
    echo "SUCCESS"
  else
    echo "FAIL"
    exit 1
  fi
  
@goal soft_folder_created
@reached_if [[ -d soft ]]
  mkdir soft

I guess you see the problem.

Parameterized goals to the rescue!

@goal pythons_tested
@depends_on python_tested @args '2'
@depends_on python_tested @args '3'

@goal python_installed @params VERSION
@depends_on soft_folder_created
@reached_if [[ -x ./soft/python$VERSION ]]
  local tgz="python$VERSION.tar.gz"
  wget "https://software-repository/$tgz"
  tar xzvf "$tgz" --directory ./soft
  rm "$tgz" 
 
@goal python_tested @params VERSION 
@depends_on python_installed @args VERSION
  if ./soft/python$VERSION script.py
  then
    echo "SUCCESS"
  else
    echo "FAIL"
    exit 1
  fi

@goal soft_folder_created
@reached_if [[ -d soft ]]
  mkdir soft

Much better!


For a more realistic example let’s consider the build file of my project intellij-awk - “the missing IntelliJ IDEA language support plugin for AWK”. There, starting line 119 I had a rather repetitive set of goals. They were responsible for downloading the HTML documentation files from the GNU Awk online manual, processing them, and compiling the resulting file std.awk. This file is then used to provide documentation popups inside the IDE:

documentation popup inside IDE

And this is what the same goals look like after refactoring them using goal parameterization.

Pretty impressive, huh? To make it more prominent just check the diff of the change.


I want to mention another case for parameterized goals. You see, makesure happened to have a very simple yet powerful facility @reached_if. A goal can declare a condition that it has already been reached. In this case, the goal body (the corresponding shell script) will no longer be executed. This simple mechanism makes it very convenient and declarative to express the idempotent logic of work. In other words, to speed up the build, since what has already been done will not be repeated. This feature was inspired by the ideas from Ansible.

No wonder people started using Makesure as a very simple Ansible replacement. But at that time it lacked parameterized goals, and so they again suffered from repetitive code with no easy way to reuse.


Next, I want to talk a bit about how I designed this function and what principles I followed.

Design principles

I design the tool very minimalistic. In accordance with the principle worse is better, all else being equal, I prefer not to add a feature to the project than to add it. In other words, the necessity of a feature must be absolutely outstanding to justify its addition.

In general, when developing a product or library, it is very important to implement the minimum possible functionality, and exactly the one that users need now. Quite often, developers are tempted to add some obvious improvements and features that are not critical and/or are redundant, simply because it seems simple. Moreover, for the same reason, it is often useful to explicitly exclude certain features/use cases. Because you can always add them later if there is an explicit request from users. Removing an unsuccessful feature can be much more problematic.

So I used to have this piece in the documentation:

Omitted features

It appears that actually what we didn’t want to support was calling goals with arguments from the CLI. So this part has been slightly reformulated.

By themselves, parameterized goals turned out to be a really necessary feature, as shown in the examples above.

Design considerations

I resisted adding this feature for a long time. First, because of the increasing complexity that would not be needed in a majority of typical makesure usage scenarios. But mostly because it’s tricky to do while preserving the declarative semantics of dependencies. You see, dependency of one goal on another is fundamentally different from a function call.

Why is that? You see, the dependency tree is resolved by makesure before running the goals! This is why, for example, it is possible to report a cycle in dependencies as an error, rather than falling into an infinite execution loop. In addition, the run-only-once semantics for reaching the goals:

@goal a
@depends_on b c

@goal b
@depends_on c

@goal c
  echo "Reaching c ..." # must be printed only once

But it wasn’t impossible. For example, just does have parameterized goals.

I needed some time to think about the problem in depth. Adding the feature needed lots of thorough considerations to:

Implementation process

The idea of parameterized goals lived in my head for quite some time. The first design attempt was this. But in this form, it turned out to be a dead end (simply, I didn’t like the result), so it was discarded.

A lot of time has passed. Russia started an aggressive and genocidal full scale war against my country 🇺🇦Ukraine.

I started the new design by drafting the basic design points in the document.

Obviously, I started with designing the syntax and quickly came up with a solution with two complementary keywords @params and @args (this was inspired by async + await from JavaScript):

@goal greet @params H W
  echo "$H $W!"
  
@goal default
@depends_on greet @args 'hello' 'world' 
@depends_on greet @args 'hi' 'there' 

I also considered the syntax:

@goal greet(H, W)
  echo "$H $W!"
  
@goal default
@depends_on greet('hello', 'world') 
@depends_on greet('hi', 'there') 

Although the latter is more natural for a programmer, I settled on the former for the following reasons:

  1. It’s much easier to parse. Remember, “worse is better”.
  2. The Makesurefile syntax is specifically designed to be a valid shell syntax (although the semantics may differ). This gives free syntax highlighting in the IDEs and on GitHub.

Makesurefile highlighting in IDE

Further, in the same design document I analyzed other aspects of the feature in the form of Q and A. Mostly I strove to answer “No.” to most of them in order to keep the feature as limited as possible, but at the same time useful for existing use cases. Some or all of them may be added later if necessary. But it’s absolutely important to start with something really simple but practical.

Finally I thought through the interaction of a new feature with existing ones.


Implementation-wise I had an idea how it should be done. It is better to explain with an example. Given this parameterized goal and its usage:

@goal greet @params WHO
  echo "Hello $WHO!"
  
@goal greet_all
@depends_on greet @args 'world' 
@depends_on greet @args 'Jane' 
@depends_on greet @args 'John' 

we will have some pre-processing step before the execution to “materialize” (or in other words, de-parameterize the goals):

@goal 'greet@world'
  WHO=world
  echo "Hello $WHO!"
@goal 'greet@Jane'
  WHO=Jane
  echo "Hello $WHO!"
@goal 'greet@John'
  WHO=John
  echo "Hello $WHO!"

@goal greet_all
@depends_on 'greet@world' 
@depends_on 'greet@Jane' 
@depends_on 'greet@John'

You see, it’s the same logic, but no more parameterized goals. Instead, “materialized” goals are generated according to their usages. I believe the more scientific term for this is monomorphization.

This was a key step because it allowed the reuse of an existing execution model that could already handle the non-parameterized representation.

Since the problem was now very clear and self-contained, it was logical to solve it separately, apart from the main code. I did this in separate playground repository. Once this part was done and tested, it was integrated into the core of the tool.

Result

The parameterized goals feature has been implemented and delivered in the latest release 0.9.20.

As I mentioned above, the requirement to keep the tool small was very important. Before the addition the tool weighed around 20 KB. I expected this feature to be quite complex to implement, but I felt it should not take more than 5 KB to add it. Surprisingly, it became only 2 KB bigger! This is due to some minification tricks I’ve added in this release.

In addition to the refactoring opportunities mentioned above this also allowed to improve a bit the project’s own Makesurefile! Yes, we eat our own dog food.

It’s also interesting to note that the development and debugging of this new feature revealed a bug in the latest version of GAWK (already fixed, thanks to Arnold Robbins).


If you found this article interesting and would like to try the tool for your project, please find the installation instructions.


If you noticed a typo or have other feedback, please email me at xonixx@gmail.com