Imperative Programming - Algol

January 21, 2019 6 minute read

Haskell is a dialect Algol!

Modernized Algol (MA) - revised by Robert HarperPermalink

MA = PCF with a $m o d a l i t y$ - distinguishes expressions from commands

τ = t h i n g s i n P C F | c m d (τ) e x p r e s s i o n s e = t h i n g s i n P C F | c m d (m) c o m m a n d s m = r e t (e) | b n d (e, x . m) | d c l (e, a . m) (d c l a := e i n m) | s e t [a] (e) (a := e) | g e t [a] (g e t a)

$a$ ’s are assignables not variables! $x$ ’s are variables! Assignables are not a form of an expression of it’s type. Assignables is a location in memory whose contents has a type, where we write $a_{1} \sim τ_{1}$ (not $a_{1} : τ_{1}$ ). Assignables are really indices to a family of $g e t$ , $s e t$ operations, they are not values, arguments, or evaluated. They are just indices, and $g e t$ ’s and $s e t$ ’s are just capabilities to get and set $a$ . We can define references, i.e. &a in a real programming language, as a pair $< g e t_{a}, s e t_{a} >$ , which just a thing that gives you access to the capabilities of getting and setting $a$ .

Types and expressions are “pure” - don’t depend on memory, whereas commands are “impure”.

StaticsPermalink

Γ ⊢_{Σ} e : τ

where $Σ$ is tye types of assignables, i.e. $a_{1} \sim τ_{1}, \dots, a_{n} \sim τ_{n}$ .

Γ ⊢_{Σ} m \sim: τ

It means a well-formed command whose return values has type $τ$

\frac{Γ ⊢_{Σ} m \sim: τ}{Γ ⊢_{Σ} c m d (m) : c m d (τ)}

The above is the Introduction rule for $c m d$

\frac{Γ ⊢_{Σ} e : c m d (τ); Γ, x : τ ⊢_{Σ} m^{'} \sim: τ^{'}}{Γ ⊢_{Σ} b n d (e, x . m^{'}) \sim: τ^{'}}

The above is the Elimination rule for $c m d$

\frac{Γ ⊢_{Σ} e : τ}{Γ ⊢_{Σ} r e t (e) \sim: τ}

\frac{Γ ⊢_{Σ} e : τ; Γ ⊢_{Σ, a \sim τ} m^{'} \sim: τ^{'}; τ m o b i l e; τ^{'} m o b i l e}{Γ ⊢_{Σ} d c l (e, a . m^{'}) \sim: τ^{'}}

It means I am declaring an assignable: I declare $a$ , initialize it to $e$ , and run the command $m^{'}$ . A type is $m o b i l e$ if the value of the type can be pulled out from the scope of the assignable. Example of mobile types: eager natural numbers, pairs/sums of mobile types. Example of not mobile types: functions (because the body of the function can use assignables even if the ultimate return value is $n a t$ ), commands. This will be explained in later sections when we talk about the dynamics.

\frac{}{Γ ⊢_{Σ, a \sim τ} g e t [a] \sim: τ}

\frac{Γ ⊢_{Σ, a \sim τ} e : τ}{Γ ⊢_{Σ, a \sim τ} s e t [a] (e) \sim: τ}

Exercise: We have the following [Pre-]monad defined:

T (a) : t y p e r : a \to T (a) b : T (a) \to (a \to T (b)) \to T (b)

Show that you can define $r$ and $b$ for $T (a) = c m d (a)$ .

The important fact is that you start with the modality, then they can be formed into a pre-monad.

DynamicsPermalink

e v a l_{Σ}

e \mapsto_{Σ} e^{'}

$μ | | m$ means a command $m$ in memory $μ$ . The notation is designed to connecto with concurrency. The idea is that we have a concurrent composition of a main program $m$ running simultaneously with threads that govern the contents of each of the location.

μ | | m f i n a l_{Σ}

μ | | m \mapsto_{Σ} μ^{'} | | m^{'}

\frac{}{c m d (m) v a l_{Σ}}

f r a c e \mapsto_{Σ} e^{'} μ | | r e t (e) \mapsto_{Σ} μ | | r e t (e^{'})

\frac{e v a l_{Σ}}{μ | | r e t (e) f i n a l_{Σ}}

\frac{e \mapsto_{Σ} e^{'}}{μ | | b n d (e, x . m_{1}) \mapsto_{Σ} μ | | b n d (e^{'}, x . m_{1})}

\frac{μ | | m \mapsto_{Σ} μ^{'} | | m^{'}}{μ | | b n d (c m d (m), x . m_{1}) \mapsto_{Σ} μ | | b n d (c m d (m^{'}), x . m_{1})}

\frac{e v a l_{Σ}}{μ | | b n d (c m d (r e t (e)), x . m_{1}) \mapsto μ | | [e / x] m_{1}}

\frac{}{μ \otimes a ↪ e | | g e t [a] \mapsto_{Σ, a \sim τ} μ \otimes a ↪ e | | r e t (e)}

Exercise: define $s e t$

We have something called Stack Discipline invented by Dijkstra. The idea is that the assignables in Algol are stack alocated. When I do a $d c l (e, a . m^{'})$ , I declare an assignable in $m^{'}$ , I can get it and set it in $m^{'}$ . When $m^{'}$ is finished it’s deallocated.

\frac{e v a l_{Σ}; μ \otimes a ↪ e | | m \mapsto_{Σ} μ^{'} \otimes a ↪ e^{'} | | m^{'}}{μ | | d c l (e, a . m) \mapsto_{Σ} μ^{'} | | d c l (e^{'}, a . m^{'})}

To rephrase the above in English: Start from lower left: I have a value $e$ which is the initializer of the assignable $a$ and I want to execute $m$ in the presence of that assignable. What can I do? I go above the line and do: let’s extend the memory $μ$ with $a$ having the content $e$ , and I execute $m$ , and I will, in the process of doing that, maybe modify some outer assignables (turn $μ$ to $μ^{'}$ ), maybe modify some inner assignables (make $a$ have the content $e^{'}$ instead of $e$ originally) and get a new command $m^{'}$ . Then I update the memory (from $μ$ to $μ^{'}$ ), and reset the world (from $m$ to $m^{'}$ ). In other words, I take the starting state where I declare $a$ being initialized to $e$ and execute $m$ , once you take a step of execution of $m$ in the situation of $μ \otimes a ↪ e$ , you might have done a $s e t$ in $a$ and updated that to $e^{'}$ ! The resulting state is: I restart your program in the situation in which the initializer is not what it was ( $e$ ) but what it becomes as result of the execution step ( $e^{'}$ ) and then proceed from there.

\frac{e, e^{'} v a l_{Σ}}{μ | | d c l (e, a . r e t (e^{'})) \mapsto_{Σ} μ | | r e t (e^{'})}

To rephrase the above in English: Start from lower left: I declare an assignable $a$ and assign it to $e$ , and I am returning a value $r e t (e^{'})$ , what do I do next? The idea of the stack discipline is: when you finish executing the body of $d c l$ then you get out of it!

Some issues with type safetyPermalink

In the above formula, in the lower left part, if $e$ has type $τ$ , then $e^{'}$ has type $τ^{'}$ in the context of $Σ, a \sim τ$ , but in the lower right part, $e^{'}$ should also have type $τ^{'}$ , but with only $Σ$ alone. In traditional Algol, we can only return $n a t$ , which means if a numeric value $v a l$ type checks with an assignable present ( $Σ, a \sim τ$ ), it will type checks with the assignable absence (only $Σ$ ), only under some conditions!

So here is the question: Under what condition is the following statement true?

If $e$ is a value of type $n a t$ and type checks with the assignable $a$ , it will also type check in type $n a t$ without $a$ .

Answer: Only if the successors are valued eagerly! if the successors are lazy, then the arguments of the successors are unevaluated expressions (they are no longer values $v a l$ )! Algol only makes sense if the constructors of the successors are eager!

Here is an (clever) example of a successor of $N$ doesn’t type check if lazy. I want a successor of something, which is an expression, that uses an assignable $a$ . The goal is it will not type check outside of scope of $a$ :

S ((λ x : n a t . z) (c m d (g e t ([a]))))

Explanation: We have a constant function that takes in a $n a t$ and return 0 $z$ . And to type check the argument $c m d (g e t ([a]))$ we have to eventually type check $a$ , even though it doesn’t matter the ultimate return value of the function because it’s constant!

This is a perfect example of the interactions between language features. We might think that whatever I do with the PCF level whatever we don’t care lazy or eager and the command level is separated. This is wrong! The hard part of a language design is to make everything fit in a coherent way. This is way language design is hard!

So traditional Algol they make things work by restricting the return value only $n a t$ , and make $n a t$ eager. A better idea is that we can demand the result type of $d c l$ $m o b i l e$ ! And we also need to restrict that we cannot assign values that aren’t $m o b i l e$ , because we can assign the value from a return value. This is the explanation of the above “This will be explained in later sections”.

MA - Scoped AssignablesPermalink

ReferencesPermalink

We can define $r e f (τ)$ (immobile) as we mentioned briefly before in the following ways:

Concretely:

r e f (τ) ≜ c m d (τ) \times (τ \to c m d (τ))

Where $c m d (τ)$ is just the getter and $τ \to c m d (τ)$ is the setter.

Then we can have:

g e t r e f (< g, s >) \mapsto g s e t r e f (< g, s >) \mapsto s

The problem is, if you look at the type $r e f (τ)$ , it only says it has getter and setters, but it doesn’t say the getter and setter are for the same assignable! So you can have a getter for $a$ and a setter for $b$ and you won’t know the difference. So then we can define it in another way:

Abstractly:

The type is still $r e f (τ)$ , but we have the following elim rules:

g e t r e f (& a) \mapsto g e t [a] s e t r e f (& a, e) \mapsto s e t [a] (e)

MA - (Scope) Free AssignablesPermalink

All types are mobile. Previously because the stack discipline is causing us trouble about mobility, and we go back to to add some mobility rules in the statics. But there are other ways to fix this: let’s change the statics, make every type mobile, and we’ll make our dynamics fit that. Here are the new dynamics. (Also called scope-free dynamics, or in other words, assignables are heap allocated.)

γ Σ {μ | | m} \mapsto γ Σ^{'} {μ^{'} | | m^{'}}

Note the above transition is unlabelled.

\frac{e v a l_{Σ}}{γ Σ {μ | | d c l [τ] (e, a . m)} \mapsto γ Σ, a \sim τ {μ \otimes a ↪ e | | m}}

\frac{e v a l}{γ Σ {μ | | r e t (e)} f i n a l}

PCF (FPC with recursive types $r e c$ ) + commands above with free assignables $\approx$ Haskell :)

IssuesPermalink

In Algol, we have a clear distinction between expressions and commands; they are completely separated. There are many benefits of doing that. But in the “real world”, in doing that, we lose a lot of benign effects. For example efficiency, we lose:

Yanxi Chen

Imperative Programming - Algol

Modernized Algol (MA) - revised by Robert HarperPermalink

StaticsPermalink

DynamicsPermalink

Some issues with type safetyPermalink

MA - Scoped AssignablesPermalink

ReferencesPermalink

MA - (Scope) Free AssignablesPermalink

IssuesPermalink

Share on

Comments

You May Also Enjoy

潮汕牛肉丸改良版

鸡肝酱Pâté

萝卜/芋头糕

潮汕肉丸