Umberto Raimondi's Blog

A Long Overdue Update on Swift 5.0 For Raspberry Pi Zero/1/2/3

2019-04-11T00:00:00+02:00

Update 3/2020: 5.1.5 binaries are now available.

It has been nearly a year since the last update on the state of Swift on the Raspberry Pi and other ARM boards so let’s start with a short recap of what happened since the release of Swift 4.1.2.

TLDR: After a bloodbath with 4.2, for the first time we got Swift 5.0 running with relatively minimal effort and you can already download it now, see below.

If you have kept an eye on updates to buildSwiftOnARM you already know that by the end of October we had a working, albeit with a lingering small bug, Swift 4.2.

This was the result of the work (a huge amount of it, really) done by Kaiede (you can buy him a cup of coffee here to show your appreciation), that lead to multiple PRs opened for Swift and related libraries and took a few months to get something that could be considered stable.

And it goes without saying that the community built around the swift-arm Slack channel was responsible for ironing out a plethora of other additional issues and testing on various SBCs.

A few additional small issues were found and fixed in the monthly 4.2.x Linux-only releases and 4.2.4 now compiles without patches even on 32-bit ARM.

Considering what we are used to when a new major release comes out, getting 5.0 to compile was a breeze (in part thanks to some preemptive work done by Kaiede before the final release).

Current Status

Right now a few patches are required to compile Swift 5.0 but all the issues identified have a fix or a workaround.

A new temporary patch that limits the number of parallel jobs launched by SPM has been added by hpux735 since from this release SPM spawns a number of compiler threads equal to the number of the cpu’s cores every time you build something. This behaviour, not really optimal for platforms with a small amount of RAM, cannot be changed yet, and so for now we are limiting the number of jobs used by SPM to 1. A proper fix should come in one of the future monthly limux releases.

The Swift binaries provided below contain this hotfix if needed.

Feel free to open new issues on buildSwiftOnARM if you spot additional problems and crashes.

Prebuilt binaries

OS	Architecture	Boards	Download
Raspbian Stretch	ARMv6	RaspberryPi Classic, All versions of Pi Zero	4.2.3 - 5.0
Raspbian Stretch	ARMv7	All versions of RaspberryPi 2/3	4.2.3 - 5.0-hotfix1
Ubuntu 16.04	ARMv7	All versions of RaspberryPi 2/3	4.2.3 - 5.0-hotfix1
Ubuntu 18.04	ARMv7	All versions of RaspberryPi 2/3	4.2.3 - 5.0-hotfix1
Ubuntu 18.10	ARMv7	All versions of RaspberryPi 2/3	4.2.3 - 5.0-hotfix1

Just decompress the tgz archive and both swiftc and swift will be available under ./usr/bin. Use the former to compile Swift files directly or the swift binary to access additional tools like SPM (as usual the REPL will not be available).

If you have patience and a few hours to spare, check out the updated buildSwiftOnARM scripts to build your own 5.0 binaries.

And for those interested in IoT projects, SwiftyGPIO has been verified with Swift 5.0, that is now the recommended Swift release.

Swift and AArch64

AArch64 should not require additional patches now (other than small patches to select the proper linker on some releases of Ubuntu), Neil Jones provides downloadable prebuilt binaries here or the same binaries via more practical apt packages.

All about Concurrency in Swift - Part 2: The Future

2017-07-22T00:00:00+02:00

In the first part of this series we have seen what you’ll have to deal with when working with concurrency, how to reason about some typical multithreading problems, what you’ll need to pay special attention to, and we’ve also looked into some of the traditional instruments we have at our disposal when writing Swift applications.

This time we’ll look into what the future could have in store for us in one of the next major releases of Swift.

We’ll analyze some popular paradigms and abstractions used in other programming languages highlighting their strengths and weaknesses, trying to envision how and if they could fit in the overall design of the Swift language.

This article should give you the introduction you’ll need to take part in the discussions on swift-evolution that will revolve around introducing native concurrency functionalities in a future release of Swift and give you a few pointers to start experimenting with new paradigms using the open source libraries that are already available.

Some of these paradigms will still tackle directly the problems related to sharing resources in a multithreaded environment, while others will address the same underlying issues proposing a better way to handle asynchronous tasks guaranteeing that each one will operate without external side effects.

Most languages provide a basic set of common concurrency primitives you’ll find everywhere (locks, semaphores, conditions, etc…) but then select a few specific concurrency paradigms to define their own idiomatic approach.

Some of the paradigms that will be described here could be suitable to be included in the Swift’s standard library sooner or later, while others will probably never be.

Each concurrency model has its supporters and detractors and the community is usually very polarized. This article will not discuss my personal preferences, I’ll try to give you a good overview of the most popular models and I recommend you to invest some time in testing them to see how they fare in the real world, maybe building some throwaway application.

I’ll make frequent references to other languages throughout the article and will include some code samples in languages you may not be familiar with, but don’t worry, I’ll explain syntax quirks when necessary.

Memory Model and Standard Library
Futures and Promises
Async/Await
STM: Software Transactional Memory
The Actor Model
Communicating Sequential Processes: Channels
Swift: Where are we now
A few words on Kotlin
Closing Thoughts

Memory Model and Standard Library

In this section we’ll take a look at some areas of the Swift language and its runtime that could be improved from the point of view of concurrency without introducing new paradigms.

As described in the first part, Swift does not provide substantial concurrency guarantees and it hasn’t really defined a memory model similar to what you could find elsewhere.

We know that global variables are initialized atomically for example, but concurrent read/write or multiple writes to the same variable are still classified under undefined behaviour, the developer will have to guard those critical sections to avoid concurrent variable modifications.

The proposal SE-0176 addresses this issue proposing that modifications to a variable must be excluvive and should not happen at the same time of other accesses to that variable. Enforcing exclusivity would allow only reads to happen at the same time while forcing writes to be performed in isolation without other threads trying to read or modify the same variable half way through the original write.

Exclusivity could be guaranteed statically during compilation, checking your code for possible invalid accesses to shared variables, or could be enforced dynamically while your program is running keeping track of the access state of your variables.

The proposal describe this problem and discusses what exclusivity entails for different types, but it’s also part of a bigger effort to make Swift’s handling of memory more safe and predictable described in the Ownership Manifesto.

I will not try to dissect this complex document here, if you are interested in knowing more, I recommend to put aside some time (and maybe grab a cup of coffee or tea) and slowly delve into it.

The functionalities described in the manifesto will likely lead to the definition of a memory model, that as described in part one of the series is essential, among other things, to show how the compiler will be allowed to evolve, defining which new optimization strategies will be practicable and which will not.

Now, let’s consider how the data structures of the runtime could be improved and then we’ll talk about managing the data flow with streams and about some lightweight threading functionalities that could help to improve scalability.

Concurrent Data Structures

Some languages have a rich set of data structures included in the default runtime that provide both thread-safe implementations and faster unsafe variants.

The developer will have then to choose each time the right implementation depending on how the object will be used used.

Let’s take a quick look, just for this section, to Java, that more or less ten years ago extended its basic data structures to provide thread-safe variants.

This was part of a larger effort to give Java something more than the extremely basic concurrency functionalities it had at the time, an effort that resulted in the definition of the JSR-166 specification and its implementation in Java 5.

This extension to the language contained things like locks, semaphores and thread pools that you’ll find in libdispatch or Foundation but also concurrent collections, atomic objects and thread-safe queues that are not among the default data structures available from Swift. The majority of those new features were implemented in a lock-less fashion.

While you could implement on your own a thread safe queue or extend arrays or dictionaries to support concurrency, it could still make sense to include efficient implementations of some of these data structures in the standard library.

In addition to the expected lists, maps and sets that perform modifications atomically guaranteeing that all read operations will always complete correctly (using fine-grained locking) and that we’ll always be able to iterate on them even in presence of concurrent changes, Java also provides Blocking Queues and Double Ended Queues.

Blocking queues are fixed-size FIFO queues that have additional methods that allow to wait for the queue to become non-empty when we want to remove a value or wait for the queue to have some free space when we are adding an element on a full queue. An these queue also have add and remove methods that fail if their preconditions are not met right away or after a specific amount of time.

This data structure is well suited to be employed in producers/consumers scenario, where you have multiple producer threads that generate data that other consumers threads will elaborate.

A simple scenario can be implemented in fifty or so lines of Java:


import java.util.concurrent.*;

class Producer implements Runnable {
    private final BlockingQueue<String> queue;

    Producer(BlockingQueue<String> q) { queue = q; }

    public void run() {
        try {
            while (true) { 
                //Create a new string
                queue.put(produce());
            }
        } catch (InterruptedException ex) {}
    }
    String produce() {
        //Do something and return a string
        return "";
    }
}

class Consumer implements Runnable {
    private final BlockingQueue<String> queue;

    Consumer(BlockingQueue<String> q) { queue = q; }

    public void run() {
        try {
            while (true) {
                //Wait on take() or consume
                consume(queue.take()); 
            }
        } catch (InterruptedException ex) {}
    }
    void consume(String data) {
        //Do something
    }
}

public class Main {
    public static void main(String... args) {
        BlockingQueue<String> q = new ArrayBlockingQueue<String>(10);

        Producer p1 = new Producer(q);
        Producer p2 = new Producer(q);
        Consumer c1 = new Consumer(q);
        Consumer c2 = new Consumer(q);

        new Thread(p1).start();
        new Thread(p2).start();
        new Thread(c1).start();
        new Thread(c2).start();
    }
}

If you get past the verbosity of the language and those semicolons, this implementation is quite an improvement on the classic locks implementation. This data structure hides away all the gritty details and provides an extremely simple interface.

As we’ll see in a few sections, the basic idea of the blocking queue with finite size will come back again, in another, better, form.

Stream Processing and Parallelism

Now, let me introduce briefly an approach that you could already be sort of familiar with if you have ever used the map and flatMap methods that Swift’s sequences provide.

Swift adds some useful methods to sequences that allow to process their content in a more functional way, turning series of iterations typical of procedural code to a sequence of transformations on some input data:


func getData() -> [String] {
    var ret = [String]()
    for i in (0..<10000).reversed() {
        ret.append("i"+String(i))
    }
    return ret
}

let res = getData()
    .lazy
    .map{$0[$0.index(after: $0.startIndex)..<$0.endIndex]}
    .flatMap{Int($0)}
    .filter{$0 < 100}

for n in res.prefix(5) {
	print(n)
}

Each element contained in data sequence will be lazily processed, first removing the initial character from each string and then converting the remaining substring to an integer. Values lower than 100 will be filtered out.

Since we have enclosed the initial sequence in a lazy collection with lazy, and no operations in the sequence require the complete output from the previous step to produce a result, only five elements will be actually processed to produce the 5 integers we’ll print.

Normally, on a non-lazy sequence, these operation would have worked through the whole sequence.

Stream processing is a way to perform multiple sequential computations on a series of values, either consuming a set of data (of any kind), available since the beginning of the processing, or a flow of data that will be progressively available over time (with new values coming in every now and then) processed reactively.

Some languages, like Java 8, Scala and others provide those streaming functionality available with Swift’s sequences and much more through more practical Stream types, that can be obtained wrapping common sequence or collection types but that can be also used to implement streams of data with elements that can become available asynchronously, and that will be processed in a non-blocking way.

Streams have all the familiar functional operators you are familiar with, like map, filter and reduce, and as we did with sequences, multiple steps, each one producing a Stream with intermediate results, are pipelined to obtain the final output.

Parallelism can easily be introduced during stream processing for those operations like map that do not require the whole content of the stream to perform their duty but just execute the same task on each element, without side-effects.

And this is maybe where stream processing shine, you will not need to break your computation in blocks to perform separately those phases that can take advantage of parallel execution on systems with multiple CPUs.

Some stream operations will behave differently when parallel execution will be requested and all of this will just require from the developer some planning about how the data will be used between these multiple steps.

I will not include examples of parallel execution (since I would have to do it in Java, that has the most simple API for streams) but you just need to know that most libraries will wrap streams in a specific parallel stream object that will alter the behavior of some operators transparently.

While there not seem to be traditional stream-oriented Swift libraries (likely for the overlap with what the language already provide), the most pupular libraries that use reactive stream processing are probably the principal Swift reactive frameworks, RxSwift and ReactiveSwift, that under the hood use stream of data or events to handle the asynchronous flow of data typical in mobile/desktop applications.

Coroutines and Green Threads

From the first part of this series, we already know that the concept of executing multiple tasks concurrently predates the general availability of platforms with multiple physical execution units.

But how can we handle the execution of multiple tasks on a single CPU, giving the illusion of simultaneous execution, in a multitasking system?

The software layer responsible for the execution of multiple tasks needs to implement the machinery necessary to share the available CPUs between the currently running tasks to increase utilization, for example to temporarily pause tasks waiting for I/O that leave the CPU idle and execute some other task.

There are two main flavor of multitasking systems, that adopt different approaches for CPU allocation.

Cooperative multitasking allows each running task to decide when to relinquish the CPU, giving complete freedom to the developer on how the resources should be used but increasing the chance that the whole system hangs because one or more tasks are monopolizing the available execution units.

Preemptive multitasking instead, takes this responsibility away from the developer and delegates the job of deciding which task should run to a specific Scheduler component, that schedules tasks depending on their priority, but normally allocates the same slice of CPU time to each one, and perform the necessary context switch when a task has used up all its time slice or is waiting for I/O or is otherwise idle.

The preemptive approach is nowadays used in every modern OS kernel (with wildly different scheduling strategies) to allow the execution of multiple threads and processes, making ample use of interrupts to guarantee good performance when handling I/O.

As we already know, the kernel can manage multiple threads and we can use user-space APIs to make use of these threads in our applications.

Multitasking doesn’t have to be limited to the OS layer, no one stop us from implementing user-space threads if we are so inclined.

But does it make sense to do so? It turns out that, yes, it does.

Kernel-level threads need to guarantee data integrity when performing the context switch, making this operation way more complex than one could initially imagine (multiple steps are involved, from saving a well defined set of registers to adjust the processor privilege level). Furthermore, the initial setup required when creating a new thread and the need for a per-thread stack make the creation of a new thread a relatively costly operation.

While with a small number of threads the costs of creation and context switch are negligible, this does not hold true when we start to have hundred or thousands of active threads simultaneously.

The requirement for a stack local to each thread alone would reduce the number of threads you could create in a real world scenario pretty fast.

For these reasons, we can’t use classic OS threads when we plan to have hundreds of thousands (or way more) of concurrent threads in our application (we’ll see in the next sections why we could need these huge amount of threads).

The common solution to this problem, that you’ll see implemented in many languages, consist in creating lightweight user-space threads.

These threads don’t need the full context switch required by kernel threads to provide a similar functionality and can operate in the context of a single kernel thread without having their own local stacks.

User-space threads can be implemented following both multitasking approaches described above, but most of the times you’ll see an implementation ascribable to the cooperative category, where data integrity is less of an issue since there is no scheduler and the developer decides where to yield control of the CPU and guarantees integrity across context switches.

The system that manages this lightweight threads can be backed by one or more classic kernel threads (organized in a pool) but most implementations tend to employ by default a number of kernel threads equal to the number of hardware CPUs.

Coroutines and Green Threads are two examples of user-space lightweight threads with different characteristics.

Green threads recreate what is available at kernel level, with usual system threads, completely at user level, with a lightweight implementation of threads performing preemptive scheduling with a user level scheduler built from scratch that don’t need to invoke threading system calls to function.

Even if the first implementation of green threads was used in the first releases of Java but was later on abandoned in favor of system threads, green threads are now in vogue again for their scalability properties, after improvements in memory performance in the last decade or so have made their advantages quite evident.

At the moment there are no Swift implementations of green threads, but the important thing to know is that green threads have the same API of normal threads, the developer does not need to do anything to make them work the same way system threads would.

Coroutines implement cooperative multitasking instead, leaving the responsibility to guarantee that a routine will play well with other concurrently running routines to the developer, that will have to include in the code special instructions that will specify when the closure will be ready to be paused to run other routines, giving the illusion of closure running concurrently.

While control of the CPU is usually yielded to the next routine waiting to be run with a specific yield other operations that are normally blocking have the same effect. Examples of this are sleep instructions, blocking operations on files and network and many more. Implementations of coroutines usually provide coroutine-aware versions of those basic operations.

The most serious attempt at implementing coroutines in Swift (even if this is not a pure-Swift implementation since it’s built on top of libdill) is represented by the Venice library that offers them among other features.

Let’s see some basic examples using Venice


let coroutine = try Coroutine {
    while true {
        performExpensiveComputation()
        try Coroutine.yield()
    }  
}

let printer = try Coroutine {
    while true {
        try Coroutine.wakeUp(100.milliseconds.fromNow())
        print(".")
    }
}

try Coroutine.wakeUp(5.second.fromNow())
coroutine.cancel()
printer.cancel()

The first coroutine perform a series of expensive operations in a loop, yielding the control of the CPU to allow other coroutines to execute cooperatively. Other coroutines will do the same, for example breaking up what they need to do in a series of steps and yielding back control periodically if needed.

The second coroutine waits for a non-blocking timer to elapse, yielding back control every time, before printing a dot.

The same wakeUp is used in the main thread to wait for 5 seconds before cancelling both coroutines. Cancelling will result in the coroutine throwing a VeniceError and exiting.

While Venice coroutines don’t return values, other implementations of coroutines could look a bit more like Swift’s generators and returning intermediate values like Yield does. But note that the library has not been updated to Swift 3 and uses system threads, it’s presented here just to give you an idea of how the API would look like.

Here is an example from the documentation that generates the Fibonacci sequence:


let maxFibbValue = 100

let fibbGenerator = Coroutine<Int> { yield in
    var (a, b) = (1, 1)
    while b < maxValue {
        (a, b) = (b, a + b)
        yield(a) // Returns `a`, and then continues from here when `next` is called again
   }
}

let fibb = AnySequence { fibbGenerator }

for x in fibb {
    print(x) // -> 1, 2, 3, 5, 8, 13, 21, 55, 89
}

Could some form of lightweight thread be a useful addition to the Swift language?

Definitely, and these new mechanisms would make the language way more scalable, something essential when developing server applications that deal with the network, an area where languages like Go and Erlang are at the moment a better choice.

And lightweight threads, as we’ll see, could be the scalable base mechanism needed to implement effectively the concurrency paradigms that will be described in the next sections.

Futures and Promises

Let’s now introduce another concept you could already be familiar with from other languages: tracking the execution of asynchronous closures through objects that contain a reference to their future return value (we’ve already seen something similar when discussing OperationQueues).

This pattern is especially useful on platforms that don’t have the luxury of being able to distribute asynchronous calls between multiple threads and, to avoid callback hell, need to provide some practical high level API built on top of some form of cooperative multitasking runtime.

Both Futures and Promises are placeholder for the real return value of a function, their completion status can be checked and handlers can be registered to do something when the return value of the function they refer to will be available or to handle errors.

Our programs can also wait for the value of the future to be available or combine multiple futures in a chain when they are dependent on each other.

But what’s the difference between futures and promises?

If we follow the original nomenclature, futures are a simple container that store a value and can only be read once ready, while promises are objects that manage the life-cycle of a future, performing the transition needed to complete it (normally only once) successfully or not (in presence of errors).

Libraries tend to use one or the other interchangeably.

There are quite a few open source Swift libraries that implement Futures and Promises (and as we’ll see in the next section async/await): PromiseKit, Hydra, then, BrightFutures, Overdrive and many others.

While all these implementations provide the basic features needed to use Futures and Promises (sometime also called Tasks) with more or less the same API, some add extended functionalities that could make you code more concise.

Let’s see an example to understand how Futures and Promises work in practice using Hydra.

Suppose we have a retrieveUserData that retrieves the profile data for a user with a specific numeric id that once done calls a completion handler. Let’s improve the API using Promises and Futures.


func getUserData(for id: Int) -> Promise<UserStruct> {

    return Promise<UserStruct>(in: .background, { resolve, reject in
        retrieveUserData(id: id, completion: { data, error in
            if let error = error {
                reject(error)
            } else {
                resolve(data)
            }
        })
    })
}

getUserData(42).then(.main, {user in 
    // On successful completion
    print("Loaded data for user: \(user.username)")
}).catch(.main, {error in
    // On error
    print("Error: \(error)")
})

In the example above, the getUserData returns a Promise<UserStruct> object, that is set with a value or an error in the completion handler using either the resolve(value:UserStruct) or reject(error:Error) methods.

We then register a function to be called when the promise will be fulfilled using .then, specifying that the closure should be executed on the main queue. The .catch method handles errors for all promises in a chain.

Yes, promises can be chained and have also a few other useful operators, they are definitely not just a cleaner alternative to callbacks.

We will now define two new functions that returns promises and we’ll chain them together to perform three operations in sequence:



func getPhotos(user: UserData) -> Promise<[Photos]>
func bulkDeletePhotos(photos: [Photos]) -> Promise<Int>

getUserData(42).then(getPhotos)
               .then(.bulkDeletePhotos)
               .catch{ err in
    print("Error while deleting photos: \(err)")
}

After the information about the user have been retrieved, we try to get a list of all the photos of this account and then perform a mass deletion. If any of those operations fails, the final catch block will handle the error.

Promises libraries also provide some operators like: all, any, retry, recover, validate and many others.

Combining these operators, complex logic with multiple nested statements can be turned into something more concise but still keeping a good degree of clarity.


getUserData(42).retry(3)
               .validate($0.age > 18)
               .then{ user in
    print("Username: \(user.username)")
}.always{
    print("Done.")
}

This time we’ll try to retrieve information on the user with id 42, retrying on failure at most three times, and if successful we’ll print the username after we have verified that the user is over 18 years old.

Using the always method, we can also add a finalizing closure that will always be executed no matter what happens to the closures that precede it in the chain.

The all and any methods create a composite future that will be fulfilled only when either all or at least one of the underlining futures will complete successfully.


let promises = [40,41,42,43,44,45].map {return getUserData($0)}

all(promises).then {users in
    print("All \(users.count) users retrieved!")
}.catch {err in
    print("Error: \(err)")
}

any(promises).then {users in
    print("At least some users retrieved!")
}

While these basic functionalities are common among different implementations, other methods, that for example allow to introduce delays along a chain or that perform additional transformations, could be available.

Async/Await

Now let’s talk about the Async/Await functionality, that is quite popular lately since it has been finally added to Javascript in the ES2017 version of the standard, but that was already available in other languages (C# introduced this API first, in release 5.0 a few years ago).

Async/Await can be considered an extension over the basic futures API.

In the mono-thread world of Javascript, futures provided a better alternative to callback nesting, allowing to enclose asynchronous closures in convenient objects that could be used to chain sequential operations, with the side-effect of simplifying stacktrace browsing during debugging.

These new keywords allow to do something that at a first look appears to be only a small improvement over using promises directly: await allows to block execution until a function returning a promise completes and async allows to annotate functions that during their execution wait for completion of one or more futures.

But let’s see a simple example in Javascript that makes use of both keywords:


async function getUserProfileImageUrl(id) {
    let user = await getUserData(id);
    let img = await getImageData(user.imageId)
    return img.url;
}

With futures we could have obtained a reference to the UserData structure and to ImageData in a series of then closures extracting the URL as last step.

Using Async/Await this asynchronous code looks more like classic synchronous code, an improvement from the point of view of aesthetic but something that could have been replicated waiting in a blocking way for the fulfillment of those two promises.

But look at this from another point of view.

With Async you are also delimiting a context that contains asynchronous calls and with Await you are clearly marking where the execution will stop waiting for a result.

These two information would be extremely useful to implement some form of cooperative multitasking under the hood, and this is what is likely to happen in the context of Javascript in most implementations.

That wait will thus be a non-blocking wait in most of the implementations, with every await function being still executed in sequence, as we saw in the first part of the series, waiting in a non-blocking way is useful when you need to perform I/O or just to increase CPU/resources utilization.

Let’s go back to Swift with some examples with Hydra, but int this case too, other libraries provide a mostly identical API.


let asyncFunc = async({
	let user = try await(getUserData(42))
	let photos = try await(getPhotos(user))
	let success = try await(bulkDeletePhotos(photos))
})

The async block will be executed in the .background thread as the await functions, that run in the context in which they are called.

Additionally, Hydra and other libraries allows to return a promise from within the async block so that you’ll be able to use all the operators seen in the previous section or just simply chain the block with other promises.

How is the failure of a promise handled with async/await?

To handle errors, await calls are usually surrounded by a catch statement, in Javascript but also in the Swift libraries listed above, since as you could have guessed , the try in the previous sample is a sign that an Error will be thrown by await when a promise completes with a failure.

So, for most libraries you’ll be able to catch errors adding a do/catch inside the async block.

If you are interested in learning more about async/await I recommend you to check out the opensource libraries listed above and look under the hood at how the various functions have been implemented.

STM: Software Transactional Memory

In this section we’ll take a look at Clojure, a LISP language running on the JVM that is usually a good source of interesting ideas.

If you want to know more about languages of the LISP family, check out the article I wrote a while ago about implementing a LISP from scratch in Swift, that will guide you through the implementation of a minimal LISP while describing the fundamental components of an interpreter.

Pure functional languages do not allow mutable data or the use of mutable state and as such are inherently thread-safe. If there is no mutable memory that needs to be protected from concurrent access, the problems of concurrency are already solved.

Clojure is an impure functional languages, and manages mutable data through immutable and persistent data structures that are concurrency-aware.

Managing mutability with immutable structures may seem a contradiction until you grok the meaning of the term persistent in this context.

Quoting the article that coined the term:

Ordinary data structures are ephemeral in the sense that making a change to the structure destroys the old version, leaving only the new one. We call a data structure persistent if it supports access to multiple versions. The structure is partially persistent if all versions can be accessed but only the newest version can be modified, and fully persistent if every version can be both accessed and modified.

You could be already familiar with the mechanism of Copy-On-Write, used profusely in Swift, that speeds up the copy of a value when it’s reassigned performing a real copy only when one of the clones of the original value is modified, saving memory and avoiding pointless copying.

In other words, when a data structure that supports COW get copied to a new variable, the new variable will still point to the original area of memory, until a real copy of the original content is triggered by an attempt to change it.

Persistent data structures take this idea a step further and try to share with the original value the part of its structure that didn’t change after the update to perform something akin to a selective copy. All of this with the guarantee that the original memory content will be untouched and still accessible through the other references.

The image below shows a simple example of persistent list, that highlights how the actual copy is performed only when the original data structure will become unusable because the last element down the chain will not be needed anymore.

Similarly to COW variables, using persistent data structure has the effect of isolating changes to the reference on which they were applied.

This property is available with all the basic collection types in Clojure and in addition to this, transient, read-only views of persistent data structures can be created in those scenarios where mutability is not needed.

Clojure guarantees thread-safety with different mechanisms building upon persistence through 4 container primitives: Vars, Refs, Atoms and Agents.

The most interesting and most sophisticated are Refs that are an integral component of a system based on Software Transactional Memory.

STM is based on the idea of grouping a series of operations on mutable variables, that need to happen atomically, in a single transaction that is guaranteed to be executed completely or not at all. You could already be familiar with this concept from the world of databases.

Multiple transactions can be executed concurrently but they will never be allowed to interfere with each other, transactions that try to perform conflicting modifications on the same set of data will simply fail and retried by the STM runtime.

Refs contains the objects that can be modified during a transaction and they can be either dereferenced to obtain the value they contain or modified setting a new value or executing a function that alters the current value.


(def counter (ref 0))
(def counter2 (ref 0))

(deref counter)            // Dereferenced -> 0

(dosync
    (alter counter inc)
    (alter counter2 inc)
)

Let me just do a quick and dirty recap of the LISP notation, each couple of parenthesis contain a series of terms that define a symbolic expression. These expression are written in prefix notation, where the first element of this list of terms is an operator and the rest are operands. A sum of two integers would be expressed as (+ 1 2). As said above, check out this article if you want to know more.

In that example we defined two Refs to an integer variable named counter and counter2 and then alter their value in a transaction enclosed in a dosync block, incrementing them.

Both values will always contain the same number regardless of how much contention there will be on those two Refs but when run concurrently, for example with a few threads that just try to execute the transaction, a good amount of those transaction will fail (silently) and will be retried automatically and transparently.

STM implementations are usually lock-free and as such scale better that alternative solutions based on the classic locks we saw in part one, and as you can see there is not much else to do other than defining the content of the transaction.

There are just a few implementation of STM in Swift with varying degree of sophistication, like Typelift’s Concurrent and Swift STM.

Let’s see how STM looks with Concurrent using the bank account example, straight from their documentation:


typealias Account = TVar<UInt>

/// Some atomic operations
func withdraw(from account : Account, amount : UInt) -> STM<()> { 
  return account.read().flatMap { balance in
    if balance > amount {
      return account.write(balance - amount)
    }
    throw TransactionError.insufficientFunds
  } 
}

func deposit(into account : Account, amount : UInt) -> STM<()> { 
  return account.read().flatMap { balance in
    return account.write(balance + amount)
  }
}

func transfer(from : Account, to : Account, amount : UInt) -> STM<()> { 
  return from.read().flatMap { fromBalance in
    if fromBalance > amount {
      return withdraw(from: from, amount: amount)
          .then(deposit(into: to, amount: amount))
    }
    throw TransactionError.insufficientFunds
  }
}

The TVar type represent a container that supports the transactional memory interface and in this case it contains an integer with the account balance and the withdraw, deposit and transfer functions change its value or perform a transfer between different accounts.

As you can see in the transfer function, a transaction is built concatenating multiple operations:


let alice = Account(200)
let bob = Account(100)

let finalStatement = 
    transfer(from: alice, to: bob, amount: 100)
        .then(transfer(from: bob, to: alice, amount: 20))
        .then(deposit(into: bob, amount: 1000))
        .then(transfer(from: bob, to: alice, amount: 500))
        .atomically()

The transaction will be executed only when atomically() is called.

To learn more about STM and how it can be implemented I recommend the article Beautiful Concurrency from Simon Peyton Jones.

The Actor Model

In the first part of this series, we dealt with concurrency using locking primitives and more complex APIs based on the use of pool of threads to control multiple accesses to shared memory and to perform concurrent operations.

Locks and derivatives are the most basic primitives we have at how disposal, through them we can limit the access to code that modifies a shared area of memory imposing entry checks and regulating the execution flow of multiple threads.

Locks does not scale well, with increasing number of threads or with the same memory (or most of the times, data structure) shared in too many places reasoning about the state of your programs can become quite challenging.

High level APIs like DispatchQueue allows to model concurrent operations in a way that is easy to reason about but are based on threads, a low level primitive that as we saw in the green threads section is quite resource hungry when you consider the cost of the context switch and the memory usage.

And everything we have described until now does not really take into consideration that our programs could need to synchronize their state across multiple machines and between multiple instances of our application.

Wouldn’t be nice to be able to write concurrent code that could nearly transparently be able to run in a distributed environment without any modification?

To support this additional use case we need to introduce new paradigms based on an idea that you are already familiar with from the days of Objective-C: Message Passing.

This approach allows to handle concurrency without sharing memory (implicitly solving a lot of the concurrency issues we’ve seen until now) through higher level constructs that handle the exchange of messages between concurrent entities operating in isolation from each other.

We’ll take a look at two paradigms based on message passing: The Actor Model and Communicating Sequential Processes.

Implementations of the actor model can be found in various languages, from the most notable, Erlang, to languages like Java that support the paradigm through external libraries like Akka.

An actor is an object that provides various services through an interface that processes asynchronous messages and that maintains a mutable internal state (if the actor is stateful instead of stateless) that is never shared with the actor’s clients to guarantee thread-safety.

In most implementations, direct messages are received in the same order they were sent and are always processed sequentially since, internally, actors are single-threaded (but not usually backed by a real system thread) and as such, message processing can be considered as an atomic operation.

But messages are not actually sent directly to an actor object.

Each actor has its own Mailbox, a queue that stores the asynchronous messages directed to a specific actor, working as a buffer for incoming messages. A mailbox can have additional characteristics that change the way the actor receives messages, it could for example, limit the number of messages it holds or introduce the concept of message priority.

An actor receives messages but can also send messages to other actors if it knows their address or if when it needs it, it can create a new actor instance for the destination actor. This property is known as locality.

Every message has a reference to its sender, so that is always possible to reply back maintaining locality.

A system that uses the actor model is made up by a network on multiple actors with various functionalities, that communicate with each other constantly. Since each actor can create new actors, the topology of the network is dynamic. Smaller network of actors can be interconnected to build bigger systems.

Since there are no mature Swift libraries implementing the actor model, let’s see some concrete examples using Scala, a language running on the JVM with lot’s of interesting features, and the Akka library.

We’ll create a counter actor, that will maintain an integer value as its state and that will expose to other actors methods to increment this counter or get its value.

No concurrent access to this integer value will be possible, the actor will implicitly take care of that. This is functionally equivalent to having an integer counter protected by multiple accesses through a lock or some other equivalent basic concurrency primitive.


import akka.actor.Actor
import akka.actor.Props
import akka.event.Logging

class Counter extends Actor {
    var count = 0

    def receive = {
        case "incr" => count += 1
        case "get"  => sender() ! count
        case _      => println("Unknown message received")
    }
}

class Main extends Actor {
    val counter = context.actorOf(Props[Counter], “counter”)
    
    counter ! “incr”
    counter ! “incr”
    counter ! “incr”
    counter ! “get”
    
    def receive = {
        case count: Int =>
            println(s“count was $count”)
            context.stop(self)
        case _          => println("Unknown message received")
    }
}

Above we are defining two actors, the Counter actor that will manage the mutable count and the Main actor that we’ll use to send command to the counter.

The receive method contains a message loop that defines the behavior of the actor, associating at each message string a closure that will be executed every time that specific message is executed.

Akka requires that all the loops must be exhaustive, so, in this case we’ll have to add a final catch all case to our receive loops. Using something akin to enums with associated values (i.e. Scala’s case classes) and specifying all the cases would remove this requirement. Other implementations (e.g. Erlang actors) do not require exhaustiveness.

An interesting feature of actors is the ability to alter their behavior while they are running. Multiple message loops can be defined and the actor will be able to switch between them selecting one of them as active loop.

In the sample above, the Counter replies to incr and get, respectively incrementing the local counter or sending the value of the counter to the actor that sent the message.

In this example all the messages are sent in “fire and forget” mode using the ! operator, that sends the message asynchronously and returns right away. Alternatively, messages can be sent encapsulating the status of the call into a future.

The Main actor creates a Counter actor and then proceeds to incrementing the counter three times before retrieving its value. The value is then obtained asynchronously in its receive loop, where the value is printed before the actor stops itself.

Now that we’ve got this basic example down, let’s talk again about behaviors.

The fact that multiple receive loops can be defined before-hand and then swapped at runtime can be useful to implement a Finite State Machine to manage the various states the actor will find itself during its lifetime.

An actor could for example be started in a uninitialized state and require a series of messages to be configured and perform a transition to its normal running state. Or it could have a more complex logic, easy to represent with a FSM, where every state could be implemented with a different receive loop and where transitions would be performed when specific messages are received. Furthermore, the ability to change the behavior could be used to encapsulate the internal state of stateful actors.

Let’s see a quick example going back to the counter actor, we’ll remove the need for an external integer counter embedding the counter value in the receive loop.


class Counter extends Actor {
    def counter(n: Int) = {
        case "incr" => context.become(counter(n + 1))
        case "get" => sender() ! n
        case _      => println("Unknown message received")
    }
    def receive = counter(0)
}

A simple actor with two states returning two alternating values can be easily defined as follow:


class PingPong extends Actor {
    def ping = {
        case "get" => 
            sender() ! "Ping!"
            context.become(pong)
        case _      => println("Unknown message received") 
    }
    def pong = {
        case "get" => 
            sender() ! "Pong!"
            context.become(ping)
        case _      => println("Unknown message received") 
    }

    def receive = ping
}

Now, we have our system based on a huge number of actors all operating concurrently and processing asynchronous message, how do we handle abnormal errors and fault conditions?

While normal error condition related to the logic of an actor can still be handled through error messages, unexpected exceptions (remember that Scala and Akka run on the JVM) that may crash an actor can’t.

Systems based on actors follow a “let it crash” approach, and focus on bringing back the system to a normal functioning state deciding what to do with components marked as failed, instead of trying to keep components alive stabilizing their state perturbed by an unexpected error.

Actor system are fault tolerant and allow to choose a recovery strategy to make an actor (or a series of actors in a network) fully functional again after a fault.

A failing actor could be for example simply restarted with a clean slate, discarding its internal state, it could be stopped completely or we could just ignore a class of errors and go on as if nothing happened. The strategy you choose depends on the error and what you need to do to bring back the actor to its running state.

What you have read in this section barely scratches the surface of what modern actor systems have to offer, to learn more check out this video with Carl Hewitt, the Akka documentation or read about Erlang’s implementation.

To conclude, we’ve seen that the actor model has a lot of interesting characteristics until now, but it also has some weaknesses.

In some situations, it could be quite hard to model your problem following the rigid actor model and other approaches could be better suited for what you need to do.

When using actors you’ll also need to consider how messages flow in your system and configure accordingly the mailbox of each actor. Failing to do this could lead to mailbox’s overflows.

Actors are also the wrong tool when you want to parallelize sections of your code, that’s not the problem the actor model is trying to solve.

Communicating Sequential Processes: Channels

The Communicating Sequential Process model is another concurrency model based on message passing that, instead of focusing on the objects receiving or sending messages and what they do, revolves around the idea of channels, that compose the infrastructure needed to exchange messages between different entities running concurrently.

A channel can be used to send and receive messages between entities running in different tasks that will use it as a communication channel. You could have for example a task that takes care of centralized logging that exposes its services through a channel that every other task in the system would use.

From the point of view of the implementation, a channel is nothing more than a thread-safe FIFO queue, that can hold multiple messages awaiting to be received if needed(buffered channels).

The sender usually blocks when the channel is full and waits until a receiver removes a message from the list. Conversely, receivers blocks when a queue is empty and wait for new messages. Channels can be closed when they are not needed anymore, unblocking all the senders and receivers that were waiting on that queue.

Implementations of channels have usually a rich API that allows to handle messages from different channels in a single reception block and are usually coupled with some lightweight thread implementation to make the system more scalable.

And channels are usually paired with some form of lightweight threads, either coroutines or green threads, since these two functionality pair quite well together.

A few Swift implementations for CSP channels, each one with a different API and a different set of functionalities, are already available: Venice, Concurrent and Safe.

The only library that also provides coroutines is Venice, but on the other hand has a channel API a bit different from what you’d normally expect and is not a pure-Swift framework. As far as I know there are no Swift implementation of green threads.

The Go programming language is probably the reason behind the recent surge in popularity of CSP channels since channels and goroutines (go’s implementation of coroutines) are the corner stone of its approach to concurrency.

Considering that every Swift implementation available at the moment lacks something (Safe is Swift 2 only, Concurrent and Venice do not support the select statement), I’ll present a few examples in Go, that has a clean and minimalistic API for both channels and goroutines. But I still recommend to check out those libraries since what they have could be enough for your use case.

We’ll create a simple program that defines a channel and uses it to send a string between a goroutine and the main thread. The main thread will wait for the string using the channel and it will just print it once it’s available.


package main

import "fmt"
import "time"

func main() {
    messages := make(chan string)

    go func(){
        time.Sleep(time.Second) 
        messages <- "Hello!"
    }()

    msg := <- messages
    fmt.Println(msg)  
}

In the main function, the entry point of the program, we define a channel that will contain a single string with the make function.

What follows is the declaration of a goroutine that will execute an anonymous closure responsible for sending a string through the channel.

As we said in the Coroutines and Green Threads section, goroutines are just lightweight threads, we could happily create hundred of thousands of them.

After sleeping for a second, the goroutine will send a string message through the channel using the <- operator. Depending on what is on the right side of the operator, it send or receive a message through a channel.

The main thread will block trying to extract a string from the messages channel until the goroutine will send one. Once it gets a hold of one, it will print it.

Optionally, channels can be buffered and be able to store more than one object and they can also be restricted to a specific direction (send-only or receive-only).

They can be used in different ways to coordinate work between a chain of goroutines or to send back results from a background worker goroutine that had some task to execute.

Since most implementations will be based on multiple channels to coordinate multiple goroutines, each one with a different job to perform, a construct able to handle messages from more than one channel in a single place exists, the select block.


package main
import(
    "fmt"
    "time"
)

func main() {
    ch1 := make(chan string, 3)
    ch2 := make(chan string, 3)
    done := make(chan bool)

    go func() {
        for i := 0; i < 10; i++ {
            ch1 <- "Message from 1"
            time.Sleep(time.Second * 1)
        }
    }()

    go func() {
        for i := 0; i < 10; i++ {
            ch2 <- "Message from 2"
            time.Sleep(time.Second * 2)
        }
        done <- true
    }()

    go func() {
        for {
            select {
                case msg1 := <- ch1:
                    fmt.Println(msg1)
                case msg2 := <- ch2:
                    fmt.Println(msg2)
             }
        }
    }()

    <- done
}

This time we have three anonymous closures executed in as many goroutines.

The first two will send a string and wait, with the longer second goroutines sending a boolean through a boolean channel when is done with its strings. This channel is used as completion signal of the main thread (not a perfect implementation).

The third goroutines receives the messages checking continuously the two string channels and prints everything it receives. The select block allows to wait on more than one channel and be able to receive all the messages from the channels regardless of the order in which they are sent.

Considering that each wait on a channel would have been otherwise blocking, this new construct is essential, without it channels are instantly a lot less useful.

Proponents of actors and CSP are member of two opposing school of thoughts, so, you’ll rarely found someone advocating the use of both approaches in a language.

Channels have great performance characteristics are way more simple than actors and that can lead to simple and clean architectures sometime.

But in the end they are just blocking queues, they don’t solve any of the concurrency problems described in the first part of the series, and you’ll have to use them as building blocks of more complex mechanisms on your own.

While CSP channels introduce a lot of flexibility compared to the one message queue per task approach of the Actor model and allow to build a network of lightweight processes connected by multiple communication channels, they do not have much to say in regards to fault tolerance.

You will not have something analogous to the independent software components that can be restarted at will available with the actor model.

But probably the greatest weakness of CSP channels is that you’ll be rarely able to structure your program around the use of channels alone without creating convoluted hierarchies of channels.

In most cases, you’ll still need to resort to the basic locks seen in part one.

Swift: Where are we now

Discussions on the future of concurrency on swift-evolution have yet to start, and the community will start introducing the first proposal likely a few months after the release of Swift 4 (assuming that improving concurrency will be one of the goals of Swift 5).

So, until the end of 2017 it’s unlikely that discussions around concurrency will start.

But when it will happen, I expect heated discussions with lot’s of input from the community, on a scale similar to what happened when discussing the access keywords or SE-110.

For now, in addition to the already cited Ownership Manifesto, the only document that touches the subject is this unmaintained draft that contains quite a few interesting ideas.

The document outlines some of the challenges involved in making concurrency more safe and discusses possible approaches to implement Actors, CSP channels and Async/await.

Update 8/17

Chris Lattner wrote his toughts on concurrency in a manifesto and discusses all the topics of this post, check out his take on it. He and Joe Groff also published a proposal for async/await.

A few words on Kotlin

Now let’s talk about concurrency in Kotlin, the de-facto counterpart of Swift on Android.

When using Java interoperability, Kotlin inherits all the concurrency functionalities available in Java, from threads and locks to thread pools and concurrent data structures but since one day Kotlin could not have an underlying JVM to depend on, a new alternative approach to concurrency is in development.

I’m referring to an experimental implementation of some of the paradigms described in the previous sections based on coroutines, that at the moment are distributed as an external kotlinx.coroutines library.

The coroutines library contains implementations of async/await, generators(that I didn’t describe in this article, but that have a structural similarity with Swift’s generators) and channels with support for select.

Swift could follow the same route, instead of choosing one paradigm in particular, multiple options could be offered like in the proposal linked in the previous section, and they could be built on top of some form of lightweight threads.

Closing Thoughts

This article should contain a good overview of the most popular paradigms and should give you enough pointers to experiment and learn further.

We’ve discussed the basic principles behind Promises, Await/Async, STM, Actors and CSP, paradigms that could have a place in the future of Swift, maybe (and I realize I’ve said this more than one time) built leveraging some lightweight mechanism like coroutines or green threads.

And while it’s too soon to speculate which new features will be added to Swift and when, we can all be sure of one thing: concurrency is coming.

All about Concurrency in Swift - Part 1: The Present

2017-05-07T00:00:00+02:00

Update 10/17: Updated for Swift 4

Update: The second part of this series is now available: All About concurrency in Swift - Part 2: The Future.

The current release of the Swift language doesn’t include yet any native concurrency functionality like other modern languages such as Go or Rust do.

If you plan to perform tasks concurrently and when you’ll need to deal with the resulting race conditions, your only option is to use external libraries like libDispatch or the synchronization primitives offered by Foundation or the OS.

In the first part of this series, we’ll take a look at what we have at our disposal with Swift 3, covering everything from Foundation locks, threads and timers to the language guarantees and the recently improved Grand Central Dispatch and Operation Queues.

Some basic concurrency theory and a few common concurrency pattern will also be described.

Even if they are available on every platform where Swift is available, the functions and primitives from the pthread library will not be discussed here, since for all of them, higher level alternatives exist. The NSTimer class will also not be described here, take a look here for info on how to use it with Swift 3.

As has already been announced multiple times, one of the major releases after Swift 4.0 (not necessarily Swift 5) will expand the language to better define the memory model and include new native concurrency features that will allow to handle concurrency, and likely parallelism, without external libraries, defining a swifty idiomatic approach to concurrency.

This will be the topic of the next article in this series, where we’ll discuss a few alternative approaches and paradigms implemented by other languages, how they could be implemented in Swift and we’ll analyze a few open-source implementations that are already available today and that allow to use the Actors paradigm, Go’s CSP channels, Software Transactional Memory and more with the current release of Swift.

This second article will be completely speculative and its main goal will be to give you an introduction to these subjects so that you’ll be able to participate to the, likely heated, discussions that will define how concurrency will be handled in the future releases of Swift.

The playgrounds for this and other articles are available from GitHub or Zipped.

Multithreading and Concurrency Primer
Language Guarantees
Threads
Synchronization Primitives
GCD: Grand Central Dispatch
Operations and OperationQueues
Closing Thoughts

Multithreading and Concurrency Primer

Nowadays, regardless of what kind of application you are building, sooner or later you’ll have to consider the fact that your application will be running in an environment with multiple threads of execution.

Computing platforms with more than one processors or processors with more than one hardware execution core have been around for a few decades and concept like thread and process are even older than that.

Operating systems have been exposing these capabilities to user programs in various ways and every modern framework or application will likely implement a few well known design pattern involving multiple threads to improve flexibility and performance.

Before we start to delve into the specifics of how to deal with concurrency with Swift, let me explain briefly a few basic concepts that you need to know before starting to consider if you should use Dispatch Queues or Operation Queues.

First of all, you could ask, even if Apple’s platform and frameworks use threads, why should you introduce them in your applications?

There are a few common circumstances that make the use of multiple threads a no-brainer:

Task groups separation: Threads can be used to modularize your application from the point of view of execution flow, different threads can be used to execute in a predictable manner a group of task of the same type, isolating them from the execution flow of other parts of your program, making it easier to reason about the current state of your application.
Parallelize data-independent computations: Multiple software threads, backed by hardware threads or not(see next point), can be used to parallelize multiple copies of the same task operating on a subset of an original input data structure.
Clean way to wait for conditions or I/O: When using blocking I/O or when performing other kinds of blocking operations, background threads can be used to cleanly wait for the completion of these operations. The use of threads can improve the overall design of your application and make handling blocking calls trivial.

But when multiple threads are executing your application code, a few assumptions that made sense when looking at your code from the point of view of a single thread cease to be valid.

In an ideal world where each thread of execution behave independently and there is no sharing of data, concurrent programming is actually not much more complex that writing code that will be executed by a single thread. But if, as often happens, you plan to have multiple threads operating on the same data, you’ll need a way to regulate access to those data structures and to guarantee that every operation on that data completes as expected without unwanted interaction with operations from other threads.

Concurrent programming requires additional guarantees from the language and the operating system, that need to explicitly state how variables (or more generically “resources”) will behave when multiple threads try to alter their value accessing them at the same time.

The language needs to define a Memory Model, a set of rules that explicitly states how some basic statements will behave in presence of concurrent threads, defining how memory can be shared and which kind of memory accesses are valid.

Thanks to this, the user will have a language that behave predictably in the presence of threads and we’ll know that the compiler will only perform optimizations that respect what has been defined in the memory model.

Defining a memory model is a delicate step in the evolution of a language, since a model too strict could limit how the compiler will be allowed to evolve. New clever optimizations could be invalid as a consequence of past decisions on the memory model.

The memory model defines for example:

Which language statements can be considered atomic and which are not, operations that can be executed only as a whole where no thread can see partial results. It’s for example essential to know if variables are initialized atomically or not.
How shared variables are handled by threads, if they are cached by default or not and if it would be possible to influence the caching behaviour with specific language modifiers.
Concurrency operators that are used to mark and to regulate access to critical sections, sections of code that operate on shared resources, allowing for example only one thread to follow a specific code path at a time.

Now let’s go back to discussing the use of concurrency in your programs.

To handle concurrency correctly, you’ll have to identify the critical sections in your program and use concurrency primitives or concurrency-aware data structure to regulate access to data shared among different threads.

Imposing access rules to these sections of code or data structures open the way to another set of problems, that derive from the fact that while the desired outcome is that every thread gets to be executed and has a chance to modify the shared data, under some circumstances some of them could not execute at all or the data could be altered in unexpected and unpredictable ways.

You’ll face an additional set of challenges and you’ll have to work around some common problems:

Race conditions: With multiple threads operating on the same data, for example reading and writing it concurrently, the outcome of the execution of a series of operations could become unpredictable and dependent on the order of execution of the threads.
Resources contention: Multiple threads, that could be performing different tasks, trying to access the same resources will increase the amount of time needed to obtain the required resources safely. These delays needed to acquire the resources you need could lead to unexpected behaviour or could require that you structure your program to regulate access to these resources.
Deadlocks: Multiple threads waiting for each other to release the resources/locks they need, forever, blocking the execution of that group of threads.
Starvation: A thread could never be able to acquire the resource, or a set of resources in a specific order, it needs for various reasons and keep trying forever unsuccessfully to acquire them.
Priority Inversion: A thread with lower priority could keep acquiring resources needed for a thread with higher priority effectively inverting the priority assigned by the system.
Non-determinism and Fairness: We can’t make assumptions on when and in what order a thread will be able to acquire a shared resource, this delay cannot be determined a priori and is heavily influenced by the amount of contention. A thread could even never be able to acquire a resource. But concurrency primitives used to guard a critical section can also be built to be fair or to support fairness, guaranteeing access to the critical section to all the threads that are waiting, also respecting the request order.

Language Guarantees

Even if at the moment the Swift language itself doesn’t have features related to concurrency, it still offers some guarantees related to how properties are accessed.

Global variables for example are initialized atomically, we will never need to handle manually the case in which multiple threads try to initialize the same global variable concurrently or worry that someone could see a partially initialized variable while initialization is still ongoing.

We’ll discuss again this behaviour when talking about implementing singletons below.

But it’s important to remember that lazy properties initialization is instead not performed atomically, and the language for now does not provide annotations or modifiers to change this behaviour.

Access to class properties is again not atomic, and if you need to make it so, you’ll have to implement exclusive access manually using locks or similar mechanisms.

Threads

Foundation offers a Thread class, internally based on pthread, that can be used to create new threads and execute closures.

Threads can be created using the method detachNewThreadSelector:toTarget:withObject: of the Thread class or we can create a new thread declaring a custom Thread class and then overriding the main() method:


class MyThread : Thread {
    override func main(){
        print("Thread started, sleep for 2 seconds...")
        Thread.sleep(forTimeInterval:2)
        print("Done sleeping, exiting thread")
    }
}

But since iOS 10 and macOS Sierra, it’s finally possible on all platforms to just create a new thread using the initializer that allows to specify the closure that the thread will execute. All the example in this article will still extend the base Thread class though, so that you don’t have to worry about having the right OS to try them out.


var t = Thread {
    print("Started!")
}

t.stackSize = 1024 * 16
t.start()               //Time needed to spawn a thread around 100us

Once we have a thread instance we need to manually start it. As an optional step, we can also choose a custom stack size for this new thread.

Threads can be stopped abruptly calling exit() but that’s never recommended since it doesn’t give you the opportunity to cleanly end the current task, most of the times you’ll implement the stopping logic yourself if you need it or just use the cancel() method and check the isCancelled property inside your main closure to know if the thread is required to stop the current job before its natural end.

Synchronization Primitives

When we have different threads that want to mutate shared data, is essential to handle synchronization of those threads in some way to prevent data corruption and non-deterministic behavior.

The basic facilities usually used to synchronize threads are locks, semaphores and monitors.

Foundation provides all of them.

As you’ll see momentarily, the classes (yes, all of them are reference types) implementing these constructs have not lost the NS prefix in Swift 3, but could in one of the next releases of Swift.

NSLock

NSLock is the basic type of lock that Foundation offers.

When a thread tries to lock this object two things can happen, the thread will acquire the lock and proceed if it hasn’t already been acquired by a previous thread or alternatively the thread will wait, blocking its execution, until the owner of the lock unlocks it. In other words, locks are object that can be acquired (or locked) only by one thread at a time and this make them perfect to monitor access to critical sections.

NSLock and the other Foundation’s locks are unfair, meaning that when a series of threads is waiting to acquire a lock, they will not acquire it in the same order in which they originally tried to lock it.

You can’t make assumption on the execution order, and in cases of high thread contention, when a lot of threads are trying to acquire the resource, some of your threads may be subject to starvation and never be able to acquire the lock they are waiting for (or not able to acquire it in a timely fashion).

The time needed to acquire a lock, without contention, is measurable in 100s of nanoseconds, but that time grows rapidly when more than one thread tries to acquire the locked resource. So, from a performance point of view, locks are usually not the best solution to handle resource allocation.

Let’s see an example with two threads and remember that since the order in which the lock will be acquired is not deterministic, it could happen that T1 acquires the Lock two times in a row (but that wouldn’t be the norm).


let lock = NSLock()

class LThread : Thread {
    var id:Int = 0
    
    convenience init(id:Int){
        self.init()
        self.id = id
    }
    
    override func main(){
        lock.lock()
        print(String(id)+" acquired lock.")
        lock.unlock()
        if lock.try() {
            print(String(id)+" acquired lock again.")
            lock.unlock()
        }else{  // If already locked move along.
            print(String(id)+" couldn't acquire lock.")
        }
        print(String(id)+" exiting.")
    }
}


var t1 = LThread(id:1)
var t2 = LThread(id:2)
t1.start()
t2.start()

Let me just add a word of warning for when you’ll decide to use locks. Since it’s likely that sooner or later you’ll have to debug concurrency issues, always try to circumscribe your use of locks inside the bounds of some sort of data structure and try not to refer directly to a single lock object in multiple places in your code base.

Checking the status of a synchronized data structure with few entry points while debugging a concurrency problem is way more pleasant than having to keep track of which part of your code is holding a lock and having to remember the local status of multiple functions. Go the extra mile and structure well your concurrent code.

NSRecursiveLock

Recursive locks can be acquired multiple times from the thread that already holds that lock, useful in recursive function or when calling multiple functions that check the same lock in sequence. This would not work with basic NSLocks.


let rlock = NSRecursiveLock()

class RThread : Thread {
    
    override func main(){
        rlock.lock()
        print("Thread acquired lock")
        callMe()
        rlock.unlock()
        print("Exiting main")
    }
    
    func callMe(){
        rlock.lock()
        print("Thread acquired lock")
        rlock.unlock()
        print("Exiting callMe")
    }
}


var tr = RThread()
tr.start()

NSConditionLock

Condition locks provides additional sub-locks that can be locked/unlocked independently from each other to support more complex locking setups (e.g. consumer-producer scenarios).

A global lock (that locks regardless of a specific condition) is also available and behaves like a classic NSLock.

Let’s see a simple example with a lock that guards a shared integer, that a consumer prints and a producer updates every time it has been shown on screen.


let NO_DATA = 1
let GOT_DATA = 2
let clock = NSConditionLock(condition: NO_DATA)
var SharedInt = 0

class ProducerThread : Thread {
    
    override func main(){
        for i in 0..<5 {
            clock.lock(whenCondition: NO_DATA) //Acquire the lock when NO_DATA
            //If we don't have to wait for consumers we could have just done clock.lock()
            SharedInt = i
            clock.unlock(withCondition: GOT_DATA) //Unlock and set as GOT_DATA
        }
    }
}

class ConsumerThread : Thread {
    
    override func main(){
        for i in 0..<5 {
            clock.lock(whenCondition: GOT_DATA) //Acquire the lock when GOT_DATA
            print(i)
            clock.unlock(withCondition: NO_DATA) //Unlock and set as NO_DATA
        }
    }
}

let pt = ProducerThread()
let ct = ConsumerThread()
ct.start()
pt.start()

When creating the lock, we need to specify the starting condition, represented by an integer.

The lock(whenCondition:) method will acquire the lock when the condition is met or will wait until another thread sets that value when releasing the lock using unlock(withCondition:).

A small improvement over basic locks that allows us to model slightly more complex scenarios.

NSCondition

Not to be confused with Condition Locks, a condition provide a clean way to wait for a condition to occur.

When a thread that has acquired a lock verifies that an additional condition it needs (some resource it needs, another object being in a particular state, etc…) to perform its work is not met, it needs a way to be put on hold and continue its work once that condition is met.

This could be implemented by continuously or periodically checking for that condition (busy waiting) but doing so, what would happen to the locks the thread holds? Should we keep them while we wait or release them hoping that we’ll be able to acquire them again when the condition is met?

Conditions provide a clean solution to this problem, once acquired a thread can be put on a waiting list for that condition and is woken up once another thread signals that the condition has been met.

Let’s see an example:


let cond = NSCondition()
var available = false
var SharedString = ""

class WriterThread : Thread {
    
    override func main(){
        for _ in 0..<5 {
            cond.lock()
            SharedString = "😅"
            available = true
            cond.signal() // Notify and wake up the waiting thread/s
            cond.unlock()
        }
    }
}

class PrinterThread : Thread {
    
    override func main(){
        for _ in 0..<5 { //Just do it 5 times
            cond.lock()
            while(!available){   //Protect from spurious signals
                cond.wait()
            }
            print(SharedString)
            SharedString = ""
            available = false
            cond.unlock()
        }
    }
}

let writet = WriterThread()
let printt = PrinterThread()
printt.start()
writet.start()

NSDistributedLock

Distributed locks are quite different from what we’ve seen until now and I don’t expect that you’ll need them frequently.

They are made to be shared between multiple applications and are backed by an entry on the file system (for example a simple file). The file system will obviously need to be accessible by all the applications that need to acquire it.

This kind of lock is acquired using the try() method, a non blocking method that returns right away with a boolean indicating if the lock was acquired or not. Acquiring a lock will usually require more than one try, to be performed manually and with a proper delay between successive attempts.

Distributed locks are released as usual using the unlock() method.

Let’s see a basic example:


var dlock = NSDistributedLock(path: "/tmp/MYAPP.lock")

if let dlock = dlock {
    var acquired = false

    while(!acquired){
        print("Trying to acquire the lock...")
        usleep(1000)
        acquired = dlock.try()
    }

    // Do something...

    dlock.unlock()
}

OSAtomic Where Art Thou?

Atomic operations like those that were provided by OSAtomic are simple operations that allow to set, get or compare-and-set variables without using the classic locking logic because they leverage specific CPU functionalities (sometimes native atomic instructions) and that provide way better performance than the locks described previously.

It goes without saying that they are extremely useful to build concurrent data structures, since the overhead needed to handle concurrency is reduced to a minimum.

OSAtomic has been deprecated since macOS 10.12 and was never available on Linux, but a few open source project like this with its useful Swift extensions or this provide similar functionalities. Also, check out the recently released AtomicKit.

On Synchronized Blocks

In Swift you can’t create a @synchronized block out of the box as you would do in Objective-C, since there is no equivalent keyword available.

On Darwin, with a bit of code you could roll out something similar to the original implementation of @synchronized using objc_sync_enter(OBJ) and objc_sync_exit(OBJ) to enter and exist an @objc object monitor like @synchronized does under the hood, but it’s not recommended, it’s better to simply use a lock if you need something like that, more versatile.

And as we’ll see when describing Dispatch Queues, we can use queues to replicate this functionality with even less code performing a synchronous call on a serial queue:


serialQueue.sync {
    // Only a thread at a time! 
    v += 1
    print("Current value \(v)")    
}

The playgrounds for this and other articles are available from GitHub or Zipped.

GCD: Grand Central Dispatch

For those that are not already familiar with this API, the Grand Central Dispatch (GCD) is a queue based API that allows to execute closures on workers pools.

In other words, closures containing a job that need to be executed can be added to a queue that will execute them using a series of threads either sequentially or in parallel depending on the queue’s configuration options. But regardless of the type of queue, jobs will always be started following the First-in First-out order, meaning that the jobs will always be started respecting the insertion order. The completion order will depend on the duration of each job.

This is a common pattern that can be found in nearly every relatively modern language runtime that handles concurrency. A thread pool is way more easy to manage, inspect and control than a series of free and unconnected threads.

The GCD API had a few changes in Swift 3, SE-0088 modernized its design and made it more object oriented.

Dispatch Queues

The GCD allows the creation of custom queues but also provide access to some predefined system queues.

To create a basic serial queue, a queue that will execute your closures sequentially, you just need to provide a string label that will identify it and it’s usually recommended to use a reverse order domain name prefix to simplify tracking back the owner of the queue in stack traces.


let serialQueue = DispatchQueue(label: "com.uraimo.Serial1")  //attributes: .serial

let concurrentQueue = DispatchQueue(label: "com.uraimo.Concurrent1", attributes: .concurrent)

The second queue we created is concurrent, meaning that the queue will use all the available threads in its underlying thread pool when executing the jobs it contains. Order of execution is unpredictable in this case, don’t assume that the completion order of your closures will be in any way related to the insertion order.

The default queues can be retrieved from the DispatchQueue object:


let mainQueue = DispatchQueue.main

let globalDefault = DispatchQueue.global()

The main queue is the sequential main queue that handles the main event loop for graphical applications on either iOS or macOS, responding to events and updating the user interface. As we know, every alteration to the user interface should be performed on this queue and every long operation performed on this thread will render the user interface less responsive.

The runtime also provides access to other global queues with different priorities that can be identified by their Quality of Service (Qos) parameter.

The different levels of priority are declared in the DispatchQoS class, from higher to lower:

.userInteractive
.userInitiated
.default
.utility
.background
.unspecified

It’s important to note that on mobile devices that provide a low power mode, background queues will be suspended when the battery is running low.

To retrieve a specific default global queue, use the global(qos:) getter specifying the desired priority:


let backgroundQueue = DispatchQueue.global(qos: .background)

The same priority specifier can be used with or without other attributes also when creating custom queue:


let serialQueueHighPriority = DispatchQueue(label: "com.uraimo.SerialH", qos: .userInteractive)

Using Queues

Jobs, in the form of closures, can be submitted to a queue in two ways: synchronously using the sync method or asynchronously with the async method.

When using the former, the sync call will be blocking, in other words, the call to the sync method will complete when its closure will complete (useful when you need to wait for the closure to end, but there are better approaches), whereas the former will add the closure to the queue and complete, scheduling the closure for deferred execution and allowing the current function to continue.

Let’s see a quick example:


globalDefault.async {
    print("Async on MainQ, first?")
}

globalDefault.sync {
    print("Sync in MainQ, second?")
}

Multiple dispatch calls can be nested, for example when after some background, low priority, operation executed on a queue of our choosing, we need to update the user interface form the main queue.


DispatchQueue.global(qos: .background).async {
    // Some background work here

    DispatchQueue.main.async {
        // It's time to update the UI
        print("UI updated on main queue")
    }
}

Closures can also be executed after a specific delay, Swift 3 finally allows to specify in a more comfortable way the desired time interval with the utility enum DispatchTimeInterval that allows to compose intervals using these four time units: .seconds(Int), .milliseconds(Int), .microseconds(Int) and .nanoseconds(Int).

To schedule a closure for future execution use the asyncAfter(deadline:execute:) method with a time interval:


globalDefault.asyncAfter(deadline: .now() + .seconds(5)) {
    print("After 5 seconds")
}

If you need to execute multiple iteration of the same closure concurrently (like you were used to to with the dispatch_apply) you can use the concurrentPerform(iterations:execute:) method, but beware, these closure will be executed concurrently if possible in the context of the current queue, so remember to always enclose a call to this method in a sync or async call running on a queue that support concurrency.


globalDefault.sync {  
    DispatchQueue.concurrentPerform(iterations: 5) {
        print("\($0) times")
    }
}

While normally a queue is ready to process its closures upon creation, it can be configured to start in an idle state and to start processing jobs only when manually enabled.


let inactiveQueue = DispatchQueue(label: "com.uraimo.inactiveQueue", attributes: [.concurrent, .initiallyInactive])
inactiveQueue.async {
    print("Done!")
}

print("Not yet...")
inactiveQueue.activate()
print("Gone!")

This is the first time we need to specify more than one attribute, but as you can see, you can just add multiple attributes with an array if needed.

Execution of jobs can also be suspended or resumed temporarily with methods inherited from DispatchObject:


inactiveQueue.suspend()

inactiveQueue.resume()

A setTarget(queue:) method that is to be used only to configure the priority of inactive queues (using it on active queues will result in a crash) is also available. The result of calling this method is that the priority of the queue is set to the same priority of the queue given as parameter.

Barriers

Let’s say you added a series of closures to a specific queue (with different durations) but you now want to execute a job only after all the previous asynchronous task are completed. You can use barriers to do it.

Let’s add 20 tasks (that will sleep for a timeout of 1 second) to the concurrent queue we created previously and use a barrier to print something once the other jobs complete, we’ll do this specifying a flag DispatchWorkItemFlags.barrier in our final async call:


let concurrentQueue = DispatchQueue(label: "com.uraimo.Concurrent", attributes: .concurrent)

concurrentQueue.async { 
    DispatchQueue.concurrentPerform(iterations: 5) { (id:Int) in
        sleep(1)
        print("Async on concurrentQueue, 5 times: "+String(id))
    }
}   

concurrentQueue.async (flags: .barrier) {
    print("All 5 concurrent tasks completed")
}

The 20 tasks will be executed in parallel without a specific order by the concurrent queue and you’ll see those messages appearing in groups of a size equal to the number of execution cores of your Mac, but the final call will always be executed last.

Barriers are a way to impose ordering on concurrent queues that normally don’t execute the registered tasks in a repeatable order.

As Arthur Hammer notes, it’s important to remember that dispatch barriers have no effect on serial queues and on any of the global concurrent queues. You’ll need to define a new custom concurrent queue if you plan to use them.

Singletons and Dispatch_once

As you could have already noticed, in Swift 3 there is no equivalent of dispatch_once, a function used most of the times to build thread-safe singletons.

Luckily, Swift guarantees that global variables are initialized atomically and if you consider that constants can’t change their value after initialization, these two properties make global constants a great candidate to easily implement singletons:


final class Singleton {

    public static let sharedInstance: Singleton = Singleton()

    private init() { }

    ...
}

We’ll declare the class as final to deny the ability to subclass and we’ll make the designated initializer private, so that it will not be possible to manually create additional instances of this object. A public static constant will be the only entry point of the singleton and will be used to retrieve the single, shared, instance.

The same behaviour can be used to define blocks of code that will be executed only once:


func runMe() {
    struct Inner {
        static let i: () = {
            print("Once!")
        }()
    }
    Inner.i
}

runMe()
runMe() // Constant already initialized
runMe() // Constant already initialized

It’s not really pretty to look at but it works, and it could be an acceptable implementation if it’s just a one time thing™.

But if we need to replicate exactly the functionality and API of dispatch_once we need to implement it from scratch, as described in the synchronized blocks section with an extension:


import Foundation

public extension DispatchQueue {
    
    private static var onceTokens = [Int]()
    private static var internalQueue = DispatchQueue(label: "dispatchqueue.once")
    
    public class func once(token: Int, closure: ()->Void) {
        internalQueue.sync {
            if onceTokens.contains(token) {
                return
            }else{
                onceTokens.append(token)
            }
            closure()
        }
    }
}

let t = 1
DispatchQueue.once(token: t) {
    print("only once!")
}
DispatchQueue.once(token: t) {
    print("Two times!?")
}
DispatchQueue.once(token: t) {
    print("Three times!!?")
}

As expected, only the first of the three closures will be actually executed.

Dispatch Groups

If you have multiple tasks, even if added to different queues, and want to wait for their completion, you can group them in a dispatch group.

Let’s see an example, a task can be added to a specific group directly with the sync or async call:


let mygroup = DispatchGroup()

for i in 0..<5 {
    globalDefault.async(group: mygroup){
        sleep(UInt32(i))
        print("Group async on globalDefault:"+String(i))
    }
}

The tasks are executed on globalDefault, but we can register an handler for mygroup completion that will execute a closure on the queue we prefer once all of them will be completed. The wait() method can be used to perform a blocking wait.


print("Waiting for completion...")
mygroup.notify(queue: globalDefault) {
    print("Notify received, done waiting.")
}
mygroup.wait()
print("Done waiting.")

Another way to do track a task with groups, consists in manually entering and leaving a group instead of specifying it when performing the call on the queue:


for i in 0..<5 {
    mygroup.enter()
    sleep(UInt32(i))
    print("Group sync on MAINQ:"+String(i))
    mygroup.leave()
}

Dispatch Work Items

Closures are not the only way to specify a job that needs to be executed by a queue, sometimes you might need a container type able to keep track of its execution status and for that we have DispatchWorkItem. Every method that accepts a closure has a variant for work items.

Work Items encapsulate a closure that is executed by the thread pool of the queue invoking the perform() method:


let workItem = DispatchWorkItem {
    print("Done!")
}

workItem.perform()

And WorkItems also provide other useful methods, like notify that as it did with groups allows to perform a closure on a specific queue on completion:


workItem.notify(queue: DispatchQueue.main) {
    print("Notify on Main Queue!")
}

defaultQueue.async(execute: workItem)

We can also wait until the closure has been executed or flag it for removal before the queue tries to execute it with the cancel() method (that does not cancel closures during execution).


print("Waiting for work item...")
workItem.wait()
print("Done waiting.")

workItem.cancel()

But it’s important to know that wait() doesn’t just block the current thread waiting for completion but also elevates the priority of all the preceding work items in its queue, to try to complete this specific item as soon as possible.

Dispatch Semaphores

Dispatch Semaphores are locks that can be acquired by more than one thread depending on the current value of a counter.

Threads wait on a semaphore when the counter, decremented every time the semaphore is acquired, reaches 0.

A slot to access the semaphore is released for the waiting threads calling signal that has the effect of incrementing the counter.

Let’s see a simple example:


let sem = DispatchSemaphore(value: 2)

// The semaphore will be held by groups of two pool threads
globalDefault.sync {
    DispatchQueue.concurrentPerform(iterations: 10) { (id:Int) in
        sem.wait(timeout: DispatchTime.distantFuture)
        sleep(1)
        print(String(id)+" acquired semaphore.")
        sem.signal()
    }
}

Dispatch Assertions

Swift 3 introduces a new function to perform assertions on the current execution context, that allows to verify if a closure is being executed on the expected queue. We can build predicates using the three enum cases of DispatchPredicate: .onQueue, to verify that we are on a specific queue, .notOnQueue, to verify the opposite and .onQueueAsBarrier to check if the current closure or work item are acting as a barrier on a queue.


dispatchPrecondition(condition: .notOnQueue(mainQueue))
dispatchPrecondition(condition: .onQueue(queue))

The playgrounds for this and other articles are available from GitHub or Zipped.

Dispatch Sources

Dispatch Sources are a convenient way to handle system-level asynchronous events like kernel signals or system, file and socket related events using event handlers.

There are a few kind of Dispatch Sources available, that can be grouped as follow:

Timer Dispatch Sources: Used to generate events at a specific point in time or periodic events (DispatchSourceTimer).
Signal Dispatch Sources: Used to handle UNIX signals (DispatchSourceSignal).
Memory Dispatch Sources: Used to register for notifications related to the memory usage status (DispatchSourceMemoryPressure).
Descriptor Dispatch Sources: Used to register for different events related to files and sockets (DispatchSourceFileSystemObject, DispatchSourceRead, DispatchSourceWrite).
Process dispatch sources: Used to monitor external process for some events related to their execution state (DispatchSourceProcess).
Mach related dispatch sources: Used to handle events related to the IPC facilities of the Mach kernel (DispatchSourceMachReceive, DispatchSourceMachSend).

And you can also build your own dispatch sources if needed. All dispatch sources conform to the DispatchSourceProtocol protocol that defines the basic operations required to register handlers and modify the activation state of the Dispatch Source.

Let’s see an example with DispatchSourceTimer to understand how to use these objects.

Sources are created with the utility methods provided by DispatchSource, in this snippet we’ll use makeTimerSource, specifying the dispatch queue that we want to use to execute the handler.

Timer Sources don’t have other parameters, so we’ll just need to specify the queue to create a source, as we’ll see, dispatch source able to handle multiple events will usually require that you specify the identifier of the event you want to handle.


let t = DispatchSource.makeTimerSource(queue: DispatchQueue.global())
t.setEventHandler{ print("!") }
t.scheduleOneshot(deadline: .now() + .seconds(5), leeway: .nanoseconds(0))
t.activate()

Once the Source is created, we register an event handler with setEventHandler(closure:) and if no other configurations are required enable the dispatch source with activate() (previous releases of libDispatch used the resume() method for this purpose).

Dispatch Sources are initially inactive, meaning that they will not start delivering events right away allowing further configuration. Once we are ready, the source can be activated with activate() and if needed the event delivery can be temporarily suspended with suspend() and resumed with resume().

Timer Sources require an additional step to configure which kind of timed events the object will deliver. In the example above we are defining a single event that will be delivered 5 seconds after the registration with a strict deadline.

We could have also configured the object to deliver periodic events, like we could have done with the Timer object:


t.scheduleRepeating(deadline: .now(), interval: .seconds(5), leeway: .seconds(1))

When we are done with a dispatch source and we want to just stop completely the delivery of events, we’ll call cancel(), that will stop the event source, call the cancellation handler if we did set one and perform some final cleanup operations like unregistering the handlers.


t.cancel()

The API is still the same for the other dispatch source types, let’s see for example how Kitura initializes the read source it uses to handle asynchronous reads on an established socket:


readerSource = DispatchSource.makeReadSource(fileDescriptor: socket.socketfd,
                                             queue: socketReaderQueue(fd: socket.socketfd))

readerSource.setEventHandler() {
    _ = self.handleRead()
}
readerSource.setCancelHandler(handler: self.handleCancel)
readerSource.resume()

The function handleRead() will be called on a dedicated queue when new bytes will be available in the socket’s incoming data buffer. Kitura also uses a WriteSource to perform buffered writes, using the dispatch source events to efficiently pace the writes, writing new bytes as soon as the socket channel is ready to send them. When doing I/O, read/write dispatch sources can be a good high level alternative to other lower level APIs you’ll normally use on *nix platforms.

And on the topic of dispatch sources related to files, another one that could be useful in some use cases is DispatchSourceFileSystemObject, that allows to listen to changes to a specific file, from its name down to changes to its attributes. With this dispatch source you’ll be also able to receive notifications if a file has been modified or deleted, essentially a subset of the events that on Linux are managed by the inotify kernel subsystem.

The remaining source types operate similarly, you can check out the complete list of what’s available in libDispatch’s documentation but remember that some of them like the Mach sources and the memory pressure source will work only on Darwin platforms.

Operations and OperationQueues

Let’s talk briefly of Operation Queues, and additional API built on top of GCD, that uses concurrent queues and models tasks as Operations, that are easy to cancel and that can have their execution depend on other operations completion.

Operations can have a priority, which defines the order of execution, and are added to OperationQueues that be executed asynchronously.

Let’s see a basic example:


var queue = OperationQueue()
queue.name = "My Custom Queue"
queue.maxConcurrentOperationCount = 2

var mainqueue = OperationQueue.main //Refers to the queue of the main thread

queue.addOperation{
    print("Op1")
}
queue.addOperation{
    print("Op2")
}

We can also create a Block Operation object and configure it before adding it to the queue and if needed we can also add more than one closure to this kind of operations.

Note that NSInvocationOperation, that creates an operation with target+selector, is not available in Swift.


var op3 = BlockOperation(block: {
    print("Op3")
})
op3.queuePriority = .veryHigh
op3.completionBlock = {
    if op3.isCancelled {
        print("Someone cancelled me.")
    }
    print("Completed Op3")
}

var op4 = BlockOperation {
    print("Op4 always after Op3")
    OperationQueue.main.addOperation{
        print("I'm on main queue!")
    }
}

Operations can have a priority and a secondary completion closure that will be run once the main closure completes.

We can add a dependency from op4 to op3, so that op4 will wait for the completion of op3 to execute.


op4.addDependency(op3)

queue.addOperation(op4)  // op3 will complete before op4, always
queue.addOperation(op3)

Dependencies can also be removed with removeDependency(operation:) and are stored in a publicly accessible dependencies array.

The current state of an operation can be examined using specific properties:


op3.isReady       //Ready for execution?
op3.isExecuting   //Executing now?
op3.isFinished    //Finished naturally or cancelled?
op3.isCancelled    //Manually cancelled?

You can cancel all the operations present in a queue calling the cancelAllOperations method, that sets the isCancelled flag on the operations remaining in the queue. A single operation can be canceled invoking its cancel method:


queue.cancelAllOperations() 

op3.cancel()

It’s recommended to check the isCancelled property inside your operation to skip execution if the operation was cancelled after it was scheduled to run by the queue.

And finally, you can also stop the execution of new operations on an operation queue (the currently running operation will not be affected):


queue.isSuspended = true

The playgrounds for this and other articles are available from GitHub or Zipped.

Closing Thoughts

This article should have given you a good summary of what is possible today from the point of view of concurrency using the external frameworks that are available from Swift.

Part 2 will focus on what could come next, in term of language facilities that could handle concurrency “natively”, without resorting to external libraries. A few interesting paradigms will be described with the help of a few open source implementations already available today.

I hope that these two articles will be a good introduction to the world of concurrency and that they will help you understand and participate to the discussions that will take place on the swift-evolution mailing list when the community will start considering what to introduce in, let’s hope, Swift 5.

For more interesting content on concurrency and Swift, check out the Cocoa With Love blog.

Building a LISP from scratch with Swift

2017-02-05T00:00:00+01:00

Some say that building a small language interpreter, especially if a LISP, is one of those things you have to do at least one time in your life as a programmer, an eye opening experience that will give you new insights into how the tools you use everyday work and demystify a few concepts that seem daunting when seen from afar.

In this article, we’ll implement a minimal LISP based on the 1978 paper by John McCarthy titled A Micro-Manual For Lisp - Not The Whole Thruth, that defines a small and self-contained LISP, as a Swift framework that will be able to evaluate strings containing LISP symbolic expressions.

We’ll eventually use this compact interpreter to build a simple REPL (Read-Eval-Print Loop) that will interactively execute statements and print out the result of the evaluation. A playground to play around with the interpreter is also available.

This article will explain everything you need to known to roll out you own LISP interpreter, something that could be a great weekend project. Feel free to follow along or just read the introduction, write your own interpreter using this post as a starting point for your alternative implementation.

The diagram below shows the overall design of what we are going to build:

The first functional block, the Read phase, reads some text containing code and with a two-phase process produces an syntax tree with an internal representation of the input program.

The first phase, represented by the Lexer separates the input text in tokens (the building blocks, from a textual point of view, of the program) and then the Parser takes this series of tokens and produces an Abstract Syntax Tree (AST), a hierarchical representation of the source code.

Once we have an AST we’ll be able to evaluate the expression to produce a result that we’ll then print on screen for the user.

The library with the interpreter described in this article and a playground to test it are avaliable on GitHub.

Contents:

LISP Basics
Building the interpreter
- Lexer and Parser
- Evaluation and Default Global Environment
SwiftyLisp REPL
Conclusion

LISP Basics

Let’s start with a brief recap of what we are going to implement, looking into the McCarthy’s article that essentially contains the definition of the language.

First of all, if you are not familiar with LISP, the acronym derives from LISt Processor and that’s a good way do describe languages of the LISP family. Their essential data structure is the list and your programs will perform operations on those lists.

As you have guessed since I used the term family a lot of variants or dialects of the original LISP defined by John McCarthy exist nowadays, from traditional languages like Racket to languages like Clojure that are built on top of different technologies (the Java virtual machine and the Java runtime in this case) and are able to extend the underlying platform with the functionalities that a LISP can provide through its different paradigm.

What we are going to implement here is a minimal LISP that contains the bare essential elements to do something useful.

A LISP interpreter can be described as an evaluator of programs expressed using a peculiar recursive data structure called symbolic expression or form, something that can assume the appearance of either an Atom or a List. Atoms are simple series of alphanumeric characters that can assume different meaning whereas Lists (also called compound forms) are sequences of other symbolic expressions, represented as a sequence of values enclosed inside parenthesis.

An additional kind of form exists in this LISP, the special form that differs from other kind of symbolic expressions because has different evaluation rules for its sub-expressions.

To represent the data that your program will manipulate, we will again use the same symbolic expression data type, ending up using the same data structure to represent both you source code and the data it uses.

But what about the AST? A syntactically valid program is structured as a series of symbolic expression, in other words a series of nested lists, so when converting the source code to an AST we will again use a data structure able to store lists to model our program.

Languages like LISP, where programs and their internal representation can be expressed with the language fundamental data type are called homoiconic and this property makes meta-programming, the ability that a program has to modify itself or other programs in the same language, easier than what it is in classic non-homoiconic languages(most of those you know, Swift included). You will be able to leverage the fact that code and data share the same representation to modify your code at runtime without using complex mechanisms.

If you look at the end of the McCarthy’s paper you’ll notice that also building a LISP interpreter in LISP, called a Meta Circular Evaluator, will just be a matter of a few lines of code. The Swift interpreter we are about to build will do the same thing, it will evaluate these symbolic expressions recursively and will produce another symbolic expression as result.

But let’s see an example of a LISP program expressed using symbolic expressions:


(COUNT (QUOTE (A B C) ) 42)

In the example above, COUNT, QUOTE, A, B, C, 42 are all atoms (let’s ignore their meaning for now), and each sequence between parenthesis is a list. Note how a list can contain any kind of symbolic expression, even sub-lists.

How will our interpreter evaluate this expression?

This expression will be evaluated as an expression that uses polish notation, where each list will be considered as an operator followed by the operands it needs to be applied on, e.g. a sum between two integers would be represented as (+ 1 2).

In the example above, the operator/function COUNT will be applied on the operands/parameters (QUOTE (A B C)) and 42.

And you have certainly noticed that in this definition of our language, atoms are not typed, we have a single type of atom and common types like integers, booleans and strings are not available. This LISP does not have the complex type system we find in languages like Swift.

The micro-manual defines a series of atoms that perform basic operations and describes the value they produce once a list that contains them is evaluated. In the table below, e will denote generic symbolic expressions whereas l will be used for lists.

Atom	Structure	Description
Quote	(quote e1)	This atom once evaluated returns its sub expression as is, e.g. (quote A) = A
Car	(car l)	Returns the first element of a non-empty sub-list, e.g. (car (quote (A B C))) = A
Cdr	(cdr l)	Returns all the elements of the sub-list after the first in a new list, e.g. (cdr (quote (A B C))) = (B C)
Cons	(cons e l)	Returns a new list with e as first element and then the content of the sublist e.g. (cons (quote A) (quote (B C))) = (A B C)
Equal	(equal e1 e2)	Returns an atom aptly named true if the two symbolic expressions are recursively equal and the empty list () (that serves as both nil and false value) if they are not, e.g. (equal (car (quote (A B))) = (quote A))
Atom	(atom e)	Returns true if the symbolic expression is an atom or an empty list if it is a lis, e.g. (atom A) = true
Cond	(cond (p1 e1) (p2 e2) … (pn en))	Returns the first e expression whose p predicate expression is not equal to the empty list. This is basically a conditional atom with a slightly more convoluted syntax than a common if construct. e.g. (cond ((atom (quote A)) (quote B)) ((quote true) (quote C) = B
List	(list e1 e2 … en)	Returns a list of all the given expressions, identical to applying cons recursively to a sequence of expressions.

The description contains the set of rules that will be used to evaluate these expressions.

If you look closely you’ll notice that cond is slightly different from the others since it conditionally evaluates its body depending on the sublists it contains. This is our first example of special form, we’ll pay special attention to this detail when implementing the evaluator.

Now let’s see another category of these operators, the one that is able to define functions:

Atom	Structure	Description
Lambda	( (lambda (v1 … vn) e) p1 … pn)	Defines a lambda expression with body e that describes an anonymous function that uses a series of environment variables v. This function will be evaluated using the provided parameters as value for the variables. e.g. ((lambda (X Y) (cons (car x) y) (quote (A B)) (cdr (quote (C D)))) = (A D)
Defun	(defun (v1 ... vn) e)	Define a lambda expression and registers it in the current context to be used when we need it. We’ll be able to define a function like (defun cadr (X) (car (cdr x))) and use it in another expression like (cadr (quote (A B C D))).

McCarthy’s paper describe an additional operator that can be used to define local labeled lambda expressions but we are not going to implement it, when we’ll need something similar we’ll use defun instead.

Building The Interpreter

Now that we are done describing the content of the paper it’s time to discuss the implementation of the interpreter.

In this section, each functional module that composes the interpreter will be analyzed in detail, for the full code of the interpreter check out this repository on Github.

The first aspect that need to be addressed is how symbolic expressions will be represented inside the interpreter defining how the AST will be structured. This is an important aspect since a good structure simplifies the evaluation.

Modeling Symbolic Expressions

The most obvious way to model the symbolic expressions is to use a recursive enum:


public enum SExpr{
    case Atom(String)
    case List([SExpr])
}

Usually you need indirect when declaring a recursive enum, but in this case the array is acting as a container, so we can do without it. Other than this, there is not much to see here, this enum simply mimics the definition of symbolic expression.

Now let’s add a few other things to this enum, we’ll need a way to understand if two expressions are equal and a way to print them. For that we are going to implement Equatable and CustomStringConvertible declaring two extensions.


extension SExpr : Equatable {
    public static func ==(lhs: SExpr, rhs: SExpr) -> Bool{
        switch(lhs,rhs){
        case let (.Atom(l),.Atom(r)):
            return l==r
        case let (.List(l),.List(r)):
            guard l.count == r.count else {return false}
            for (idx,el) in l.enumerated() {
                if el != r[idx] {
                    return false
                }
            }
            return true
        default:
            return false
        }
    }
}

extension SExpr : CustomStringConvertible{
    public var description: String {
        switch self{
        case let .Atom(value):
            return "\(value) "
        case let .List(subxexprs):
            var res = "("
            for expr in subxexprs{
                res += "\(expr) "
            }
            res += ")"
            return res
        }
    }
}

Both functions recursively traverse the symbolic expression structure, triggering a call to themselves (using the equality operator or converting a SExpr to String) to perform their duty.

Now that the data structure has been defined let’s look into how each component of the REPL diagram can be implemented.

Lexer and Parser

The Read phase that translates the source code to an AST to be evaluated can be divided in two stages, each one performed by a dedicated component: the Lexer and the Parser.

The main job of the Lexer or tokenizer is to perform lexical analysis on an input block of text containing the source code.

The lexer is able to break down a series of characters to a series of lexemes or tokens that represent strings that have meaning when considered in the context of a language. Tokens can be language keywords like if, operators like = or various identifiers (e.g. variable names) and literals.

Since the lexical grammar of our language, the definition of what a valid token is, is extremely simple, the lexer/tokenizer will also be simple. The Lexer will just identify string tokens separated by spaces or parenthesis.

Let’s add a read() method to SExpr to convert Strings to our enum representation and let’s start discussing the tokenize stage of the process.


extension SExpr {
    
    /**
     Read a LISP string and convert it to a hierarchical S-Expression
     */
    public static func read(_ sexpr:String) -> SExpr{
        
        enum Token{
            case pOpen,pClose,textBlock(String)
        }
        
        /**
         Break down a string to a series of tokens
         
         - Parameter sexpr: Stringified S-Expression
         - Returns: Series of tokens
         */
        func tokenize(_ sexpr:String) -> [Token] {
            var res = [Token]()
            var tmpText = ""
            
            for c in sexpr.characters {
                switch c {
                case "(":
                    if tmpText != "" {
                        res.append(.textBlock(tmpText))
                        tmpText = ""
                    }
                    res.append(.pOpen)
                case ")":
                    if tmpText != "" {
                        res.append(.textBlock(tmpText))
                        tmpText = ""
                    }
                    res.append(.pClose)
                case " ":
                    if tmpText != "" {
                        res.append(.textBlock(tmpText))
                        tmpText = ""
                    }
                default:
                    tmpText.append(c)
                }
            }
            return res
        }      
        
        // Parser
        // ...
        
        // Read: tokenize -> parse -> result
        let tokens = tokenize(sexpr)
        let res = parse(tokens)
        return res.subexpr ?? .List([])  
    }

}

The tokenize method will go through all the characters of the input string turning an opaque (from the point of view of syntax) string to a series of values defined in the Token enum. The possible values will be: pOpen (for open parenthesis), pClose (for close parenthesis) and textBlock (for every other string, representing an atom). Everything quite straightforward, since there are no special rules that could make the content read invalid.

The next phase is performed by the Parser.

The purpose of a parser is to convert a series of tokens into an AST that represent our code in a form easy to check for syntax errors and easy to evaluate (and optimize and compile if we are building a compiler instead of an interpreter).

We are going to implement a very simple Top-down parser that will consume the token array in its natural order and build the AST. If you plan to build a parser for a language with a more complex grammar you’d likely need something slightly more sophisticate like a Recursive Descent Parser (easy to hand code) or a LL Parser.

But for languages with complex grammar the parser is usually generated with a parser generator (e.g. ANTLR, that recently introduced support for Swift), so you’ll have to describe your grammar in a DSL instead of coding manually the parser.

The parser will be definitely more convoluted than the lexer, but again, thanks to how simple this language is, it will be a really small and simple parser.


extension SExpr {
    
    /**
     Read a LISP string and convert it to a hierarchical S-Expression
     */
    public static func read(_ sexpr:String) -> SExpr{
        
        // Tokenizer
        // ...
        
        func appendTo(list: SExpr?, node:SExpr) -> SExpr {
            var list = list
            
            if list != nil, case var .List(elements) = list! {
                elements.append(node)
                list = .List(elements)
            }else{
                list = node
            }
            return list!
        }  

        /**
         Parses a series of tokens to obtain a hierachical S-Expression
         
         - Parameter tokens: Tokens to parse
         - Parameter node: Parent S-Expression if available
         
         - Returns: Tuple with remaning tokens and resulting S-Expression
         */
        func parse(_ tokens: [Token], node: SExpr? = nil) -> (remaining:[Token], subexpr:SExpr?) {
            var tokens = tokens
            var node = node
            
            var i = 0
            repeat {
                let t = tokens[i]
                
                switch t {
                case .pOpen:
                    //new sexpr
                    let (tr,n) = parse( Array(tokens[(i+1)..<tokens.count]), node: .List([]))
                    assert(n != nil) //Cannot be nil
                    
                    (tokens, i) = (tr, 0)
                    node = appendTo(list: node, node: n!)
                    
                    if tokens.count != 0 {
                        continue
                    }else{
                        break
                    }
                case .pClose:
                    //close sexpr
                    return (Array(tokens[(i+1)..<tokens.count]), node)
                case let .textBlock(value):
                    node = appendTo(list: node, node: .Atom(value))
                }
                
                i += 1
            }while(tokens.count > 0)
            
            return ([],node)
        }
        
        let tokens = tokenize(sexpr)
        let res = parse(tokens)
        return res.subexpr ?? .List([])
    }
}

The parse(tokens:node:) method goes through every token produced by the lexer using .pOpen and .pClose to delimit lists and converts every other token to atoms.

Notice that the parsing is performed recursively, with every nested call receiving the array of tokens left to parse and the parent list that will contain the values parsed during the next recursion step (starting with nil for the root expression). When a close parenthesis is found, the list is considered complete and is returned to the caller along with the remaining tokens that still have to be parsed.

After these functions you can see the actual body of the read() method, that executes every step in sequence returning the top-level form or an empty list (that doubles as false as we saw in the previous section) on error.


        let tokens = tokenize(sexpr)
        let res = parse(tokens)
        return res.subexpr ?? .List([])
    }
}

Now that we have a working read module, let’s add something to the SExpr enum that will allow us to obtain an expression directly from a string literal without invoking manually the read() method by implementing the ExpressibleByStringLiteral protocol:

  
extension SExpr : ExpressibleByStringLiteral,
                  ExpressibleByUnicodeScalarLiteral,
                  ExpressibleByExtendedGraphemeClusterLiteral {
    
    public init(stringLiteral value: String){
        self = SExpr.read(value)
    }
    
    public init(extendedGraphemeClusterLiteral value: String){
        self.init(stringLiteral: value)
    }
    
    public init(unicodeScalarLiteral value: String){
        self.init(stringLiteral: value)
    }
    
}

With this we’ll be able to read programs directly from a string:


let expr: SExpr = "(cond ((atom (quote A)) (quote B)) ((quote true) (quote C)))"

print(expr)
print(expr.eval()!)  //B

Evaluation and Default Global Environment

The evaluation phase will be more complex than what we’ve seen until now, the eval() function will recursively evaluate the AST and return the resulting evaluated symbolic expression.

First of all let’s collect all the basic operators defined by our language in a private dictionary called defaultEnvironment that will associate to every operator atom name a function of type (SExpr, [SExpr]?, [SExpr]?)->SExpr that implements it.

These function will take a SExpr parameter containing the original list (function name and parameters), evaluate it and return a SExpr as result. Those two optional arrays as second and third parameter will contain a list of variables with their values, and will be used for user-defined functions defined via defun and lamdba, in all the other cases they will just be nil. But we’ll come back to this when we’ll take a look at those operators.

To keep track of the basic builtin operators the Builtin enum has been declared with a function that identifies which operators don’t need sub-expression evaluation. Those are operators like quote (that exists with the sole purpose to disable sub-expressions evaluation), special forms like cond or the lambda defining operators that will handle internally the evaluation of the sub-expressions.


/// Basic builtins
fileprivate enum Builtins:String{
    case quote,car,cdr,cons,equal,atom,cond,lambda,defun,list,
         println,eval
    
    /**
     True if the given parameter stop evaluation of sub-expressions.
     Sub expressions will be evaluated lazily by the operator.
     
     - Parameter atom: Stringified atom
     - Returns: True if the atom is the quote operator
     */
    public static func mustSkip(_ atom: String) -> Bool {
        return  (atom == Builtins.quote.rawValue) ||
                (atom == Builtins.cond.rawValue) ||
                (atom == Builtins.defun.rawValue) ||
                (atom == Builtins.lambda.rawValue)
    }
}

All the defaultEnvironment functions start with a simple check to verify that the minimum number of parameters has been provided and then proceed in building up the result to return.

Let’s take a look to a few of those, check the full project for the complete list.


/// Global default builtin functions environment
///
/// Contains definitions for: quote,car,cdr,cons,equal,atom,cond,lambda,label,defun.
private var defaultEnvironment: [String: (SExpr, [SExpr]?, [SExpr]?)->SExpr] = {
    
    var env = [String: (SExpr, [SExpr]?, [SExpr]?)->SExpr]()

    env[Builtins.quote.rawValue] = { params,locals,values in
        guard case let .List(parameters) = params, parameters.count == 2 else {return .List([])}
        return parameters[1]
    }
    env[Builtins.cdr.rawValue] = { params,locals,values in
        guard case let .List(parameters) = params, parameters.count == 2 else {return .List([])}
        
        guard case let .List(elements) = parameters[1], elements.count > 1 else {return .List([])}
        
        return .List(Array(elements.dropFirst(1)))
    }
    env[Builtins.equal.rawValue] = {params,locals,values in
        guard case let .List(elements) = params, elements.count == 3 else {return .List([])}
        
        var me = env[Builtins.equal.rawValue]!
        
        switch (elements[1].eval(with: locals,for: values)!,elements[2].eval(with: locals,for: values)!) {
        case (.Atom(let elLeft),.Atom(let elRight)):
            return elLeft == elRight ? .Atom("true") : .List([])
        case (.List(let elLeft),.List(let elRight)):
            guard elLeft.count == elRight.count else {return .List([])}
            for (idx,el) in elLeft.enumerated() {
                let testeq:[SExpr] = [.Atom("Equal"),el,elRight[idx]]
                if me(.List(testeq),locals,values) != SExpr.Atom("true") {
                    return .List([])
                }
            }
            return .Atom("true")
        default:
            return .List([])
        }
    }
    env[Builtins.atom.rawValue] = { params,locals,values in
        guard case let .List(parameters) = params, parameters.count == 2 else {return .List([])}
        
        switch parameters[1].eval(with: locals,for: values)! {
        case .Atom:
            return .Atom("true")
        default:
            return .List([])
        }
    }
    // ...
    
    return env
}()

While functions like quote or cdr just manipulate the parameter list to build an output list, other functions like equal implement a more complex logic (in this case it performs a recursive equality check). To keep the source readable for didactic purposes, error checks have been kept to a minimum, additional parameters are ignored and when something goes wrong the empty list is returned.

For special forms like the conditional cond a different handling of the evaluation is required.

Conditional operators are essential to implement recursion, because only with this kind of statements we are able to decide if we have to stop the recursion or proceed with another iteration.


env[Builtins.cond.rawValue] = { params,locals,values in
    guard case let .List(parameters) = params, parameters.count > 1 else {return .List([])}
    
    for el in parameters.dropFirst(1) {
        guard case let .List(c) = el, c.count == 2 else {return .List([])}
        
        if c[0].eval(with: locals,for: values) != .List([]) {
            let res = c[1].eval(with: locals,for: values)
            return res!
        }
    }
    return .List([])
}

The implementation of cond, once it has dropped the first element of the list containing the cond atom, iterates through the list until it finds a sublist where the first member is a form with a value different than the empty list (that means false as we already saw), and when it finds it, evaluates the second member of the sublist and returns it. With this kind of evaluation we evaluate only what we actually need and when evaluating a recursive function we don’t follow the infinite series of nested recursive calls that the body of those functions contains.

Among those default functions, the defun and lambda operators allow the creation of user-defined functions that are then registered in a globally accessible dictionary called localContext:


/// Local environment for locally defined functions
public var localContext = [String: (SExpr, [SExpr]?, [SExpr]?)->SExpr]()

Let’s see how defun (lambda is mostly identical) can be implemented.


env[Builtins.defun.rawValue] =  { params,locals,values in
    guard case let .List(parameters) = params, parameters.count == 4 else {return .List([])}
    
    guard case let .Atom(lname) = parameters[1] else {return .List([])}
    guard case let .List(vars) = parameters[2] else {return .List([])}
    
    let lambda = parameters[3]
    
    let f: (SExpr, [SExpr]?, [SExpr]?)->SExpr = { params,locals,values in
        guard case var .List(p) = params else {return .List([])}
        p = Array(p.dropFirst(1))
        
        // Replace parameters in the lambda with values
        if let result = lambda.eval(with:vars, for:p){
            return result
        }else{
            return .List([])
        }
    }
    
    localContext[lname] = f
    return .List([])
}

This function requires a list with four symbolic expressions, one for the operator name, one for the name (this as expected will be a simple atom) and the last two for the variables list and the lambda body respectively. Therefore, once we store each component in a constant (note that again, the empty list is used as error value), we define and register in localContext a function with type (SExpr, [SExpr]?, [SExpr]?)->SExpr that as we’ll see momentarily will be invoked by eval() when the evaluator will find it in an expression.

During invocation, this anonymous function will evaluate the body of the lambda replacing the variables contained in the original variables list with the current parameters and return the result.

To better understand what is happening there, let’s finally take a look at the eval() function:


public enum SExpr{
    case Atom(String)
    case List([SExpr])
    
    /**
     Evaluates this SExpression with the given functions environment
     
     - Parameter environment: A set of named functions or the default environment
     - Returns: the resulting SExpression after evaluation
     */
    public func eval(with locals: [SExpr]? = nil, for values: [SExpr]? = nil) -> SExpr?{
        var node = self
        
        switch node {
        case .Atom:
            return evaluateVariable(node, with:locals, for:values)
        case var .List(elements):
            var skip = false
            
            if elements.count > 1, case let .Atom(value) = elements[0] {
                skip = Builtins.mustSkip(value)
            }
            
            // Evaluate all subexpressions
            if !skip {
                elements = elements.map{
                    return $0.eval(with:locals, for:values)!
                }
            }
            node = .List(elements)
            
            // Obtain a a reference to the function represented by the first atom and apply it, local definitions shadow global ones
            if elements.count > 0, case let .Atom(value) = elements[0], let f = localContext[value] ?? defaultEnvironment[value] {
                let r = f(node,locals,values)
                return r
            }
            
            return node
        }
    }
    
    private func evaluateVariable(_ v: SExpr, with locals: [SExpr]?, for values: [SExpr]?) -> SExpr {
        guard let locals = locals, let values = values else {return v}
        
        if locals.contains(v) {
            // The current atom is a variable, replace it with its value
            return values[locals.index(of: v)!]
        }else{
            // Not a variable, just return it
            return v
        }
    }
    
}

The evaluator traverses the AST performing different operations depending on the type of the form under evaluation.

When an atom is encountered, it tries to resolve it as a variable with the current context of local variables (set initially by defun or lambda and propagated between calls) but most of the times it will just return the atom as it is.

This is where the variables substitution for user-defined lambda is performed, we simply verify for each atom with evaluateVariable if its name is present in the array of the variables and if it is, we replace the atom with the one with the same index from the values array.

We have more to consider when evaluating a list or compound form.

We’ll first try to evaluate recursively all the sub-expressions in the current list, but only if the current operator does not need to handle this evaluation itself. As said above in this simple LISP only quote, special forms and lambda definition operator fall in this category.

Once the sub-expressions have been evaluated, it’s time to apply the operator to its operands performing a lookup for a lambda with the same name of the operator atom in localContext and then in defaultEnvironment. The order is important, since we want to be able to shadow the default definitions with new functions we would want to define manually.

If a lambda with that name exists, the function is invoked and the result returned to the previous step of the recursive evaluation.

This concludes the description of the basic interpreter, the whole thing needs more or less 400 lines of code.

SwiftyLisp REPL

It’s time to implement the REPL, but it won’t take long, the interpreter has all the basic functionalities we need.

We’ll read a line from the terminal, convert it to a SExpr, evaluate it and print the result, that will be well formatted thanks to the CustomStringConvertible protocol.


import SwiftyLisp

var exit = false

while(!exit){
    print(">>>", terminator:" ")
    let input = readLine(strippingNewline: true)
    exit = (input=="exit") ? true : false
    
    if !exit {
        let e = SExpr.read(input!)
        print(e.eval()!)
    }
}

The REPL is also available on Github in a separate repository.

Conclusion

This article described a minimal LISP interpreter to show you the basic building blocks of interpreters in general regardless of the language.

If you never built something like this before it could seem daunting at first but I hope to have shown that it is definitely something that with a bit of work everyone can do.

Check out the complete project on Github and let me know in the comments if you’d like to read more about interpreters and compilers!

For more interesting articles on building interpreters and compilers check out the awesome-compilers list.

Unowned or Weak? Lifetime and Performance

2016-10-27T00:00:00+02:00

Update 10/17: Mike Ash wrote about some recent improvements to weak references handling here.

The usual explanation that when dealing with retain cycles you should choose between unowned or weak references considering the object lifetimes is well known, but sometimes you could be still in doubt about which one you should actually use and wondering if defensively using only weak references is a good idea.

In this article, after a brief introduction, I’ll analyze the differences between the two in term of lifetime and performance with excerpts from the Swift sources to, I hope, help you choose which flavor of weak reference you should use in different circumstances.

Contents:

The Basics
The Question: Unowned or Weak?
Performance: A Look Under The Hood
- Deconstructing capture lists handling
Conclusion
Footnotes

Get the playground for this and other articles from GitHub or zipped. Click here instead, for the closure sample and the SIL,SILGen and LLVM IR output.

The Basics

As we all know, Swift leverages the good old ARC (Automatic Reference Counting) to manage memory and as a consequence, as we were used to with Objective-C, we’ll have to deal manually with retain cycles through a wise use of weak references.

If you are not familiar with ARC, you just need to know that every reference type instance will have a reference count (a simple integer value) associated with it, that will be used to keep count of the number of times an object instance is currently being referred to by a variable or a constant. Once that counter reaches zero, instances are deallocated and the memory and resources they held are made available again.

You have a retain cycle every time two instances refer in some way to each other (e.g. two classes instances that have a property that refers to the other class instance as will happen with two adjacent node instances in a doubly linked list) preventing those instances from being deallocated because the retain count has always a value greater than zero.

To solve this, in Swift but also in many other languages, the concept of weak references has been introduced, references that are not considered by ARC and that as such will not increment the retain count of your objects.

Considering that weak references do not prevent instances from being deallocated, it’s essential to always remember that at any point a weak reference could not point anymore to a valid object. Not a problem impossible to overcome, but something that we need to consider when we deal with this kind of references.

Swift has two kinds of weak references: unowned and weak.

While they serve the same purpose, they are slightly different in regard to the assumptions they do related to your instance lifetime and have different performance characteristics.

Instead of looking at this from the perspective of retain cycles between classes, we’ll discuss this in the context of closures, that from the days of Objective-C is maybe the most common situation in which you’ll have to deal with retain cycles. As happens with classes, using an external instance inside the closure creates a strong reference to that instance, or captures it, blocking its deallocation.

In Objective-C, following the standard pattern you would have declared a weak reference to that instance outside the block and then declared a strong reference to that instance inside the block to get a hold of it during the execution of the block. And obviously, checking that the reference was still valid was necessary.

To help deal with retain cycles, Swift introduces a new construct to simplify and make more explicit the capturing of external variables inside closures, the capture list. With capture lists, you can declare on top of your function the external variables that will be used specifying which kind of reference should be created internally.

Let’s see a few examples of what is the result of capturing variables in different ways.

When you don’t use capture lists, the closure will create a strong reference to the value from the outer scope:


var i1 = 1, i2 = 1

var fStrong = {
    i1 += 1
    i2 += 2
}

fStrong()
print(i1,i2) //Prints 2 and 3

Modification happening inside the closure will alter the value of the original variable as you would expect.

Using a capture list, a new constant valid inside the closure’s scope is created instead. If you don’t specify a capture modifier the constant will simply be a copy of the original value and this will work with both value types and reference types.



var fCopy = { [i1] in
    print(i1,i2)
}

fStrong()
print(i1,i2) //Prints 2 and 3  

fCopy()  //Prints 1 and 3

In the example above we are declaring the fCopy function before the call to fStrong, and it’s when the function is declared that the private constant is initialized. As you can see, when we call the second function we still print the original value for i1.

Specifying either weak or unowned before the name of an external variable with a reference type, this constant will be instead initialized as a weak reference to the original value, and this specific form of capturing is the one we use to break retain cycles.


class aClass{
    var value = 1
}

var c1 = aClass()
var c2 = aClass()

var fSpec = { [unowned c1, weak c2] in
    c1.value += 1
    if let c2 = c2 {
        c2.value += 1
    }
}


fSpec()
print(c1.value,c2.value) //Prints 2 and 2

The difference in how the two aClass captured instances are handled inside the closure is a consequence of their different characteristics.

Unowned references are used when the original instance will never be nil while the closure is reachable and are declared as implicitly unwrapped optionals. Trying to use its captured value when the original instance has been deallocated will result in a crash.

If instead, the original instance we want to capture could be nil at some point during execution, we must declare the reference as weak and verify that the reference is still valid before using it.

The Question: Unowned or Weak?

Which one of the two weak reference types should you use?

This question can be answered simply reasoning about the lifetime of the original object and the closure that references it.

There are two possible scenarios:

The closure has the same lifetime of the captured object, so the closure will be reachable only until the object will be reachable. The external object and the closure have the same lifetime (e.g. simple back-references between and object and its parent). In this case, you should declare the reference as unowned.

A common example is the [unowned self] used in many examples of small closures that do something in the context of their parent and that not being referenced or passed anywhere else do not outlive their parents.
The closure lifetime is independent from the one of the object being captured, the closure could still be referenced when the object is not reachable anymore. In this case you should declare the reference as weak and verify it’s not nil before using it (don’t force unwrap).

A common example of this is the [weak delegate = self.delegate!] you can see in some examples of closure referencing a completely unrelated (lifetime-wise) delegate object.

What if you are unsure about the lifetime relationship between two objects and you don’t want to risk having an invalid unowned reference? Always capturing defensively as weak could be a good approach?

No, and not only because having a clear idea of your objects lifetime is a good thing, the two attributes have also wildly different performance characteristics.

The most common implementation for weak references requires that each time a new reference is created it must be registered in a side-table where every weak reference is associated with the object it refers to.

When an object does not have any strong references pointing to it, the runtime will start the deallocation process but before this happens, it will set to nil all the weak references that were pointing to the object. Because of this behavior, weak references implemented this way are called zeroing weak references.

This implementation has a tangible overhead if you consider that an additional data structure needs to be maintained and that we need to guarantee the correctness of all operations on these global reference holding structures even in presence of concurrent access. It should not be possible under any circumstance to access the value pointed by a weak reference once the deallocation process has started.

Weak references (unowned and with some variations weak too) in Swift employ a less convoluted and faster mechanism instead.

Every object in Swift keeps two reference counters, the usual strong reference counter, used to decide when ARC will be able to safely deinitialize an object and an additional weak reference counter that counts how many unowned or weak references have been created toward this object, when this counter reaches zero, the object is deallocated.

It is important to understand that an object is not really deallocated until all its unowned references have been released, it will be kept reachable but it will be in an uninitialized state, with its content being just garbage after the deinitialization occurs.

Every time and unowned reference is declared its unowned reference counter is incremented atomically (using atomic gcc/llvm operations, that allow to perform basic operations like increment, decrement, compare and compare&swap in a fast and thread-safe way) to guarantee thread-safety and each time it’s used, the strong reference count will be checked to understand if the object it’s still valid before safely retaining it.

Trying to access an invalid object will result in a failed assertion and your application will fail gracefully with an error at runtime (that’s why this unowned implementation is defined unowned(safe) implementation).

As further optimization, if your application is compiled with -OFast unowned references will not be checked for object validity anymore and the references will behave like the __unsafe_unretained you have in Objective-C. If the object is invalid, your reference will point to deinitialized garbage memory (implementation known as unowned(unsafe)).

When an unowned reference is released, if there are no more strong or unowned references the object will finally be deallocated and this is why the object cannot just be deallocated completely when the strong counter reaches zero, all the reference counters must still be accessible to verify the unowned and strong count.

Swift’s weak references add an additional layer of indirection wrapping unowned references in an optional container, something that is useful to handle cleanly all those cases when the pointed object could become null after a deallocation. But this does not come for free, additional machinery is required to manage this optional correctly.

Considering all this, the use of unowned should be preferred over weak every time it’s possible and lifetime relationships allow it. But this is not the end of our story, let’s talk about performance¹ now.

Performance: A Look Under The Hood

Before we look into the source of the Swift project to verify what we said in the previous section, we need to understand how each kind of reference is managed by ARC and to do that, I need to explain a few things about swiftc, LLVM and SIL.

I’ll try to give you a short overview and explain only what is strictly necessary, if you want to learn more you’ll find some useful links in the footnotes.

Let’s start with a diagram that contains the basic functional blocks of swiftc, the Swift compiler, to give you an idea of what the whole compilation process entails:

Swiftc follows an approach for the most part similar to other compilers built on top of LLVM like clang.

In the first part of the compilation process, managed by a specific language frontend, the source code is parsed to produce an AST representation² of your source code and the resulting AST is then analyzed from a semantic point of view to identify semantic errors.

At this point, in other LLVM-based compilers, after an additional step that performs a static analysis of your code (and if necessary displays errors and warnings) through a dedicated component, the content of the AST is converted to a light-weight and low-level machine independent representation called LLVM IR (LLVM Intermediate Representation) by the IRGen component.

These two components, static analyzer and IRGen, are separated even if some checks needs to be performed in both of them, there is usually a lot of code duplication between these two modules.

The IR is a Static Single Assignment form (SSA-form) compliant language and can be considered the RISC-style assembly language of the LLVM register-based virtual machine. Being SSA based simplifies greatly the next step of the compilation process, where multiple passes of optimization are applied to the IR obtained from the internal representation provided by the language frontend.

It’s important to know, that one of the characteristics of IR is that it can be represented in three different forms: an in-memory representation (used internally), a serialized bitcode representation (the same bitcode you already know) and a human-readable form.

This last form is quite useful to verify the final structure of the IR code that will be passed to the last step of the process, that will convert our machine-independent IR to a platform-specific representation (e.g. x86, ARM, etc…). This last step will be performed by dedicated LLVM platform backends.

But what makes swiftc different from other compilers based on LLVM?

The fundamental structural difference between swiftc and other compilers is the presence of an additional component, SILGen, right before IRGen that performs diagnostic and optimization passes on your source producing an intermediate high-level representation called SIL(Swift Intermediate Language), that will be converted to LLVM IR. This allows to consolidate in a single software component all the language-specific checks and simplifies IRGen.

The conversion from AST to IR is a two step process. SILGen converts the source represented as an AST into raw SIL and then the compiler performs Swift diagnostic checks (printing errors or warnings if necessary) and optimizes the validated raw SIL through multiple passes producing canonical SIL. As the diagram above shows, the canonical SIL is then converted into LLVM IR.

SIL³ is, again, a SSA-form language and extends the Swift syntax with additional constructs. It relies on Swift’s type system and is able to understand Swift declaration but it’s important to remember that top level Swift code or function content will be ignored when compiling hand-coded (yes, we can write SIL and compile it) SIL sources.

In the second half of this section, we’ll analyze an example of canonical SIL to understand how unowned and weak references are handled by the compiler. Looking at the SIL generated from our code, a basic closure with a capture list, we’ll be able to see all the ARC-related function calls added by the compiler.

Get the playground for this and other articles from GitHub or zipped. Click here instead for the closure sample and the SIL,SILGen and LLVM IR output.

Deconstructing capture lists handling

Let’s start with this simple Swift example that declares two variables and captures them weakly in a closure:


class aClass{
    var value = 1
}

var c1 = aClass()
var c2 = aClass()

var fSpec = { 
    [unowned c1, weak c2] in
    c1.value = 42
    if let c2o = c2 {
        c2o.value = 42
    }
}

fSpec()

To generate canonical SIL for this sample, just compile the swift source file with xcrun swiftc -emit-sil sample.swift. Raw SIL can be generated using the -emit-silgen option.

If you run the command above you’ll notice that swiftc produces quite a lot of code, let’s take a look at an excerpt of the swiftc output to learn what some basic SIL directives do and to understand the overall structure.

I’ve added a few multiline comments with explanations (convenient single line comments are generated by the compiler) where needed that should be enough to clarify what’s happening:


/*
  This file contains canonical SIL 
*/
sil_stage canonical             

/* 
  Some special import available only internally that can be used in SIL 
*/
import Builtin                  
import Swift
import SwiftShims

/* 
 Definitions for three global variables for c1,c2 and the fSpec closure 
 @_Tv4clos2c1CS_6aClass is the symbol name for this variable and $aClass 
 its type (types start with $). Variable names are mangled here but can 
 be transformed in something more readable as we'll see below.  
*/
// c1
sil_global hidden @_Tv4sample2c1CS_6aClass : $aClass

// c2
sil_global hidden @_Tv4sample2c2CS_6aClass : $aClass

// fSpec
sil_global hidden @_Tv4sample5fSpecFT_T_ : $@callee_owned () -> ()

...

/*
  A hierarchical scope definition that refers to positions in the original source.
  Each SIL instruction will point to the sil_scope it was generated from.
*/
sil_scope 1 {  parent @main : $@convention(c) (Int32, UnsafeMutablePointer<Optional<UnsafeMutablePointer<Int8>>>) -> Int32 }
sil_scope 2 { loc "sample.swift":14:1 parent 1 }


/* 
  An autogenerated @main function that contains the code of our original global
  scope.
 
  It follows the familiar c main() structure accepting the number of
  arguments and an arguments array. The function conforms to the c calling convention.
  This function contains the instructions needed to invoke the closure above.
*/
// main
sil @main : $@convention(c) (Int32, UnsafeMutablePointer<Optional<UnsafeMutablePointer<Int8>>>) -> Int32 {
/*
  Registers starts with a % followed by a numeric id.
  Every time a new register is defined (or at the beginning of a function for function
  parameters) the compiler adds a trailing comment with the list of registers or instructions
  that depend on its value (called users). 
  For other instructions, the id of the current instruction is provided.

  In this case, register 0 will be used to calculate the content of register 4 and register 1
  will be used to create the value of register 10.
*/
// %0                                             // user: %4
// %1                                             // user: %10
/*
  Every function is decomposed in a series of basic blocks of instructions and each block ends 
  with a terminating instruction (a branch or a return). 
  This graph of blocks represents all the possible execution paths of the function.
*/
bb0(%0 : $Int32, %1 : $UnsafeMutablePointer<Optional<UnsafeMutablePointer<Int8>>>):
  ...
  /*
    Each SIL instruction has a reference to the source location that contains the Swift 
    instruction from which it originated and a reference to the scope it's part of.
    We'll look to some of this below when analyzing this method.
  */
  unowned_retain %27 : $@sil_unowned aClass, loc "sample.swift":9:14, scope 2 // id: %28
  store %27 to %2 : $*@sil_unowned aClass, loc "sample.swift":9:14, scope 2 // id: %29
  %30 = alloc_box $@sil_weak Optional<aClass>, var, name "c2", loc "sample.swift":9:23, scope 2 // users: %46, %44, %43, %31
  %31 = project_box %30 : $@box @sil_weak Optional<aClass>, loc "sample.swift":9:23, scope 2 // user: %35
  %32 = load %19 : $*aClass, loc "sample.swift":9:23, scope 2 // users: %34, %33
  ...
}

...

/* 
  A series of autogenerated methods for aClass, init/deinit,  
  setter/getter and other utility methods. 
  
  The comments added by the compiler clarify what they do.
*/

/*
  Hidden function are visible only inside their module.

  @convention(method) is the default Swift method calling convention, an additional 
  parameter is added at the end to contain a reference to self.
*/
// aClass.__deallocating_deinit
sil hidden @_TFC4clos6aClassD : $@convention(method) (@owned aClass) -> () {
    ...
}

/*
  @guaranteed parameters are guaranteed to be valid for all the duration of the call.
*/
// aClass.deinit
sil hidden @_TFC4clos6aClassd : $@convention(method) (@guaranteed aClass) -> @owned Builtin.NativeObject {
    ...
}

/*
  Functions annotated with [transparent] are small function that can be inlined.
*/
// aClass.value.getter
sil hidden [transparent] @_TFC4clos6aClassg5valueSi : $@convention(method) (@guaranteed aClass) -> Int {
    ...
}

// aClass.value.setter
sil hidden [transparent] @_TFC4clos6aClasss5valueSi : $@convention(method) (Int, @guaranteed aClass) -> () {
    ...
}

// aClass.value.materializeForSet
sil hidden [transparent] @_TFC4clos6aClassm5valueSi : $@convention(method) (Builtin.RawPointer, @inout Builtin.UnsafeValueBuffer, @guaranteed aClass) -> (Builtin.RawPointer, Optional<Builtin.RawPointer>) {
    ...
}

/*
  @owned specifies that the object is owned by the caller.
*/
// aClass.init() -> aClass
sil hidden @_TFC4clos6aClasscfT_S0_ : $@convention(method) (@owned aClass) -> @owned aClass {
    ...
}

// aClass.__allocating_init() -> aClass
sil hidden @_TFC4clos6aClassCfT_S0_ : $@convention(method) (@thick aClass.Type) -> @owned aClass {
    ...
}

/* 
  The closure.
*/
// (closure #1)
sil shared @_TF4closU_FT_T_ : $@convention(thin) (@owned @sil_unowned aClass, @owned @box @sil_weak Optional<aClass>) -> () {
    ...
    /* SIL for the closure, see below */
    ...
}

...

/* 
  sil_vtable defines the virtual function table for the aClass class.

  It contains as expected all the autogenerated methods.
*/
sil_vtable aClass {
  #aClass.deinit!deallocator: _TFC4clos6aClassD	// aClass.__deallocating_deinit
  #aClass.value!getter.1: _TFC4clos6aClassg5valueSi	// aClass.value.getter
  #aClass.value!setter.1: _TFC4clos6aClasss5valueSi	// aClass.value.setter
  #aClass.value!materializeForSet.1: _TFC4clos6aClassm5valueSi	// aClass.value.materializeForSet
  #aClass.init!initializer.1: _TFC4clos6aClasscfT_S0_	// aClass.init() -> aClass
}

Now let’s go back to the main function, to see how the two class instances are retrieved and passed to the closure when it’s invoked.

This time all the symbols are demangled⁴ to make the snippet slightly more readable:

 
// main
sil @main : $@convention(c) (Int32, UnsafeMutablePointer<Optional<UnsafeMutablePointer<Int8>>>) -> Int32 {
// %0                                             // user: %4
// %1                                             // user: %10
bb0(%0 : $Int32, %1 : $UnsafeMutablePointer<Optional<UnsafeMutablePointer<Int8>>>):
  ...
  /*
    References to the global variables are placed in three registers.
  */
  %13 = global_addr @clos.c1 : $*aClass, loc "sample.swift":5:5, scope 1 // users: %26, %17
  ...
  %19 = global_addr @clos.c2 : $*aClass, loc "sample.swift":6:5, scope 1 // users: %32, %23
  ...
  %25 = global_addr @clos.fSpec : $*@callee_owned () -> (), loc "sample.swift":8:5, scope 1 // users: %48, %45
  /*
    c1 is unowned_retained. 
    This instruction increments the unowned reference count of the variable.
  */
  %26 = load %13 : $*aClass, loc "sample.swift":9:14, scope 2 // user: %27
  %27 = ref_to_unowned %26 : $aClass to $@sil_unowned aClass, loc "sample.swift":9:14, scope 2 // users: %47, %38, %39, %29, %28
  unowned_retain %27 : $@sil_unowned aClass, loc "sample.swift":9:14, scope 2 // id: %28
  store %27 to %2 : $*@sil_unowned aClass, loc "sample.swift":9:14, scope 2 // id: %29
  /*
    For c2 the process is more complex.
    alloc_box creates a reference-counted container for this variable that will be stored
    on the heap.

    After the box has been created, an optional variable is initialized to point to c2 and stored
    in the box. The box retains the value it contains, so, as you see below, once the box is 
    populated, the optional can be released.

    At one point, while the value of c2 is being stored in the optional, the object is
    temporarily strong_retained and then released.
  */
  %30 = alloc_box $@sil_weak Optional<aClass>, var, name "c2", loc "sample.swift":9:23, scope 2 // users: %46, %44, %43, %31
  %31 = project_box %30 : $@box @sil_weak Optional<aClass>, loc "sample.swift":9:23, scope 2 // user: %35
  %32 = load %19 : $*aClass, loc "sample.swift":9:23, scope 2 // users: %34, %33
  strong_retain %32 : $aClass, loc "sample.swift":9:23, scope 2 // id: %33
  %34 = enum $Optional<aClass>, #Optional.some!enumelt.1, %32 : $aClass, loc "sample.swift":9:23, scope 2 // users: %36, %35
  store_weak %34 to [initialization] %31 : $*@sil_weak Optional<aClass>, loc "sample.swift":9:23, scope 2 // id: %35
  release_value %34 : $Optional<aClass>, loc "sample.swift":9:23, scope 2 // id: %36
  /*
    A reference to the closure is retrieved.
  */
  // function_ref (closure #1)
  %37 = function_ref @sample.(closure #1) : $@convention(thin) (@owned @sil_unowned aClass, @owned @box @sil_weak Optional<aClass>) -> (), loc "sample.swift":8:13, scope 2 // user: %44
  /*
    c1 is tagged with @unowned and the variable is then unowned_retained.
  */
  strong_retain_unowned %27 : $@sil_unowned aClass, loc "sample.swift":8:13, scope 2 // id: %38
  %39 = unowned_to_ref %27 : $@sil_unowned aClass to $aClass, loc "sample.swift":8:13, scope 2 // users: %42, %40
  %40 = ref_to_unowned %39 : $aClass to $@sil_unowned aClass, loc "sample.swift":8:13, scope 2 // users: %44, %41
  unowned_retain %40 : $@sil_unowned aClass, loc "sample.swift":8:13, scope 2 // id: %41
  strong_release %39 : $aClass, loc "sample.swift":8:13, scope 2 // id: %42
  /*
    The box containing an optional with the value of c2 is strong_retained.
  */
  strong_retain %30 : $@box @sil_weak Optional<aClass>, loc "sample.swift":8:13, scope 2 // id: %43
  /*
    Creates a closure object binding the function to its parameters.
  */
  %44 = partial_apply %37(%40, %30) : $@convention(thin) (@owned @sil_unowned aClass, @owned @box @sil_weak Optional<aClass>) -> (), loc "sample.swift":8:13, scope 2 // user: %45
  store %44 to %25 : $*@callee_owned () -> (), loc "sample.swift":8:13, scope 2 // id: %45
  /*
    Performs release on the c1 and c2's box variables (using the matching *_release functions).
  */
  strong_release %30 : $@box @sil_weak Optional<aClass>, loc "sample.swift":14:1, scope 2 // id: %46
  unowned_release %27 : $@sil_unowned aClass, loc "sample.swift":9:14, scope 2 // id: %47
  /*
     Loads the previously stored closure object, retains it strongly and invoke the function.
  */
  %48 = load %25 : $*@callee_owned () -> (), loc "sample.swift":17:1, scope 2 // users: %50, %49
  strong_retain %48 : $@callee_owned () -> (), loc "sample.swift":17:1, scope 2 // id: %49
  %50 = apply %48() : $@callee_owned () -> (), loc "sample.swift":17:7, scope 2
  ...
}

The closure has a more complex structure:

 
/*
  The closure parameters are annotated with @sil annotations that specify how they will be 
  retained, we have an unowned aClass, c1, and a weak box with and optional containing c2.
*/
// (closure #1)
sil shared @clos.fSpec: $@convention(thin) (@owned @sil_unowned aClass, @owned @box @sil_weak Optional<aClass>) -> () {
// %0                                             // users: %24, %6, %5, %2
// %1                                             // users: %23, %3
/*
  This function has three blocks, with the last two being executed conditionally depending
  on the value of the c2 optional.
*/
bb0(%0 : $@sil_unowned aClass, %1 : $@box @sil_weak Optional<aClass>):
  ...
  /*
    c1 is strongly retained.
  */
  strong_retain_unowned %0 : $@sil_unowned aClass, loc "sample.swift":10:5, scope 17 // id: %5
  %6 = unowned_to_ref %0 : $@sil_unowned aClass to $aClass, loc "sample.swift":10:5, scope 17 // users: %11, %10, %9
  /*
    Using the internal Builtin package, an Int with value 42 is initialized using an integer 
    literal as parameter for an Int struct.

    This value is then set as new value of c1 and once done the variable is released.
    The class_method instruction that we see here for the first time, retrieves a reference 
    to a function from the vtable of an object.
  */
  %7 = integer_literal $Builtin.Int64, 42, loc "sample.swift":10:16, scope 17 // user: %8
  %8 = struct $Int (%7 : $Builtin.Int64), loc "sample.swift":10:16, scope 17 // user: %10
  %9 = class_method %6 : $aClass, #aClass.value!setter.1 : (aClass) -> (Int) -> () , $@convention(method) (Int, @guaranteed aClass) -> (), loc "sample.swift":10:14, scope 17 // user: %10
  %10 = apply %9(%8, %6) : $@convention(method) (Int, @guaranteed aClass) -> (), loc "sample.swift":10:14, scope 17
  strong_release %6 : $aClass, loc "sample.swift":10:16, scope 17 // id: %11
  /*
    And now it's the turn of c2.
    The optional is retrieved and branch to one the the last to blocks is performed depending
    on its content. 

    If the optional has a value the bb2 block will be executed before jumping 
    to bb3, if it doesn't after a brief jump to bb1, the function will proceed to bb3 releasing
    the retained parameters.
  */
  %12 = load_weak %3 : $*@sil_weak Optional<aClass>, loc "sample.swift":11:18, scope 18 // user: %13
  switch_enum %12 : $Optional<aClass>, case #Optional.some!enumelt.1: bb2, default bb1, loc "sample.swift":11:18, scope 18 // id: %13

bb1:                                              // Preds: bb0
  /*
    Jumps to the end of the closure.
  */
  br bb3, loc "sample.swift":11:18, scope 16        // id: %14

// %15                                            // users: %21, %20, %19, %16
bb2(%15 : $aClass):                               // Preds: bb0
  /*
    Invokes the setter for aClass setting a value of 42 and procedes.
  */
  ...
  %17 = integer_literal $Builtin.Int64, 42, loc "sample.swift":12:21, scope 19 // user: %18
  %18 = struct $Int (%17 : $Builtin.Int64), loc "sample.swift":12:21, scope 19 // user: %20
  %19 = class_method %15 : $aClass, #aClass.value!setter.1 : (aClass) -> (Int) -> () , $@convention(method) (Int, @guaranteed aClass) -> (), loc "sample.swift":12:19, scope 19 // user: %20
  %20 = apply %19(%18, %15) : $@convention(method) (Int, @guaranteed aClass) -> (), loc "sample.swift":12:19, scope 19
  strong_release %15 : $aClass, loc "sample.swift":13:5, scope 18 // id: %21
  br bb3, loc "sample.swift":13:5, scope 18         // id: %22

bb3:                                              // Preds: bb1 bb2
  /*
    Releases both captured parameters and returns.
  */
  strong_release %1 : $@box @sil_weak Optional<aClass>, loc "sample.swift":14:1, scope 17 // id: %23
  unowned_release %0 : $@sil_unowned aClass, loc "sample.swift":14:1, scope 17 // id: %24
  %25 = tuple (), loc "sample.swift":14:1, scope 17 // user: %26
  return %25 : $(), loc "sample.swift":14:1, scope 17 // id: %26
}

At this point, ignoring for a moment the performance characteristics of the various ARC instructions we can do a quick recap of what needs to be done for each kind of captured variable at different stages:

Action	Unowned	Weak
Pre-call #1	unowned_retain the object	Create a @box, strong_retain the object, create an optional and store it in the @box,release the optional
Pre-call #2	strong_retain_unowned, unowned_retain and strong_release	strong_retain
Closure execution	strong_retain_unowned, unowned_release	load_weak, switch on Optional, strong_release
Post-call	unowned_release	strong_release

As we saw in the SIL excerpts above, handling weak references involves more work because they make use of an optional that needs to be handled.

Here is a brief explanation of what each one of the ARC instructions listed above do as described in the documentation:

unowned_retain: Increments the unowned reference count of the heap object.
strong_retain_unowned: Asserts that the strong reference count of the object is still positive, then increases it by one.
strong_retain: Increases the strong retain count of the object.
load_weak: Not really an ARC call but it increments the strong reference count of the object referenced by the optional.
strong_release: Decrements the strong reference count of the object. If the release operation brings the strong reference count of the object to zero, the object is destroyed and the weak references are cleared. When both its strong and unowned reference counts reach zero, the object’s memory is deallocated.
unowned_release: Decrements the unowned reference count of the object. When both its strong and unowned reference counts reach zero, the object’s memory is deallocated.

Now let’s dig deeper in the Swift runtime to see how these instructions are implemented, the files that contain what we need are HeapObject.cpp, HeapObject.h, RefCount.h and for a few minor definitions Heap.cpp and SwiftObject.mm. Boxes implementation can be found in MetadataImpl.h, but I will not talk about them in this post.

Many of the ARC functions declared in these file come in three variants, a basic implementation for Swift objects and two additional implementations for non native Swift objects: Bridge objects and Unknown objects. The last two variants will not be discussed here.

The first set of instructions we’ll discuss is the one related to unowned references.

The functions that implement unowned_retain and unowned_release can be found halfway through HeapObject.cpp:

 
SWIFT_RT_ENTRY_VISIBILITY
void swift::swift_unownedRetain(HeapObject *object)
    SWIFT_CC(RegisterPreservingCC_IMPL) {
  if (!object)
    return;

  object->weakRefCount.increment();
}

SWIFT_RT_ENTRY_VISIBILITY
void swift::swift_unownedRelease(HeapObject *object)
    SWIFT_CC(RegisterPreservingCC_IMPL) {
  if (!object)
    return;

  if (object->weakRefCount.decrementShouldDeallocate()) {
    // Only class objects can be weak-retained and weak-released.
    auto metadata = object->metadata;
    assert(metadata->isClassObject());
    auto classMetadata = static_cast<const ClassMetadata*>(metadata);
    assert(classMetadata->isTypeMetadata());
    SWIFT_RT_ENTRY_CALL(swift_slowDealloc)
        (object, classMetadata->getInstanceSize(),
         classMetadata->getInstanceAlignMask());
  }
}

While swift_unownedRetain, the implementation of unowned_retain, simply increments atomically the unowned reference count (here called weakRefCount), swift_unownedRelease is more complex because as described above it needs to handle object deallocation, performing it only when there are no other unowned references left.

But nothing particularly complex here, as you can see here the doDecrementShouldDeallocate function, called by a similarly named function in the snippet above, doesn’t do much and swift_slowDealloc just frees the given pointer.

And once we have an unowned reference to an object, another instruction, strong_retain_unowned, is used to create a strong reference:


SWIFT_RT_ENTRY_VISIBILITY
void swift::swift_unownedRetainStrong(HeapObject *object)
    SWIFT_CC(RegisterPreservingCC_IMPL) {
  if (!object)
    return;
  assert(object->weakRefCount.getCount() &&
         "object is not currently weakly retained");

  if (! object->refCount.tryIncrement())
    _swift_abortRetainUnowned(object);
}

Since this object should already be weakly referenced, an assert is performed to verify that the object is indeed weakly retained and once done, an attempt to increment its strong retain count is performed. The attempt will fail if the object is in the process of being deallocated.

All the functions like tryIncrement that modify in some way the retain counters are located in RefCount.h and require just a few atomic operations to perform their task.

Let’s talk about weak references now, as we saw before, swift_weakLoadStrong is used to obtain a strong reference to the object contained in the optional:


HeapObject *swift::swift_weakLoadStrong(WeakReference *ref) {
  if (ref->Value == (uintptr_t)nullptr) {
    return nullptr;
  }

  // ref might be visible to other threads
  auto ptr = __atomic_fetch_or(&ref->Value, WR_READING, __ATOMIC_RELAXED);
  while (ptr & WR_READING) {
    short c = 0;
    while (__atomic_load_n(&ref->Value, __ATOMIC_RELAXED) & WR_READING) {
      if (++c == WR_SPINLIMIT) {
        std::this_thread::yield();
        c -= 1;
      }
    }
    ptr = __atomic_fetch_or(&ref->Value, WR_READING, __ATOMIC_RELAXED);
  }

  auto object = (HeapObject*)(ptr & ~WR_NATIVE);
  if (object == nullptr) {
    __atomic_store_n(&ref->Value, (uintptr_t)nullptr, __ATOMIC_RELAXED);
    return nullptr;
  }
  if (object->refCount.isDeallocating()) {
    __atomic_store_n(&ref->Value, (uintptr_t)nullptr, __ATOMIC_RELAXED);
    SWIFT_RT_ENTRY_CALL(swift_unownedRelease)(object);
    return nullptr;
  }
  auto result = swift_tryRetain(object);
  __atomic_store_n(&ref->Value, ptr, __ATOMIC_RELAXED);
  return result;
}

Obtaining a strong reference in this case requires more complex synchronization that will reduce performance under heavy thread contention.

The WeakReference object we see here for the first time is a simple struct that contains an integer Value field pointing to the target object that as every Swift object is represented in the runtime with the HeapObject class.

Right after the weak reference in acquired for the current thread setting the WR_READING flag, the Swift object is retrieved from the WeakReference container and if it’s not valid anymore or if it has become eligible for deallocation while we were waiting to acquire the resource, the current reference is set to null.

If the object is still valid, an attempt to retain is performed as expected.

Therefore, even from this point of view we can expect the performance of weak reference during common operations to be lower than what we would expect from simpler unowned references (but from what I’ve seen the major overhead seems to be optional handling).

Conclusion

Does using defensively only weak references make sense? No, from both the point of view of performance and code clarity.

Using the right type of capture modifier makes explicit some lifetime characteristics of our code and makes it harder to reach wrong conclusions about how the code behaves when someone else, or future you, will read what you wrote.

Footnotes

1: The first discussion on the weak/unowned dilemma with input from Apple can be found here, and a later discussion on twitter with Joe Groff has been summarized here by Michael Tsai. This article starts from there with the intention of providing a throughout and approachable explanation.
2: A good description of ASTs can be found on Wikipedia while this article from Slava Pestov has more details on how this is implemented for the Swift compiler.
3: For more information about SIL, check out the detailed official SIL guide and this video from the 2015 LLVM Developers's Meeting. A quick reference for SIL instructions written by Lex Chou is available here.
4: To learn more about how name mangling is performed in Swift, read this reference from Lex Chou.
5: Mike Ash talked about weak references with an experimental approach in one of his Friday Q&A, it's not completely up-to-date with the current way things are named and implemented in Swift but the explanations are still valid.

Recursive Tail Calls and Trampolines in Swift

2016-05-05T00:00:00+02:00

Update 10/17:This post has been verified with Swift 4, no changes required.

Update 10/16:This post has been updated to Swift 3.

The use of recursion can often lead to a cleaner implementation of your algorithms but when compared with implementations based on loops, recursive ones incur in the additional cost of allocating and managing a new stack frame for every method call performed, something that makes the recursive implementation slower and that can also quickly lead to stack exhaustion (aka stack overflow).

To avoid the risk of overflowing the stack, the recommended approach suggests to rewrite your algorithm employing tail recursion to leverage the tail call optimization that some compilers provide.

But what’s the difference between recursion and tail recursion and what are those compilers actually doing to solve the issues above?

Tail recursion differs from generic recursion for the fact that the result of the recursive call is always returned without any additional manipulation to the caller and calculation are performed through an accumulator variable that passes partial results to the successive recursive calls along the chain, until the end of the recursion is reached.

If this definition seems confusing, an example in the next section will make it more clear, for now you just need to know that this specific kind of recursion can be optimized and these recursive algorithms can be easily identified and converted by a compiler to a more efficient loop-based implementation, not affected by stack size limitations.

But in Swift, we can’t rely on the fact that the compiler will always perform the tail call elimination described above in every circumstance.

This limitation has already been discussed on Natasha’s blog before and some work has been done on the draft of a proposal that aimed at adding a few new attributes that would have made the behavior of the optimizer more verifiable, allowing to explicitly state which recursive method call we expected to be optimized (and if that didn’t happen an error would have been thrown).

In this post we’ll see how the lack of predictable tail call elimination in Swift could be overcome using trampolines and a few alternatives to recursion will be described.

Get the playground for this post from GitHub or zipped.

Triangular numbers with recursion

Let’s see an example of an algorithm that calculates the n-th triangular number recursively:


func tri(n:Int)->Int{
    if n <= 0 {
        return 0
    }
    return n+tri(n: n-1)
}
tri(n: 300) //45150

In this example of simple recursion, the result of the recursive call is added to the value passed as parameter and the result of our initial method call tri(300) will be the sum of all those integer values, chained together with recursion.

To improve this algorithm adding tail recursion, we’ll add an accumulator that will be passed along to the next call:


func ttri(n:Int, acc:Int=0)->Int {
    if n<1 {
        return acc
    }
    return ttri(n: n-1,acc:acc+n)
}

ttri(n: 300) //45150

Notice how the result of this algorithm is now built with the accumulator and the last step of the recursion just returns the accumulator value to complete the calculation.

But both functions will crash your application or playground when invoked with a big enough integer as parameter. Let’s see how trampolines can solve this issue.

Trampolines

The idea behind trampolines is actually quite simple.

A trampoline is not much more than a loop that executes iteratively functions that can either return the next function in the sequence (in form of a thunk or continuation, a data structure that contains the information needed to perform a specific function call) or another kind of value (in this case the content of the accumulator) to signal the end of the iteration.

The tail recursive function we wrote will have to be slightly modified if we want to execute it sequentially through a trampoline, we’ll need to rewrite it in continuation-passing style.

Update

As @oisdk pointed out, the modified function we’ll see below will only slightly resemble actual CPS:

Here, closures are letting you simulate lazy evaluation to do pseudo tail-call optimisation. In Continuation-Passing style, you pass in a continuation as an extra parameter to your recursive function. The continuation represents what happens after the function it’s being passed into is called. Then, to evaluate the continuation, you (usually) pass in the identity function. This lets you transform non tail-recursive functions into tail-recursive ones. Obviously, since TCO isn’t guaranteed in Swift, this isn’t very useful.

Regardless, here’s what the triangular numbers example would look like in CPS:
 
func triCont(n: Int, cont:@escaping Int -> Int) -> Int {
    return n <= 1 ? cont(1) : triCont(n: n-1) { r in cont(r+n) }
}

func id<A>(x: A) -> A { return x }

triCont(n: 10, cont: id) // 55
Thanks for the great explanation.

Instead of performing directly a recursive call, our ttri function will now return an object that will wrap the actual call we performed previously and once the point where the execution should complete will be reached, we’ll return a sentinel value with the content on the accumulator.

We start defining a Result enum that represents the range of values that the modified recursive function will be able to return: a .Done value that will signal the end of the recursion and that will contain the accumulator, and a .Call value that will contain a closure with the next function call to perform.


enum Result<A>{
    case Done(A)
    case Call(()->Result<A>)
}

And after that we define a new function, containing the modified ttri tail recursive function and a section of code implementing the trampoline. This last part is usually contained in a separate function, but in this example everything will be kept together to make the code more readable.


func tritr(n:Int)->Int {
    func ttri(n:Int, acc:Int=0)->Result<Int> {
        if n<1 {
            return .Done(acc)
        }
        return .Call({
            ()->Result<Int> in
            return ttri(n: n-1,acc: acc+n)
        })
    }
    
    // Trampoline section
    let acc = 0
    var res = ttri(n: n,acc:acc)
    
    while true {
        switch res {
        case let .Done(accu):
            return accu
        case let .Call(f):
            res = f()
        }
    }
}

tritr(300)

Once you wrap your head around it, understanding what is happening in the trampoline section it’s not too hard.

After an initial call to the ttri method to bootstrap the trampoline, the functions contained in the .Call enums are executed in sequence while the value of the accumulator is being updated at each step:


return .Call({
    ()->Result<Int> in
    return ttri(n: n-1,acc: acc+n)
})

Even if the code is different, the behavior it’s still the same of our original recursive call.

Once we are done, the ttri function returns a .Done enum with the final result to be returned.

And even if this implementation is slower than the original one for all the code needed to operate the trampoline that was added, this version solves our biggest issue related to stack exhaustion, we’ll now be able to calculate every triangular number our heart desires… until we hit the size limit for integers.

Update: Following a suggestion from @oisdk, the design of the ttri function could be improved using the often forgotten @autoclosure attribute:

 
func call<A>(c:@escaping () -> Result<A>) -> Result<A> {
    return .Call(c)
}

func ttri(n: Int, acc:Int=1) -> Result<Int> {
    return n <= 1 ? .Done(acc) : call(c: tri(n: n-1, acc: acc+n))
}

Before we move on, let me add one more thing about that example, enclosing a block of code within a while true it’s usually a code smell but this time I did it to make the example more compact, a more proper loop check would have looked something like this:

 
while case .Call(_) = res {
    switch res {
    case let .Done(accu):
        return accu
    case let .Call(f):
        res = f()
    }
}
    
if case let .Done(ac) = res {
    return ac
}
    
return -1

And for an even more proper check, since we are using enums with associated values, we should have implemented the comparison operator for this specific enum and check for not done on top of the loop.

Now that the basics of how trampolines work are explained, we can build a generic function that given a function converted to use the Result enum, returns a closure that will execute the original function with a trampoline, hiding what’s happening under the hood:


func withTrampoline<V,A>(f:@escaping (V,A)->Result<A>) -> ((V,A)->A){
    return { (value:V,accumulator:A)->A in
        var res = f(value,accumulator)
        
        while true {
            switch res {
            case let .Done(accu):
                return accu
            case let .Call(f):
                res = f()
            }
        }
    }
}

The body of the closure we return is essentially what we had in the trampoline section in the previous example and withTrampoline expects as parameter a function with the form (V,A)->Result<A>, that is what we had before. The more obvious difference with the previous version is that since we can’t initialize the generic accumulator A because we don’t know yet its concrete type, we’ll need to expose it as a parameter in the returned function, a minor annoyance.

Let’s see how to use our new utility function:


var fin: (_ n:Int, _ a:Int) -> Result<Int> = {_,_ in .Done(0)}
fin = { (n:Int, a:Int) -> Result<Int> in
    if n<1 {
        return .Done(a)
    }
    return .Call({
        ()->Result<Int> in
        return fin(n-1,a+n)
    })
}

let f = withTrampoline(fin)

f(30,0)

This is probably a bit more verbose than what you expected.

Since we need a reference to the current function inside the closure to use it in the thunk, we must declare a dummy reference before declaring the actual closure and use that valid reference in our function.

Declaring the fin closure directly without the dummy and attempting to use it would get us a Variable used within its own initial value error. If you feel adventurous, an alternative to this ugly workaround consists in using a Z Combinator.

But if moving away from the traditional trampoline design is not an issue, we can improve what we have above simplifying the Result enum and keeping track of the function to call inside the trampoline instead of saving it as an associated value of the enum:


enum Result2<V,A>{
    case Done(A)
    case Call(V, A)
}

func withTrampoline2<V,A>(f:@escaping (V,A)->Result2<V,A>) -> ((V,A)->A){
    return { (value:V,accumulator:A)->A in
        var res = f(value,accumulator)
        
        while true {
            switch res {
            case let .Done(accu):
                return accu
            case let .Call(num, accu):
                res = f(num,accu)
            }
        }
    }
}

let f2 = withTrampoline2 { (n:Int, a:Int) -> Result2<Int, Int> in
    if n<1 {
        return .Done(a)
    }
    return .Call(n-1,a+n)
}

f2(30,0)

Way cleaner and compact.

Get the playground for this post from GitHub or zipped.

Swifty Alternatives to recursion

As you already know if you’ve read some of the posts in the Swift and the functional approach series, Swift provides a few features that can be helpful to build alternative implementations for algorithms that are usually implemented recursively.

For example, triangular numbers could have been calculated with just a simple functional one liner using reduce:


(1...30).reduce(0,+) //465

Or we could have created a Sequence or an Iterator to generate a sequence with all the possible triangular numbers:


class TriangularSequence :Sequence {
    func makeIterator() -> AnyIterator<Int> {
        var i = 0
        var acc = 0
        return AnyIterator(body:{
            print("# Returning "+String(i))
            i=i+1
            acc = acc + i
            return acc
        })
    }
}

var fs = TriangularSequence().makeIterator()

for i in 1...30 {
    print(fs.next())
}

And these are only two alternative implementations we could have built using what Swift provides.

Closing Thoughts

This post describes the limitations that Swift has in regard to recursion and shows how trampolines, the by-the-book workaround for languages lacking TCO, could be implemented in Swift. But, am i advocating the use of trampolines in your code?

Definitely not.

In Swift, considering that the language is not purely functional, what could be solved with a complex construct like trampolines can always be solved in a better way (producing code that is more readable and with a behavior easier to grasp) using one of the features the language provide, don’t over-engineer your code, your future self will be glad you didn’t.

Thanks to @oisdk for his insightful comments.

Swift And C: Everything You Need to Know on Types, Pointers and more

2016-04-07T00:00:00+02:00

Update 5/17:A few minor improvements here and there related to other additions in Swift 3

Update 3/17:This post has been updated to Swift 3 and extended with way more details and examples on UnsafePointers.

Only a few months have passed since the open-source release of Swift but the language has already been ported to numerous new platforms and new projects to port Swift somewhere else pop up every month.

The availability on different platforms turned mixing Swift and C from something that appeared to be an esoteric practice with a very limited practical utility other than wrapping native libraries to something you could have to deal with daily depending on where your code is running.

While some of the basics of C interoperability are well explained in the official guide Using Swift with Cocoa and Objective-C, more than a few things, especially related to the actual usage of bridged functions in real world scenarios, still remain mysterious and documented and explained properly only in an handful of blog posts.

This article will hopefully shed some light on the most non-obvious details and give you some practical examples of how to interact with C APIs, and while this post have been written mainly for people that plan to start developing in Swift on Linux, everything explained here also applies to Darwin-based OSes.

After a brief description of how C types are imported in Swift, we’ll delve into the specifics of pointers, strings and functions, and conclude with a short tutorial about creating mixed Swift/C projects using LLVM modules.

Get the mixed Swift/C playground for this post from GitHub or zipped.

C Types
Macros
Working with Pointers
Working with Strings
Working with Functions
- Unmanaged
Working with Files
Bitwise operations
Swift and C: Mixed Projects
- Swift Package Manager
Swift 3 Changes
Closing Thoughts

C Types

For each one of the basic C types, Swift provides a corresponding equivalent type that can be used when interoperating with C functions from Swift:

C Type	Swift C Type	Typealias of
bool	CBool	Bool
char,unsigned char	CChar, CUnsignedChar	Int8, UInt8
short, unsigned short	CShort, CUnsignedShort	Int16, UInt16
int, unsigned int	CInt, CUnsignedInt	Int32, UInt32
long, unsigned long	CLong, CUnsignedLong	Int, UInt
long long, unsigned long long	CLongLong, CUnsignedLongLong	Int64, UInt64
wchar_t, char16_t, char32_t	CWideChar, CChar16, CChar32	UnicodeScalar, UInt16, UnicodeScalar
float, double	CFloat, CDouble	Float, Double

The table above, in addition to what is already described in the official documentation, shows the actual Swift type the C typealias refers to.

Even if when writing code that interacts with C APIs you should use the Swift C types when possible, you’ll notice that the result of the import from C performed by Swift will most of the times simply use the usual Swift fixed-size types you are already familiar with.

Arrays and Structs

Let’s now talk about compound data structures like arrays and structs.

In an ideal world, you would expect that a global array like this one:


//header.h

char name[] = "IAmAString";

would be translated as a Swift String or at least as an array of some kind of character type. Well… this is what happens instead, once we try to use that imported name array in Swift:


print(name) // (97, 115, 100, 100, 97, 115, 100, 0)

This alone is more than enough to recommend using pointer to sequences of objects instead of vanilla arrays every time you can on the C layer of mixed Swift/C applications, to avoid painful translations once you reach the Swift layer.

But wait, can that global string declared with an array be recovered only with a convoluted piece of code converting the tuple to something more useful? Actually no, we’ll see how to fix that tuple with a few lines of code when discussing pointers.

Luckily, the situation it’s not so dire when dealing with structs, that are converted as Swift structs as expected, and its members are treated in the same predictable way, each one is converted recursively to the related Swift type.

For example, this struct:


typedef struct {
    char name[5];
    int value;
    int anotherValue;
} MyStruct;

is converted to a MyStruct Swift struct. This clean conversion simplifies struct initialization too that is also performed as usual:


let ms = MyStruct(name: (0, 0, 0, 0, 0), value: 1, anotherValue:2)
print(ms)

In one of the next sections we’ll see that this is not the only way to allocate and initialize an instance of a struct and especially if we just need a pointer to an empty object it could be easier to just allocate a new empty struct pointer instance manually.

Enums

If you need to access a C enum from Swift, declaring it as you usually do in C:


typedef enum ConnectionError{
    ConnectionErrorCouldNotConnect = 0,
    ConnectionErrorDisconnected = 1,
    ConnectionErrorResetByPeer = 2
}

Will get you something completely different from what you expected, once imported in Swift that enum will be represented by a structure and some global variables:


struct ConnectionError : RawRapresentable, Equatable{ }

var ConnectionErrorCouldNotConnect: ConnectionError {get}
var ConnectionErrorDisconnected: ConnectionError {get}
var ConnectionErrorResetByPeer: ConnectionError {get}

And it’s quite obvious that this way we’ll lose all the capabilities that native Swift enums provide. But getting what we want is just a matter of using a specific macro in C:


typedef NS_ENUM(NSInteger,ConnectionError) {
    ConnectionErrorCouldNotConnect,
    ConnectionErrorDisconnected,
    ConnectionErrorResetByPeer   
}

Using the NS_ENUM macro (more details on this macro that is equivalent to declaring a classic C enum, here), this is how Swift will import the enum:


enum ConnectionError: Int {
    case CouldNotConnect
    case Disconnected
    case ResetByPeer
}

Notice that the conversion also stripped away the prefix that the enum had, this is one of the conversion rules of Swift that you can also see in action when you use the standard iOS/OSX frameworks in Swift.

Additionally, Swift also provides a NS_OPTIONS macro that can be used to declare option sets conforming to OptionSetType. To learn more about this macro, check out the official documentation.

Unions

Let’s talk about unions, an interesting C type that has no Swift counterpart.

Swift supports unions only partially, meaning that while unions will be imported, not every kind of field is supported and as a consequence, some of the fields that you have declared in C could not be available (at the moment there is no documentation about what is not supported).

Let’s see an example of actual usage of this scarcely documented C type:


//header.h
union TestUnion {
    int i;
    float f;
    unsigned char asChar[4];
} testUnion;

Here we’ve declared a TestUnion type with a related testUnion union variable, each field represents a different view on the same 4 byte chunk of memory, in C we would be able to access testUnion either as an integer, a float or as a set of bytes.

Since there is nothing similar to unions in Swift, this type will be imported as a struct:


strideof(TestUnion)  // 4 bytes

testUnion.i = 33
testUnion.f  // 4.624285e-44
testUnion.i  // 33
testUnion.asChar // (33, 0, 0, 0)

testUnion.f = 1234567
testUnion.f  // 1234567
testUnion.i  // 1234613304
testUnion.asChar // (56, 180, 150, 73)

The first line verifies that this type, like we would expect from an union, is indeed only 4 bytes long and the next lines modify one of the fields to verify that the values contained in the others are updated too. But why when we set testUnion with 33 using the integer field we get 4.624285e-44 when we read the field as a float?

This is related to how unions work. You can think of an union as a bag of bytes that can be set or read using the formatting rules of each one of the fields it’s composed of, what we did above was setting that 4 bytes memory area with the same bit content an Int32(33) would have had, and then we read the 4 bytes memory area interpreting its bit pattern like a IEEE float.

But let’s verify this using the useful (but dangerous) unsafeBitCast function:


var fv:Float32 = unsafeBitCast(Int32(33), to: Float.self)   // 4.624285e-44

Here we are doing exactly the kind of conversion that happens when we access the union as a float, we are taking the bits that compose an Int32 with value 33 and assigning to them a variable with Float type without any conversion or safety checks.

Now that we’ve see how this behave, could we implement a similar struct manually in Swift?

Even without checking the source, we can guess that TestUnion it’s just a simple struct backed by a memory block of 4 bytes (it’s not important in which form) and the properties we access are just computed properties that hide all the conversion operations with their set/get.

The size of things

In Swift you can obtain the data-only or memory size of a specific type (primitive or compound) using the MemoryLayout<T> generic struct and the properties and functions it provides.

If you are interested in the size of the data contained in a variable, ignoring any additional space added to guarantee memory alignment, you can use the size property of the type or its static method size(ofValue:).

As you could have guessed, MemoryLayout also provides an additional property and a function to correctly retrieve the size of variables or types that take into account the additional space needed for alignment and that you should favor instead of the previous ones most of the times: stride and its static method stride(ofValue:).

Let’s see an example where you’ll notice the difference between the values returned by size and stride:


print(MemoryLayout<CChar>.stride)  // 1 byte

struct Struct1{
    let anInt8:Int64
    let anInt:Int16
    let b:Bool
}

print(MemoryLayout<CStruct1>.size))    // 11 (8+2+1) byte
print(MemoryLayout<CStruct1>.stride)  // 16 (8+4+4) byte

And while the amount of additional space added to abide to the alignment rules of the processor architecture can be obtained doing a difference between the value returned by stride and size, an additional property alignment is also available.

Null, nil, 0

Luckily, Swift does not have an additional constant to represent null values, you can just use the Swift nil, regardless of the type of the specific variable or parameter.

As we’ll see when talking about pointers, nil gets also automatically translated to a null Unsafe[Mutable]RawPointer? when passed as a parameter.

If you need a typed null pointer instead, you can create one assigning the nil value to an optional pointer:


let p: UnsafeMutablePointer<UInt8>? = nil

Just don’t try to unwrap this optional since it contains a nil value, assigning nil to an implicitly unwrapped optional is also ok.

Macros

Simple C defines are translated in Swift as global constants, something like this in C:


#define MY_CONSTANT 42

Will be translated as:


let MY_CONSTANT = 42

More complex macros and preprocessor directives will be completely ignored by Swift and will not be available.

Swift also provides a simple conditional compilation statement and some support functions that can be used to include certain sections of code only for specific OSes, architectures or versions of Swift.


#if arch(arm) && os(Linux) && swift(>=2.2)
    import Glibc
#elseif !arch(i386)
    import Darwin
#else
    import Darwin
#endif

puts("Hello!")

In this example, we’ll import the standard c library from a different source depending on the fact that we are compiling this application on an ARM Linux host or not.

The functions available to customize the compilation behavior are: os() (with valid values: OSX, iOS, watchOS, tvOS, Linux), arch() (with valid values: x86_64, arm, arm64, i386) and swift() (that requires its parameter to be specified as >=version number). The results of these functions can be combined with some basic logical operators to build complex rules: &&, ||, !.

And since this is the first time we see it, remember that as on OSX you would import Darwin (or one of the frameworks that depends on it) in your projects to gain access to the libc functions and on a platform like Linux you’ll need to import Glibc.

Working with Pointers

Pointers are automatically translated to different kinds of UnsafePointer<Pointee> objects depending on the characteristics of the value they point to:

C Pointer	Swift Type
int *	UnsafeMutablePointer<Int32>
const int *	UnsafePointer<Int32>
NSDate**	AutoreleasingUnsafeMutablePointer<NSDate>
struct UnknownType *	OpaquePointer
void *	UnsafeMutableRawPointer
const void *	UnsafeRawPointer

While the general rule is that mutable Pointer instances point to mutable variables, for class objects as in the third example NSDate**, pointers to object passed as pointers are translated as AutoreleasingUnsafeMutablePointer.

Moreover, if the type we are pointing to is not completely defined or cannot be represented in Swift (structs or unions that can only partially be translated from C), the pointer will be translated as an OpaquePointer, an untyped pointer, essentially just a struct that contains some bits. Values pointed to by an OpaquePointer cannot be accessed directly, the Pointer variable will need to be converted first.

Conversions from the three mutable Unsafe[Mutable]Pointer types to UnsafePointer<Type> are performed automatically (for example when you are passing a mutable pointer to a function requiring an immutable pointer) while a compiler error is raised the other way around.

A pointer to an immutable value cannot be converted to a pointer to a mutable value implicitly, Swift tries to guarantee a minimum of safety even in this circumstance, and this is an example of one of the operations on pointers that as we’ll see need to be performed manually. To obtain a mutable unsafe pointer you must now use the initialize UnsafeMutablePointer(mutating:)

It’s important to note that since Swift 3.0 conversion between pointers with different element types using the init method is not possible anymore, casting will need to be done explicitly with specific methods as we’ll see in one of the following sections.

The release 3.0 of Swift most notably added the UnsafeRawPointer type to deal with untyped pointers, pointers that will normally be represented in C as void pointers and that were represented in Swift 2.x as Unsafe[Mutable]Pointer<Void>

UnsafeRawPointers have a few methods that as we’ll see in the following sections will make our life a lot more easier simplifying all operations related to pointer conversions.

All the pointer types, with the exclusion of raw pointers and opaque pointers, are type safe (the compiler will perform type checks on the pointer and its content) and guarantee address alignement with the Pointee type they point to (the pointer, even after operations that alter its address moving int forwards or backwards, will always point at the start of a Pointee value).

The “unsafe” prefix in all those type names instead, refers to how we access the content, because interacting with pointers directly allows to circumvent the safety measures that the language has put in place when we access objects and is therefore considered inherently unsafe.

But what about the lifetime of the pointed object? How are they handled, if they are, through ARC?

As we already know, Swift uses ARC to manage the lifetime of reference types (and some struct and enums are tracked too when they contain reference types) and to track ownership, does unsafe pointers behave in some peculiar way?

No, Unsafe[Mutable]Pointer<Type> structs will be tracked if they are pointing to a reference type (a class object) or if they contain some reference to be tracked, that’s something you should know and that will help us explain something apparently weird when we’ll talk about memory allocation.

But again, this is not true for raw pointers and opaque pointers, that are not managed by ARC. And because of this, in some circumstances you’ll need to use a specific utility class called Unmanaged to control the lifetime of the pointed objects, we’ll discuss this later.

Now that we know how pointers are represented, there are two things left to describe: how pointers can be dereferenced to obtain or modify the value they point to and how we can obtain a pointer to a new or preexisting Swift variable.

Once you get a hold of a non-Void Unsafe[Mutable]Pointer<T> retrieving or modifying the pointed value is straightforward using the pointee property:


var anInt:Int = myIntPointer.pointee   //UnsafeMutablePointer<Int> --> Int

myIntPointer.pointee = 42

myIntPointer[0] = 43

And you can also access a specific element in a sequence of pointers of the same type as you would do in C, using a convenient array subscript, where each increment of the index will move you to the next element of size MemoryLayout<T>.stride of the sequence.

On the other hand, if you need to obtain an Unsafe[Mutable]Pointer to a Swift variable to use it as parameter of a function call and only in that situation, this can be done easily using the same operator we use to pass inout parameters by address to functions:


let i = 42
functionThatNeedsAPointer(&i)

Considering that the operator cannot be used to perform that conversion outside of the described function invocation context, if you need to get a hold of the pointer variable to perform further computation (e.g. a pointer type conversion), Swift provides the withUnsafePointer and withUnsafeMutablePointer utility functions:


withUnsafePointer(&i, { (ptr: UnsafePointer<Int>) -> Void in
    var vptr= UnsafeRawPointer(ptr)
    functionThatNeedsAVoidPointer(vptr)
})

let r = withUnsafePointer(&i, { (ptr: UnsafePointer<Int>) -> Int in
    var vptr= UnsafeRawPointer(ptr)
    return functionThatNeedsAVoidPointerAndReturnsInt(vptr)
})

This function creates a pointer object for the given variable and passes it to a closure that can then use it and optionally return a value. The pointer is guaranteed to be valid for the duration of the closure and considering it’s meant to be used only in that context it can’t be returned to the outside scope.

This way, the ability to access the variable unsafely is limited to the well-defined scope of the closure. In the sample above we are converting the int pointer to a void pointer before passing it to a function, in the “Pointers Conversion” section we’ll see how to convert one pointer type to another.

And let’s talk again briefly about OpaquePointer, there is nothing special about it and it can be easily converted to a specific typed pointer thanks to the initializers of the Unsafe[Mutable]Pointer and then accessed using the pointee property like any other Unsafe[Mutable]Pointer:


// ptr is an untyped OpaquePointer

var iptr = UnsafePointer<Int>(ptr)
print(iptr.pointee)

Swift also provides unsafe buffers, that are used to manipulate areas of memory that contain a sequence of values of the same type.

We can obtain a pointer to a classic Swift array with a slightly different syntax that converts it to an Unsafe[Mutable]BufferPointer using one of its methods withUnsafe[Mutable]BufferPointer:


let array: [Int8] = [ 65, 66, 67, 0 ]
puts(array)  // ABC
array.withUnsafeBufferPointer { (ptr: UnsafeBufferPointer<Int8>) in
    puts(ptr.baseAddress! + 1) //BC
}

Note that Unsafe[Mutable]BufferPointer also exposes a baseAddress property that contains the base address of the buffer.

Following the choice made with the withUnsafePointer(to:body:) method, a global static method with the same name is also provided for unsafe buffer.

The proposal SE-0138 implemented in Swift 3.0.1, adds a new global and a new array type method, both called withUnsafeBytes that allows to manipulate the area of memory pointed by a typed pointer via a raw buffer pointer to a series of bytes (UInt8).

Something that can come in handy when you want inspect or alter the underlying raw representation of a high level data structure.

Let’s try this with Swift’s strings, that are built using a _StringCore struct internally, a struct contains three components: a pointer to an area of memory with the actual UTF-8 characters, a bit mask with the length and other flags, and an owner reference.


public struct _StringCore {
  // Internals
  public var _baseAddress: UnsafeMutableRawPointer?
  var _countAndFlags: UInt
  public var _owner: AnyObject?
  ...

We’ll use withUnsafeBytes to print each byte that makes up the structure and then we’ll build a pointer with the first 8 bytes of the struct (obtaining the _baseAddress pointer, stored as little endian, with the Least Significant Byte placed in the first position of the byte sequence) and finally we’ll create new string using the character buffer pointed by _baseAddress as input.


import Foundation


var str = "iAmAStringHello"

withUnsafeBytes(of:&str){ ptr in
    ptr.forEach{
        //Print the content of the underlying _StringCore structure
        print(String(format:"0x%x",$0))
    }
    let strptr = UnsafePointer<UInt8>(bitPattern:
        UInt(ptr[0]) |              // e.g. 0x26
        (UInt(ptr[1]) << 8) |       //      0x58
        (UInt(ptr[2]) << 16) |      //      0xef
        (UInt(ptr[3]) << 24) |      //      0x0e
        (UInt(ptr[4]) << 32) |      //      0x01
        (UInt(ptr[5]) << 40) |      //      0x00
        (UInt(ptr[6]) << 48) |      //      0x00
        (UInt(ptr[7]) << 56) )      //      0x00
    
    print(String(format:"0x%x",UInt(bitPattern: strptr)))
    
    //Print the c string stored at address 0x000000010eef5826
    print(String(cString:strptr!))  // Prints: iAmAStringHello
}

As we’ll see in a while, we’ll not need to reconstruct the addresses we stored in memory every time, that piece of code can be greatly simplified as follow, changing the way we access that block of memory and with a few pointer castings:


let tmpp = UnsafeMutableRawPointer(ptr.baseAddress!).assumingMemoryBound(to: Int.self)
let strptr = UnsafeMutablePointer<UInt8>(bitPattern: tmpp.pointee)

I’m not going to explain this yet, after reading the next sections you’ll understand what I did here.

There is another kind of pointer we haven’t discussed yet: function pointers.

Since Swift 2.0, C function pointers are imported in Swift as closures with a special attribute @convention(c) that specifies that the closure conforms to C calling conventions, we’ll see what this means in one of the following sections.

Ignoring these details for a moment, the essential thing to know about function pointers is that every time an imported C function expects a function pointer parameter we will be able to use a closure declared in-place or a Swift function reference as parameter.

Allocating memory

Until now we have only obtained pointers to existing Swift objects but we have never allocated memory manually, in this section we’ll see how this can be done in Swift using the recommended approach or with the functions of the malloc family like we would have done in C (that could make sense in a few very specific circumstances).

But before we start, we should be aware that unsafe pointers like good old C pointers can have three possible states during their lifetime:

Unallocated: No memory has been reserved for the pointer.
Allocated: The pointer points to a valid allocated memory location but its value is uninitialized.
Initialized: The pointer points to an allocated and initialized memory location.

Pointers will move between these states in response to the operations we’ll perform on them.

The recommended approach for dealing with pointers, the one you should choose most of the times, consists in using the methods that the UnsafeMutablePointer class provides, to allocate a new object and then retrieve a pointer to the instance, initialize it and once we’re done with it, clean up its content and deallocate the memory it refers to.

Let’s see a basic example:


var ptr = UnsafeMutablePointer<CChar>.allocate(capacity: 10)

ptr.initialize(from: [CChar](repeating: 0, count: 10))

//Do something with the object
ptr[3] = 42

ptr.deinitialize() //Clean up

ptr.deallocate(capacity: 10) //Let's free the memory we allocated

Here we have allocated a block of memory of 10 CChars (UInt8) using allocate(capacity:) (that for raw pointers has an additional parameter to specify the alignment), that’s basically equivalent to calling malloc specifying the total size of the memory chunk and then casting the result to the specific pointer type we need, but way less error prone since we don’t have to specify manually the size. c Once the space for the UnsafeMutablePointer has been allocated, we must initialize the mutable object with one of the initialize methods, for example initialize(to:Pointee, count:Int) to initialize with count number of Pointee values or the initialize(from: Collection) method specifying the initial content with another sequence. When we are done with the object and we want to free the resources allocated, we first clean up its content with deinitialize and then proceed to deallocate the pointer with deallocate(capacity:).

It’s important to note that the Swift runtime will not do this last two things for you, as was your responsibility to allocate the memory needed by the variable it’s also your responsibility to deallocate the object once you are done working on that memory area.

Let’s see another example with a pointer to a more complex Swift value type this time:


var ptr = UnsafeMutablePointer<String>.allocate(capacity: 1)
sptr.initialize(to:"Test String")

print(sptr[0])
print(sptr.pointee)

ptr.deinitialize()
ptr.deallocate(capacity: 1)

The sequence of operations comprising the allocation&initialization and cleanup&deallocation phase are the same for value types and reference types, but if you play around with it you’ll notice that for some value types (like integers,floats or some simple structs) the initialization it’s not actually necessary and you can just initialize the content with the pointee property or via array subscript.

But this will definitely not work when your pointer points to a class or to some specific structs or enums. Sometimes initialization is required, but why?

The reason behind this behavior is related to what happens, from the memory management point of view, when you modify the content in one of the way described above, let’s see a snippet that doesn’t require manual initialization and then one that crashes miserably if we don’t initialize the UnsafeMutablePointer before altering its content.


struct MyStruct1{
    var int1:Int
    var int2:Int
}

var s1ptr = UnsafeMutablePointer<MyStruct1>.allocate(capacity: 5)

s1ptr[0] = MyStruct1(int1: 1, int2: 2)
s1ptr[1] = MyStruct1(int1: 1, int2: 2) //This always works!

s1ptr.destroy()
s1ptr.dealloc(5)

No problem here, this just works, but let’s see this other example:


class TestClass{
    var aField:Int = 0
}

struct MyStruct2{
    var int1:Int
    var int2:Int
    var tc:TestClass // we have introduced a field with a reference type
}

var s2ptr = UnsafeMutablePointer<MyStruct2>.alloc(5)
s2ptr.initialize(from: [MyStruct2(int1: 1, int2: 2, tc: TestClass()),   // Remove the initialization
                        MyStruct2(int1: 1, int2: 2, tc: TestClass())])  // and you'll have a crash below

s2ptr[0] = MyStruct2(int1: 1, int2: 2, tc: TestClass())
s2ptr[1] = MyStruct2(int1: 1, int2: 2, tc: TestClass())

s2ptr.deinitialize()
s2ptr.deallocate(capacity: 5)

What happens here, is related to what was described at the beginning of the Working with Pointers section, MyStruct2 contains a reference and it’s lifetime is managed with ARC. When we modify one of the values in the memory block it points to, the Swift runtime will try to release the previous object contained in that slot, and if the memory contains garbage because it was never initialized, your application will crash.

Be aware of this, and to be on the safe side, favor initializing your UnsafeMutablePointers using initialize once they have been allocated over just setting the memory directly.

The alternative approach we hinted at at the beginning of this section, consists in importing the standard C library (Darwin or Glibc on Linux) and using the familiar functions of the malloc family:


var ptr = malloc(10*MemoryLayout<CChar>.stride).bindMemory(to: CChar.self, capacity: 10*MemoryLayout<CChar>.stride)

ptr[0] = 11
ptr[1] = 12

free(ptr)

As you can see we are not initializing the instance like we did when we followed the recommended approach, because as noted in the last paragraph with a type like CChar or some basic structs this will work.

Let’s now see two additional examples of usage of two common functions: memcpy and mmap:


var val = [CChar](repeating: 0, count: 10)
var buf = [CChar](repeating: 0, count: val.count)

memcpy(&buf, &val, buf.count*MemoryLayout<CChar>.stride)
buf // [1,1,1,1,1,1,1,1,1,1]

let mptr = mmap(nil, Int(getpagesize()), PROT_READ | PROT_WRITE, MAP_ANON | MAP_PRIVATE, -1, 0)!

if (Int(bitPattern: mptr) == -1) {    //MAP_FAILED not available, but its value is (void*)-1
    perror("mmap error")
    abort()
}

// Bind the *uninitialized* memory to the Int type, for initialized memory we should have used .assumingMemoryBound(to:)
let iptr = mptr.bindMemory(to: Int.self, capacity: Int(getpagesize())/MemoryLayout<Int>.stride)
iptr[0] = 3

munmap(ptr, Int(getpagesize()))

This code is similar to what you would have done in C, note that you can easily retrieve the size of a memory page with getpagesize().

But while the first example just shows that we can use memcpy, the second shows a real use case for the alternative approach to memory allocation, here we are mapping a new memory page but we could have mapped a specific memory area or a specific file pointer, and in that case we would have needed only direct access to the preexisting content, without any initialization.

Let’s see a real world example taken from SwiftyGPIO, here I’m mapping the area of memory that contains the registers for the digital GPIO pins of the Raspberry Pi that will be used throughout the library to read or write values:


// BCM2708_PERI_BASE = 0x20000000
// GPIO_BASE = BCM2708_PERI_BASE + 0x200000 /* GPIO controller */
// BLOCK_SIZE = 4*1024

private func initIO(id: Int){
    let mem_fd = open("/dev/mem", O_RDWR | O_SYNC)
    guard (mem_fd > 0) else {
        print("Can't open /dev/mem")
        abort()
    }
    
    let gpio_map = mmap(
        nil,                 //Any adddress in our space will do
        BLOCK_SIZE,          //Map length
        PROT_READ|PROT_WRITE,// Enable reading & writting to mapped memory
        MAP_SHARED,          //Shared with other processes
        mem_fd,              //File to map
        off_t(GPIO_BASE)     //Offset to GPIO peripheral
        )!
    
    close(mem_fd)
    
    let gpioBasePointer = gpio_map.assumingMemoryBound(to: Int.self)
    
    if (Int(bitPattern:gpioBasePointer) == -1) {    //MAP_FAILED not available, but its value is (void*)-1
        perror("GPIO mmap error")
        abort()
    }

    gpioGetPointer = gpioBasePointer.advanced(by: 13)
    gpioSetPointer = gpioBasePointer.advanced(by: 7)
    gpioClearPointer = gpioBasePointer.advanced(by: 10)
        
    inited = true
}

Once we’ve mapped the 4KB area starting at 0x20200000, we retrieve the address of the 3 registers we are interested in and from there we just read or write their values through the memory property.

Pointer Conversion

Conversions between Unsafe[Mutable]Pointers with a different type can be performed in a few ways.

If you need to bind the content pointed by a pointer to a different type temporarily, obtaining what we could consider a different view of the pointed data, you can use withMemoryRebound(to:capacity:body:) that provides an UnsafaMutablePointer in the context of a closure where you can modify the data, essentially circumventing momentarily the type safety rules of the original pointer.


// With ptr being an UnsafeMutablePointer<UInt8>

let charPtr = ptr.withMemoryRebound(to: CChar.self, capacity: 11, {
    (cptr) -> String in
        return String(validatingUTF8: cptr)!
})

To perform a permanent type conversion, you’ll need raw pointers.

A raw pointer pointing to the same address can simply be created initializing a raw pointer with the typed pointer you want to convert, using either the method assumingMemoryBound(to:) or bindMemory(to:capacity) depending on the state of the memory you are binding to the new type. If the block of memory the pointer points to is uninitialized use bindMemory(to:capacity), using this method on already bound memory rebinds it to the new type. On already initialized memory that was already bound to the new type and is correctly aligned, use assumingMemoryBound(to:) instead.

We have already seen a few examples with these methods in the previous section but now let’s isolate only what we really need to perform a simple pointer conversion:


UnsafeRawPointer(typedPointer).bindMemory(to: UInt8.self capacity:1024)

In some very specific circumstances, and only after serious consideration, you could reach the conclusion that, for example for performance reasons, it could make sense to do without the safety features that Swift provides, and just work with raw pointers and maybe perform some type punning.

As we have already seen, most of the functionalities you’ll need in this cases are offered by the Unsafe[Mutable]RawPointer object. Let’s see a few other methods that are available to operate on chunks of memory without having to worry about type safety checks.


// Get a value starting at a specific offset and cast it to a type
let value = ptr.load(fromByteOffset: 8, as: UInt8.self)

// Set a value at a specific offset and with the specific type
ptr.storeBytes(of: 0xFA, toByteOffset: 8, as: UInt8.self)

// Copy bytes from another pointer
ptr.copyBytes(from: anotherPtr, count: 10 * MemoryLayout<UInt8>.stride)

Now we are ready to go back to the char array we had at the beginning of this article, with all the information we have now and knowing that a CChar tuple is automatically converted to a pointer of a sequence of CChar we can convert the tuple to a String easily:


let namestr = withUnsafePointer(to: &name, { (ptr) -> String? in  
    let charPtr = ptr.withMemoryRebound(to: CChar.self, capacity: 11, {
        (cptr) -> String in
            return String(validatingUTF8: cptr)!
    })
    return charPtr
})

print(namestr!) //IA#AString

Pointer arithmetic

In C is quite common to use pointers arithmetic to move through sequences or to get a reference to a specific member of compound variables, can this be done in Swift too?

Sure, UnsafePointer and its mutable variants provide a few convenient methods that allow to perform the same operations that in C we’ll perform leveraging pointer arithmetic incrementing or decrementing pointers: successor(), predecessor(), advanced(by:) and distance(to:UnsafePointer<T>).


var aptr = UnsafeMutablePointer<CChar>.allocate(capacity: 5)
aptr.initialize(from: [33,34,35,36,37])

print(aptr.successor().pointee) // 34
print(aptr.advanced(by: 3).pointee) // 36
print(aptr.advanced(by: 3).predecessor().pointee) // 35

print(aptr.distance(to: aptr.advanced(by: 3))) // 3

aptr.deinitialize()
aptr.deallocate(capacity: 5)

Even if I’ve presented these methods first and even if those are the one I recommend you to use, it’s still possible to increment and decrement an unsafe pointer adding an integer to obtain a pointer to one of the other elements in a sequence:


print((aptr+1).pointee) // 34
print((aptr+3).pointee) // 36
print(((aptr+3)-1).pointee) // 35

When you increment or decrement an Unsafe[Mutable]Pointer, the actual pointer is moved by multiples of MamoryLayout<Pointee>.alignement. Raw pointers or opaque pointers are instead just incremented or decremented by the given number of positions.

The classic use case for pointer arithmetic is the traversal of components of structures or of structures that contain nested sub-structures.

Let’s see an example that uses some of the techniques explained until now with a data structure composed by an header, defined as a Header struct with a few fields, and a body with a series of float values. We will first allocate an untyped pointer to the whole block of memory and then obtain two typed pointers to the two distinct logical segments:


struct Header{
    let field1: Int64 // 8 bytes
    let field2: Int32 // 4 bytes
    let field3: Int64 // 8 bytes
}

let numValues = 100 // 100 float values in the body

let ptr = UnsafeMutableRawPointer.allocate(
  bytes: MemoryLayout<Header>.stride + numValues * MemoryLayout<Float>.stride,
  alignedTo: MemoryLayout<Header>.alignment) 

let header = ptr.bindMemory(to: Header.self, capacity: 1)
let data = (ptr + MemoryLayout<Header>.stride).bindMemory(to: Float.self, capacity: numValues)

With the code above we’ll end up with two UnsafeMutablePointer instances, one pointing at the beginning of the memory area and one shifted of 20 bytes containing a series of 100 floats.

Working with addresses

Sometimes you could need to retrieve the actual pointer value to perform some calculation with it or to store it like an integer. This could be done before Swift 3.0 with an unsafe bit cast to an integer of compatible size:


import Foundation

func address<T>(of: UnsafePointer<T>) -> UInt {
    return unsafeBitCast(of, to:UInt.self)
}

var a = 1
print( String(format:"0x%016x",address(of:&a)) )

But since Swift 3.0 you can accomplish the same thing way more safely using an Int or UInt initializer with every pointer type:


import Foundation

func address<T>(of: UnsafePointer<T>) -> UInt {
    return UInt(bitPattern: of)
} 

var a = 1
print( String(format:"0x%016x", address(of: &a)) )

What if you already have and integer representation of an address pointer and need to build an unsafe pointer from that?

Swift has a specific unsafe pointer initializer to do it without resorting again to the unsafe bit cast. You can either create a typed pointer or a raw pointer using a specific integer bit pattern as address.The address will need to be correctly aligned to the Pointee type, meaning that pattern % MemoryLayout<Pointee>.alignment should be 0.


let ptr = UnsafeMutableRawPointer(bitPattern: 0x12345678)!

But please not that trying to unwrap a pointer with bit pattern equal to zero (a null pointer) will result in a crash, like it would for any other optional Swift type with a nil value.

Get the mixed Swift/C playground for this post from GitHub or zipped.

Working with Strings

When a C functions has a char pointer parameter, as we now know, this parameter is imported in Swift as an Unsafe[Mutable]Pointer<Int8>, but since Swift automatically converts strings into pointer to UTF8 buffers, you’ll be able to invoke those functions simply using a string as parameter, without manually converting it first.

Alternatively, if you need to perform additional operations with that pointer before invoking the function expecting a char pointer, Swift strings also provide the withCString method that passes the UTF8 char buffer to a closure that can optionally return a value.


puts("Hey! I was a Swift string!") //Passing a swift string to a libc method

var testString = "AAAAA"

testString.withCString { (ptr: UnsafePointer<Int8>) -> Void in
    // Do something with ptr
    functionThatExpectsAConstCharPointer(ptr)
}

Turning a C string into a full fledged Swift string is straightforward, just use the init(cString:) or init?(validatingUTF8:) String initializers, but remember that the C string must be null terminated.


let swiftString = String(validatingUTF8: aCString)!

If you are porting some C code that deals with strings to Swift, for example something handling with user input, you could have the need to compare the value of each character in a string against a single ASCII character value or an ASCII range, can this be done in Swift considering how strings are structures?

Yes, but i will not delve too much into the specifics of Swift strings, if you want to learn more about how strings in Swift are structured read this article from Ole Begemann and this article by Andy Bargh to learn more about Unicode.

Let’s see an example of a function that verifies if a string is composed only by basic ASCII printable characters that looks like something that could have been ported from C:


func isPrintable(text:String)->Bool{
    for scalar in text.unicodeScalars {
        let charCode = scalar.value
        guard (charCode>31)&&(charCode<127) else {
            return false // Unprintable character
        }
    }
    return true
}

What in C was likely a comparison between a char integer value and an ASCII range for each character composing the string, in Swift doesn’t change much and just uses the value of each one of the unicode scalars for the comparison. Clearly, this approach makes sense only if those strings have graphemes composed by a single scalar, not much in the general case.

And what about simple conversions between characters and their numerical ASCII value?

To convert a numerical value to the corresponding Character or String we must first convert it to an UnicodeScalar, while the most compact way to do the opposite uses the specific constructor that only the UInt8 type provide:


let c = Character(UnicodeScalar(70))   // "F"

let s = String(UnicodeScalar(70))      // "F"

let asciiForF = UInt8(ascii:"F")       // 70

The guard statement in the previous example could have been improved with UInt8(ascii:) to increase readability.

Working with Functions

As happened with strings, Swift is able to automatically convert closure into C function pointers when used as parameters, but with a major twist, closures that will be used as a C function pointer parameter cannot capture any value outside of their context.

To enforce this, this kind of closures (and closures that are the result of a conversion from a C function pointer) are annotated automatically with a specific type attribute @convention(c) that, as described in the chapter on type attributes of Swift Language Reference, indicate the calling convention the closure will conform to. The possible value are : c, objc and swift.

A few alternatives to work around this limitation are described in this article by Chris Eidhof and consist in using a block-based function, if you are on a Darwin-derived OS and are calling a function that also have a block variant, or passing a retained environment object to the function following a common C pattern.

Now let’s talk briefly about variadic functions.

Swift does not support traditional variadic C functions and this will be clear the first time you try to call a function like printf form Swift, compile time error. If you really need to use one of those, the only viable alternative is building a wrapper function in C that limits the number of parameters or a wrapper that accepts multiple parameters indirectly using a va_list (that is supported by Swift).

So, even if printf does not work, vprintf or similar functions will.

To turn an array of parameters or a variadic Swift parameter list into a va_list pointer, each parameter must implement CVarArgType and then you just need to call withVaList to obtain a CVaListPointer that points to your list of parameters (getVaList is also available but the documentation recommends to avoid it). Let’s see a short example with vprintf:


withVaList(["a", "b", "c"]) { ptr -> Void in
    vprintf("Three strings: %s, %s, %s\n", ptr)
}

Unmanaged

We’ve learn more or less everything we have to know about pointers, but there is still something that we’ll not be able to handle with what we know now.

What if we pass to a C function that returns its result in a callback a Swift reference object as parameter? Can we be sure that during this switch of context the Swift object will still be there and that ARC will not have released it? No, we can’t make the assumption that the object will still be there.

Meet Unmanaged, a class with some interesting utility methods that we’ll use to manage situations like the one described above. With Unmanaged you will be able alter the retain count of an object and convert it to a COpaquePointer if you need to pass it around.

Let’s get right into it and solve the issue we described, here is an example of a C function similar to what was described before:


// cstuff.c
void aCFunctionWithContext(void* ctx, void (*function)(void* ctx)){
    sleep(3);
    function(ctx);
}

And some Swift code that calls it:


class AClass : CustomStringConvertible {
    
    var aProperty:Int=0

    var description: String {
        return "A \(type(of: self)) with property \(self.aProperty)"
    }
}

var value = AClass()

let unmanaged = Unmanaged.passRetained(value)
let uptr = unmanaged.toOpaque()
let vptr = UnsafeMutableRawPointer(uptr)

aCFunctionWithContext(vptr){ (p:UnsafeMutableRawPointer?) -> Void in
    var c = Unmanaged<AClass>.fromOpaque(p!).takeUnretainedValue()
    c.aProperty = 2
    print(c) //A AClass with property 2
}

With the passRetained and passUnretained methods, Unmanaged retains a given object for us, respectively incrementing or not its reference count.

Since the callback needs a void pointer, we first obtain a COpaquePointer with toOpaque() and then convert it to UnsafeMutablePointer<Void>.

In the callback, we perform the same operations in reverse to obtain a reference to the original class and modify its value.

When extracting the class from the unmanaged object, we could have called either takeRetainedValue or takeUnretainedValue, that following the same pattern described before, respectively decrement or leave unmodified the reference count of the value.

In this case, we are not decrementing the reference count so that the class will not become releasable once we are out of the closure scope, this class will have to be released manually somewhere else through the unmanaged instance we declared initially.

And this is just one simple, and maybe not the best, example of a category of problems that Unmanaged could solve, to learn more about Unmanaged check out this NSHipster’s article.

Working with Files

Since on some platform we could have to deal directly with files frequently using the standard C library functions, let’s see some examples of how to read and write from files:


let fd = fopen("aFile.txt", "w")
fwrite("Hello Swift!", 12, 1, fd)

let res = fclose(file)
if res != 0 {
    print(strerror(errno))
}

let fd = fopen("aFile.txt", "r")
var array = [Int8](repeating:0, count: 13)
fread(&array, 12, 1, fd)
fclose(fd)

let str = String(validatingUTF8: array)!
print(str) // Hello Swift!

As you can see there is nothing weird or convoluted about file access, this is essentially the same code we would have written in C. Notice that we have full access to errno and all the related functions.

Bitwise Operations

Since it’s highly likely that you’ll need to perform bit mask operations when interoperating with C, i recommend a post I wrote a while ago on the subject that should cover all you need to know.

Get the mixed Swift/C playground for this post from GitHub or zipped.

Swift and C: Mixed Projects

Swift projects can access libraries written in C using a bridging header following the same procedure used for Objective-C libraries.

But since this does not work for framework projects, let’s see a more general alternative approach that has only slightly more of typing involved. We’ll create a LLVM module that will contain the C code we want to export to Swift.

Let’s suppose we’ve added to our Swift project a C source file:


//  CExample.c
#include "CExample.h"
#include <stdio.h>

void printStuff(){
    printf("Printing something!\n");
}

void giveMeUnsafeMutablePointer(int* param){ }
void giveMeUnsafePointer(const int * param){ }

And its header:


//  CExample.h
#ifndef CExample_h
#define CExample_h

#include <stdio.h>
#define IAMADEFINE 42

void printStuff();
void giveMeUnsafeMutablePointer(int* param);
void giveMeUnsafePointer(const int * param);

typedef struct {
    char name[5];
    int value;
} MyStruct;

char name[] = "IAmAString";
char* anotherName = "IAmAStringToo";

#endif /* CExample_h */

To keep the C sources separated from the rest we’ve put these files in the CExample sub-directory in the root of the project.

We must now create a module.map file in that same directory and this file will declare what our C module exports and from which header file.


module CExample [system] {
    header "CExample.h"
    export *
}

As you can guess, we are exporting all the content declared in the header but modules can also selectively export only part of what’s declared.

Furthermore, in this example the actual source of the library is contained in the project, but if you need to expose to Swift one of the libraries you have installed on your system you can just create a module.map (not necessarily in its own directory) and specify as header one or more of your system’s headers. In this case you’ll likely need to specify the name of the library your headers refer to too using the link libname directive in your modulemap (that will link that library as you would manually do with -llibname). And you can also declare more than one module in a single module.map.

To learn more about LLVM modules and all the options available check out the official documentation.

The last step consist in adding the module directory to the search path of the compiler. To do this, open the project properties and add the module path (${SRCROOT}/CExample) to Import Paths under Swift Compiler - Search Paths:

And that’s it, we can now import from Swift the new module and use what it contains:


import CExample

printStuff()
print(IAMADEFINE) //42

// Pass a bogus pointer at address 0x1
giveMeUnsafePointer(UnsafePointer<Int32>(bitPattern: 1))
giveMeUnsafeMutablePointer(UnsafeMutablePointer<Int32>(bitPattern: 1))

let ms = MyStruct(name: (0, 0, 0, 0, 0), value: 1)
print(ms)

print(name) // (97, 115, 100, 100, 97, 115, 100, 0)
//print(String(validatingUTF8:name)!) // Cannot convert it

print(anotherName) //0xXXXXXX pointer address
print(String(validatingUTF8:anotherName)!) //IAmAStringToo

Swift Package Manager

The Swift Package Manager supports the creation of C modules via module.map as described in the documentation. You can either create mixed projects or create a main project importing various sub-projects, each one providing a wrapper module (with likely some Swift to decorate the original C API) for each different C library you need to use.

Swift 3 Changes

The third major release of Swift introduces many changes related to pointers and other functionalities shown in this article, some of the pertaining Swift Evolution proposals are: SE-0016, SE-0055, SE-0076, SE-0101, SE-0107, SE-0136 and finally SE-0138 that extends SE-0107 and has been implemented in Swift 3.0.1.

With SE-0107, a new UnsafeRawPointer type has been introduced to replace the UnsafePointer<Void> that was used in Swift 2.2 code for pointer without type information and strict aliasing for the pointers element type is now enforced. A comprehensive migration guide is available to help the transition.

Closing Thoughts

I hope this article will at least shed some light on the mysterious and scantily documented world of Swift and C interoperability but I don’t really expect to have covered everything you could encounter in your projects.

You’ll find yourself in situations where a bit of experimenting will be needed to make things work as you want and it’s likely that C interoperability will be improved over the next releases of Swift (UnsafePointer and all the related functions were introduced just in Swift 2.0, before that, interoperability with C was a bit more convoluted) with new constructs.

A Short Swift GYB Tutorial

2016-02-09T00:00:00+01:00

Update 9/2019: Updated the post and added a detailed description of the line-directive tool.

The GYB (Generate Your Boilerplate) tool is a Swift tool intended for internal use that generates source files starting from a template.

It’s extremely useful when you have more than one struct/class/enum that share a common structure, and you are not eager to maintain multiple versions of what is, actually, the same code. Every time you are writing the same set of methods or properties for slightly different objects, your maintenance effort (and bugs resulting from careless copy/paste) could be reduced using GYB. The tool is used extensively throughout the Swift codebase and there are just a few things you need to know to use it in your projects.

As a diligent guinea-pig (AFAIK, the only other project that uses GYB at the moment is Swift-JNI, part of the Android Swift porting project), I’ve used GYB extensively in the first release of Bitter, a Swift library that simplifies working with bits and bitwise operations, where I had a lot of very similar code inside extensions for each one of the fixed size Swift Ints.

With this tool, I was able to define a single template that the tool was able to expand to the 10 separate extensions I initially coded by hand.

Let’s see what you need to know to start using GYB.

GYB Engine Elements

The GYB templating engine is quite simple but requires a minimal knowledge of Python. A template is composed by these elements:

Lines starting with % (leading whitespace is ignored) are followed by Python code and are used for control flow statements, as you would expect those statement are closed with an %end. Statements can be nested.
Lines that do not start with an % are treated as text and simply inserted into the output.
Elements with the form ${VARIABLE_NAME} are replaced with the value of VARIABLE_NAME
The % and $ characters are escaped respectively as %% and $$.
Blocks of Python code can be added to the template and are delimited by %{ and }% , the indentation outside the block is stripped, so, it’s irrelevant for your Python code.

Let’s see what we can do with these few simple rules with an example, taken from the Bitter template, that adds to all fixed size integers a computed property allOnes, that returns an Int/UInt initialized with a bit pattern with all ones:


%{
  intTypes = [8,16,32,64]
}%

% for intType in intTypes:
    % for sign in ['','U']:

/// Extension that adds a few additional functionalities to ${sign}Int${intType}
extension ${sign}Int${intType} {

        /// Returns a ${sign}Int${intType} with all ones
        %if sign == '':
    public static var allOnes:Int${intType}{return Int${intType}(bitPattern: UInt${intType}.max)}
        %else:
    public static var allOnes:UInt${intType}{return UInt${intType}.max}
        %end

}
    %end
%end

With a Python block we declare an array with all the fixed sizes of the Ints available in Swift and then iterate over it, using an internal loop to consider signed and unsigned integers too. We than output two different snippets depending on the value of the sign variable, we’ll output the first one if the sign variable is empty (signed integers) or the second one if it’s not (unsigned integers).

In this example we are using a simple if/else and foreach statements, but we could have used everything that is valid in Python like an elif or a variation of that for.

Running this through GYB we’ll get 8 extensions, one for each fixed size integer, from Int8/UInt8 to Int64/UInt64.

Generating the source

You can download GYB from the Swift repository:


wget https://github.com/apple/swift/raw/master/utils/gyb
wget https://github.com/apple/swift/raw/master/utils/gyb.py
chmod +x gyb

And parse your template this way:


./gyb --line-directive '' -o ../Sources/Bitter/Bitter.swift Bitter.swift.gyb

The -o option specifies the output file, while the last filename specifies the name of the file containing the template.

Without the --line-directive '' parameter, GYB outputs additional debugging comments, that for each section of the output describe which element in the original template was executed to generate it.

Useful while you are still in the process of writing your template but once you are done the debug comments can be disabled to obtain a clean output.

Better debugging of generated sources with line-directive

But what about debugging code that has been generated with GYB?

That’s where the comments that we purged from the output in the previous step come in handy, comments that will be parsed by an additional tool, line-directive, that will simplify what would otherwise be an excruciating process.

Download it from the main repository on GitHub:


wget https://github.com/apple/swift/raw/master/utils/line-directive
chmod +x line-directive

To explain how this works, here is a detailed explanation by GYB’s author, Dave Abrahams, directly from the line-directive usage documentation.

#sourceLocation is a directive used by tools like the Swift compiler and debugger to adjust the lines reported in diagnostics and to determine what source you see when you’re stepping.

#sourceLocation corresponds to #line in C/C++ which is inserted by code generators like Lex/Flex/Yacc/Bison so that you deal with the actual code you wrote and not the generated result. For dealing with errors in the Swift generated by your gyb source it’s important that your tools can take you to the right line in your gyb file rather than in generated .swift file. If you don’t have such a tool, manually indirecting through the generated code is tedious, but at least it’s possible since gyb leaves #sourceLocation information behind.

But Swift’s #sourceLocation directive is suboptimal for the purposes of the freeform code generation done with gyb because it can only appear between grammatically-complete declarations and statements. So instead of inserting #sourceLocation directives, gyb inserts //###sourceLocation comments (by default, it’s tunable).

This line-directive tool remaps file and line information in the output of your swift compiler (or whatever tool you are using to process generated source, gyb is not swift-specific) so that the error points to the right place in the gyb source.

You invoke it as follows:


  line-directive <generated-sources> -- <compilation command>

For example, if you have foo.swift.gyb, bar.swift.gyb, and baz.swift, instead of:


  gyb foo.swift.gyb -o foo.swift
  gyb bar.swift.gyb -o bar.swift
  swiftc foo.swift bar.swift baz.swift

You do this:


  gyb foo.swift.gyb -o foo.swift
  gyb bar.swift.gyb -o bar.swift
  line-directive foo.swift bar.swift -- swiftc foo.swift bar.swift baz.swift

Dealing with Bit Sets in Swift

2016-02-05T00:00:00+01:00

Update 10/17:This post has been updated to Swift 4.

Update 10/16:This post has been updated to Swift 3.

Swift provides convenient fixed size integers and the usual set of bitwise operators you already know, so, dealing with bit sets would seem pretty straightforward.

But you’ll soon discover that the language and the standard library put always safety first and dealing with bits and different integer types requires a few more type conversion than what you could be used to. This post describes the a gotcha you should be aware of.

Before explaining what i mean, let’s get up to speed with the basics of integer types and bitwise operations.

Get the playground for this article from GitHub or zipped.

Integer Types and bitwise operators

Swift offers a set of integer types with different fixed length and signedness: Int/UInt, Int8/UInt8(8 bits), Int16/UInt16(16 bits), Int32/UInt32(32 bits) and Int64/UInt64(64 bits).

The types Int and UInt are platform dependent and are equivalent to Int32/UInt32 on 32 bit platforms and to Int64/UInt64 on 64 bit platforms. The others have the specified length regardless of the target platform you compiled for.

Fixed length type are extremely useful when used in combination with bitwise operators because they make explicit the size of the data you are working on, and when performing operations on single bit you’ll rarely use the platform dependent Int or UInt.

Variables with a fixed size integer type can be initialized with a binary, octal or hexadecimal value this way:


var int1:UInt8 = 0b10101010
var int2:UInt8 = 0o55
var int3:UInt8 = 0xA7

In regards to bitwise operations, Swift supports what you’d expect: unary NOT(~ operator), AND(& operator), OR(| operator), XOR(^ operator) and the left and right shift(« and » operators).

It’s important to remember that for unsigned integers, shifting left or right by the given number of positions inserts a 0 in the opposite direction. For signed integers instead, the sign is preserved inserting the sign bit instead of a zero when shifting to right.

For integers more than one byte long, Swift also provides a few useful computed properties to deal with endianness conversion: littleEndian, bigEndian, byteSwapped, that respectively convert from the current integer representation to big or little endian or convert to the opposite endianness. One last thing, is there a way to understand if we are on a 32 or 64 bits platform?

Sure, but considering that the Builtin module is not accessible, we can only compare the size of Int with one of the two fixed size integers that correspond to the two supported platform widths:


MemoryLayout<Int>.stride==MemoryLayout<Int32>.stride //Are we on a 32bits platform? Nope.

I’m using the stride property here, but in this case size would have worked as well.

Integer type conversions

Swift does not perform implicit type conversions, and as you may have already noticed when doing mixed types arithmetic operations, you need to explicitly convert the variables of your expression to a type big enough to hold your results.

In case of multiple integers in the same expression, Swift can only infer the type of free integer literals when they are used with some typed variable of one single type, as before, no implicit conversion toward the bigger integer type is performed.

Let’s see an example of what is allowed and what isn’t:


var u8:UInt8 = 1
u8 << 2           //4: The number 2 is considered an UInt8 and u8 is shifted
                  //   to the left by 2 positions

var by2:Int16 = 1
u8 << by2         //Error: Operands of different types, doesn't compile
u8 << UInt8(by2)  //4: It works, we manually converted the Int type, 
                  //   but this is NOT SAFE!

Why it’s not safe you could ask?

Because when converting a bigger integer type to a smaller one or an unsigned integer to an signed one, Swift does not perform any truncation of the contained value, and fails at runtime when the value being converted overflows the receiving type.

This must be kept in mind and it’s extremely important when you perform conversions on integer that are the result of something entered by the user or coming from some external component.

But luckily, Swift provides a way to perform bit truncating conversions using the init(truncatingIfNeeded:) constructor, quite useful when you are performing operations on bits and are not interested in the actual decimal value of the integer.


var u8:UInt8=UInt8(truncatingIfNeeded:1000)
u8 // 232

In this sample we converted the Int 1000 that has a binary representation of 0b1111101000 to an UInt8 just keeping the 8 least significant bits and discarding everything else. That way we obtained 232, that has a binary representation of 0b11101000.

And this works the same way for every combination of Intn or UIntn types, the sign of the signed Int is ignored and the bit sequence is simply used to initialize a new integer variable. Between signed and unsigned integers of the same size, init(bitPattern:) is also available but the result is the same of the usual truncating conversion.

The only drawback of this safety-first/no-assumptions approach, is that when you need to perform a lot of type conversions, you code starts to become bloated with all those truncating conversions.

But luckily, in Swift we can extend base types with new methods and we can use this to add some utility methods that truncate to a specific size to all the integer types, for example:


extension Int {
    public var toU8: UInt8{ get{return UInt8(truncatingIfNeeded:self)} }
    public var to8: Int8{ get{return Int8(truncatingIfNeeded:self)} }
    public var toU16: UInt16{get{return UInt16(truncatingIfNeeded:self)}}
    public var to16: Int16{get{return Int16(truncatingIfNeeded:self)}}
    public var toU32: UInt32{get{return UInt32(truncatingIfNeeded:self)}}
    public var to32: Int32{get{return Int32(truncatingIfNeeded:self)}}
    public var toU64: UInt64{get{
            return UInt64(self) //No difference if the platform is 32 or 64
        }}
    public var to64: Int64{get{
            return Int64(self) //No difference if the platform is 32 or 64
        }}
}

extension Int32 {
    public var toU8: UInt8{ get{return UInt8(truncatingIfNeeded:self)} }
    public var to8: Int8{ get{return Int8(truncatingIfNeeded:self)} }
    public var toU16: UInt16{get{return UInt16(truncatingIfNeeded:self)}}
    public var to16: Int16{get{return Int16(truncatingIfNeeded:self)}}
    public var toU32: UInt32{get{return UInt32(self)}}
    public var to32: Int32{get{return self}}
    public var toU64: UInt64{get{
        return UInt64(self) //No difference if the platform is 32 or 64
        }}
    public var to64: Int64{get{
        return Int64(self) //No difference if the platform is 32 or 64
        }}
}

var h1 = 0xFFFF04
h1
h1.toU8   // Instead of UInt8(truncatingIfNeeded:h1)

var h2:Int32 = 0x6F00FF05
h2.toU16  // Instead of UInt16(truncatingIfNeeded:h2)

Common Bitwise Patterns

Now, let’s see some of this in action with some common bitwise operation patterns, just as an excuse to talk about something really useful that is not available in Swift.

Multi-byte Component Extraction

A combination of AND and right shift is commonly used to extract single bits or bytes from longer sequences, let’s see an example where we want to extract the single color components from an RGB representation of the color:


let swiftOrange = 0xED903B
let red = (swiftOrange & 0xFF0000) >> 16    //0xED
let green = (swiftOrange & 0x00FF00) >> 8   //0x90
let blue = swiftOrange & 0x0000FF           //0x3B

Here we are isolating the bits we are interested in performing an AND with a bitmask that has only the bits we want to get in the result at 1, the rest is zeroed out. To obtain and 8 bits representation of the component we need we shift the result of the AND to the right, moving of 16 positions for the red component(2 bytes to the right) and 8 for the green component (1 byte to the right). And that’s it, this masking+shifting pattern has a wide range of applications, but when used in sub-expression it can make your expression unreadable really fast, but why not implement it as a subscript of all the integer types? In other words, why not add the ability to access with an index the single byte components of an Int like we do with arrays?

For example, let’s add the subscript to Int32:


extension UInt32 {
    public subscript(index: Int) -> UInt32 {
        get {
            precondition(index<4,"Byte set index out of range")
            return (self & (0xFF << (index.toU32*8))) >> (index.toU32*8)
        }
        set(newValue) {
            precondition(index<4,"Byte set index out of range")
            self = (self & ~(0xFF << (index.toU32*8))) | (newValue << (index.toU32*8))
        }
    }
}

var i32:UInt32=982245678                        //HEX: 3A8BE12E

print(String(i32,radix:16,uppercase:true))      // Printing the hex value

i32[3] = i32[0]
i32[1] = 0xFF
i32[0] = i32[2]

print(String(i32,radix:16,uppercase:true))      //HEX: 2E8BFF8B

The magical XOR

Some of you may know XOR from the simple and quite useless XOR cipher, that allows to encrypt a bit stream with a key and retrieve the original value performing again an XOR with the same key. For the sake of brevity, let’s use a message and a key with the same size:


let secretMessage = 0b10101000111110010010101100001111000 // 0x547C95878
let secretKey =  0b10101010101010000000001111111111010    // 0x555401FFA
let result = secretMessage ^ secretKey                    // 0x12894782

let original = result ^ secretKey                         // 0x547C95878
print(String(original,radix:16,uppercase:true))                  // Printing the hex value

The same property of the XOR can be used for other things, the most simple one is the XOR swap that swaps two integer variables without an additional temporary variable:


var x=1
var y=2
x = x ^ y
y = y ^ x   // y is now 1
x = x ^ y   // x is now 2

Not really useful in Swift since you can do the same trick using tuples (see item #11 here).

Another thing you can do with the XOR but that i will not describe here, is building a variation of the classic doubly linked lists: the XOR linked list. A way more interesting use of the XOR, learn more about this on wikipedia.

Double Negation: Is That Bit Set?

Another common pattern, similar to the first presented, is using a bitmask in conjunction with a suspicious looking double negation to understand if a specific bit or a specific bit pattern is set in the input bit sequence:


let input:UInt8 = 0b10101101
let mask:UInt8 = 0b00001000
let isSet = !!(input & mask)  // If the 4th bit is set this is equal to 1.
                              // But this code is not valid in Swift...

The double negation is based on the specific behavior of the logical negation in C/C++ (and a few other languages) and the fact that in C/C++ booleans are implemented with integers (0 as false, 1 as true), quoting from the C99 standard:

The result of the logical negation operator ! is 0 if the value of its operand compares unequal to 0, 1 if the value of its operand compares equal to 0. The result has type int. The expression !E is equivalent to (0==E).

Considering this, the behavior of the double negation becomes more clear. The first logical NOT turns our masked input into either a 0 or 1 if the operand was respectively a number greater than 0 or 0 (effectively inverting its boolean value as we’d expect), while the second logical NOT turns the input back to the original boolean value, but now, the only possible value will be 0 or 1. Maybe the explanation is a bit messy, but you should get the idea of how this works.

But Swift has a proper boolean type and the logical NOT works only on those booleans. So, what can we do?

Let’s define a custom operator(usually i don’t really like them, but let’s make an exception) that implements the double negation for the UInt8 type!


prefix operator ~~ 

prefix func ~~(value: UInt8)->UInt8{
    return (value>0) ? 1 : 0
}

~~7  // 1
~~0  // 0

let isSet = ~~(input & mask)   // 1 as expected

As an improvement we could return a Bool instead of an UInt8, to use it directly in conditional statements, but we’ll loose the ability to embed it in other integer expressions.

Bitter: A library for bits manipulation

All the alternative approaches to manipulate bit sets presented in this post are part of Bitter, a library that try to offer a more “swifty” interface for bit sets manipulation.

To recap what you’ll find in Bitter (available via CocoaPods,Carthage, SwiftPM):

Convenient computed properties for bit pattern truncating conversions
Integer byte indexed subscripting for every integer type
The double negation operator
and more…

The library is still young and feedback is highly appreciated, feel free to try it out and open issues if something doesn’t work or you have ideas for additional features!

10 Swift One Liners To Impress Your Friends

2016-01-06T00:00:00+01:00

Update 10/17:This post has been updated for Swift 4

Update 09/16:This post has been updated for Swift 3

A few years ago, at the peak of the Functional Renaissance™, a blog post that presented 10 Scala functional one liners became quite popular and was rapidly followed by a series of articles that implemented the same one liners in other languages like Haskell, Ruby, Groovy, Clojure, Python, C#, F#, CoffeeScript.

We’ll never know how many people were actually impressed by those one liners during social gatherings, but my guess is that at least for the uninitiated the more complex examples were a good incentive to learn more about functional programming.

Let’s see how Swift fares against the other languages, trying to solve the same 10 exercises using one liners, maybe obtaining something interesting in the process (see #6 and #10).

Get the playground for this article from GitHub or zipped.

#1 Multiply each element of an array by 2

Not much to see in this first example, easily solvable using map as we all know.


(1...1024).map{$0 * 2}

#2 Sum a list of numbers

This exercise is solved using reduce and the plus operator, leveraging the fact that the plus operator is a function, but the solution is obvious, we’ll see in a moment a few more creative uses of reduce.


(1...1024).reduce(0,+)

#3 Verify if Exists in a String

Let’s verify if a tweet contains at least one of a few selected keywords using filter:


let words = ["Swift","iOS","cocoa","OSX","tvOS"]
let tweet = "This is an example tweet larking about Swift"
let valid = !words.filter({tweet.contains($0)}).isEmpty

valid //true

Update: @oisdk suggests a few better alternatives:


words.contains(where:tweet.contains)

Way more concise, and this one:


tweet.characters.split(separator:" ")
                .lazy
                .map(String.init)
                .contains(where:Set(words).contains)

#4 Read in a File

Reading a file into an array of lines is not possible through an easy built-in like in other languages but we can create something short that doesn’t need a for using a combination of split and map:


let path = Bundle.main.path(forResource:"test", ofType: "txt")

let lines = try? String(contentsOfFile: path!).characters
                     .split{$0 == "\n"}
                     .map(String.init)

if let lines=lines {
    lines[0] // O! for a Muse of fire, that would ascend
    lines[1] // The brightest heaven of invention!
    lines[2] // A kingdom for a stage, princes to act
    lines[3] // And monarchs to behold the swelling scene.
}

That last step with map and the string constructor turns our arrays of characters into strings.

#5 Happy Birthday to You!

This will display the Happy Birthday song to console, a simple use of map with a range and the ternary operator.


let name = "uraimo"
(1...4).forEach{print("Happy Birthday " + (($0 == 3) ? "dear \(name)":"to You"))}

In this case we are asked to partition a sequence using a provided filtering function. Many languages have in addition to the usual map, flatMap, reduce, filter, etc… also a partitionBy function that does exactly that, Swift as you know doesn’t have something similar (the NSArray function that filters by NSPredicate is not what we want here).

Therefore, we could solve this extending Sequence with a partitionBy function that we’ll use to partition an integer array:


extension Sequence{
   typealias Element = Self.Iterator.Element 
    
   func partitionBy(fu: (Element)->Bool)->([Element],[Element]){
       var first: [Element] = [] 
       var second: [Element] = []
       for el in self {
          if fu(el) { 
             first.append(el) 
          }else{ 
             second.append(el) 
          } 
       } 
       return (first,second) 
   } 
}

let part = [82, 58, 76, 49, 88, 90].partitionBy{$0 < 60}
part // ([58, 49], [82, 76, 88, 90])

It’s not really a one liner and the approach is imperative. But could we use filter to improve it a little?


extension Sequence{ 
   func anotherPartitionBy(fu: (Self.Iterator.Element)->Bool)->
          ([Self.Iterator.Element],[Self.Iterator.Element]){ 
      return (self.filter(fu),self.filter({!fu($0)})) 
   } 
} 

let part2 = [82, 58, 76, 49, 88, 90].anotherPartitionBy{$0 < 60}
part2 // ([58, 49], [82, 76, 88, 90])

This is slightly better, but it traverses the sequence two times and trying to turn this into a one liner removing the enclosing function will get us something with too much duplicated stuff (the filtering function and the array that will be used in two places).

Can we build something that will transform the original sequence into a partition tuple using a single stream of data? Yes we can, using reduce.


var part3 = [82, 58, 76, 49, 88, 90].reduce( ([],[]), { 
   (a:([Int],[Int]),n:Int) -> ([Int],[Int]) in 
   (n<60) ? (a.0+[n],a.1) : (a.0,a.1+[n])  
}) 

part3 // ([58, 49], [82, 76, 88, 90])

What we are doing here is building the result tuple that contains the two partitions, an element at a time, testing each element of the original sequence using the filtering function and appending this element to the first or second partition array depending on the filtering result.

Finally a true one liner but notice that the fact that the partition arrays are being built via append will actually make it way slower than the two previous implementations.

#7 Fetch and Parse an XML web service

Some of languages above don’t rely on external libraries and have more than one option available by default to deal with XML (e.g. Scala that “natively” albeit awkwardly supports XML parsing into objects), but Foundation provides only the SAX parser NSXMLParser, and as you may have already guessed we are not going to use it.

There are a few alternative open source libraries we could use in this case, some of them written in C or Objective-C and others in pure Swift.

This time we are going to use the pure-Swift AEXML:


let xmlDoc = try? AEXMLDocument(xmlData: NSData(contentsOf: URL(string:"https://www.ibiblio.org/xml/examples/shakespeare/hen_v.xml")!)!) 

if let xmlDoc=xmlDoc {
    var prologue = xmlDoc.root.children[6]["PROLOGUE"]["SPEECH"]
    prologue.children[1].stringValue // Now all the youth of England are on fire,
    prologue.children[2].stringValue // And silken dalliance in the wardrobe lies:
    prologue.children[3].stringValue // Now thrive the armourers, and honour's thought
    prologue.children[4].stringValue // Reigns solely in the breast of every man:
    prologue.children[5].stringValue // They sell the pasture now to buy the horse,
}

#8 Find minimum (or maximum) in a List

We have various ways to find the minimum and maximum of a sequence, among those the min and max functions:


//Find the minimum of an array of Ints 
[10,-22,753,55,137,-1,-279,1034,77].sorted().first 
[10,-22,753,55,137,-1,-279,1034,77].reduce(Int.max, min) 
[10,-22,753,55,137,-1,-279,1034,77].min() 

//Find the maximum of an array of Ints 
[10,-22,753,55,137,-1,-279,1034,77].sorted().last 
[10,-22,753,55,137,-1,-279,1034,77].reduce(Int.min, max) 
[10,-22,753,55,137,-1,-279,1034,77].max()

#9 Parallel Processing

Some languages allows to enable in a simple and transparent way parallel processing of sequences for functionalities like map and flatMap, to speed up the execution of sequential and independent operations using thread pools under the hood.

This feature is not yet available in Swift but can be built using GCD: http://moreindirection.blogspot.it/2015/07/gcd-and-parallel-collections-in-swift.html

#10 Sieve of Erathostenes

The good old sieve of Erathostenes is used to find all the prime numbers until a given upper limit n.

Starting with a sequence of all the integers smaller than n, the algorithm removes all the multiples of each integer, until we are left with just prime numbers. And to speed up the execution, we don’t actually need to check each integer for its multiple, we can just stop at the square root of n.

Based on that definition the first implementation could look like this:


var n = 50 
var primes = Set(2...n) 

(2...Int(sqrt(Double(n)))).forEach{
     primes.subtract(Set(stride(from:2*$0, to:n+1, by:$0)))
}
primes.sorted()

We use the outer range to iterate over the integers we want to check and for each one we calculate a sequence of multiples of those numbers using stride(from:Int, to:Int, by:Int). Those sequences are then substracted from a Set initialized with all the integers from 2 to n.

But as you can see, to actually remove the multiples we use an external mutable Set, introducing a side-effect.

To eliminate side-effects, as we should always try to do, we will first calculate all the subsequences, flatMap them in a single array of multiples and remove these integers from the original Set.


var sameprimes = Set(2...n) 

sameprimes.subtract(Set(2...Int(sqrt(Double(n)))) 
                   .flatMap{stride(from:2*$0, to:n+1, by:$0)}) 
sameprimes.sorted()

Way cleaner and a nice example of usage of flatMap to flatten nested arrays.

#11 Bonus: Tuple swap via destructuring

As a bonus, not everyone knows that like in other languages that have a tuple type, you can leverage tuple destructuring to perform a compact variable swap:


var a=1,b=2

(a,b) = (b,a)
a //2
b //1

And that’s all, as expected Swift is as expressive as many other languages.

Thanks to @oisdk for reviewing the post

Xcode 7.2 and Swift 2.1.1 Released [Updated, XCode 7.2.1]

2015-12-08T00:00:00+01:00

Update 01/03/16:

XCode 7.2.1 has been released, this release it’s a maintenance release with minor fixes and stability improvements, no updated to Swift are included. Those minor fixes include:

xcodebuild test will no longer timeout.
Resolved a debugger crash that could occur when your code was depending on a binary Swift library or framework.
The certificates used for Apple Wallet passes, Safari Push notifications and Safari extensions have been updated.

Original XCode 7.2 Release Post:

With the new version of XCode released today a maintenance release of the Swift compiler will be also available, as expected there is nothing new regarding the language itself, only a few fixes as described in the release notes.

For more informations about what changed from the release 2.0 to 2.1.x check out this specific post and this post on Swift 2.1 Function Types Conversion: Covariance and Contravariance.

Here is a summary of the changes:

In previous releases of Swift, if a type had a mutable property of protocol type, chained accesses to properties of that property were always treated as mutations of the property, even if the second property was only read, not written. For example:
```
protocol Countable {
  var count: Int { get }
}
class MyObject {
  var widgets : Countable {
      didSet { print("in didSet") }
  }
}
var obj : MyObject = ...
let count = obj.widgets.count
  
```
The example above would perform a spurious write back to the property widgets, causing didSet to unexpectedly fire. The workaround was to split the access into separate expressions. For example:
```
let widgets = obj.widgets
let count = widgets.count
  
```
This bug has now been fixed(radar 22953072).
Swift data types that are imported into Swift from C struct types (such as CGRect and CGPoint) can now be inspected in the debugger (radar 23088739).
A bug that caused some Swift projects to crash in whole module optimization with a “program too clever” LLVM error has been fixed (radar 23200656).
A bug has been fixed where Swift code calling an Objective-C method that took a block returning a nonnull NSString * with a Swift closure would be miscompiled, causing the compiled program to crash (radar 23285766).

Swift 3.0 Upcoming Changes

2015-12-03T00:00:00+01:00

It finally happened, today Swift has been opensourced and it’s available on GitHub.

And with its public release of the source, the first informations on what will be included in the next major release, Swift 3.0, are finally available.

From now on, everything related to the evolution of the language will be tracked through the aptly named Swift-Evolution repository on GitHub.

Proposals for updates to the language (in the form of new features or alterations to the current behaviour, that will come from Apple or the community) will be discussed following the Swift Evolution Process on the dedicated mailing lists, and Apple will review each proposal and ultimately decide what will be accepted or rejected.

At the moment, Swift 3.0 is planned to be released in the fall of 2016.

The update will focus on a few key areas:

ABI Stabilization: Stabilize the application binary interface (ABI) to guarantee binary backward compatibility. This will involve the stabilization of core details of the language implementation like function calling conventions and internal language objects name mangling.
Language refinement: Even if the language is still in its infancy, the next releases will start to make the language more consistent outlining a set of core principles that define the language and eliminating features that deviate from those principles.
Improvements to generics: The work on generics will reach completion and some known gaps will be filled.
Type system cleanup and documentation of behaviour
Complete the alignement of core libraries to the API guidelines

No updates related to new concurrency primitives or C++ interoperability will be included in the next major release, but these are among the new features that will be available in future releases.

Regarding the actual changes coming with Swift 3.0, for now we know that a few proposals have been accepted and will be included:

Better Translation of Objective-C APIs Into Swift

The automatic translation of Objective-C APIs will be improved to adhere more strictly to what is described in the Swift API Design Guidelines, so, as you imagine, more than a few method names will change.

To give you and idea of some of the changes, the translator will drop the NS prefix from all Foundation classes, boolean properties will gain an is prepended to their name (e.g. a boolean empty property will become isEmpty) methods will lose redundant portions of their name (e.g. UIBezierPath.moveToPoint(p : CGPoint) will become UIBezierPath.moveTo(p : CGPoint) and methods like UIBezierPath.bezierPathByReversingPath() will simply become UIBezierPath.reversing()).

Removal of currying func declaration syntax

To simplify the language, removing some rarely used syntactic sugar, Swift 3.0 will lose the curried function declaration syntax (e.g. func foo(a:Int)(b:Int)). Not a big loss in my opinion.

Removal of var from Function Parameters and Pattern Matching

It will not be possible anymore to create local mutating copies of function parameters or binding variables inside if/switch/guard/for/etc… using the var keyword. Thus, no more implicit variable shadowing will be performed, and that’s a good thing.

To give you a practical example, this will not be valid Swift 3.0 code:


if var x = getOptionalInt() {
  x += 1
  return x
}

But something like that could be easily updated this way, introducing a new temporary variable:


if let x = getOptionalInt() {
  var x = x
  x += 1
  return x
}

Removal of the ++ and – operators

The unary increment and decrement operators will be completely removed. Implemented only by a few types, prone to be implemented with some non-obvious implicit behaviour, and prone to encourage the implementation of some overly “clever” code.

And that’s all for now.

The list of features is obviously not complete yet, more proposals will be discussed and accepted in the following months. I’ll keep and eye on Swift-Evolution and the mailing-list to follow the progress of the release.

Updates

10/12/15

The first proposal from the community has been accepted, there will be no more c-style For (the good old indexed for) in Swift 3.0 (a warning will be displayed in Swift 2.2). A lot of additional modifications are being proposed on the mailing list, I recommend to check it out, there are many discussions that detail the rationale behind current language choices, explanations you will not find anywhere else.

8/12/15

Read an interesting interview to Craig Federighi about the future of open source Swift.

After just one day, more than a few additional proposal for modifications are already being discussed on the specific mailing list (archive here) and proposal documents are starting to appear on github.

Experimenting with Swift 3 Sequences and Iterators

2015-11-12T00:00:00+01:00

Update 02/17: Improved the code snippets, with less cryptic variable names.

Update 10/16: This post has been updated to Swift 3.

In this article, part of a series on Swift and the functional approach, we’ll explore what we need to do to build our own sequences in Swift 2, discuss the differences between finite and infinite sequences and examine what we can do with them in a few example scenarios.

Get the playground for this post from GitHub or zipped.

The Sequence standard protocol is defined in the documentation simply as a type that can be iterated with a for…in loop. The section of the protocol definition relevant to our custom implementation is in the top half:


public protocol Sequence {
    /// A type that provides the sequence's iteration interface and
    /// encapsulates its iteration state.
    associatedtype Iterator : IteratorProtocol    

    /// Returns an iterator over the elements of this sequence.
    func makeIterator() -> Iterator
	...
	...
}

The protocol contains an associated type (Swift weird way of making protocols generic) that refers to the IteratorProtocol protocol, we’ll need to implement this too in some way when creating a sequence. Our custom Sequence will return a custom iterator with a specific element type when makeIterator() is called.

Sequences also provide many other interesting methods, already implemented via protocol extensions, like map, flatmap (check out my in-depth article on map and flatMap), filter, reduce, subsequence functions and others.

Having these for free makes Sequence a bit more useful that just a type that can be used with a for each.

Let’s take a look at the IteratorProtocol definition:


public protocol IteratorProtocol {
    /// The type of element traversed by the iterator.
    associatedtype Element
    
    ...
    ...
    /// - Returns: The next element in the underlying sequence if a next element
    ///   exists; otherwise, `nil`.
    mutating func next() -> Element?
}

This simple protocol contains just a next() method, responsible for returning the next element in the sequence managed by this iterator. It’s very important that the iterator returns nil when the sequence ends, we’ll see why below, when we’ll build an infinite sequence.

Let’s build a simple iterator that produces numbers from the well known Fibonacci sequence:


class FibonacciIterator : IteratorProtocol {
    var nextValues = (0,1)
    var stopsAt:Int
    var iterationsCount = 0
    
    init(iteratorLength:Int){
        stopsAt = iteratorLength
    }
    
    func next() -> Int?{
        guard iterationsCount<stopsAt else {
            return nil
        }
        iterationsCount += 1
        
        let next = nextValues.0
        nextValues = (nextValues.1,nextValues.0+nextValues.1)
        return next
    }
}

To return a finite sequence we need an additional constructor that we’ll use to specify the sequence length and return nil instead of a new element when we reach it. There is not much else to see here other than the tuple swap trick that save us a few lines, but let’s see how to use this iterator:


var fg = FibonacciIterator(iteratorLength:10)

while let fib = fg.next() {
    print(fib)
}

This way we’ll iterate on the elements until nil is returned.

Implementing a Sequence for this iterator is straightforward:


class FibonacciSequence : Sequence {
    var stopsAt:Int
    
    init(sequenceLength:Int){
        stopsAt = sequenceLength
    }
    
    func makeIterator() -> FibonacciIterator{
        return FibonacciIterator(iteratorLength: stopsAt)
    }
}

let arr = Array(FibonacciSequence(sequenceLength:10))

for f in FibonacciSequence(sequenceLength: 10) {
    print(f)
}

The sequence can be used in a foreach as expected but can also be used to create other sequences like an array as seen above.

But there is no need to declare the iterator as a separated entity, we can use the AnyIterator<T> class to make this example more compact:


class CompactFibonacciSequence : Sequence {
    var stopsAt:Int
    
    init(sequenceLength:Int){
        stopsAt = sequenceLength
    }
    
    func makeIterator() -> AnyIterator<Int> {
        var nextValues = (0,1)
        var iterationsCount = 0
        
        return AnyIterator{
            guard iterationsCount<self.stopsAt else {
                return nil
            }
            iterationsCount += 1
            
            let next = nextValues.0
            nextValues = (nextValues.1,nextValues.0+nextValues.1)
            return next
        }
    }
}

This will work exactly like the previous sequence, the only substantial difference is that the AnyIterator<Int> returned by makeIterator() conforms to Sequence too now, it’s not anymore just a simple object implementing IteratorProtocol like the one we started with.

Not really that useful here, considering that the iterator is already embedded in a sequence, but in some circumstances, a simple sequence generated with AnyIterator(body:) could be more than enough for what we want to do.

For instance, we could create a sequence with the first 10 numbers of the Lucas sequence, a numeric series similar to Fibonacci that starts with 2,1 instead of 0,1 generating a quite different sequence (i.e. 2, 1, 3, 4, 7, 11, 18, 29, etc…) , using just an iterator and initializing an array with it:


var nextValues = (2,1)
var iterationsCount = 0

let lucas = AnyIterator{
    ()->Int? in
    guard iterationsCount<10 else {
        return nil
    }
    
    iterationsCount += 1
    let next = nextValues.0
    last = (nextValues.1,nextValues.0+nextValues.1)
    return next
}

let a = Array(lucas) //[2, 1, 3, 4, 7, 11, 18, 29, 47, 76]

Definitely not bad, we removed a lot of boilerplate, but since we can improve our algorithm further with a formula involving the golden ratio, let’s do it:


import Darwin

let Phi = (sqrt(5)+1.0)/2
let phi = 1/Phi

func luc(n:Int)->Int {
    return Int(pow(Phi, Double(n))+pow(-phi,Double(n)))
}

c = 0
var compactLucas = AnyIterator{ c<10 ? luc(n: c+1): nil }

let a2 = Array(compactLucas) //[2, 1, 3, 4, 7, 11, 18, 29, 47, 76]

Does it really work? Yes, feel free to play around with it using the playground (zip).

To try out some of the functional(ish) facilities that Sequence provide, we’ll now build a derived sequence that will only return even numbers from the Lucas sequence:


c = 0
var evenCompactLucas = AnyIterator{ c<10 ? luc(n: c+1): nil }.filter({$0 % 2 == 0})
let a3 = Array(evenCompactLucas) //[2, 4, 18, 76]

Notice that we are redeclaring our AnyIterator because the previous one has already been used up and it will return no more elements, it has reached the end of the sequence it was able to generate and from now on it will return only nil. But this aside, you can also notice how easily we modified the original sequence to return a modified set of objects. We could perform even bolder transformations using map methods.

Infinite Sequences

But now, what it we remove the nil termination requirement described above to build an infinite sequence of all the possible Lucas numbers?


c = 0
var infiniteLucas = AnyIterator{luc(n: c+1)}

Converting the original finite sequence we had was easy, and now we have a new sequence that does not have any limit to the number of results it can generate, but it’s also easy to understand that we’ll now need a way to limit the number of elements it generates to be able to traverse the sequence using the usual control flow constructs.

And luckily the Sequence protocol comes to the rescue with one of its methods:


let a4 = Array(infiniteLucas.prefix(10)) //[2, 1, 3, 4, 7, 11, 18, 29, 47, 76]

for var f in infiniteLucas.prefix(10){
    print(f)
}

This way we’ll extract 10 elements from the sequence into a newly created sequence and use it like we were used to from the previous examples.

But let’s go a step further and again apply a filter to our sequence, to obtain a sequence of even Lucas numbers:


var onlyEvenLucas = infiniteLucas.filter({$0 % 2 == 0})
for var f in onlyEvenLucas.prefix(10){
    print(f)
}

Well… this will not work as expected.

Assuming you are using a playground, you’ll see an error where we declared onlyEvenLucas that will highlight that a stack overflow happened. If you wrote that in an application, you’ll see your application likely crashing instead.

The reason why this happens is related to how filter works on normal sequences, as you could already know. When we apply a filter to the original sequence the filtering is carried out instantly and all the elements of the sequence consumed eagerly, but without the terminating nil we have no way to specify when this operation should complete.

Let’s see visually what’s happening using a more verbose infinite sequence of integers that will print some text every time a value is requested from the iterator:


class InfiniteSequence :Sequence {
    func makeIterator() -> AnyIterator<Int> {
        var i = 0
        return AnyIterator{
            print("# Returning "+String(i))
            i += 1
            return i
        }
    }
}

var fs = InfiniteSequence().filter({$0 % 2 == 0}).makeIterator()

for i in 1...5 {
    print(fs.next())
}

If you run this you’ll verify the behavior described above, the filtering on InfiniteSequence will start consuming the sequence… until it will not be able to proceed anymore a few minutes/hours later.

Luckily, obtaining the behavior we expected is again quite easy, we just need to lazily evaluate the infinite Lucas sequence:


var onlyEvenLucas = infiniteLucas.lazy.filter({$0 % 2 == 0})
for var f in onlyEvenLucas.prefix(10){
    print(f)
}

Retrieving .lazy from the original sequence, we’ll get a new LazySequenceType on which operations such as map, flatMap, reduce or filter will be executed lazily and the real evaluation will be performed on demand only when a terminal operation (other languages call them this way) down the chain such as next or something that needs the whole sequence content will be performed.

Making your infinite sequences lazy is a required step, since Swift sequences are not lazy by default (they were in the first few releases of Swift 1.0). Detailed information about how to implement directly LazySequenceProtocol (that most of the times could be the right approach) are available in the documentation, and it’s likely I’ll do a future post on it.

Note: Thanks to Rennie for his suggestion of making the samples a bit less concise :)

Error Handling: From Objective-C to Swift and Back

2015-11-03T00:00:00+01:00

Update 10/16:This post has been updated to Swift 3

The code for this article is available as a playground on Github or zipped.

Swift introduces error handling constructs like do/catch and try and its variants. In this article we’ll discuss this new feature, how it affects the base frameworks and how Swift modules that manage errors that way can be integrated in legacy Objective-C applications.

In the first releases of Swift, error handling was performed in a way that essentially mimicked how it has always been performed in Objective-C (the approach recommended by Apple).

In the good old days of Objective-C, when a method could fail with a recoverable error a NSError pointer was added as the last parameter of the function, and in case of errors it was used to return a description of what happened.

Unrecoverable errors, that could prevent the application from being able to continue its execution normally, were sometimes handled with exceptions, that Objective-C also supported. Using NSError for error handling was definitely the favorite approach.

Back to Swift, something like this in Objective-C:


NSError *error = nil;
NSArray *imageFiles = [[NSFileManager defaultManager] contentsOfDirectoryAtPath:@"./" error: &error];

was translated in Swift 1.2 as:


var error: NSError? = nil
let manager = NSFileManager.defaultManager()
var array = manager.contentsOfDirectoryAtPath(path:"./", error: error)

The contentsOfDirectoryAtPath function was defined this way:


contentsOfDirectoryAtPath(path: String,error error: NSErrorPointer) -> [AnyObject]?

And NSErrorPointer was defined as:


typealias NSErrorPointer = AutoreleasingUnsafePointer<NSError?>

Even without going into the details of the types involved, it’s easy to see that in Swift 1.x there was nothing new about how errors were handled.

Everything changed with Swift 2.0.

Let’s see how error handling works (download the playground if you want to play with these examples) before discussing how a Swift component using the new constructs can be integrated in a legacy Objective-C application (hint: it’s not that hard).

Error Handling in Swift

The following simple example shows the basic syntax:


enum MyError : Error{
    case AnError
    case AnotherError
    case JustAnotherError
}

func throwsError()throws ->Int {
    throw MyError.AnotherError
}

do{
    try throwsError()
}catch MyError.AnError {
    print("AnError")
}catch MyError.AnotherError {
    print("AnotherError")  //AnotherError will be catched and printed
}catch{
    print("Something else happened")
}

do{
    do{
        try throwsError()
    }catch MyError.AnError {
        print("AnError")
    }
}catch MyError.AnotherError {
    print("AnotherError")  //AnotherError will be catched and printed
}catch{
    print("Something else happened")
}

This example creates a custom error defining a new enum that conforms to the Error protocol, each value will refer to a different condition for this error.

When an error needs to be returned from the function, we simply throw one of the values available in the MyError enum and the function will complete its execution and return control to the caller, that will handle the error. Notice that the throwing function has to explicitly state that it could, under some circumstances, throw an error specifying the throws keyword in its declaration.

To perform the actual error handling, the functions that could throw need to be preceded by a try(or one of its alternatives as we’ll see) and need to be enclosed in a do/catch block, that defines the context in which the errors will be managed.

Each catch will handle a specific error in its body. In the example above, our function throws always the same error that’s handled by the second catch, resulting in the name of the exception printed to console.

The do/catch does not have to cover all possible error conditions, if no catch block is able to handle an error, the error is simply propagated to the outer scope an so on, until a catch able to manage the error is found.

The nested do/catch of the next snippet shows that in action, the first do/catch handles only .AnError errors, while the surrounding do/catch is able to handle the remaining alternatives.

But Swift error handling has a lot more to offer, as shown in this more complex example:


enum MyError2 : Error{
    case GenericError
    case DetailedError(String)
    case NumericError(Int)
}

func throwsDetailedError()throws ->Int {
    throw MyError2.DetailedError("Some details here")
}

func shouldNeverThrow()throws ->Int {
    return 0
}

do{
    defer{
        //Clean up
    }
    
    try throwsDetailedError()
    var value = try! shouldNeverThrow()
    var imNil = try? throwsDetailedError()
}catch MyError2.GenericError {
    print("GenericError")
}catch MyError2.DetailedError(let message) {
    print("Error: \(message)")  //Will print Error: Some details here
}catch MyError2.NumericError(let number) where number>0{
    print("Error with id: "+String(number))
}catch{
    print("Something else happened: "+String(describing:error))
}

There is much more going on in this example, this time our custom error can also have parameters for specific conditions and those parameters can then be bound to a variable in the catch clause. If no error or no binding variable is specified, like in the last catch, the error is automatically bound to an error variable.

Another interesting thing is the use of defer (I have placed it inside the do/catch but it doesn’t necessarily need to be there, it could be at the beginning of the current function), that provides the same functionality that is usually provided by a finally block in Objective-C and other languages. The code contained in defer is guaranteed to be executed, no matter what, and because of this is usually used to perform mandatory clean-up operations.

Maybe more interesting it’s the ability to perform pattern matching as you are used to with switches, the code above contains a very simple example, just to show that it can be done, but everything available for switches is available here.

And then there is the first variation on try.

Using try! you are disabling error propagation for the shouldNeverThrow function and wrapping it in a runtime assertion that will generate a runtime error (and crash your application) if the function throws an error. This allows you to ignore errors and their handling in those situations when you can be completely sure that even if a function is declared to throw, no error will actually ever be thrown.

The second and last variation of try is try?, that handle errors producing an optional value that will contain the returned value if available or that will be nil in case of an error. You lose information about what kind of error was thrown (it could not be important) but you gain the ability to use the resulting optional in combination with all the statements that support them, from if let to [map & flatMap](http://www.uraimo.com/2015/10/08/Swift2-map-flatmap-demystified/).

This is just an example of what you could do using map/flatMap:


var convertedInt = (try? shouldNeverThrow()).map{String($0)}
convertedInt

Not really that useful if taken alone, but with a lot of function possibly throwing errors, something like this or simply a wise use of optionals could help, in some cases, turn your code littered by do/catches into something more readable.

Base Frameworks and Error Handling

With the introduction of the new error handling in Swift 2.0, the contentsOfDirectoryAtPath function and all the functions of the base frameworks that return NSErrors now have a slightly different prototype:


func contentsOfDirectoryAtPath(path: String) throws -> [String]

Every function that returned errors using an NSError is now a function that throws but with one less parameter, and what the function throws is no other than the original NSError. And if you check the documentation on NSError, you’ll notice that among other protocols it now implements the Error protocol.

And this alteration of the function prototype is the result of automatic bridging, as described in Apple’s “Using Swift with Cocoa and Objective-C”:

Swift automatically bridges between the Error type and the NSError class.

Objective-C methods that produce errors are imported as Swift methods that throw, and Swift methods that throw are imported as Objective-C methods that produce errors, according to Objective-C error conventions.

Let’s see what this means for Objective-C projects that need to integrate Swift code.

Swift Error Handling and Legacy Objective-C Applications

Suppose that you have an existing Objective-C project and that you plan to migrate gradually to Swift or that you simply want to extend an existing Objective-C application with a new Swift component, essentially turning your project in a mixed language application.

This section will use a simple OSX console application to show how this can be done, to follow along create a new console application or get the full sample on Github.

Once the project has been created, to start writing some Swift, simply add a new Swift file to your project, when asked if a bridging header needs to be created click No. With a bridging header you would be able to use Objective-C classes in Swift, in this section we’ll do the opposite.

Paste this code in your newly created Swift file:


import Foundation

@objc enum MyError:Int, Error{
    case AnError
    case AnotherError
}


public class MyClass:NSObject{
    
    public func throwAnError() throws {
        throw MyError.AnotherError
    }
    
    public func callMe(){
        print("Someone called!")
    }
    
}

We are going to invoke these two methods from Objective-C, but before that it’s important to know the module name of our project as shown under Product Module Name in Build Settings:

To access the Swift code we just wrote from Objective-C we just need to import an auto-generated header with a name that follows the format ModuleProductName-Swift.h (the actual file will be placed in the temporary build directory of your project and regenerated when needed).

Considering that our product name was ErrorHandling, the generated Swift header file will be named ErrorHandling-Swift.h.

The main.m of your project should look like this:


#import <Foundation/Foundation.h>
#import "ErrorHandling-Swift.h"

int main(int argc, const char * argv[]) {
    @autoreleasepool {

        MyClass* c = [MyClass new];
        NSError* err=nil;
        [c throwAnErrorAndReturnError:&err];
        NSLog(@"Domain:%@ Code:%d Message:%@",err.domain,err.code,err.localizedDescription);
        [c callMe];

    }
    return 0;
}

In this example, we are simply creating a new instance of MyClass and then invoking its two methods in sequence. There is not much to see other than the fact that the name of our throwAnError function is now throwAnErrorAndReturnError. This is the result of the automatic bridging process described in the previous section.

The header that has been generated for us has this content:


SWIFT_CLASS("_TtC13ErrorHandling7MyClass")
@interface MyClass : NSObject
- (BOOL)throwAnErrorAndReturnError:(NSError * __nullable * __null_unspecified)error;
- (void)callMe;
- (nonnull instancetype)init OBJC_DESIGNATED_INITIALIZER;
@end

typedef SWIFT_ENUM(NSInteger, MyError) {
  MyErrorAnError = 0,
  MyErrorAnotherError = 1,
};
static NSString * const MyErrorDomain = @"ErrorHandling.MyError";

As expected, the bridging process has added an NSError parameter to our error throwing function but also appended AndReturnError to its name following a common naming convention.

Also, since we added an @objc modifier to our enum, but this was an optional step (not required to perform the bridging), we now have a convenient Objective-C enum that can be used in conjunction with the error code contained in the NSError.

Running out program produces the following output:


ErrorHandling[8104:413890] Domain:MyError Code:1 Message:The operation couldn’t be completed. (MyError error 1.)
Someone called!
Program ended with exit code: 0

Even if calling Swift code from Objective-C is not as straightforward as doing the opposite, the process is still painless, as promised at the beginning of the article error handling in a mixed language project it’s not that hard.

Effective Method Swizzling in Swift

2015-10-23T00:00:00+02:00

Update 11/16:This post and the example project have been updated to Swift 3 with the new dispatch_once syntax.

Get the sample project for this article from GitHub or zipped.

Method Swizzling is a well known practice in Objective-C and in other languages that support dynamic method dispatching.

Through swizzling, the implementation of a method can be replaced with a different one at runtime, by changing the mapping between a specific #selector(method) and the function that contains its implementation.

While this seems extremely convenient, this functionality does not come without its drawbacks. Performing this sort of alterations at runtime, you can’t take advantage of all the safety checks that are usually available at compile time. Swizzling is something that should be used with care.

The definitive article on how to swizzle in Objective-C is available on NSHipster (and some additional details are here) and a comprehensive discussion on the perils of using method swizzling can be found on Stackoverflow.

Swift takes a static approach regarding method dispatching, but it’s still possible to perform method swizzling if some conditions are met.

Before giving you some pointers on how to use swizzling with Swift, let me reiterate that this technique should be used sparingly, only when a more “swifty” alternative to solve your problem does not exist and not considered as a real alternative to subclassing or to the use of protocols and extensions.

As described in another article on NSHipster, performing swizzling in Swift with a class from one of the base frameworks (Foundation, UIKit, etc…), except for a few gotchas, is not that different from what you were used to in Objective-C:


extension UIViewController {
    public override static func initialize() {

        // make sure this isn't a subclass
        if self !== UIViewController.self {
            return
        }

        struct Inner {
            static let i: () = {
                let originalSelector = #selector(UIViewController.viewWillAppear(_:))
                let swizzledSelector = #selector(UIViewController.newViewWillAppear(_:))

                let originalMethod = class_getInstanceMethod(UIViewController.self, originalSelector)
                let swizzledMethod = class_getInstanceMethod(UIViewController.self, swizzledSelector)

                let didAddMethod = class_addMethod(UIViewController.self, originalSelector, method_getImplementation(swizzledMethod), method_getTypeEncoding(swizzledMethod))

                if didAddMethod {
                    class_replaceMethod(UIViewController.self, swizzledSelector, method_getImplementation(originalMethod), method_getTypeEncoding(originalMethod))
                } else {
                    method_exchangeImplementations(originalMethod, swizzledMethod);
                }
            }()
        }
        let _ = Inner.i
    }

    // MARK: - Method Swizzling

    func newViewWillAppear(animated: Bool) {
        self.newViewWillAppear(animated)
        if let name = self.descriptiveName {
            print("viewWillAppear: \(name)")
        } else {
            print("viewWillAppear: \(self)")
        }
    }
}

In this example, additional operations needs to be performed for every UIViewController in the application but the original behavior of the viewWillAppear method needs to be preserved, this can be done only through swizzling.

The viewWillAppear method implementation will be replaced with the implementation of a new method named newViewWillAppear in initialize. Note that after the swizzling, what in the code seems to be a recursive call to newViewWillAppear will become a call to the original viewWillAppear method.

The first difference from the recommended Objective-C approach is that the swizzling is not performed in load.

The load method is guaranteed to be called when the definition of a class is loaded and this makes it the right place to perform method swizzling.

But load is a Objective-C only method and cannot be overridden in Swift, trying to do it anyway will result in a compile time error. The next best place to perform the swizzling is in initialize, a method called right before the first method of your class is invoked.

Enclosing all the operations that modify the methods in the lazy initialization block of a computed global constant ensures that the procedure will be performed only once (since initialization of these variables or constants uses dispatch_once behind the scenes).

And that’s what you need to know for classes from base frameworks or for bridged Objective-C classes. When instead, you plan to use pure Swift classes there are a few additional things you should keep in mind to be able to perform method swizzling correctly.

Method Swizzling with Swift classes

To use method swizzling with your Swift classes there are two requirements that you must comply with:

The class containing the methods to be swizzled must extend NSObject
The methods you want to swizzle must have the dynamic attribute

More information about why this is necessary can be found in Apple’s “Using Swift with Cocoa and Objective-C”:

Requiring Dynamic Dispatch

While the @objc attribute exposes your Swift API to the Objective-C runtime, it does not guarantee dynamic dispatch of a property, method, subscript, or initializer. The Swift compiler may still devirtualize or inline member access to optimize the performance of your code, bypassing the Objective-C runtime. When you mark a member declaration with the dynamic modifier, access to that member is always dynamically dispatched. Because declarations marked with the dynamic modifier are dispatched using the Objective-C runtime, they’re implicitly marked with the @objc attribute.

Requiring dynamic dispatch is rarely necessary. However, you must use the dynamic modifier when you know that the implementation of an API is replaced at runtime. For example, you can use the method_exchangeImplementations function in the Objective-C runtime to swap out the implementation of a method while an app is running. If the Swift compiler inlined the implementation of the method or devirtualized access to it, the new implementation would not be used.

This also means that you can’t perform swizzling if the method that you want to replace has not been declared as dynamic.

Let’s see how this translates to code:


class TestSwizzling : NSObject {
    dynamic func methodOne()->Int{
        return 1
    }
}


extension TestSwizzling {
    
    //In Objective-C you'd perform the swizzling in load() , but this method is not permitted in Swift
    override class func initialize()
    {
        // Perform this one time only
        struct Inner {
            static let i: () = {
                let originalSelector = #selector(TestSwizzling.methodOne)
                let swizzledSelector = #selector(TestSwizzling.methodTwo)
                
                let originalMethod = class_getInstanceMethod(TestSwizzling.self, originalSelector)
                let swizzledMethod = class_getInstanceMethod(RestSwizzling.self, swizzledSelector)
                
                method_exchangeImplementations(originalMethod, swizzledMethod)
            }()
        }
        let _ = Inner.i
    }
    
    func methodTwo()->Int{
        // It will not be a recursive call anymore after the swizzling
        return methodTwo()+1
    }
}

var c = TestSwizzling()
print(c.methodOne())  //2
print(c.methodTwo())  //1

In this simplified example the implementations of methodOne and methodTwo will be replaced with one another, just before the first method of the TestSwizzling object is called.

Closing remarks

As you have seen, it’s still possible to perform method swizzling in Swift, but in my opinion, most of the times it should never end up in actual production code. What a quick fix using swizzling can solve, can be better solved (for various definition of better) refactoring your code and thinking a better architecture.

Swift 3: Map and FlatMap Demystified

2015-10-08T00:00:00+02:00

Update 12/16:This post has been verified with Swift 3, minimal changes were required.

Get this and other playgrounds from GitHub or zipped.

Swift is a language still slightly in flux, with new functionalities and alterations of behavior being introduced in every release. Much has already been written about the functional aspects of Swift and how to approach problems following a more “pure” functional approach.

Misterious but correct depiction of the monadic bind

Considering that the language is still in its infancy, often, trying to understand some specific topics you’ll end up reading a lot of articles referring to old releases of the language, or worst, descriptions that mix up different releases. Sometimes, searching for articles on flatMap, you could even fortuitously find more than one really good articles explaining Monads in the context of Swift.

Add to the lack of comprehensive and recent material the fact that many of these concepts, even with examples or daring metaphors, are not obvious, especially for someone used to the imperative way of thinking.

With this short article (part of a series on Swift and the functional approach) I’ll try to give a clear and throughout explanation of how map and especially flatMap work for different types, with references to the current library headers.

Map
- Map on Optionals
- Map on Sequences
FlatMap
- FlatMap on Optionals
- FlatMap on Sequences

Map

Map has the more obvious behavior of the two *map functions, it simply performs a closure on the input and, like flatMap, it can be applied to Optionals and Sequences (i.e. arrays, dictionaries, etc..).

Map on Optionals

For Optionals, the map function has the following prototype:


public enum Optional<Wrapped> : ... {
	...
    /*
        - Parameter transform: A closure that takes the unwrapped value
          of the instance.
        - Returns: The result of the given closure. If this instance is `nil`,
          returns `nil`.
	*/
    public func map<U>(_ transform: (Wrapped) throws -> U) rethrows -> U?
	...
}

The map function expects a closure with signature (Wrapped) -> U, if the optional has a value applies the function to the unwrapped optional and then wraps the result in an optional to return it (an additional declaration is present for implicitly unwrapped optionals, but this does not introduce any difference in behavior, just be aware of it when map doesn’t actually return an optional).

Note that the output type can be different from the type of the input, that is likely the most useful feature.

Straightforward, this does not need additional explanations, let’s see some real code from the playground for this post:


var o1:Int? = nil

var o1m = o1.map({$0 * 2})
o1m /* Int? with content nil */

o1 = 1

o1m = o1.map({$0 * 2})
o1m /* Int? with content 2 */

var os1m = o1.map({ (value) -> String in
    String(value * 2)
})
os1m /* String? with content 2 */

os1m = o1.map({ (value) -> String in
    String(value * 2)
}).map({"number "+$0})
os1m /* String? with content "number 2" */

Using map on optionals could save us an if each time we need to modify the original optional (map applies the closure to the content of the optional only if the optional has a value, otherwise it just returns nil), but the most interesting feature we get for free is the ability to concatenate multiple map operations that will be executed sequentially, thanks to the fact that a call to map always return an optional. Interesting, but quite similar and more verbose than what we could get with optional chaining.

Map on Sequences

But it’s with Sequences like arrays and dictionaries that the convenience of using map-like functions is hard to miss:


var a1 = [1,2,3,4,5,6]

var a1m = a1.map({$0 * 2})
a1m /* [Int] with content [2, 4, 6, 8, 10, 12] */

let ao1:[Int?] = [1,2,3,4,5,6]

var ao1m = ao1.map({$0! * 2})
ao1m /* [Int] with content [2, 4, 6, 8, 10, 12]  */

var a1ms = a1.map({ (value) -> String in
    String(value * 2)
}).map { (stringValue) -> Int? in
    Int(stringValue)
}
a1ms /* [Int?] with content [.Some(2),.Some(4),.Some(6),.Some(8),.Some(10),.Some(12)] */

This time we are calling the .map function defined on Sequence as follow:


/* 
    - Parameter transform: A mapping closure. `transform` accepts an
      element of this sequence as its parameter and returns a transformed
      value of the same or of a different type.
    - Returns: An array containing the transformed elements of this
      sequence.
*/
func map<T>(_ transform: (Element) throws -> T) rethrows -> [T]

The transform closure of type (Element) -> T is applied to every member of the collection and all the results are then packed in an array with the same type used as output in the closure and returned. As we did in the optionals example, sequential operation can be pipelined invoking map on the result of a previous map operation.

This basically sums up what you can do with map, but before moving to flatMap, let’s see three additional examples:



var s1:String? = "1"
var i1 = s1.map {
    Int($0)
}
i1 /* Int?? with content 1 */

var ar1 = ["1","2","3","a"]
var ar1m = ar1.map {
    Int($0)
}
ar1m /* [Int?] with content [.Some(1),.Some(2),.Some(3),nil] */

ar1m = ar1.map {
    Int($0)
    }
    .filter({$0 != nil})
    .map {$0! * 2}
ar1m /* [Int?] with content [.Some(2),.Some(4),.Some(6)] */

Not every String can be converted to an Int, so our integer conversion closure will always return an Int?. What happens in the first example with that Int??, is that we end up with an optional of an optional, for the additional wrapping performed by map. To actually get the contained value will need to unwrap the optional two times, not a big problem, but this starts to get a little inconvenient if we need to chain an additional operation to that map. As we’ll see, flatMap will help with this.

In the example with the array, if a String cannot be converted as it happens for the 4th element of ar1 the that element in the resulting array will be nil. But again, what if we want to concatenate an additional map operation after this first map and apply the transformation just to the valid (not nil) elements of our array to obtain a shorter array with only numbers?

Well, we’ll just need intermediate filtering to sort out the valid elements and prepare the stream of data to the successive map operations. Wouldn’t it be more convenient if this behavior was embedded in map? We’ll see that this another use case for flatMap.

FlatMap

The differences between map and flatMap could appear to be minor but they are definitely not.

While flatMap is still a map-like operation, it applies an additional step called flatten right after the mapping phase. Let’s analyze flatMap’s behavior with some code like we did in the previous section.

FlatMap on Optionals

The definition of the function is a bit different, but the functionality is similar, as the reworded comment implies:


public enum Optional<Wrapped> : ... {
	...
    /*
	    - Parameter transform: A closure that takes the unwrapped value
          of the instance.  
        - Returns: The result of the given closure. If this instance is `nil`,
          returns `nil`.
	*/
    public func flatMap<U>(_ transform: (Wrapped) throws -> U?) rethrows -> U?
	...
}

There is a substantial difference regarding the closure, flatMap expects a (Wrapped) -> U?) this time.

With optionals, flatMap applies the closure returning an optional to the content of the input optional and after the result has been “flattened” it’s wrapped in another optional.

Essentially, compared to what map did, flatMap also unwraps one layer of optionals.


var fo1:Int? = nil

var fo1m = fo1.flatMap({$0 * 2})
fo1m /* Int? with content nil */

fo1 = 1

fo1m = fo1.flatMap({$0 * 2})
fo1m /* Int? with content 2 */

var fos1m = fo1.flatMap({ (value) -> String? in
    String(value * 2)
})
fos1m /* String? with content "2" */

var fs1:String? = "1"

var fi1 = fs1.flatMap {
    Int($0)
}
fi1 /* Int? with content "1" */

var fi2 = fs1.flatMap {
    Int($0)
    }.map {$0*2}

fi2 /* Int? with content "2" */

The last snippet contains and example of chaining, no additional unwrapping is needed using flatMap.

As we’ll see again when we describe the behavior with Sequences, this is the result of applying the flattening step.

The flatten operation has the sole function of “unboxing” nested containers. A container can be an array, an optional or any other type capable of containing a value with a container type. Think of an optional containing another optional as we’ve just seen or array containing other array as we’ll see in the next section.

This behavior adheres to what happens with the bind operation on Monads, to learn more about them, read here and here.

FlatMap on Sequences

Sequence provides the following implementations of flatMap:


    /// - Parameter transform: A closure that accepts an element of this
    ///   sequence as its argument and returns a sequence or collection.
    /// - Returns: The resulting flattened array.
    ///
    public func flatMap<SegmentOfResult : Sequence>(_ transform: (Element) throws -&gt: SegmentOfResult) rethrows -> [SegmentOfResult.Iterator.Element]

    /// - Parameter transform: A closure that accepts an element of this
    ///   sequence as its argument and returns an optional value.
    /// - Returns: An array of the non-`nil` results of calling `transform`
    ///   with each element of the sequence.
    ///
    public func flatMap<ElementOfResult>(_ transform: (Element) throws -> ElementOfResult?) rethrows -> [ElementOfResult]

flatMap applies those transform closures to each element of the sequence and then pack them in a new array with the same type of the input value.

These two comments blocks describe two functionalities of flatMap: sequence flattening and nil optionals filtering.

Let’s see what this means:


var fa1 = [1,2,3,4,5,6]

var fa1m = fa1.flatMap({$0 * 2})
fa1m /*[Int] with content [2, 4, 6, 8, 10, 12] */

var fao1:[Int?] = [1,2,3,4,nil,6]

var fao1m = fao1.flatMap({$0})
fao1m /*[Int] with content [1, 2, 3, 4, 6] */

var fa2 = [[1,2],[3],[4,5,6]]

var fa2m = fa2.flatMap({$0})
fa2m /*[Int] with content [1, 2, 3, 4, 6] */

While the result of the first example doesn’t differ from what we obtained using map, it’s clear that the next two snippets show something that could have useful practical uses, saving us the need for convoluted manual flattening or filtering.

In the real world, there will be many instances where using flatMap will make your code way more readable and less error-prone.

And an example of all this is the last snippet from the previous section, that we can now improve with the use of flatMap:


var far1 = ["1","2","3","a"]
var far1m = far1.flatMap {
    Int($0)
}
far1m /* [Int] with content [1, 2, 3] */

far1m = far1.flatMap {
        Int($0)
    }
    .map {$0 * 2}
far1m /* [Int] with content [2, 4, 6] */

I may look just a minimal improvement in this context, but with longer chain it would become something that greatly improves readability.

And let me reiterate this again, in this context too, the behavior of swift flatMap is aligned to the bind operation on Monads (and “flatMap” is usually used as a synonym of “bind”), you can learn more about this reading here and here.

Learn more about Sequence and IteratorProtocol protocols in the next article in the series.

Drawing inspired by emacs-utils documentation.

Swift 2.1 Function Types Conversion: Covariance and Contravariance

2015-09-29T00:00:00+02:00

Update 10/16:This post has been updated for Swift 3, click here to see what changed in the code samples.

This Swift 2.1 post requires Xcode7.1 beta or later, get this and other playgrounds from GitHub or zipped.

In Swift 2.1, coming with XCode 7.1 (see the change log), function types will support covariance and contravariance, let’s see why this will matter.

In the context of computer science and types, the term variance refers to how the relationship between two types influences the relationship between the complex types that have been derived from them. This relationship between complex types is the result of either invariance, covariance and contravariance applied to the original types. Understanding how this derived relationship is defined is essential to effectively use complex types.

To clarify this, using pseudocode, let’s consider a complex parametric type List<T> and two other simple types: Car and Maserati, a subtype of Car.

Invariance, Covariance and contravariance can be explained as follow considering the relationship that could bind the two types obtained choosing a specific T for List:

Covariance: If List<Maserati> is also a subtype of List<Car>, then the type relationship between the original types is preserved on List because List is covariant on his original type.
Contravariance: If instead, List<Car> is a subtype of List<Maserati>, then the type relationship between the original types is reversed on List because List is contravariant on his original type.
Invariance: List<Car> is not subtype of List<Maserati> and neither the opposite, the two complex types have no derived relationship.

Each language employs a specific set of variance approaches, knowing how complex types are related to each other helps understanding if two complex types are compatible and could be use interchangeably in some situations, in the same way a type and its subtypes do.

In the context of function types, and this is what changes with Swift 2.1, complex type compatibility boils down to one simple question, when it’s safe to use an alternative function of a type B where a function of type A was expected?

The general rule is that a compatible function type can have more generic parent types for parameters (the client of A function will be able to manage a more specialized parameter than what A declared) and can return as result a more specific subtype (the clients of the A function will treat the result as the simpler parent type declared in A). Contravariance applied to parameters and Covariance to what is returned.

Before Swift 2.1, function types behaved in an invariant way, if you try to run the code below in a playground, you will get a few variations of this error: Cannot convert value of type '(Int)->Int' to expected argument type '(Int) -> Any'.


func testVariance(foo:(Int)->Any){foo(1)}

func innerAnyInt(p1:Any) -> Int{ return 1 }
func innerAnyAny(p1:Any) -> Any{ return 1 }
func innerIntInt(p1:Int) -> Int{ return 1 }
func innerIntAny(p1:Int) -> Any{ return 1 }

testVariance(p1:innerIntAny)
testVariance(p1:innerAnyInt)
testVariance(p1:innerAnyAny)
testVariance(p1:innerIntInt)

With Swift 2.1, this changes, function type conversion is supported and function types are now contravariant regarding parameter types and covariant regarding the result type.

Back to the code sample, is now legal to pass all the three Any->Any, Any->Int, Int->Int functions to the testVariance function with a Int->Any parameter of the example above.

Swift 2.1 Released

2015-09-20T00:00:00+02:00

With the release of XCode 7.1, Swift 2.1 is now available.

It’s not a major release so don’t expect major changes to the language, here is a list of what changed:

Expressions interpolated in strings may now contain string literals. For example, "My name is \(attributes["name"]!)" is now a valid expression. Highly appreciated.
Conversions between function types are supported, exhibiting covariance in function result types and contravariance in function parameter types (See the specific post on this: Swift 2.1 Function Types Conversion).
Enums imported from C now automatically conform to the Equatable protocol, including a default implementation of the == operator. This conformance allows you to use C enum pattern matching in switch statements with no additional code.

And some less evident changes:

The NSNumberunsignedIntegerValue property now has the type UInt instead of Int, as do other methods and properties that use the NSUInteger type in Objective-C and whose names contain “unsigned..”. Most other uses of NSUInteger in system frameworks are imported as Int as they were in Xcode 7.
Field getters and setters are now created for named unions imported from C. In addition, an initializer with a named parameter for the field is provided.

For example, given the following Objective-C typdef:
```
typedef union IntOrFloat {
int intField;
float floatField;
} IntOrFloat;
  
```
Importing this typedef into Swift generates the following interface:
```
struct IntOrFloat {
  var intField: Int { get set }
  init(intField: Int)
  var floatField: Float { get set }
  init(floatField: Float)
}
  
```
Bitfield members of C structs are now imported into Swift.
The type dispatch_block_t now refers to the type @convention(block) () -> Void, as it did in Swift 1.2.

This change allows programs using dispatch_block_create to work as expected, solving an issue that surfaced in Xcode 7.0 with Swift 2.0.
Editing a file does not trigger a recompile of files that depend upon it if the edits only modify declarations marked private.
Error messages produced when the type checker cannot solve its constraint system continue to improve in many cases.

For example, errors in the body of generic closures (for instance, the argument closure to map) are much more usefully diagnosed.

A Brief iOS9 UIStackView Guide

2015-09-08T00:00:00+02:00

This Swift 2.0 Guide requires Xcode7 beta or later, get the complete project from GitHub or zipped.

iOS9 introduces UIStackViews, a new component that greatly simplifies building layouts that can be broken down to vertical or horizontal sequences of UIViews, providing an alternative to manually positioning views using auto-layout.

Acting as an invisible container, each UIStackView is able to display a single sequence of subviews (arranged views) aligned either vertically or horizontally, automatically resizing its content according to the current screen size and adapting to changes in orientation. How these subviews are actually positioned depends on a few properties that define how the subviews should be aligned, spaced and, if needed, resized.

What happens under the hood is that the UIStackView class manages auto-layout constraints for you. Think of UIStackView as an abstraction layer above auto-layout that simplifies the creation of a well defined subset of layouts. You can start building your layout from a main UIStackView and add nested UIStackView until all your UIViews are positioned correctly.

If you have done any Android development, you’ll notice that the UIStackView concept is quite similar to LinearLayouts, likely the most used android layout scheme, that in turn borrowed some ideas and improved upon the multitude of layouts that were already available in Java Swing.

The Basics

As usual, UIStackViews can be created both programmatically and in Interface Builder.

In Interface Builder you can add a new vertically or horizontally aligned UIStackView choosing the right control from the Object Library and once the view is in place, new views can be added dragging controls inside the UIStackView.

A new UIStackView can also be wrapped around one of more existing views, just select them and click the new Stack icon you’ll find in the bottom bar of Interface Builder.

Quite simple, but in this guide we’ll create a basic nested layout programmatically.

In this brief example, a vertical UIStackView, placed right below the status bar, will contain four controls: two UILabels, one horizontal UIStackView and one UIButton. Three buttons with default icons will be placed inside an inner horizontal UIStackView.

Let’s start, create a new Single View Application, verifying that the selected Deployment Target is 9.0+.

Open your only ViewController and replace the viewDidLoad method with this one:



    var stackView:UIStackView!
    var nestedStackView=UIStackView()

    override func viewDidLoad() {
        super.viewDidLoad()

		stackView.translatesAutoresizingMaskIntoConstraints=false
        self.view.addSubview(stackView)
		// Main UIStackView contraints, nearly fills its parent view
        self.view.addConstraints(NSLayoutConstraint.constraintsWithVisualFormat("V:|-30-[stackView]-30-|",
            options: NSLayoutFormatOptions.AlignAllLeading,metrics: nil, views: ["stackView":stackView]))
        self.view.addConstraints(NSLayoutConstraint.constraintsWithVisualFormat("H:|-10-[stackView]-10-|",
            options: NSLayoutFormatOptions.AlignAllLeading,metrics: nil, views: ["stackView":stackView]))

        stackView.axis = .Vertical
        stackView.alignment = .Fill
		stackView.spacing = 25
        stackView.distribution = .FillEqually
        
        var lbl = UILabel()
        lbl.text="Label 1"
        lbl.backgroundColor = UIColor.redColor()
        stackView.addArrangedSubview(lbl)
        
        lbl = UILabel()
        lbl.text="Label 2"
        lbl.backgroundColor = UIColor.greenColor()
        stackView.addArrangedSubview(lbl)

        nestedStackView.axis = .Horizontal
        nestedStackView.alignment = .Fill
        nestedStackView.spacing = 25
        nestedStackView.distribution = .FillEqually
        nestedStackView.addArrangedSubview(UIButton(type: .InfoDark))
        nestedStackView.addArrangedSubview(UIButton(type: .InfoLight))
        nestedStackView.addArrangedSubview(UIButton(type: .ContactAdd))
        stackView.addArrangedSubview(nestedStackView)
        
        let btn=UIButton(type: .System)
        btn.setTitle("Press Me", forState: .Normal)
        stackView.addArrangedSubview(btn)
        
    }

To specify a vertical orientation for the main UISTackView we are setting the axis property to .Vertical, the first three controls will be equally spaced and the UIButton will fill the rest of the available space. The three buttons with default styles contained in the inner nestedStackView will be arranged in a similar fashion. The alignment, distribution and spacing properties will be explained in the next section, ignore them for now.

Sooner or later you will need to hide and show some of the arranged views and this is quite straightforward for UIStackViews, just set the hidden property of one of your views.

To test this, let’s add an action to the UIButton and a new pressedMe method as follow:



		...
		btn.setTitle("Press Me", forState: .Normal)
        btn.addTarget(self, action: "pressedMe:", forControlEvents: UIControlEvents.TouchUpInside)
        stackView.addArrangedSubview(btn)
        
    }
    
    func pressedMe(sender: UIButton!){
        UIView.animateWithDuration(0.5) {
            self.nestedStackView.hidden = !self.nestedStackView.hidden
        }
    }

Clicking the button will now hide or show with a short animation the inner UIStackView and the main UIStackView will reposition the remaining views according to the properties specified in viewDidLoad.

If needed, subviews can also be completely removed from the UIStackView and all the contained arranged views will be, again, repositioned according to the current properties.


    
    func pressedMe(sender: UIButton!){
        stackView.removeArrangedSubview(nestedStackView)
        nestedStackView.removeFromSuperview()
    }

Removal is a two step process, calling the removeArrangedSubview method will remove the view from the UIStackView and reposition the remaining subviews but will not remove the view from its superview. The removed views need to be completely removed from their super views too, or they will still be shown outside the UIStackView. To do this, simply invoke removeFromSuperview on the removed view.

UIStackView: Alignment, Distribution And Spacing

Let’s take a look at the positioning properties UIStackView exposes:

Axis

Defines along which axis your views will be positioned, has two possible values: Vertical, Horizontal.

Alignment

The alignment property specifies the perpendicular (to the selected axis) alignment for your views, the value Fill will also resize all your views to fill the available space, the other values will not modify your views size. Available values are: Fill, Leading, Top, FirstBaseline, Center, Trailing, Bottom, LastBaseline.

Distribution

Distribution specifies how the subviews should be resized or distributed to fill all the available space along the axis, the possible values can be divided in two groups: fill and spacing values.

Fill values modify the size of the subviews if they don’t fill (or fit in) all the available space. The spacing between the subviews will be the one specified with the spacing property.

Fill: The subviews will be shrinked or stretched according to their content content resistance or hugging priority. If you didn’t set any, one of the subviews will be modified to fill the space available.
FillEqually: Disregarding any constraint, the subviews will be resized to the same size along the axis.
FillProportionally: The subviews will be proportionally resized according to the original size of each subview.

Spacing values fill the space along the axis altering the spacing between the subviews, the size of the subviews will be modified, according to the compression resistance, only if the subviews still don’t fit or if any auto-layout ambiguity arises.

EqualSpacing: The subviews will be equally spaced
EqualCentering: The subviews center axis will be equally spaced

Spacing

The spacing property is expressed in points and its meaning depends on the current distribution value.

If the UIStackView distribution property is either EqualSpacing or EqualCentering, the spacing property will represent the minimum spacing among subviews. Alternatively, if a FillProportionally distribution was selected, the requested spacing value will be exactly the chosen value.

UIStackViews: iOS7+ Backports

UIStackViews are supported only from iOS9 onward but a few backports, that partially implements this feature in iOS7 or later, have already been built: