High Performance Programming

by Tyler Swann, in collaboration with Monash DeepNeuron

Welcome

Welcome to Monash DeepNeuron's High Performance Programming (C++ edition), a book aimed at teaching techniques for developing programs that are both fast and safe. Throughout this book you will be learning the C++ programming language along with techniques for; computer memory, algorithm intuition, parallel computing and more.

How to use this book

This book is designed to be read cover-to-cover. Concepts in later chapters will build upon concepts from previous chapters. On either side of the page there are arrow buttons that will move you between pages and chapters. You can also search for specific content using the search button in the top left or by pressing the S key.

Synopsis

Chapter 1 - Getting Started - Setup & Introduction to C++
Chapter 2 - Basics of C++ - Types, Variables, Operators, IO, Conditionals, Loops and Functions
Chapter 3 - Memory - Pointers, Slices, References, Dynamic Memory and The Standard Library
Chapter 4 - Intermediate C++ - Functional Programming, Namespaces, Enumerations, Unions, Structures
Chapter 5 - Generic Programming - Classes, Templates, Generics and Concepts
Chapter 6 - Algorithms & Data Structures - Iterators, Data Structures, Algorithms, Ranges and Views
Chapter 7 - Parallel Programming - Parallel Algorithms, Atomics, Threads, Mutexes & Locks and Async

Suggestions, Fixes and Contributions

Refer to the source code of this book for details on how to contribute changes, fix typos or create new content for this book.

External Resources


      version: 1.0.0

Getting Started

Let's begin by setting up your device for developing with C++. In this chapter we will discuss:

Installing WSL (Windows)
Installing Homebrew for system package management
Setting up Git and the basics of source version control
Installing a C++ compiler
Installing bpt, a C++ package and build tool
Installing VSCode, a text editor
Writing a C++ program that prints "Hello World!"
How to compile and execute programs
How to use Compiler Explorer to share code snippets.

WSL

In this section we will install WSL. This is a virtualized Linux Kernel for Windows. This makes managing developer tools far easier and separates your development OS (Linux) from your personal OS (Windows).

Note: This section only applies to Windows users.

Update Windows and Virtualization Check

Before we begin, it is best to ensure we have the most recent Windows update available. Go to Settings > Updates and install any updates to your system.

Secondly, you will want to ensure that virtualization is enabled on your device. To do this open 'Task Manager', click more details, open the performance tab and make sure you are on the CPU performance section. In the details below the CPU's graph there should be an option called 'virtualization'. This should have the value 'Enabled' next to it. If it doesn't, you will need to enable a feature called SVM in your computers BIOS. If you are comfortable doing this; go for it but if you do not want to do this yourself do not worry. We will ensure everyone is setup correctly in the first meetup. Continue reading through as there will be a way you can start coding at the end of the sections.

Task Manager Example

Windows Terminal

To get started with WSL we will want a new terminal environment for the WSL shell. Fortunately, Microsoft has an awesome project called Windows Terminal (WT). It is able to hold many instances of different shells an dis fully customizable. To install it, simply open the Microsoft Store apps and search for "Windows Terminal" and click "install".

WSL Install

To install WSL, we need to open PowerShell terminal with administrative privileges. Click on the Windows Start button (bottom left icon on the sectionbar) and type "PowerShell", select "Run as Administrator". This will open a new shell. Now run:

> wsl --install -d Ubuntu-20.04

This may require a reboot. This will install WSL as well as an image of Ubuntu. Click Start again and type "Ubuntu" and run the application. Follow the on screen instructions to create your user and password for WSL. This is different from you Windows credentials. Now open WT and press ctrl + , again. On the settings page that pops up, the first drop down called "Default Profile" should now have an option called Ubuntu (or something similar). Choose this as your default profile.

WSL is now installed. Create a new shell tab with ctrl + shift + t and the shell prompt should now display you WSL username.

Command Line Notation

In this chapter and throughout the book, we’ll show some commands used in the terminal. Lines that you should enter in a terminal all start with $. You don’t need to type the $ character; it’s the command line prompt shown to indicate the start of each command. Lines that don’t start with $ typically show the output of the previous command. Additionally, PowerShell-specific examples will use > rather than $.

APT & Packages

Before you begin, you will need to update your systems packages. Packages on Ubuntu are managed by a tool called apt. For some context, updating packages takes two steps typically, first you update the package index, then you can update the relevant packages.

# `sudo` represents 'super user do'. 
# This runs a command with admin. privileges.
# Update apt's local package index.
$ sudo apt update

# The `-y` flag means upgrade yes to all.
# This bypasses confirming package upgrades.
# Upgrade packages with new versions
$ sudo apt upgrade -y

You will also want some packages apt that we will need for C++ development.

# Installs specified packages (separated by spaces).
$ sudo apt install git curl wget ca-certificates build-essential

WSL should be installed and ready to go.

Installing Software

In this section we will install all the relevant software for developing C++ programs.

WSL & Linux

To get started open a new terminal session (for WSL, use the WSL terminal) and update your system package managers local package index. This is a list of all available packages and their versions. We can then install some system critical packages that we need in order to develop C++ programs. From there we can install Homebrew, a cross platform package manager which we will use to install our C++ compiler(s) and debuggers.

# Update apt (replace apt with relevant package manager if you are not on Ubuntu)
$ sudo apt update
$ sudo apt upgrade -y

# Install system packages
$ sudo apt install git curl wget ca-certificates build-essential

# Install Homebrew and update
$ /bin/bash -c "$(curl -fsSL https://raw.githubusercontent.com/Homebrew/install/HEAD/install.sh)"
$ brew update
$ brew upgrade

# Install C++ compiler(s), debuggers
$ brew install gcc llvm gdb

MacOS

To begin, open a new terminal session install Homebrew, a cross platform package manager which will then use to install our C++ compiler(s), debuggers, cURL and Git.

$ /bin/bash -c "$(curl -fsSL https://raw.githubusercontent.com/Homebrew/install/HEAD/install.sh)"
$ brew update
$ brew upgrade
$ brew install gcc llvm gdb curl git

Verify Installation

You can verify that GCC installed the correct version by running the following command. The output should be similar to this.

$ gcc-12 -v
Reading specs from /home/linuxbrew/.linuxbrew/Cellar/gcc/12.2.0/bin/../lib/gcc/current/gcc/x86_64-pc-linux-gnu/12/specs
COLLECT_GCC=gcc-12

# Other info ...

gcc version 12.2.0 (Homebrew GCC 12.2.0)

Authenticating Git with GitHub

If you have a GitHub (or other) account and you want to link it to your machine, run the following commands, replacing <> with your personal details.

$ git config --global user.name "<github-username>"
$ git config --global user.email "<github-email>"

Installing bpt

bpt is a build and packaging tool for C++. It makes consuming C++ libraries, running tests and packaging your code much easier compared to conventional methods (notably Cmake).

# Linux (WSL included)
curl bpt.pizza/get/linux -Lo bpt

# MacOS
curl bpt.pizza/get/macos -Lo bpt

# Both
chmod a+x bpt
./bpt install-yourself

Installing VSCode

Go to VSCode's Download page and install it for your machines host OS.

Note: For WSL users, this means install on the Windows side.

On its own VSCode is just a text editor like Windows Notepad but with coloured text however, using extensions we can set it up for developing with any language. Open VSCode as you would any other app in Windows, MacOS or Linux. In VSCode, open the extension marketplace tab. In the search bar, search for the following extensions click on the extension and click and click the install button for them.

Note: For WSL users, only install the extensions marked 'WSL only' on the Windows side. The other extensions must be installed on the WSL. Install the them after opening VSCode in WSL (instructions below).

C/C++
GitLens
Git Graph
GitHub Pull Requests and Issues
Sonarlint
Remote development (WSL only)
WSL (WSL only)
Remote SSH (WSL only)

You may have to restart VSCode for the extensions to load. Finally, press ctrl + , to open settings. in the search bar search for "cpp default standard". In the drop down select c++20.

To open VSCode from the terminal, open a new terminal window and type.

# `.` represents 'this' directory
$ code .

This will open VSCode in the current user directory which should be ~ which represents your users home directory. WSL users, make sure to launch VSCode from your WSL terminal this time. And that is it, everything should be set up and ready to go.

Hello World!

If you've never programmed before, a "Hello World" program is a simplest program and is often used to introduce a language. The first Hello World was created in Brian Kernighan's 1972 "A Tutorial Introduction to the Language B".

Introducing C++

Before you write you first C++ program I will cover a basic synopsis of the language's features.

C++ is a high-level, general purpose compiled programming language. It is strongly-typed with a static-type system and supports multi-paradigm programming.

Most of you would have had exposure to interpreted languages (Python, Ruby, Java, Bash etc.) who have a secondary program; called the interpreter, that runs alongside your code, converting the "higher level" instructions into machine code (binary) as it reads through the code.

C++ works differently, it is a compiled language. This simply means all of the C++ code is converted into machine instructions (by a compiler) before you execute the program. This has the benefit of allowing software to run on "bare metal", meaning the code you write is actually running on the machine (to some degree).

Because of C++ ability to run on baremetal, many people claim it is a "low-level" language however, this could not be more false. Almost all programming languages are mid-to-high level. This is because most support general abstraction techniques take you away from dealing with the machine directly. Only assembly and bytecode languages could be considered "low-level" like; LLVM, x86_64 etc., as these give control over memory and CPU instructions.

But C++ can style more directly interact with the hardware, how can that be if it isn't a low level language. Two things give C++ its power over hardware, first is its memory model. Many languages have little or no notion of memory. Data is data and it is as big or small as it is. How big is int in Python? To many this doesn't cross their minds when writing Python because you don't need to and that is one of the many benefits of Python. However, there is limits to resources you can use in some circumstances and sometimes you need to be able to guarantee certain memory usages from your software. C++ is one of the language that has a "conscious" notion of memory usage and gives you control over these resources. There is one problem with this, not all computer architectures are the same and don't have the same notion of memory. To tackle this, C++ uses the notion of a universal abstract machine. This is C++ second power over hardware. It has mechanisms for interacting with the underlying hardware through the OS but how it gets there is not to the concern of the developer (unless developing in kernel-space as opposed to user-space). You can use standardized features to access these resources effectively.

I mentioned above that C++ is strongly typed with a static type system. What does this mean? A strong typing basically means that types will not be implicitly cast (converted) to a different type (there are some exceptions to this but we'll cover this in chapter 2). A static type system means that all data must have an explicit type that must be known at compile time.

To wrap it off I'll briefly discuss the paradigms and styles you can write C++ in. The most obvious is procedural, similar to C. This paradigm simply uses free functions that operate on free data, performing instructions according to a procedure or set of instructions. Paired with procedural programming, C++ also allows for imperative programming style programming which consists of functions changing the systems state. This style centers mostly on telling the computer what exactly what you want done. C++ also support object-oriented-programming (OOP) with its primary IO library using many OOP patterns to create runtime polymorphism. The most popular paradigm used in C++ today is generic programming. C++ has many features that allows you to write code for generic types as opposed to creating new functions for every possible combination of types. Finally, C++ also supports functional programming patterns that allow for for creating general purpose algorithms that are composed create more specific data manipulation.

Hopefully this gives you an idea into the kind of language C++.

Hello C++

To begin we are going to open a new terminal window. We are going to create a directory called "hello", enter it and create some files and open VSCode there.

# Makes new directory
$ mkdir hello

# Enter `hello`
$ cd hello

# Create files `hello.cxx` and `README.md`
$ touch hello.cxx README.md

# Open VSCode
$ code .

Open the hello.cxx file by clicking it on the left file view.

Here is "Hello World" in C++.

// This is a comment, these are ignored by the compiler

/// Preprocessor statement using `#` symbol
/// The preprocessor runs at compile time before the code is compiled
/// `#include` copies the header `iostream` into the current file
#include <iostream>

/// Main function
/// Entry point of the executable.
/// Takes no arguments and returns an `int`.
auto main () -> int
{
    /// From the namespace `std`.
    /// Use `cout` (character out).
    /// Put (<<) the string literal to stream.
    /// From `std` put a `endl` specifier. 
    std::cout << "Hello World!" << std::endl;

    /// Return 0 on successful termination.
    return 0;
}

Size Qualifiers / Primitive Type	`short`	`unsigned short`	`signed`	`unsigned`	`long`	`unsigned long`	`long long`	`unsigned long long`
`bool`	❌	❌	❌	❌	❌	❌	❌	❌
`char`	❌	❌	✅	✅	❌	❌	❌	❌
`wchar_t`	❌	❌	❌	❌	❌	❌	❌	❌
`int`	✅	✅	✅	✅	✅	✅	✅	✅
`float`	❌	❌	❌	❌	❌	❌	❌	❌
`double`	❌	❌	❌	❌	✅	❌	❌	❌

Category	Equivalent values are..	Incomparable values are..
`std::strong_ordering`	indistinguishable	not allowed
`std::weak_ordering`	distinguishable	not allowed
`std::partial_ordering`	distinguishable	allowed

Address	Value
0x00007fff59ae6e9d	...
0x00007fff59ae6e99	0x00000004
0x00007fff59ae6e94	0x000091f5
0x00007fff59ae6e90	...

Address	Value
0x00007fff59ae6e9d	...
0x00007fff59ae6e99	0x00000004
0x00007fff59ae6e94	0x000091f5
0x00007fff59ae6e90	0x00007fff59ae6e99
0x00007fff59ae6e88	...