Skip to content

002 Some basic UNIX

Introduction

Purpose

Although this course is about coding in Python, it can be of great value to you to learn at least some basic concepts and commands of the operating system. To that end, in this session we will learn some basic unix commands. You will be able to use these in almost any modern computing operating system may use: the unix shell is a core part of linux and macOS and is directly available to you even in Windows 10/11. If you use these notes through the JupyterLab interface, even from a mobile device, you will have access to a unix shell to run commands.

There are many online tutorials on unix. A good place to start backup material and some more features for the material we will cover today is software-carpentry.org.

Prerequisites

You will need some understanding of the following:

Remember that you can 'run' the code in a code block using the 'run' widget (above) or hitting the keys ('typing') and at the same time.

Running unix commands

Unix in code cells

The code cells in this notebook take Python commands by default, but we can run unix commands either by pre-pending a single command with !:

!pwd
/Users/philiplewis/Documents/GitHub/geog0111/notebooks

or by using the cell magic %%bash:

%%bash

# comment is after #
pwd
/Users/philiplewis/Documents/GitHub/geog0111/notebooks

If you are using these notes through the JupyterLab interface you have access to a terminal to run unix commands.

~, ., ..

You will be used to the idea of navigating the filesystem from any previous computing you have ever done. You may have done this by clicking your way to a certain 'location' using File explorer (in Windows) or Finder (in MacOS), but you will have some familiarity with the tree-like nature of a filesystem: you go up or down in the system to find your way to the files and directories you want.

When we do this typing command in the unix shell, the concepts are exactly the same, but we have some new symbols to learn to help us navigate:

- ~ (tilde/twiddle)
- . (dot)
- .. (dot-dot)

The tilde symbol ~ is a shorthand to refer to your home directory (this would generally be C:\Users\username on windows, /Users/username on MacOS or /home/username in linux, where username is your username). On windows, the file separator is \ (backslash), but on posix (unix-like) systems it is '/', forward slash. This can cause some issues when changing operating system. You should try to use the posix '/' whenever you can as this is more portable.

The symbol . means the current directory, and .. refers to one level up in the directory tree.

cd, pwd

The command cd filepath is used in the shell to change from one directory to another. Typically, when you start a shell, you will be in your home directory. We can explicitly 'go to' (i.e. cd to) your home with cd ~. We use the command pwd to print the current working directory.

So the following sequence:

%%bash

cd ~
pwd
/Users/philiplewis

changes directory to our home, and prints the directory name.

In the following command, we use cd .. to go up one directory. Notice that each time we start a new shell here, the location is re-set to the location of these notebooks (e.g. geog0111/notebooks or geog0111/notebooks_lab depending on where you run the notebook from).

%%bash

pwd
cd ..
pwd
/Users/philiplewis/Documents/GitHub/geog0111/notebooks
/Users/philiplewis/Documents/GitHub/geog0111

Exercise 1

  • Use cd and .. to move one directory at a time to your home directory. Check where you are at each stage using pwd.
  • Use cd and ~ to go straight to your home directory. Then go from there back to the notebook directory. Check where you are at each stage using pwd.

ls ls -lh *

The command ls lists the files specified. For example:

%%bash
cd ~/geog0111/notebooks
ls R*
README.md

Here, R* uses the wildcard *, so R* means R followed by zero or more characters.

If we specify the option -lh then it provides a long listing (-l) with file size in 'human-readable' format (-h):

%%bash
cd ~/geog0111/notebooks
ls -lh README.md
-rw-r--r--  1 philiplewis  staff   7.0K 12 Jul 12:18 README.md

Here, the file size if 3.3K (3.3 KBytes), and the file is owned by the user plewis. The field -rw-r--r-- provides information on file permissions. Ignoring the first -, it is in 3 sets of 3 bits:

rw-  r--  r--

which refer to permissions for the user, group, and everyone, respectively. The permission fields are rwx, meaning permissions of read, write, and execute, respectively. Execute here means that we can run the file as a script. In the example above, the no execute permission is set (it is not a script file), the user has read and write permission, and group and everyone have only read permissions. So, only the user can write to this file, but everyone can read it.

These fields rwx can be viewed as 3 bits which we can interpret as a base-8 (octal) number (i.e. between 0 and 7) where:

--- -> 0
--x -> 1
-w- -> 2
-wx -> 3
r-- -> 4
r-x -> 5
rw- -> 6
rwx -> 7

Following that, we interpret the field rw-r--r-- from above as 644. The most common file permissions you will likely see or need are:

644 -> rw-r--r--
755 -> rwxr-xr-x

Exercise 2

  • examine the file permissions for files ~/geog0111/notebooks/bin/* (in the directory ~/geog0111/notebooks/bin)
  • what do you notice about these?
  • why do you think this is so?

chmod

We can change file permissions with the command chmod. For example:

%%bash
cd ~/geog0111/notebooks

ls -lh ../README.md
chmod 755 ../README.md

ls -lh ../README.md
chmod 644 ../README.md
-rw-r--r--  1 philiplewis  staff   7.1K 12 Jul 12:18 ../README.md
-rwxr-xr-x  1 philiplewis  staff   7.1K 12 Jul 12:18 ../README.md

First the permissions of the file are 644 as we saw above, then we use chmod 755 to change to 755, then back again to 644. Most commonly, we will use this later ion to apply execute permission to a file:

chmod 755 filename

absolute and relative pathnames

A posix directory name that starts with the file separator '/' is called an absolute pathname: it is addressed from the root of the file system (/). An example of an absolute filename is /home/jovyan/geog0111/README.md. If the filename starts with ~, it is in effect an absolute pathname. For example:

%%bash

ls -l ~/geog0111/notebooks/README.md
-rw-r--r--  1 philiplewis  staff  7201 12 Jul 12:18 /Users/philiplewis/geog0111/notebooks/README.md

A relative pathname is one that does not start with / or ~. It is specified relative to where we are in the filesystem in the current shell. For example:

%%bash

cd ~/geog0111/notebooks

ls -l ../README.md
-rw-r--r--  1 philiplewis  staff  7279 12 Jul 12:18 ../README.md

Recall that we use .. to specify 'up one level'. Then:

%%bash

# cd to absolute path ~/geog0111/notebooks/images
cd ~/geog0111/notebooks/images
pwd

# now relatuve cd up one and down to bin
cd ../bin
pwd

# now relative cd up one level
cd ..
pwd
/Users/philiplewis/geog0111/notebooks/images
/Users/philiplewis/geog0111/notebooks/bin
/Users/philiplewis/geog0111/notebooks

Create and delete a file, cat, rm

cat and <<

We can use the command cat to create a text files from a shell, but this is a bit more involved in a notebook cell than in normal use in an interactive unix shell, and involves the use of the redirection marker <<,. known as the here-document structure.

cat << EOF
line 1
line 2
EOF

sends the text input between the markers (EOF here):

line 1
line 2

into the command cat (which just sends it to the screen). So that's one (slightly awkward) way to create a file ...

For example:

%%bash

# code between the next line and the 
# End Of File (EOF) marker will be saved in 
# to the file work/newfile.dat
# the symbols << and > involve 
# redirection
#
# << is called the here-document structure
# and redirects input (between markers EOF here) to the cmd
# > redirects output, to the file work/newererfile.dat 

cd ~/geog0111/notebooks

cat << EOF > work/newererfile.dat

# this will go into the file
hello world - this is some text in a file

EOF

# ls -l to see what we have: 73 Bytes here
ls -lh work/n*
-rw-r--r--  1 philiplewis  staff    73B 30 Sep 13:36 work/newererfile.dat

cat and <

We can also use cat to see what is in a file. We use the stdin redirection < to send text from a text file (work/newererfile.dat below) into the command cat:

%%bash
cd ~/geog0111/notebooks

cat < work/newererfile.dat
# this will go into the file
hello world - this is some text in a file

rm

We can use the command rm to delete a file:

%%bash
cd ~/geog0111/notebooks

rm work/newererfile.dat

Creating files

In notebooks, you will find it much easier to create files using the the menu item File -> New -> Text File to open a new text file (similarly for other types of file).

Exercise 3

  • Create a file work/newfile.dat using cat and check the new file size.
  • Use the menu item File -> Open to edit the file you have created and print the new file size
  • Use cat to show the new file content
  • delete the file

Exercise 4

Using the unix commands and ideas from above:

  • show a listing of the files in the relative directory geog0111 that start with the letter f
  • interpret the file permissions and sizes of the files in there

Summary

In this section, we have learned the following unix commands and symbols:

cmd meaning example use
~ twiddle / tilde - home cd ~/geog0111
. dot - current directory cd .
.. dot dot - one level up directory cd ..
* wildcard ls R*
< stdin redirection cat < afile.dat
<< here-document structure cat << EOF
text EOF > bfile.dat
> stdout redirection cat < afile.dat > bfile.dat
cd change directory cd ~/geog0111
pwd print working directory pwd
ls list ls README.md
ls -l long list ls -l README.md
ls -lh human-readable long list ls -lh README.md
chmod change mode (permissions) chmod 644 README.md
rm remove rm work/n*
755 rwxr-xr-x chmod 755 bin/*
644 rw-r--r-- chmod 644 README.md

We have seen that we can use the cell magic %%bash or ! to use unix commands in Python code cells in a notebook. This is a very basic introduction to unix, but it will allow you to make better use of the operating system and these notebooks.

You should know how to do some basic operations in a unix shell

You should recognise the special file characters (., .., ~, *).

You should know how to navigate the unix file structure using cd, pwd and these and /, and access information about files using ls.

You should have come across redirection (<<, < and > here).

You should know how to create files, remove files, and understand and change the file permissions.

If you are unsure of any of these, then try going over the material again, explore other resources you may find, and/or come to the Thursday help sessions and ask for help.


Last update: December 6, 2021