Programming is not just about writing code. It is also about verifying that the code does what it is supposed to do and, if needed, correcting it. This process of verification is called testing.
You have probably already tested your programs by executing them. When you test your program, you usually enter some input data and look if the result is correct. This is okay for a small program, but it gets harder as the program gets bigger. Bigger programs have more options what they can do based on the possible user input and configuration. Their manual testing becomes time-consuming, especially when it needs to be repeated after every change, and it becomes more likely errors slip unnoticed into our code.
Humans are not very good at performing boring repetitive tasks, that is the domain of computers. And, not surprisingly, that is the reason why developers write the code that verifies their programs.
Automated tests are functions checking, with no manual intervention, that all features of the tested program work correctly. If made right, the tests should be lean and give quick response whether the program has issues or not. The testing does not give us 100% proof that the code is without errors but it is still better than no testing at all.
The automated tests make modification of the code easier as you can faster find possible bugs in the existing functionality (aka regressions).
pytest
libraryUp to now, we have used only the modules that come installed with Python,
for example, modules such as math
or turtle
.
There are many more libraries that are not included in Python
but you can install them to your Python environment and use them.
There is a built-in library for testing called unittest
.
It is not particularly easy to use and we decided to go with the pytest
library instead which is not only easier to use but also popular among Python
developers.
To install the library run the following command in the command line:
> python3 -m pip install pytest
$ python3 -m pip install pytest
What is pip and why do we use it?
pip
is a Python command-line tool for installing 3rd-party
Python libraries from the Python Package Index (PyPI)
and other sources (e.g., Git repositories).
python3 -m pip install pytest
makes Python to install pytest
library from PyPI.
For help on how to use pip run python3 -m pip --help
.
python3 -m <command> or just <command>
python3 -m <command>
tells Python to execute a script from the
Python module named <command>
(e.g., python3 -m pip ...
).
In a properly configured Python environment, it should be possible to call
the <command>
directly, without the help of the python
command
(e.g., pip ...
)
To save ourselves the trouble of unnecessary complications with a possibly
misconfigured Python environment we recommend using the longer
python3 -m <command>
version.
We will show testing with a very simple example.
There is a function add
that adds two numbers.
There is another function test_add
that tests if the
add
function returns correct results for specific numbers.
Make a copy of the code into a file named test_addition.py
in a new empty
directory.
def add(a, b):
return a + b
def test_add():
assert add(1, 2) == 3
The naming of files and test functions matters
pytest
scans your code and
searches for the included tests. When found, these tests are executed.
By default, the names of the test files and the test functions must start with
the test_
prefix in order to be recognized as tests.
What does the test function do?
The assert
statement evaluates the logical expression that follows it.
If the result is not True
then it raises the AssertionError
exception
which is interpreted by pytest
as a failing test.
E.g, the assert a == b
command is equivalent to:
if not (a == b):
raise AssertionError
Assertions
We have already discussed the assert
command in exceptions.
As you see, the testing library borrowed the Python built-in assertion functionality and gave it a slightly different meaning.
Just make sure you are not running your tests with the -O
option enabled 😉
You execute tests with the command python -m pytest -v <path>
followed by the path to the file containing the tests.
You can omit the <filename>
argument and then python -m pytest -v
scans the current directory and runs tests in all files whose names start
with the test_
prefix.
You can also use a path to a directory where pytest
should searches for
the tests.
This command scans the given file and calls all functions that start
with the test_
prefix. It executes them and checks if they raise any exception,
e.g., raised by the assert
statement.
$ python3 -m pytest -v test_addition.py
============================= test session starts ==============================
platform linux -- Python 3.8.3, pytest-7.1.2, pluggy-1.0.0
rootdir: /tmp/test_example
collected 1 item
test_addition.py . [100%]
============================== 1 passed in 0.00s ===============================
If an exception occurs, pytest
shows a red message with
additional details that can help you find the bug and fix it:
============================= test session starts ==============================
platform linux -- Python 3.8.3, pytest-7.1.2, pluggy-1.0.0
rootdir: /tmp/test_example
collected 1 item
test_addition.py F [100%]
=================================== FAILURES ===================================
___________________________________ test_add ___________________________________
def test_add():
> assert add(1, 2) == 3
E assert 4 == 3
E + where 4 = add(1, 2)
test_addition.py:5: AssertionError
=========================== short test summary info ============================
FAILED test_addition.py::test_add - assert 4 == 3
============================== 1 failed in 0.01s ===============================
Try to run the test yourself. Modify the add
function or (its test) so that the
test fails.
It is common to write tests separately (in another file) from the tested regular code. This way, the code is easier to read and it can be distributed without the tests, if desired.
Let's split the test_addition.py
into two separate files, moving the add()
function into a new module addition.py
and keeping the tests in the old
test_addition.py
file. Import the tested function from the new module.
Run the test and see if it is passing.
Let's now try to add two different tests for a function for computing perimeter of a rectangle from custom functions
def find_rectangle_perimeter(width, height):
""" Calculate perimeter of a rectangle from the given sides.
"""
return 2 * (width + height)
Automated tests have to be able to run unattended. They are often executed automatically and the failures are reported via some sort of notification, e.g., by email.
In practical terms, this means that the tests must not depend on live
interaction with the user, e.g., the input
function will not work in tests.
Can we test user interaction in automated tests?
There are testing techniques allowing us to emulate user interaction in the user interfaces. But is that beyond the scope of this course.
This can make your work harder sometimes. Let's look at a more complex project, the 1D (one-dimensional) tic-tac-toe.
If you do not have the 1D tic-tac-toe program, the following sections are only theoretical.
If you study at home, complete the 1D tic-tac-toe lesson before continuing. The task description is at one-dimensional tic-tac-toe)..
The structure of the 1D tic-tac-toe code looks roughly like this:
import random # (and possibly other import statements that are needed)
def move(board, space_number, mark):
"""Return the board with the specified mark placed in the specified position"""
...
def player_move(board):
"""Ask the player what move should be done and return the updated board
with the move played.
"""
...
input('What is your move? ')
...
def tic_tac_toe_1d():
"""Start the game
It creates an empty board and run player_move and computer_move alternately
until the game is finished.
"""
while ...:
...
player_move(...)
...
# Start the game:
tic_tac_toe_1d()
If you import this module, Python executes all commands in it, from top to bottom:
The first command, import
, initializes the variables and functions of the
random
module. It is module from the standard Python library it is unlikely
that it would have any side effect to worry about.
The definitions of functions (def
statements and everything in them)
just define the functions but they do not execute them.
Calling the tic_tac_toe_1d
function starts the game.
The tic_tac_toe_1d
calls the player_move()
function which calls input()
.
This is an issue.
If you import this module to the tests, the input
fails and the module does
not get not imported.
If you want to import such a module from elsewhere, e.g., you would like
to use move()
in a different game, the import of the module itself will
start the 1D tic-tac-toe game!
The calling of tic_tac_toe_1d
is a side-effect and we need to remove it.
Okay, but you cannot start the game without it! What can we do about it?
There are two possible solutions. First, we detect if the module is imported
from another script or it is itself the main script and start the game only
if it runs as the __main__
module:
if __name__ == "__main__":
# Start the game:
tic_tac_toe_1d()
Second, we can create a new python file, e.g., game.py
and we move the
tic_tac_toe_1d()
call in it:
import tic_tac_toe
tic_tac_toe.tic_tac_toe_1d()
Obviously, the game.py
itself, cannot be tested because it calls input
indirectly. But almost empty and you can execute it only if you want to play.
After the fix, we can import the original module into the test module or other scripts.
A test for the original module could look like this:
import tic_tac_toe
def test_move_to_empty_space():
board = tic_tac_toe.computer_move("-" * 20)
assert len(board) == 20
assert board.count("x") == 1
assert board.count("-") == 19
Tests that verify that a program works correctly under correct conditions are called positive tests. An exception raised during the positive testing lead to failure of the test.
Tests that check behaviour in case of invalid inputs are called negative tests. The purpose of the negative testing is verification of the graceful handling of error states. Raising of an exception is often the expected behaviour of the tested code.
For example, the computer_move
function should raise an error
(e.g., ValueError
) when the board is full.
It is much better to raise an exception than doing nothing and silently letting the program get stuck elsewhere. You can use such function in a more complex program and be sure that it will raise an understandable error when it is called under bad conditions. The error helps you to fix the actual problem. The sooner you discover an error the easier is to fix it.
To test if your code raises an exception, use the with
statement and the
pytest.raises()
context manager.
We have not talked about the with
statement and context managers yet.
But don't worry, you will learn about them later. Just check how it is used
to test whether an exception is raised.
import pytest
import tic_tac_toe
def test_move_failure():
with pytest.raises(ValueError):
tic_tac_toe.computer_move("ox" * 10)
Let's now try to edit the function for getting a perimeter of rectangle so that it raises a ValueError if any of the sides is smaller or equal to zero. Add a negative test checking that the exception is raised as expected.