Testing

Under Construction

What Is It

Most forms of testing are about isolating a particular characteristic of your system, and exploring it from a risk perspective. It could be:

Performance Testing addresses the risk of not being able to support all the users
Usability Testing tries to see whether people struggle to make sense of your software, usually because the assumptions of their Internal Models differ from those embedded in the system, or that the system isn't adequately transparent about it's own model.
Security Testing addresses the risk that your software could be used against you or its users by hackers.
Integration Testing: Where we test how the software works as-a-whole, and test that it will work with other systems
Corridor Testing: Asking a few, random people to use the system-under-test, in order to see if it confuses them, or not.
User Acceptance Testing: Asking users to review new features, and make sure that they actually do what is required
Regression Testing: Making sure changes in new versions of the system haven't broken functionality

How It Works

(/img/generated/development_process_test.png)

The whole purpose of testing is to meet reality early, ahead of putting software in front of real users, where you face Operational Risks, like reputation damage and financial penalties.

Given this, the best approach to test planning should be risk-based: consider which risks you want to mitigate, and test accordingly:

Identify Risks
Evaluate Risks
Prioritise Risks
Plan tests from the top of the priority list down.

Examples

This should work at every level within a project. If you are building a new feature, you should consider:

Is it going to connect to third-party systems? If so, I should build System Integration Tests to cover the Dependency Risk associated with this, and the chance that in the future, the interface will change.
Does my code do what I expect? I probably should build a Unit Test to mitigate Complexity Risk.
Will users understand the software I build for them? I should probably do some Beta Testing or Corridor Testing to mitigate Invisibility Risk.
To go live, am I going to need some piece of real-world paperwork? Test the process ahead-of-time to expose all the Hidden Risks

Where It's Used

Waterfall initially was conceived with a long, manual testing phase to be performed on the whole system after development
Extreme Programming championed the use of Unit Tests in order to test individual subsystems, as well as having an On-Site Customer to act as a testing resource when needed.

Variations

Automated Tests

Often, the decision of whether to automate a test will be based on whether or not it can be expressed objectively. For example, checking that a REST endpoint "returns the right error code" is objective, and is therefore a candidate for automation.

Automated tests look roughly the same, irrespective of the scope they are trying to test.

We have a System Under Test, which may be a single class, or a whole executable.
We have some Input Conditions for the test, and some Expectations.
When the test is executed, we compare the actual outputs with the expected ones, giving us The Result.

Testing Process

A useful way to think about automated testing is that it turns the System Under Test into a Pure Function: This means that for a specific set of inputs, the system will produce a specific output, reliably, every time.

Getting complex systems to behave as pure functions can be costly, but there are techniques to help with this such as Mocking. However, if you try to devise as much of your software in a pure-functional way to start with, automated testing is much easier.

Automated Testing has an interesting effect on managing Complexity Risk: Although you may initially write a Unit Test (say) to mitigate the risk of having implemented a feature wrongly, you are also given insurance against future change breaking that feature. That is to say, they are regression tests. However, implementing tests like this is better than building regression tests, as discussed here.

-- how do automated tests mitigate complexity risk? tbd

Manual Tests

Manual Testing is, at some level, essential if your product is to be used by humans. Although UI-Automation tools such as Selenium allow you to script browser interactions, they cannot reliably catch every problem.

For example, ensuring the UI "looks ok and doesn't glitch" is entirely subjective: you'll need to express this in a manual test. Manual Tests are often described in Test Plans and Test Scripts in order to ensure repeatability, and manage Process Risk.

Since manual tests carry much higher per-use cost to run, there is a tendency to want to save this cost by doing fewer releases. After all, fewer releases means less manual testing, but this may increase Process Risk.

How do you decide whether to keep a test manual, or automate? The more automated a test is, the more cheaply it can be re-used. However, the process of automation can take longer, and so adds Schedule Risk. Whether or not it's worth automating is to some extend going to depend on how much you value future time.

White-Box and Black-Box Testing

In the initial conception, Black-Box Testing ignores the implementation details of a component and tests the interface only.

White-box testing however considers the components within the box, and how they interact with one another in order to define the tests. This is fair enough if, for some reason, you are unable to test the components individually for some reason: knowing how something is implemented gives you an insight into where the bugs will hide, and therefore, where the risks lie.

Testing Level

However, if possible, it's better to break open the white box and test the components themselves. This means you end up having "higher" and "lower" level tests, depending on the scope of the System Under Test. There are several advantages to this:

First, tests become less "brittle": the smaller the System Under Test, the less Context it needs to operate, therefore the more insulated it is to changes in other parts of the system. As a counter-example, if all of your tests run over the whole system, and the authentication system changes, does that break all the tests? This is an argument from Complexity-Risk.
Tests at the "whole system" level are usually longer-running since they require starting up the whole system, and also require more data and context to run. This is an argument both from Complexity-Risk and Process Risk.

Expanding on this then, the Testing Pyramid idea is that lower level, automated tests which run quickly should be common, while there should be fewer of the more expensive "whole system" level tests.

Testing Pyramid

Finally, since manual tests are run by people (who are comparatively slow and costly), these should be the rarest kind of test.

Testing Team

Sometimes, testing is handled by external teams (possibly in other locales). This is often done as a cost-saving measure, but comes with some penalties such as:

Increased Communication Risk in terms of having to engage with an external company.
Agency Risk because the testing team are a business in their own right, who might be more interested in the goal of making money from you than shipping your product.
Obvious Coordination Risk in trying to arrange work in other teams, buildings, timezones or countries, and not having control on exactly which staff are dealing with your product.
Invisibility Risk because at some level, the testing team need to understand what your software is for.

Test-Driven Development

Also called test-first development, the idea here (from Extreme Programming) is that you write the tests before the code, in order that you think up-front about the requirements of the software you are writing. The aim of this is to minimize Complexity Risk via preventing developers from Gold Plating, and getting them to do The Simplest Thing That Can Possibly Work.

Additionally, by having test fail before they pass, you mitigate the risk of writing a "null" test (see below).

Code Coverage

Code Coverage tools are a useful way of showing you which parts of your software might contain bugs due to lack of testing, which is really useful in the Risk Evaluation phase of test-planning.

Sometimes code coverage spawns its own Map And Territory Risks though, where people forget that the goal should be mitigating overall project risk (via delivering functionality and so forth) and start to believe that the goal is delivering 100% code coverage. Writing tests to cover every get() method is a fools' errand which increases the overall codebase complexity for no real reduction in Feature Risk.

Worse still is that having 100% code coverage does not guarantee an absence of bugs, or that the code will do what the users wanted it to do. Feature Risk is always there.

Risks Mitigated

There are so many different types of testing and this guide is not meant to be exhaustive. Instead, here is a table covering some of the main types of testing and the risks they mitigate:

Risk	Mitigation
Boundary Risk	System Integration Testing CI Deployment User Acceptance Testing
Dependency Risk	Integration Testing System Integration Testing
Operational Risk	Performance Testing / Load Testing Non-Functional Testing Disaster Recovery Testing Security esting Smoke / Sanity Testing
Software Risk	Unit Testing Component Testing End-To-End Testing Functional Testing
Feature Risk	Browser-Based Testing Accessibility Testing Acceptance Testing (UAT) Beta Testing
Invisibility Risk	Usability Testing Corridor Testing
Complexity Risk	Unit Testing Automated Acceptance testing Integration Testing

Attendant Risks

Firstly, it can be easy to fool yourself with tests: just because your tests pass does not mean your code is perfect. Vigilance is required against Map And Territory Risk:

Do the tests explore the behaviour of the system the same way the users will?
Can you be sure you haven't written a "null test", one that passes when it should fail?
Have you covered the "cracks" between the different parts of the system? Just because all the components of a bicycle are fine, it doesn't mean that the bike itself will work.

Second, Testing is a double-edged sword. While it allows you to mitigate various Feature Risks, by adding test-code to your project you are necessarily increasing the complexity. Maintaining tests is hard work, and if you're not careful, running tests can take time and slow down builds and add delay through Process Risk.

Third, if you are exploring functionality in order to flush out requirements, understand user behaviour or figure out performance characteristics, then there is no point in building tests yet: what you are doing is exploratory at best and the extra code will slow you down.

For these reasons, focus on writing the smallest number of tests that mitigates the risks.

Continuous Integration

referred to.

Risk-Adjusted Value

Actually, this

bring our skills to bear whatever the risk.
cf. with evolution.
and people: people are naturally risk-averse.
investment banking, kelly bets
development bets

How does this help?

with debugging
with planning work
with development

real options

What Is It​

How It Works​

Examples​

Where It's Used​

Variations​

Automated Tests​

Manual Tests​

White-Box and Black-Box Testing​

Testing Level​

Testing Team​

Test-Driven Development​

Code Coverage​

Risks Mitigated​

Attendant Risks​

See Also​

Continuous Integration​

Risk-Adjusted Value​