The Layered Cake of Testing

Not having a comprehensive test strategy is like architecting a building without knowing the environment in which it will be built and how it will be stressed.

What’s a test strategy?

This is a plan on what and how you plan to test the different concepts or patterns without you application. Ideally, your test strategy is decided during your architectural design phase of your application. The best outcomes are driven by choosing frameworks, libraries, patterns and conventions that emphasize the ease of which these decisions can be tested and maintained.

The 4 (or maybe 5) layers of testing

A test strategy should be a multi-layered approach to ensure quality code. Each layer has its purpose and should be used in accordance to its strength and weakness. Static code analysis and end-to-end tests are both important, and one cannot replace the other. This is because they have completely different responsibilities and a completely different set of pros and cons.

With that said, let’s take a look at the layers and discuss the intentions of each.

Static code analysis

This includes linting, type checking, code-flow analysis … TypeScript, Reason, Flowtype, ESLint, TSLint are all examples of static code analysis.

Critical focus

Speed
“Set and forget”
Automated
Verifies code patterns and syntax

When to run

These should be run as frequently as possible. Optimally, they are continuously running while developing.

Code smells

Ambiguous types on large objects, like TypeScript’s any
Overly typing entities, rather than allowing for type inference
Not using the compiler’s build as a pass-fail mechanism for your pipeline
Not taking advantage of automated “fixing” of code when misaligned from standards

Coverage

As close to 100% of the codebase should be evaluated.

Recommendation

Although ReasonML is a great solution due to being both completely type safe and sound, it’s not the most well-known and does have a cognitive challenge due to its very FP nature. My default recommendation is TypeScript, due to it being type safe (though not sound), but is very well-known and is written and feels very much like JavaScript with types, and not a completely different language.

With this, one should have ESLint and Prettier run automatically at file save or at least at each commit. These tools allow the automation of code “fixing” when there is a misalignment of code from the configured standards. Running this as a part of local development process removes the need for commenting on common syntactical “nits”, allowing developers to focus on what really matters in code reviews.

Unit testing

These are super light-weight tests that run assertions against the most atomized parts of code, the units. These are just testing the result for a given input for a function. These functions/units are mostly pure, stateless functions, so there is no need to mock or stub anything other than the input/arguments provided. In other words, there is nothing external to the unit tested that’s needed, no servers, no APIs, no DOM, no HTTP … nothing.

Example

import { fnToTest } from './some-module';

describe('A group of units with related functionality', () => {
  it('test that fnToTest returns x when y', () => {
    const input = 'y';
    const expected = 'x';

    const received = fnToTest(input);
    expect(received).toBe(expected);
  });
});

Critical focus

Speed
Simplicity
Low effort
Great dev experience
Verifying internal structures

When to run

These should be run, at minimum, at every git commit.

Code smells

Stubbing functions or methods that have side-effects
Mocking (other than the arguments passed) additional objects and global variables
beforeAll for testing “setup”
afterAll for testing “teardown”
You can’t just simply run the test node myunit.test.js without running other processes or servers

Coverage

Coverage is a dangerous proposition as it can lead to low-value testing just for the sake of numbers. I think the majority of the app’s functionality should have unit tests, but whether that’s 70%, 80% or 90% is quite arbitrary and quickly surpasses the point of diminishing return.

My view is any function that has hand-written logic in it should have both positive and negative testing. Here are some easy rules that apply to all testing:

Only test hand-written logic; don’t test native functions, or methods from external libraries or frameworks
The more complex the hand-written logic, the more priority should be placed on testing
The more critical or “hot path” the function, the more priority should be placed on testing
Don’t test functions that primarily call out to external entities (usually require mocking) and return the response

The focus should be on the value (quality) of the tests, not the number of tests (quantity).

Recommendation

My very favorite unit testing library is tape, but it does require quite a bit of manual configuration at scale. For better “out of the box” configuration for scale, Jest is a great alternative that breaks the capability of running the unit test file alone (e.g. node test.js), but the benefits that come with it are too great to ignore.

Unit tests should reside right along the files they test. Example:

- todo-list/
  | - todo-filter.ts
  | - todo-filter.test.ts

The Jest library has everything needed for unit testing. There should be no additional libraries or frameworks needed.

Integration testing

These are more similar to unit tests than they are other types of tests. These tests are still input-output driven, but since it tests the composition of units, it tests at a higher, more complex level, so mocking/stubbing is almost always required.

Mocking and stubbing should be done without the need for additional processes, servers or APIs. The mocks or stubs should be static and side-effect free.

Example

import { moduleToTest } from './top-module';
import { mockedRequest } from './mock-request';
import { mockedResponse } from './mock-response';
import { expectedReturn } from './mock-expected-return';

jest.mock('../http-client/index', () => {
  return mockedResponse;
});

describe('A module or abstraction that runs multiple units', () => {
  it('test that moduleToTest returns this when that', () => {
    const received = moduleToTest(mockedRequest);
    expect(received).toBe(expectedReturn);
  });
});

Critical focus

Speed
Completeness
Good dev experience
Verifying internal structures

When to run

These should be run at minimum at every git push.

Code smells

beforeAll for testing “setup”
afterAll for testing “teardown”
You can’t just simply run the test node myunit.test.js without running other processes or servers

Coverage

All full suites of units should be tested up to the point of needing to run additional processes, servers or needing a real DOM. These should test as much surface area as possible with as few tests as possible and should be contained to just testing the local codebase or module. Never test the functionality of external/third-party functionality.

Recommendation

Continuing with the Jest recommendation from above, Jest can also handle integration testing with its ability to mock modules.

View-component testing

These are analogous to unit or integration tests for React or Vue components. These can be run along with your unit and integration tests. They are input and output based and do not require additional process, servers or live DOM. The difference between a view-component test is that the output of data, the output is a rendered component. Assertions are made against the component’s properties.

Example

import { shallow } from 'enzyme';
import { ComponentToTest } from './component';
import { res } from './mocked-response';

describe('Tests for ComponentToTest ensuring proper rendering', () => {
  it('should render an error message when response has validation errors', () => {
    const wrapper = shallow(<ComponentToTest response={ res } />);
    const expected = 'Password must be longer than 8 characters';
    expect(wrapper.find('.input-error').text()).to.equal(expected);
  });
});

Critical focus

Little mocking
Simplicity
Low effort
Good dev experience
Verifying rendered component states

When to run

These should be run at minimum for each PR to the main branch, but could be run at every git push.

Code smells

Testing a native library or framework function
Testing something that is too simple
Testing if component has a class or text that is not driven by logic or is static
Heavy use of beforeAll for testing “setup”
Heavy use of afterAll for testing “teardown”

Coverage

All view-components that have complexity, logic, derived data or branching should be tested.

Recommendation

Jest will continue our testing capabilities with the addition of JSDom for mocking the DOM in order for us to synthetically render our components for testing.

End-to-end testing (aka functional testing/e2e)

This type of testing is to mimic user interaction to the closest degree. It requires a running app and running APIs to simulate the real environment as much as possible. Tests require a Web driver to mimic clicking, typing, navigating and submitting data to a running application against appropriate testing stages.

Critical focus

Functional completeness
Predictability and reliability
Mimics real world usage
Verifying the API contracts between external, dependent systems
Can be run against live systems and mocked systems

When to run

These should be run for each PR to the main branch.

Code smells

You’re testing a native library or framework function
You’re testing something that is too simple
You’re testing the DOM or browser function

Coverage

Critical user flows, security and privacy related features, anything that’s considered high-risk. The intention of a e2e test is to verify the entire user flow within a running environment. Avoid testing functionality that can more easily be tested though unit or integration tests or redundantly testing something that’s already well covered in your unit or integration tests.

Recommendation

Playwright, Puppeteer, Cypress are good options. I prefer Playwright due to the support of Microsoft and it being cross-browser.

Cerebral Ideas

The Layered Cake of Testing

What’s a test strategy?

The 4 (or maybe 5) layers of testing

Static code analysis

Critical focus

When to run

Code smells

Coverage

Recommendation

Unit testing

Example

Critical focus

When to run

Code smells

Coverage

Recommendation

Integration testing

Example

Critical focus

When to run

Code smells

Coverage

Recommendation

View-component testing

Example

Critical focus

When to run

Code smells

Coverage

Recommendation

End-to-end testing (aka functional testing/e2e)

Critical focus

When to run

Code smells

Coverage

Recommendation

Additional references