Regression testing explained

Regression testing (rarely, non-regression testing[1]) is re-running functional and non-functional tests to ensure that previously developed and tested software still performs as expected after a change.[2] If not, that would be called a regression.

Changes that may require regression testing include bug fixes, software enhancements, configuration changes, and even substitution of electronic components (hardware).[3] As regression test suites tend to grow with each found defect, test automation is frequently involved. The evident exception is the GUIs regression testing, which normally must be executed manually. Sometimes a change impact analysis is performed to determine an appropriate subset of tests (non-regression analysis[4]).

Background

As software is updated or changed, or reused on a modified target, emergence of new faults and/or re-emergence of old faults is quite common.

Sometimes re-emergence occurs because a fix gets lost through poor revision control practices (or simple human error in revision control). Often, a fix for a problem will be "fragile" in that it fixes the problem in the narrow case where it was first observed but not in more general cases which may arise over the lifetime of the software. Frequently, a fix for a problem in one area inadvertently causes a software bug in another area.

It may happen that when a feature is redesigned some of the same mistakes that were made in the original implementation of the feature also occur in the redesign. In most software development situations, it is considered good coding practice, when a bug is located and fixed, to record a test that exposes the bug and re-run that test regularly after subsequent changes to the program.[5]

Although this may be done through manual testing procedures using programming techniques, it is often done using automated testing tools.[6] Such a test suite contains software tools that allow the testing environment to execute all the regression test cases automatically; many projects have automated Continuous integration systems to re-run all regression tests at specified intervals and report any failures (which could imply a regression or an out-of-date test).[7]

Common strategies are to run such a system after every successful compile (for small projects), every night, or once a week. Those strategies can be automated by an external tool.

Regression testing is an integral part of the extreme programming software development method. In this method, design documents are replaced by extensive, repeatable, and automated testing of the entire software package throughout each stage of the software development process. Regression testing is done after functional testing has concluded, to verify that the other functionalities are working.

In the corporate world, regression testing has traditionally been performed by a software quality assurance team after the development team has completed work. However, defects found at this stage are the most costly to fix. This problem is being addressed by the rise of unit testing. Although developers have always written test cases as part of the development cycle, these test cases have generally been either functional tests or unit tests that verify only intended outcomes. Developer testing compels a developer to focus on unit testing and to include both positive and negative test cases.[8]

Techniques

The various regression testing techniques are:

Retest all

This technique checks all the test cases on the current program to check its integrity. Though it is expensive as it needs to re-run all the cases, it ensures that there are no errors because of the modified code.[9]

Regression test selection

Unlike Retest all, this technique runs a part of the test suite (owing to the cost of retest all) if the cost of selecting the part of the test suite is less than the Retest all technique.

Test case prioritization

Prioritize the test cases so as to increase a test suite's rate of fault detection. Test case prioritization techniques schedule test cases so that the test cases that are higher in priority are executed before the test cases that have a lower priority.

Types of test case prioritization

Hybrid

This technique is a hybrid of regression test selection and test case prioritization.

Benefits and drawbacks

Regression testing is performed when changes are made to the existing functionality of the software or if there is a bug fix in the software. Regression testing can be achieved through multiple approaches; if a test all approach is followed, it provides certainty that the changes made to the software have not affected the existing functionalities, which are unaltered.[10]

In agile software development—where the software development life cycles are very short, resources are scarce, and changes to the software are very frequent—regression testing might introduce a lot of unnecessary overhead.

In a software development environment which tends to use black box components from a third party, performing regression testing can be tricky, as any change in the third-party component may interfere with the rest of the system (and performing regression testing on a third-party component is difficult, because it is an unknown entity).

Uses

Regression testing can be used not only for testing the correctness of a program but often also for tracking the quality of its output.[11] For instance, in the design of a compiler, regression testing could track the code size and the time it takes to compile and execute the test suite cases.

Regression tests can be broadly categorized as functional tests or unit tests. Functional tests exercise the complete program with various inputs. Unit tests exercise individual functions, subroutines, or object methods. Both functional testing tools and unit-testing tools tend to be automated and are often third-party products that are not part of the compiler suite. A functional test may be a scripted series of program inputs, possibly even involving an automated mechanism for controlling mouse movements and clicks. A unit test may be a set of separate functions within the code itself or a driver layer that links to the code without altering the code being tested.

See also

Notes and References

  1. Book: Pezzè . Mauro . Young . Michal . Software testing and analysis: process, principles, and techniques . 2008 . Wiley . Testing activities that focus on regression problems are called (non) regression testing. Usually "non" is omitted.
  2. Book: Basu, Anirban. Software Quality Assurance, Testing and Metrics. 2015. PHI Learning. 978-81-203-5068-7.
  3. [National Academies of Sciences, Engineering, and Medicine|National Research Council]
  4. Book: Boulanger . Jean-Louis . CENELEC 50128 and IEC 62279 Standards . 2015 . Wiley . 978-1119122487 .
  5. Book: Kolawa, Adam . Huizinga, Dorota . Automated Defect Prevention: Best Practices in Software Management . 2007 . Wiley-IEEE Computer Society Press . 73. 978-0-470-04212-0 .
  6. http://safari.oreilly.com/0201794292/ch08lev1sec4 Automate Regression Tests When Feasible
  7. Web site: Change Code Without Fear: Utilize a Regression Safety Net. daVeiga. Nada. Dr. Dobb's Journal. 2008-02-06.
  8. Web site: Developer Testing Is 'In': An interview with Alberto Savoia and Kent Beck. Dudney. Bill. 2004-12-08. 2007-11-29.
  9. 2008-03-29. Understanding Regression Testing Techniques. Gaurav. Duggal. Bharti. Suri. National Conference on Challenges and Opportunities. Mandi Gobindgarh, Punjab, India. 10.1.1.460.5875.
  10. Yoo . S. . Harman . M. . Regression testing minimization, selection and prioritization: a survey . Software Testing, Verification and Reliability . 2010 . 22 . 2 . 67–120. 10.1002/stvr.430.
  11. Web site: Regression Testing, Programmer to Programmer . Kolawa . Adam . Adam Kolawa . Wrox .