A Time Window Based Reinforcement Learning Reward for Test Case Prioritization in Continuous Integration

Year	Authors	Publisher	DOI
2019	Wu, Zhaolin Yang, Yang Li, Zheng Zhao, Ruilian	ACM	10.1145/3361242.3361258

Year

Authors

Publisher

DOI

2019

Wu, Zhaolin Yang, Yang Li, Zheng Zhao, Ruilian

ACM

10.1145/3361242.3361258

Techniques	Applicability
TCP	Industry Motivation Industry Evaluation

Techniques

Applicability

TCP

Industry Motivation Industry Evaluation

Experiment subject(s)	Industrial Partner	Programming Language
GSDTSR (5555 TCs) plus datasets from ABB Robotics (up to 2086 TCs) Industrial open-source, large scale		Unclear

Experiment subject(s)

Industrial Partner

Programming Language

GSDTSR (5555 TCs) plus datasets from ABB Robotics (up to 2086 TCs) Industrial open-source, large scale

Unclear

Effectiveness Metrics	Efficiency Metrics	Other Metrics
Average Percentage of Faults Detected (APFD)	Execution time

Effectiveness Metrics

Efficiency Metrics

Other Metrics

Average Percentage of Faults Detected (APFD)

Execution time

Information Approach	Algorithm Approach	Open Challenges
	Machine learning-based	use windows of dynamic size

Information Approach

Algorithm Approach

Open Challenges

Machine learning-based

use windows of dynamic size

Abstract

Continuous integration refers to the practice of merging the working copies of all developers into the mainline frequently. Regression testing for each mergence is characterized by continually changing test suite, limited execution time, and fast feedback, which demands new test optimization techniques. Reinforcement learning is introduced for test case prioritization to save computing resources in continuous integration environment, where a reasonable reward function is highly important for learning strategy, since the process of reinforcement learning is a reward-guided behavior. In this paper, APHFW, a novel reward function is proposed by using partial historical information of test cases effectively for fast feedback and cost reduction. The experiments are based on three open-source data sets, and the results show that the proposed reward function is more cost-effect than other reinforcement learning rewards in continuous integration environment.