Automated Code Review Comment Classification to Improve Modern Code Reviews

M. Ochodek; Miroslaw Staron; Wilhelm Meding; Ola Söder

doi:10.1007/978-3-031-04115-0_3

Automated Code Review Comment Classification to Improve Modern Code Reviews
Paper in proceeding, 2022

Modern Code Reviews (MCRs) are a widely-used quality assurance mechanism in continuous integration and deployment. Unfortunately, in medium and large projects, the number of changes that need to be integrated, and consequently the number of comments triggered during MCRs could be overwhelming. Therefore, there is a need for quickly recognizing which comments are concerning issues that need prompt attention to guide the focus of the code authors, reviewers, and quality managers. The goal of this study is to design a method for automated classification of review comments to identify the needed change faster and with higher accuracy. We conduct a Design Science Research study on three open-source systems. We designed a method (CommentBERT) for automated classification of the code-review comments based on the BERT (Bidirectional Encoder Representations from Transformers) language model and a new taxonomy of comments. When applied to 2,672 comments from Wireshark, The Mono Framework, and Open Network Automation Platform (ONAP) projects, the method achieved accuracy, measured using Matthews Correlation Coefficient, of 0.46–0.82 (Wireshark), 0.12–0.8 (ONAP), and 0.48–0.85 (Mono). Based on the results, we conclude that the proposed method seems promising and could be potentially used to build machine-learning-based tools to support MCRs as long as there is a sufficient number of historical code-review comments to train the model.

Machine learning

BERT

Modern Code Reviews

Author

M. Ochodek

Poznan University of Technology

Miroslaw Staron

University of Gothenburg

Chalmers, Computer Science and Engineering (Chalmers), Software Engineering (Chalmers)

Other publications Research

Wilhelm Meding

Ericsson

Other publications Research

Ola Söder

Axis Communication AB

Lecture Notes in Business Information Processing

1865-1348 (ISSN) 18651356 (eISSN)

Vol. 439 LNBIP 23-40
9783031041143 (ISBN)

14th International Conference on Software Quality, SWQD 2022
Vienna, Austria,

Subject Categories (SSIF 2011)

Other Computer and Information Science

Information Science

Information Systemes, Social aspects

DOI

10.1007/978-3-031-04115-0_3

Publication data connected to DOI

More information

Latest update

6/30/2025

Automated Code Review Comment Classification to Improve Modern Code Reviews Paper in proceeding, 2022