Using knowledge units of programming languages to recommend reviewers for pull requests: an empirical study

Published 2023 in Empirical Software Engineering

ABSTRACT

Determining the right code reviewer for a given code change requires understanding the characteristics of the changed code, identifying the skills of each potential reviewer (expertise profile), and finding a good match between the two. To facilitate this task, we design a code reviewer recommender that operates on the knowledge units (KUs) of a programming language. We define a KU as a cohesive set of key capabilities that are offered by one or more building blocks of a given programming language. We operationalize our KUs using certification exams for the Java programming language. We detect KUs from 10 actively maintained Java projects from GitHub, spanning 290K commits and 65K pull requests (PRs). We generate developer expertise profiles based on the detected KUs. We use these KU-based expertise profiles to build a code reviewer recommender (KUREC). We compare KUREC’s performance to that of seven baseline recommenders. KUREC ranked first along with the top-performing baseline recommender (RF) in a Scott-Knott ESD analysis of recommendation accuracy (the top-5 accuracy of KUREC is 0.84 (median) and the MAP@5 is 0.51 (median)). From a practical standpoint, we highlight that KUREC’s performance is more stable (lower interquartile range) than that of RF, thus making it more consistent and potentially more trustworthy. We also design three new recommenders by combining KUREC with our baseline recommenders. These new combined recommenders outperform both KUREC and the individual baselines. Finally, we evaluate how reasonable the recommendations from KUREC and the combined recommenders are when those deviate from the ground truth. We observe that KUREC is the recommender with the highest percentage of reasonable recommendations (63.4%). Overall we conclude that KUREC and one of the combined recommenders (e.g., AD_HYBRID) are overall superior to the baseline recommenders that we studied. Future work in the area should thus (i) consider KU-based recommenders as baselines and (ii) experiment with combined recommenders.

PUBLICATION RECORD

Publication year
2023
Venue
Empirical Software Engineering
Publication date
2023-05-09
Fields of study
Computer Science
Identifiers
DOI 10.1007/s10664-023-10421-9 arXiv 2305.05654
External record
Open on Semantic Scholar
Source metadata
Semantic Scholar

CITATION MAP

EXTRACTION MAP

CLAIMS

No claims are published for this paper.

CONCEPTS

No concepts are published for this paper.

REFERENCES

CORMS: a GitHub and Gerrit based hybrid code reviewer recommendation approach for modern code review
2022cited by this paper
Assessing the Alignment between the Information Needs of Developers and the Documentation of Programming Languages: A Case Study on Rust
2022cited by this paper
Towards Mining OSS Skills from GitHub Activity
2022cited by this paper
Diversified Third-Party Library Prediction for Mobile App Development
2022cited by this paper
DevRec: Multi-Relationship Embedded Software Developer Recommendation
2022cited by this paper
Modeling Review History for Reviewer Recommendation: A Hypergraph Approach
2022cited by this paper
Assessing Developer Expertise from the Statistical Distribution of Programming Syntax Patterns
2021cited by this paper
WhoReview: A multi-objective search-based approach for code reviewers recommendation in modern code review
2021cited by this paper
Detection and Elimination of Systematic Labeling Bias in Code Reviewer Recommendation Systems
2021cited by this paper
Representation of Developer Expertise in Open Source Software
2020cited by this paper
Modelling of Knowledge via Fuzzy Knowledge Unit in a Case of the ERP Systems Upgrade
2020cited by this paper
Workload-aware reviewer recommendation using a multi-objective search-based approach
2020cited by this paper
Towards medical knowmetrics: representing and computing medical knowledge using semantic predications as the knowledge unit and the uncertainty as the knowledge context
2020cited by this paper
Using a Context-Aware Approach to Recommend Code Reviewers: Findings from an Industrial Case Study
2020cited by this paper
Mitigating Turnover with Code Review Recommendation: Balancing Expertise, Workload, and Knowledge Distribution
2020cited by this paper
Soft Sensors Based on Deep Neural Networks for Applications in Security and Safety
2020cited by this paper
Identifying Experts in Software Libraries and Frameworks Among GitHub Users
2019cited by this paper
The impact of feature reduction techniques on defect prediction models
2019cited by this paper
Reviewer Recommendation using Software Artifact Traceability Graphs
2019cited by this paper
Algorithms for estimating truck factors: a comparative study
2019cited by this paper
WhoDo: automating reviewer suggestions at scale
2019cited by this paper
Who should make decision on this pull request? Analyzing time-decaying relationships and file similarities for integrator prediction
2019cited by this paper
Classes
2019cited by this paper
Data of SemanticWeb as Unit of Knowledge
2019cited by this paper
A Hierarchical Clustering algorithm based on Silhouette Index for cancer subtype discovery from genomic data
2019cited by this paper
Accurate Design Pattern Detection Based on Idiomatic Implementation Matching in Java Language Context
2019cited by this paper
Identifying the Cybersecurity Body of Knowledge for a Postgraduate Module in Systems Engineering
2018cited by this paper
PyDriller: Python framework for mining software repositories
2018cited by this paper
SCSMiner: mining social coding sites for software developer recommendation with relevance propagation
2018cited by this paper
An Empirical Comparison of Model Validation Techniques for Defect Prediction Models
2017cited by this paper
A hybrid approach to code reviewer recommendation with collaborative filtering
2017cited by this paper
Cybersecurity Curricular Guidelines
2017cited by this paper
A Large-Scale Study of the Impact of Feature Selection Techniques on Defect Classification Models
2017cited by this paper
Curating GitHub for engineered software projects
2017cited by this paper
Profile based recommendation of code reviewers
2017cited by this paper
Who should comment on this pull request? Analyzing attributes for more accurate commenter recommendation in pull-based development
2017cited by this paper
CVExplorer: Identifying candidate developers by mining and exploring their open source contributions
2016cited by this paper
Cross-Project Defect Prediction Using a Connectivity-Based Unsupervised Classifier
2016cited by this paper
Automatically Recommending Peer Reviewers in Modern Code Review
2016influential reference
CORRECT: Code reviewer recommendation at GitHub for Vendasta technologies
2016cited by this paper
Reviewer recommendation for pull-requests in GitHub: What can we learn from code review and bug assignment?
2016cited by this paper
EARec: Leveraging Expertise and Authority for Pull-Request Reviewer Recommendation in GitHub
2016cited by this paper
Search-Based Peer Reviewers Recommendation in Modern Code Review
2016cited by this paper
Automatically recommending code reviewers based on their expertise: An empirical comparison
2016influential reference
Clustering Mobile Apps Based on Mined Textual Features
2016cited by this paper
A novel approach for estimating Truck Factors
2016cited by this paper
Oracle Certified Professional Java SE 8 Programmer Exam 1Z0-809
2016cited by this paper
Improving Stability of Recommender Systems: A Meta-Algorithmic Approach
2015influential reference
Developers assignment for analyzing pull requests
2015cited by this paper
Matching GitHub Developer Profiles to Job Advertisements
2015cited by this paper
Who should review my code? A file location-based code-reviewer recommendation approach for Modern Code Review
2015influential reference
CoreDevRec: Automatic Core Member Recommendation for Contribution Evaluation
2015cited by this paper
Revisiting the Impact of Classification Techniques on the Performance of Defect Prediction Models
2015cited by this paper
Who should review this change?: Putting text and file location analyses together for more accurate recommendations
2015influential reference
Degree-of-knowledge
2014cited by this paper
Who Should Review this Pull-Request: Reviewer Recommendation to Expedite Crowd Collaboration
2014cited by this paper
Finding the Optimal Subspace for Clustering
2014cited by this paper
Ranking and Clustering Software Cost Estimation Models through a Multiple Comparisons Algorithm
2013cited by this paper
Java File I/O (NIO.2)
2013cited by this paper
Does bug prediction support human developers? Findings from a Google case study
2013cited by this paper
Convergent contemporary software peer review practices
2013cited by this paper
Expectations, outcomes, and challenges of modern code review
2013cited by this paper
Using developer interaction data to compare expertise metrics
2013cited by this paper
How to effectively use topic models for software engineering tasks? An approach based on Genetic Algorithms
2013cited by this paper
Software Fault Prediction Using Quad Tree-Based K-Means Clustering Algorithm
2012cited by this paper
Towards a more realistic evaluation: testing the ability to predict future tastes of matrix factorization-based recommenders
2011cited by this paper
A degree-of-knowledge model to capture source code familiarity
2010cited by this paper
Towards identifying software project clusters with regard to defect prediction
2010cited by this paper
The promises and perils of mining GitHub
2009cited by this paper
Expert recommendation with usage expertise
2009cited by this paper
Influence and correlation in social networks
2008cited by this paper
Supporting software evolution using adaptive change propagation heuristics
2008influential reference
Determining Implementation Expertise from Bug Reports
2007cited by this paper
An Approach to Outlier Detection of Software Measurement Data using the K-means Clustering Method
2007cited by this paper
Does a programmer's activity indicate knowledge of code?
2007cited by this paper
Design Pattern Detection Using Similarity Scoring
2006cited by this paper
How developers drive software evolution
2005cited by this paper
K-means clustering via principal component analysis
2004cited by this paper
Subspace clustering for high dimensional data: a review
2004cited by this paper
Algorithms for clustering high dimensional and distributed data
2003cited by this paper
Expertise Browser: a quantitative approach to identifying expertise
2002cited by this paper
Evaluating expertise recommendations
2001influential reference
Expertise recommender: a flexible recommendation system and architecture
2000influential reference
Agents to assist in finding help
2000cited by this paper
Authoritative sources in a hyperlinked environment
1999cited by this paper
Silhouettes: a graphical aid to the interpretation and validation of cluster analysis
1987cited by this paper
Relative Deprivation and the Gini Coefficient
1979cited by this paper
Diversiﬁed Third-Party Library Prediction for Mobile App Development
year unknowncited by this paper

CITED BY

Automating Code Review: A Systematic Literature Review
2025cites this paper
RGPRec: A RAG‐Enhanced GNN for Personalized Task Recommendations in Open‐Source Communities
2025cites this paper
Understanding Open Source Contributor Profiles in Popular Machine Learning Libraries
2024cites this paper
TeReKG: A temporal collaborative knowledge graph framework for software team recommendation
2024cites this paper
Predicting post-release defects with knowledge units (KUs) of programming languages: an empirical study
2024influential citation
Deep learning-based software engineering: progress, challenges, and opportunities
2024cites this paper
Predicting long time contributors with knowledge units of programming languages: an empirical study
2024cites this paper