Tweet | |
H. Watanabe, "Token-Level SZZ: Fine-Grained Identification of Bug-Inducing Commits in Java Projects," Master thesis, Graduate School of Science and Technology, Kyoto Institute of Technology, 2024. | |
ID | 949 |
分類 | 学位論文 |
タグ | bug-inducing commits fine-grained identification java projects szz token-level 修論 |
表題 (title) |
Token-Level SZZ: Fine-Grained Identification of Bug-Inducing Commits in Java Projects |
表題 (英文) |
|
著者名 (author) |
Hiroya Watanabe |
英文著者名 (author) |
Hiroya Watanabe |
キー (key) |
Hiroya Watanabe |
学校名 (school) |
Graduate School of Science and Technology, Kyoto Institute of Technology |
出版社住所 (address) |
|
刊行月 (month) |
2 |
出版年 (year) |
2024 |
URL |
|
付加情報 (note) |
|
注釈 (annote) |
|
内容梗概 (abstract) |
SZZ is the de facto standard method for identifying bug-inducing commits. The accu- racy of this method heavily relies on source code management systems, such as Git, as it requires tracing the history of source code changes (i.e., commit histories) to bug-inducing commits. However, it has been reported that these systems introduce biases in commit histories because they only store line-level changes. It is known that such coarse-grained line-level changes can result in the failure to accurately track the commit history and reduce the performance of SZZ. To relieve this challenge, we explore the accuracy of SZZ in token-level changes, which provide finer-grained information to trace commit histories compared to line-level ones, and we discuss the potential benefits and pitfalls of utilizing token-level changes for SZZ. As a result of experiments on 68 OSS projects, we found that SZZ, which uses token-level histories, identifies two new bug-inducing commits that are missed when using line-level histories. Furthermore, our manual analysis of the iden- tified commits indicates that they reduce false-positive bug-inducing commits caused by source code formatting and whitespace changes. However, this improvement in detecting bug-inducing commits comes with a trade-off of 0.081 decrease in overall accuracy, as measured by the F1 score. Consequently, we summarized three potential benefits and five pitfalls of using token-level and line-level tracking for SZZ. |
論文電子ファイル | h-watanabe-master-v1.0.0.pdf (application/pdf) [一般閲覧可] |
BiBTeXエントリ |
@masterthesis{id949, title = {Token-level {SZZ}: Fine-grained Identification of Bug-inducing Commits in {Java} Projects}, author = {Hiroya Watanabe}, school = {Graduate School of Science and Technology, Kyoto Institute of Technology}, month = {2}, year = {2024}, } |