Tweet | |
M. Kondo, D. M. German, Y. Kamei, N. Ubayashi, and O. Mizuno, "An Empirical Study of Token-Based Micro Commits," Empirical Software Engineering, 29(148), 33 pages, September 2024. | |
ID | 957 |
分類 | 学術論文誌(査読付) |
タグ | commits empirical micro study token-based |
表題 (title) |
An Empirical Study of Token-Based Micro Commits |
表題 (英文) |
|
著者名 (author) |
Masanari Kondo,Daniel M. German,Yasutaka Kamei,Naoyasu Ubayashi,Osamu Mizuno |
英文著者名 (author) |
Masanari Kondo,Daniel M. German,Yasutaka Kamei,Naoyasu Ubayashi,Osamu Mizuno |
キー (key) |
Masanari Kondo,Daniel M. German,Yasutaka Kamei,Naoyasu Ubayashi,Osamu Mizuno |
定期刊行物名 (journal) |
Empirical Software Engineering |
定期刊行物名 (英文) |
|
巻数 (volume) |
29 |
号数 (number) |
148 |
ページ範囲 (pages) |
33 pages |
刊行月 (month) |
9 |
出版年 (year) |
2024 |
Impact Factor (JCR) |
4.1 (2022) |
URL |
https://doi.org/10.1007/s10664-024-10527-8 |
付加情報 (note) |
|
注釈 (annote) |
|
内容梗概 (abstract) |
In software development, developers frequently apply maintenance activities to the source code that change a few lines by a single commit. A good understanding of the characteristics of such small changes can support quality assurance approaches (e.g., automated program repair), as it is likely that small changes are addressing deficiencies in other changes; thus, understanding the reasons for creating small changes can help understand the types of errors introduced. Eventually, these reasons and the types of errors can be used to enhance quality assurance approaches for improving code quality. While prior studies used code churns to characterize and investigate the small changes, such a definition has a critical limitation. Specifically, it loses the information of changed tokens in a line. For example, this definition fails to distinguish the following two one-line changes: (1) changing a string literal to fix a displayed message and (2) changing a function call and adding a new parameter. These are definitely maintenance activities, but we deduce that researchers and practitioners are interested in supporting the latter change. To address this limitation, in this paper, we define micro commits, a type of small change based on changed tokens. Our goal is to quantify small changes using changed tokens. Changed tokens allow us to identify small changes more precisely. In fact, this token-level definition can distinguish the above example. We investigate defined micro commits in four OSS projects and understand their characteristics as the first empirical study on token-based micro commits. We find that micro commits mainly replace a single name or literal token, and micro commits are more likely used to fix bugs. Additionally, we propose the use of token-based information to support software engineering approaches in which very small changes significantly affect their effectiveness. |
論文電子ファイル | preprint (application/pdf) [一般閲覧可] |
BiBTeXエントリ |
@article{id957, title = {An Empirical Study of Token-based Micro Commits}, author = {Masanari Kondo and Daniel M. German and Yasutaka Kamei and Naoyasu Ubayashi and Osamu Mizuno}, journal = {Empirical Software Engineering}, volume = {29}, number = {148}, pages = {33 pages}, month = {9}, year = {2024}, impactfactor = {4.1 (2022)}, } |