ファジングが発見した不具合の自動修正技術

项目来源

日本学术振兴会基金(JSPS)

项目主持人

吉田則裕

项目受资助机构

立命館大学

项目编号

24K02923

立项年度

2024

立项时间

未公开

项目级别

国家级

研究期限

未知 / 未知

受资助金额

18330000.00日元

学科

ソフトウェア関連

学科代码

未公开

基金类别

基盤研究(B)

关键词

ファジング ;

参与者

戸田航史；藤原賢二；槇原絵里奈

参与机构

福岡工業大学；奈良女子大学

项目标书摘要：Outline of Research at the Start:ファジングとは,不具合を引き起こす可能性がある入力の自動生成とプログラムの自動実行を繰り返すプロセスを指す.AFLに代表されるファジングツールは,大規模OSSから数多くの不具合を発見している.しかし,ファジングは不具合を引きおこす可能性がある入力を開発者に提示するのみであり,プログラム中のバグ位置やバグの修正方法は提示しない.本研究では,ファジングが提示する不具合を引き起こす可能性がある入力を基に,プログラム中のバグ位置を特定し,自動修正を行う技術の確立を目指す.まず,ファジングと相性の良いバグ位置特定技術を明らかにし,特定したバグを修正するために有用な修正パターンを明らかにする。

排序方式：时间相关性
显示方式：列表摘要

1.Development and benchmarking of multilingual code clone detector

关键词：
Benchmarking;Syntactics;Benchmark testing;Block extraction;Clone detection;Code blocks;Code clone;Detection performance;Language extensibilities;Parser generation;Source codes;Target language

Zhu, Wenqing;Yoshida, Norihiro;Kamiya, Toshihiro;Choi, Eunjong;Takada, Hiroaki
《Journal of Systems and Software》
2025年
219卷
期
期刊

The diversity of programming languages is growing, making the language extensibility of code clone detectors crucial. However, this is challenging for most existing clone detection detectors because the source code handler needs modifications, which requires specialist-level knowledge of the targeted language and is time-consuming. Multilingual code clone detectors make it easier to add new language support by providing syntax information of the target language only. To address the shortcomings of existing multilingual detectors for language scalability and detection performance, we propose a multilingual code block extraction method based on ANTLR parser generation, and implement a multilingual code clone detector (MSCCD), which supports the most significant number of languages currently available and has the ability to detect Type-3 code clones. We follow the methodology of previous studies to evaluate the detection performance of the Java language. Compared to ten state-of-the-art detectors, MSCCD performs at an average level while it also supports a significantly larger number of languages. Furthermore, we propose the first multilingual syntactic code clone evaluation benchmark based on the CodeNet database. Our results reveal that even when applying the same detection approach, performance can vary markedly depending on the language of the source code under investigation. Overall, MSCCD is the most balanced one among the evaluated tools when considering detection performance and language extensibility. © 2024 The Author(s)

...

2.Leveraging Context Information for Self-Admitted Technical Debt Detection

关键词：
Computer programming languages;Computer software selection and evaluation;Software design;Software quality;CodeBERT;Context information;Context-Aware;Context-aware detection;Development activity;False positive;Performance;Self-admitted technical debt;Technical debts;Technical understanding

Yonekura, Miki;Kashiwa, Yutaro;Lin, Bin;Fujiwara, Kenji;Iida, Hajimu
《33rd IEEE/ACM International Conference on Program Comprehension, ICPC 2025》
2025年
April 27, 2025 - April 28, 2025
Ottawa, ON, Canada
会议

Self-Admitted Technical Debt (SATD) refers to nonoptimal software design or implementation that is acknowledged and explicitly documented in the code by developers. Detecting SATD and understanding its evolution can help developers better manage their development activities and monitor the software quality. In recent years, numerous approaches have been proposed to automatically identify SATD. However, these approaches still suffer from a high number of false positives (i.e., non-SATD comments being detected as SATD). To further advance this field, in this paper, we conduct an empirical study to evaluate the performance of the state-of-theart SATD detection tools and investigate the causes behind the false positives. By manually analyzing 135 false positive cases, we identify the main types of comments that are easily misclassified. To address this issue, we propose a new approach, CASTI, which integrates context information into CodeBERT, a pre-trained model for programming languages. Our evaluation demonstrates that CASTI can significantly reduce the false positives and that the context information does help improve the performance. © 2025 IEEE.

...

3.Multilingual Investigation of Cross-Project Code Clones in Open-Source Software for Internet of Things Systems

关键词：
Multiprocessing programs;Risk management;Clone analysis;Clone detection;Clone genealogy;Code clone;Code clone analyze;Defect propagation;Internet of thing system;Multilingual clone detection;Open-source softwares;Project codes

Zhu, Wenqing;Yoshida, Norihiro;Matsubara, Yutaka;Takada, Hiroaki
《IEEE Access》
2024年
12卷
期
期刊

The prevalence and impact of code clones in software systems have been widely studied in the past few decades. However, the focus has primarily been on intra-project clones. Our work comprehensively investigates cross-project code clones in open-source software for Internet of Things (IoT) systems across multiple programming languages. This work addresses the prevalence of cross-project code clones in IoT systems and their impact on software maintainability. We collected 122 IoT system repositories in nine languages from GitHub and grouped them according to their primary functionality in IoT systems. We used MSCCD, a multilingual code clone detector to detect Type-3 code clones for each group. The results show that cross-project clones exist in more than 30% of the projects, particularly in communication-related functionalities. We tracked the historical evolution of these clones and classified them according to the revision history and changing trend of similarity. The results show that 95% cross-project clones are untouched. Moreover, clones with decreasing similarities were over 72%. Therefore, the same clone detector may no longer detect these clones. We also investigated whether these cross-project code clones lead to defect propagation by analyzing the commit message to determine the commits that fixed a defect. We identified nine defect propagation instances, of which seven remain unfixed. Our work contributes to understanding cross-project code clones, highlighting the importance of automated clone management tools for improving the quality and security of IoT system software to mitigate the risks associated with unresolved defects and inconsistencies in IoT software development. © 2013 IEEE.

...

排序方式：时间相关性
显示方式：列表摘要