Back to Search
Start Over
CodeAttention: translating source code to comments by exploiting the code constructs
- Source :
- Frontiers of Computer Science. 13:565-578
- Publication Year :
- 2018
- Publisher :
- Springer Science and Business Media LLC, 2018.
-
Abstract
- Appropriate comments of code snippets provide insight for code functionality, which are helpful for program comprehension. However, due to the great cost of authoring with the comments, many code projects do not contain adequate comments. Automatic comment generation techniques have been proposed to generate comments from pieces of code in order to alleviate the human efforts in annotating the code. Most existing approaches attempt to exploit certain correlations (usually manually given) between code and generated comments, which could be easily violated if coding patterns change and hence the performance of comment generation declines. In addition, recent approaches ignore exploiting the code constructs and leveraging the code snippets like plain text. Furthermore, previous datasets are also too small to validate the methods and show their advantage. In this paper, we propose a new attention mechanism called CodeAttention to translate code to comments, which is able to utilize the code constructs, such as critical statements, symbols and keywords. By focusing on these specific points, CodeAttention could understand the semantic meanings of code better than previous methods. To verify our approach in wider coding patterns, we build a large dataset from open projects in GitHub. Experimental results in this large dataset demonstrate that the proposed method has better performance over existing approaches in both objective and subjective evaluation. We also perform ablation studies to determine effects of different parts in CodeAttention.
- Subjects :
- Source code
Information retrieval
General Computer Science
Exploit
Plain text
Computer science
media_common.quotation_subject
Program comprehension
020207 software engineering
02 engineering and technology
computer.file_format
Theoretical Computer Science
0202 electrical engineering, electronic engineering, information engineering
020201 artificial intelligence & image processing
computer
media_common
Coding (social sciences)
Subjects
Details
- ISSN :
- 20952236 and 20952228
- Volume :
- 13
- Database :
- OpenAIRE
- Journal :
- Frontiers of Computer Science
- Accession number :
- edsair.doi...........b71c7bdc3edef4a3d0f4e1fd8942cea4
- Full Text :
- https://doi.org/10.1007/s11704-018-7457-6