计算机科学
一致性(知识库)
剽窃检测
余弦相似度
分级(工程)
特征(语言学)
光学(聚焦)
源代码
相似性(几何)
透明度(行为)
特征模型
数据挖掘
欧几里德距离
编码(集合论)
过程(计算)
人工智能
编程风格
公制(单位)
程序设计语言
程序代码
组分(热力学)
特征向量
编码(社会科学)
特征提取
情报检索
软件
自然语言处理
静态分析
机器学习
钥匙(锁)
作者
Marek Horváth,Matúš Motyka,Emília Pietriková
标识
DOI:10.1109/sami68106.2026.11420357
摘要
This study introduces a framework for automatic detection of stylistic and structural similarity in student programming assignments. The approach uses static analysis to extract lexical, syntactic, and stylistic features from C source code. Each submission is represented as a numerical feature vector, which can be compared using cosine similarity, Euclidean distance, and Manhattan distance metrics. The framework includes an interactive component that visualizes similarity results and highlights cases that may require closer inspection. In practice, this approach simplifies the verification of programming style consistency and supports the detection of possible plagiarism in large groups of students. The solution can be easily integrated into existing grading workflows, allowing instructors to quickly focus on the most relevant cases. By making detailed feature comparisons accessible and interactive, the system strengthens transparency in the evaluation process and contributes to higher standards of academic integrity.
科研通智能强力驱动
Strongly Powered by AbleSci AI