Commit
·
a9e021e
1
Parent(s):
f62a688
Update the model card to refer to the report PDF
Browse files
README.md
CHANGED
|
@@ -217,6 +217,8 @@ We trained LoRA adapters across all linear layers of the experts and attention b
|
|
| 217 |
|
| 218 |
## Evaluation
|
| 219 |
|
|
|
|
|
|
|
| 220 |
### Code Quality
|
| 221 |
|
| 222 |
We used SonarQube to evaluate the quality, verbosity, and complexity of Java code generated for the [ComplexCodeEval](https://github.com/ComplexCodeEval/ComplexCodeEval) and [MultiPL-E Java](https://huggingface.co/datasets/nuprl/MultiPL-E/viewer/humaneval-java) benchmarks.
|
|
@@ -225,7 +227,7 @@ The fine-tuned and base models achieve a similar pass@1 metric for code generati
|
|
| 225 |
|
| 226 |
The fine-tuned model achieves this metric while generating fewer lines of code.
|
| 227 |
|
| 228 |
-
For code quality, we see a dramatic reduction in both the number and density of Sonar issues, split among bugs, security vulnerabilities, and code smells (see the [Glossary](#glossary) for definitions).
|
| 229 |
|
| 230 |
| Metric | Base Model | Fine-tuned Model |
|
| 231 |
|--------|------------|------------------|
|
|
|
|
| 217 |
|
| 218 |
## Evaluation
|
| 219 |
|
| 220 |
+
For a comprehensive analysis with detailed metrics and additional comparisons between the base model and fine-tuned model, see our [detailed evaluation report](https://huggingface.co/SonarSource/SonarSweep-java-gpt-oss-20b/blob/main/report.pdf).
|
| 221 |
+
|
| 222 |
### Code Quality
|
| 223 |
|
| 224 |
We used SonarQube to evaluate the quality, verbosity, and complexity of Java code generated for the [ComplexCodeEval](https://github.com/ComplexCodeEval/ComplexCodeEval) and [MultiPL-E Java](https://huggingface.co/datasets/nuprl/MultiPL-E/viewer/humaneval-java) benchmarks.
|
|
|
|
| 227 |
|
| 228 |
The fine-tuned model achieves this metric while generating fewer lines of code.
|
| 229 |
|
| 230 |
+
For code quality, we see a dramatic reduction in both the number and density of Sonar issues, split among bugs, security vulnerabilities, and code smells (see the [Glossary](#glossary) for definitions). For granular breakdowns by issue type and severity, refer to the [detailed evaluation report](https://huggingface.co/SonarSource/SonarSweep-java-gpt-oss-20b/blob/main/report.pdf).
|
| 231 |
|
| 232 |
| Metric | Base Model | Fine-tuned Model |
|
| 233 |
|--------|------------|------------------|
|