AbbasSabra commited on
Commit
a9e021e
·
1 Parent(s): f62a688

Update the model card to refer to the report PDF

Browse files
Files changed (1) hide show
  1. README.md +3 -1
README.md CHANGED
@@ -217,6 +217,8 @@ We trained LoRA adapters across all linear layers of the experts and attention b
217
 
218
  ## Evaluation
219
 
 
 
220
  ### Code Quality
221
 
222
  We used SonarQube to evaluate the quality, verbosity, and complexity of Java code generated for the [ComplexCodeEval](https://github.com/ComplexCodeEval/ComplexCodeEval) and [MultiPL-E Java](https://huggingface.co/datasets/nuprl/MultiPL-E/viewer/humaneval-java) benchmarks.
@@ -225,7 +227,7 @@ The fine-tuned and base models achieve a similar pass@1 metric for code generati
225
 
226
  The fine-tuned model achieves this metric while generating fewer lines of code.
227
 
228
- For code quality, we see a dramatic reduction in both the number and density of Sonar issues, split among bugs, security vulnerabilities, and code smells (see the [Glossary](#glossary) for definitions).
229
 
230
  | Metric | Base Model | Fine-tuned Model |
231
  |--------|------------|------------------|
 
217
 
218
  ## Evaluation
219
 
220
+ For a comprehensive analysis with detailed metrics and additional comparisons between the base model and fine-tuned model, see our [detailed evaluation report](https://huggingface.co/SonarSource/SonarSweep-java-gpt-oss-20b/blob/main/report.pdf).
221
+
222
  ### Code Quality
223
 
224
  We used SonarQube to evaluate the quality, verbosity, and complexity of Java code generated for the [ComplexCodeEval](https://github.com/ComplexCodeEval/ComplexCodeEval) and [MultiPL-E Java](https://huggingface.co/datasets/nuprl/MultiPL-E/viewer/humaneval-java) benchmarks.
 
227
 
228
  The fine-tuned model achieves this metric while generating fewer lines of code.
229
 
230
+ For code quality, we see a dramatic reduction in both the number and density of Sonar issues, split among bugs, security vulnerabilities, and code smells (see the [Glossary](#glossary) for definitions). For granular breakdowns by issue type and severity, refer to the [detailed evaluation report](https://huggingface.co/SonarSource/SonarSweep-java-gpt-oss-20b/blob/main/report.pdf).
231
 
232
  | Metric | Base Model | Fine-tuned Model |
233
  |--------|------------|------------------|