Update README.md
Browse files
README.md
CHANGED
@@ -32,11 +32,11 @@ As of 18 Mar 2025, augmenting models with the Feedback-Edit Inference Time Scali
|
|
32 |
|
33 |
| Model | Arena Hard (95% CI) |
|
34 |
|:-----------------------------|:----------------|
|
35 |
-
| Llama
|
36 |
| Llama-3.1-Nemotron-70B-Instruct + **Feedback-Edit ITS** | 92.7 (-1.2, 0.9) |
|
37 |
| o1-mini-2024-09-12 | 92.0 (-1.2, 1.0) |
|
38 |
| o1-preview-2024-09-12 | 90.4 (-1.1, 1.3) |
|
39 |
-
| Llama
|
40 |
| claude-3-5-sonnet-20241022 | 85.2 (-1.4, 1.6) |
|
41 |
| Llama-3.1-Nemotron-70B-Instruct | 84.9 (-1.7, 1.8) |
|
42 |
|
|
|
32 |
|
33 |
| Model | Arena Hard (95% CI) |
|
34 |
|:-----------------------------|:----------------|
|
35 |
+
| Llama-3.3-Nemotron-49B-Instruct + **Feedback-Edit ITS** | **93.4 (-1.1, 1.0)** |
|
36 |
| Llama-3.1-Nemotron-70B-Instruct + **Feedback-Edit ITS** | 92.7 (-1.2, 0.9) |
|
37 |
| o1-mini-2024-09-12 | 92.0 (-1.2, 1.0) |
|
38 |
| o1-preview-2024-09-12 | 90.4 (-1.1, 1.3) |
|
39 |
+
| Llama-3.3-Nemotron-49B-Instruct | 88.3 (-1.6, 1.6) |
|
40 |
| claude-3-5-sonnet-20241022 | 85.2 (-1.4, 1.6) |
|
41 |
| Llama-3.1-Nemotron-70B-Instruct | 84.9 (-1.7, 1.8) |
|
42 |
|