zhilinw commited on
Commit
abf2a35
·
verified ·
1 Parent(s): ec0bb46

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +2 -2
README.md CHANGED
@@ -32,11 +32,11 @@ As of 18 Mar 2025, augmenting models with the Feedback-Edit Inference Time Scali
32
 
33
  | Model | Arena Hard (95% CI) |
34
  |:-----------------------------|:----------------|
35
- | Llama Nemotron Super + **Feedback-Edit ITS** | **93.4 (-1.1, 1.0)** |
36
  | Llama-3.1-Nemotron-70B-Instruct + **Feedback-Edit ITS** | 92.7 (-1.2, 0.9) |
37
  | o1-mini-2024-09-12 | 92.0 (-1.2, 1.0) |
38
  | o1-preview-2024-09-12 | 90.4 (-1.1, 1.3) |
39
- | Llama Nemotron Super | 88.3 (-1.6, 1.6) |
40
  | claude-3-5-sonnet-20241022 | 85.2 (-1.4, 1.6) |
41
  | Llama-3.1-Nemotron-70B-Instruct | 84.9 (-1.7, 1.8) |
42
 
 
32
 
33
  | Model | Arena Hard (95% CI) |
34
  |:-----------------------------|:----------------|
35
+ | Llama-3.3-Nemotron-49B-Instruct + **Feedback-Edit ITS** | **93.4 (-1.1, 1.0)** |
36
  | Llama-3.1-Nemotron-70B-Instruct + **Feedback-Edit ITS** | 92.7 (-1.2, 0.9) |
37
  | o1-mini-2024-09-12 | 92.0 (-1.2, 1.0) |
38
  | o1-preview-2024-09-12 | 90.4 (-1.1, 1.3) |
39
+ | Llama-3.3-Nemotron-49B-Instruct | 88.3 (-1.6, 1.6) |
40
  | claude-3-5-sonnet-20241022 | 85.2 (-1.4, 1.6) |
41
  | Llama-3.1-Nemotron-70B-Instruct | 84.9 (-1.7, 1.8) |
42