Skip to content
Open
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
3 changes: 2 additions & 1 deletion .gitignore
Original file line number Diff line number Diff line change
Expand Up @@ -3,4 +3,5 @@ __pycache__
config.py
reads
sftp*
.DS*
.DS*
venv/
5 changes: 5 additions & 0 deletions .vscode/settings.json
Original file line number Diff line number Diff line change
@@ -0,0 +1,5 @@
{
"cSpell.words": [
"punct"
]
}
9,111 changes: 9,111 additions & 0 deletions data/RobustMath/DeepSeek-R1-Distill-Llama-8B/DeepSeek-R1-Distill-Llama-8B.csv

Large diffs are not rendered by default.

Large diffs are not rendered by default.

858 changes: 858 additions & 0 deletions data/RobustMath/clean_robust_math.json

Large diffs are not rendered by default.

1,202 changes: 1,202 additions & 0 deletions data/RobustMath/noisy_math_attack.json

Large diffs are not rendered by default.

1 change: 1 addition & 0 deletions data/RobustMath/noisy_robust_math_30.json

Large diffs are not rendered by default.

64,462 changes: 64,462 additions & 0 deletions data/gsm/DeepSeek-R1-Distill-Llama-8B/DeepSeek-R1-Distill-Llama-8B.csv

Large diffs are not rendered by default.

Large diffs are not rendered by default.

Large diffs are not rendered by default.

Large diffs are not rendered by default.

340 changes: 340 additions & 0 deletions data/gsm/DeepSeek-R1-Distill-Llama-8B/gsm8k_noisy_punct_10.csv

Large diffs are not rendered by default.

Original file line number Diff line number Diff line change
@@ -0,0 +1 @@
Question,Answer - LLM,Answer - Ground Truth
Original file line number Diff line number Diff line change
@@ -0,0 +1 @@
Question,Answer - LLM,Answer - Ground Truth
45,698 changes: 45,698 additions & 0 deletions data/gsm/Llama-3.1-8B-Instruct/Llama-3.1-8B-Instruct.csv

Large diffs are not rendered by default.

45,825 changes: 45,825 additions & 0 deletions data/gsm/Llama-3.1-8B-Instruct/Llama-3.1-8B-Instruct_converted.csv

Large diffs are not rendered by default.

51,741 changes: 51,741 additions & 0 deletions data/gsm/Llama-3.1-8B-Instruct/Llama-3.1-8B-Instruct_noisy_10_converted.csv

Large diffs are not rendered by default.

51,436 changes: 51,436 additions & 0 deletions data/gsm/Llama-3.1-8B-Instruct/Llama-3.1-8B-Instruct_noisy_30_converted.csv

Large diffs are not rendered by default.

52,702 changes: 52,702 additions & 0 deletions data/gsm/Llama-3.1-8B-Instruct/Llama-3.1-8B-Instruct_noisy_50_converted.csv

Large diffs are not rendered by default.

51,741 changes: 51,741 additions & 0 deletions data/gsm/Llama-3.1-8B-Instruct/Llama-3.1-8B-Instruct_noisy_punct_10.csv

Large diffs are not rendered by default.

51,436 changes: 51,436 additions & 0 deletions data/gsm/Llama-3.1-8B-Instruct/Llama-3.1-8B-Instruct_noisy_punct_30.csv

Large diffs are not rendered by default.

52,702 changes: 52,702 additions & 0 deletions data/gsm/Llama-3.1-8B-Instruct/Llama-3.1-8B-Instruct_noisy_punct_50.csv

Large diffs are not rendered by default.

51,159 changes: 51,159 additions & 0 deletions data/gsm/Llama-3.1-8B-Instruct/Llama-3.1-8B-Instruct_noisy_punct_new_10.csv

Large diffs are not rendered by default.

52,080 changes: 52,080 additions & 0 deletions data/gsm/Llama-3.1-8B-Instruct/Llama-3.1-8B-Instruct_noisy_punct_new_30.csv

Large diffs are not rendered by default.

29,891 changes: 29,891 additions & 0 deletions data/gsm/Mathstral-7b-v0.1/Mathstral-7b-v0.1_noisy_punct_new_10.csv

Large diffs are not rendered by default.

30,460 changes: 30,460 additions & 0 deletions data/gsm/Mathstral-7b-v0.1/Mathstral-7b-v0.1_noisy_punct_new_30.csv

Large diffs are not rendered by default.

47,202 changes: 47,202 additions & 0 deletions data/gsm/Meta-Llama-3-8B-Instruct/Meta-Llama-3-8B-Instruct_noisy_punct_new_10.csv

Large diffs are not rendered by default.

47,915 changes: 47,915 additions & 0 deletions data/gsm/Meta-Llama-3-8B-Instruct/Meta-Llama-3-8B-Instruct_noisy_punct_new_30.csv

Large diffs are not rendered by default.

33,510 changes: 33,510 additions & 0 deletions data/gsm/Mistral-7B-Instruct-v0.2/Mistral-7B-Instruct-v0.2_noisy_punct_new_10.csv

Large diffs are not rendered by default.

Loading