Skip to content
Open
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
3 changes: 2 additions & 1 deletion .gitignore
Original file line number Diff line number Diff line change
Expand Up @@ -3,4 +3,5 @@ __pycache__
config.py
reads
sftp*
.DS*
.DS*
venv/
5 changes: 5 additions & 0 deletions .vscode/settings.json
Original file line number Diff line number Diff line change
@@ -0,0 +1,5 @@
{
"cSpell.words": [
"punct"
]
}
9,111 changes: 9,111 additions & 0 deletions data/RobustMath/DeepSeek-R1-Distill-Llama-8B/DeepSeek-R1-Distill-Llama-8B.csv

Large diffs are not rendered by default.

Large diffs are not rendered by default.

858 changes: 858 additions & 0 deletions data/RobustMath/clean_robust_math.json

Large diffs are not rendered by default.

1,202 changes: 1,202 additions & 0 deletions data/RobustMath/noisy_math_attack.json

Large diffs are not rendered by default.

1 change: 1 addition & 0 deletions data/RobustMath/noisy_robust_math_30.json

Large diffs are not rendered by default.

64,462 changes: 64,462 additions & 0 deletions data/gsm/DeepSeek-R1-Distill-Llama-8B/DeepSeek-R1-Distill-Llama-8B.csv

Large diffs are not rendered by default.

Large diffs are not rendered by default.

Large diffs are not rendered by default.

Large diffs are not rendered by default.

340 changes: 340 additions & 0 deletions data/gsm/DeepSeek-R1-Distill-Llama-8B/gsm8k_noisy_punct_10.csv

Large diffs are not rendered by default.

Original file line number Diff line number Diff line change
@@ -0,0 +1 @@
Question,Answer - LLM,Answer - Ground Truth
Original file line number Diff line number Diff line change
@@ -0,0 +1 @@
Question,Answer - LLM,Answer - Ground Truth
45,698 changes: 45,698 additions & 0 deletions data/gsm/Llama-3.1-8B-Instruct/Llama-3.1-8B-Instruct.csv

Large diffs are not rendered by default.

45,825 changes: 45,825 additions & 0 deletions data/gsm/Llama-3.1-8B-Instruct/Llama-3.1-8B-Instruct_converted.csv

Large diffs are not rendered by default.

51,741 changes: 51,741 additions & 0 deletions data/gsm/Llama-3.1-8B-Instruct/Llama-3.1-8B-Instruct_noisy_10_converted.csv

Large diffs are not rendered by default.

51,436 changes: 51,436 additions & 0 deletions data/gsm/Llama-3.1-8B-Instruct/Llama-3.1-8B-Instruct_noisy_30_converted.csv

Large diffs are not rendered by default.

52,702 changes: 52,702 additions & 0 deletions data/gsm/Llama-3.1-8B-Instruct/Llama-3.1-8B-Instruct_noisy_50_converted.csv

Large diffs are not rendered by default.

51,741 changes: 51,741 additions & 0 deletions data/gsm/Llama-3.1-8B-Instruct/Llama-3.1-8B-Instruct_noisy_punct_10.csv

Large diffs are not rendered by default.

51,436 changes: 51,436 additions & 0 deletions data/gsm/Llama-3.1-8B-Instruct/Llama-3.1-8B-Instruct_noisy_punct_30.csv

Large diffs are not rendered by default.

52,702 changes: 52,702 additions & 0 deletions data/gsm/Llama-3.1-8B-Instruct/Llama-3.1-8B-Instruct_noisy_punct_50.csv

Large diffs are not rendered by default.

51,159 changes: 51,159 additions & 0 deletions data/gsm/Llama-3.1-8B-Instruct/Llama-3.1-8B-Instruct_noisy_punct_new_10.csv

Large diffs are not rendered by default.

52,080 changes: 52,080 additions & 0 deletions data/gsm/Llama-3.1-8B-Instruct/Llama-3.1-8B-Instruct_noisy_punct_new_30.csv

Large diffs are not rendered by default.

29,891 changes: 29,891 additions & 0 deletions data/gsm/Mathstral-7b-v0.1/Mathstral-7b-v0.1_noisy_punct_new_10.csv

Large diffs are not rendered by default.

30,460 changes: 30,460 additions & 0 deletions data/gsm/Mathstral-7b-v0.1/Mathstral-7b-v0.1_noisy_punct_new_30.csv

Large diffs are not rendered by default.

47,202 changes: 47,202 additions & 0 deletions data/gsm/Meta-Llama-3-8B-Instruct/Meta-Llama-3-8B-Instruct_noisy_punct_new_10.csv

Large diffs are not rendered by default.

47,915 changes: 47,915 additions & 0 deletions data/gsm/Meta-Llama-3-8B-Instruct/Meta-Llama-3-8B-Instruct_noisy_punct_new_30.csv

Large diffs are not rendered by default.

33,510 changes: 33,510 additions & 0 deletions data/gsm/Mistral-7B-Instruct-v0.2/Mistral-7B-Instruct-v0.2_noisy_punct_new_10.csv

Large diffs are not rendered by default.

33,651 changes: 33,651 additions & 0 deletions data/gsm/Mistral-7B-Instruct-v0.2/Mistral-7B-Instruct-v0.2_noisy_punct_new_30.csv

Large diffs are not rendered by default.

36 changes: 36 additions & 0 deletions data/gsm/Qwen2.5-1.5B-Instruct/Qwen2.5-1.5B-Instruct.csv
Original file line number Diff line number Diff line change
@@ -0,0 +1,36 @@
Question,Answer - LLM,Answer - Ground Truth
Janet’s ducks lay 16 eggs per day. She eats three for breakfast every morning and bakes muffins for her friends every day with four. She sells the remainder at the farmers' market daily for $2 per fresh duck egg. How much in dollars does she make every day at the farmers' market?,"<|im_start|>system
You are Qwen, created by Alibaba Cloud. You are a helpful assistant.<|im_end|>
<|im_start|>user

Think step by step through the following problem and clearly show each step of your reasoning. Ensure the final answer is clearly indicated by ending with {The final answer is}.
question:
Janet’s ducks lay 16 eggs per day. She eats three for breakfast every morning and bakes muffins for her friends every day with four. She sells the remainder at the farmers' market daily for $2 per fresh duck egg. How much in dollars does she make every day at the farmers' market?
<|im_end|>
<|im_start|>system
To determine how much money Janet makes from selling her eggs at the farmers' market, we need to follow these steps:

### Step 1: Calculate the total number of eggs laid per day.
Janet's ducks lay 16 eggs per day.

### Step 2: Determine how many eggs are eaten for breakfast.
Janet eats 3 eggs for breakfast every morning. Since there are 7 days in a week (assuming a standard workweek), we can calculate the total number of eggs eaten for breakfast over a week as follows:
\[ \text{Total eggs eaten for breakfast} = 3 \times 7 = 21 \]

### Step 3: Subtract the eggs eaten for breakfast from the total number of eggs laid per day.
\[ \text{Remaining eggs after breakfast} = 16 - 21 = -5 \]
Since it's not possible to have negative eggs, this indicates that all the eggs laid are used up before the end of the day, so no eggs remain after breakfast.

### Step 4: Calculate the number of eggs used for baking muffins.
Janet uses 4 eggs to bake muffins every day.

### Step 5: Subtract the eggs used for baking muffins from the remaining eggs.
\[ \text{Remaining eggs after baking muffins} = 0 \]

### Step 6: Calculate the total revenue from selling the remaining eggs.
Since there are no eggs left after both breakfast and muffin baking, there is no revenue generated from selling any eggs. Therefore, Janet makes $0 from the farmers' market on any given day.

### Final Answer:
\[
\boxed{0}
\]<|im_end|>",18
55,782 changes: 55,782 additions & 0 deletions data/gsm/Qwen2.5-1.5B-Instruct/Qwen2.5-1.5B-Instruct_converted.csv

Large diffs are not rendered by default.

Loading