add evaluation results for weblab-10b models by kojima-takeshi188 · Pull Request #85 · Stability-AI/lm-evaluation-harness

kojima-takeshi188 · 2023-08-27T01:24:22Z

Ceated models/matsuo-lab/ directory and stored evaluation results of weblab-10b models.
Updated README.md to add the results to Leaderboard.

mkshing

@kojima-takeshi188 hi, first of all, I am sorry for my late review. And, it's late but congrats on releasing amazing models. Before merging this PR, I left one comment regarding the base model.

Thank you in advance.

mkshing · 2023-10-11T23:36:56Z

models/matsuo-lab/weblab-10b/harness.sh

+
+MODEL_NAME="weblab-10b"
+MODEL_ARGS="pretrained=matsuo-lab/${MODEL_NAME},torch_dtype=auto"
+TASK="jcommonsenseqa-1.1-0.3,jnli-1.1-0.3,marc_ja-1.1-0.3,jsquad-1.1-0.3,jaqket_v2-0.2-0.3,xlsum_ja-1.0-0.3,xwinograd_ja,mgsm-1.0-0.3"


@kojima-takeshi188 For "base" models, 0.2 or 0.1 is fair to use. (If 0.3 is used for one base model, all base models in the leaderboard have to be evaluated with 0.3 for fair comparison and update the leaderboard.)

Please refer to this script.

acc12649wd added 2 commits August 27, 2023 09:56

add evaluation results for weblab-10b models

7e0c091

add evaluation results for weblab-10b models

20a7315

kojima-takeshi188 requested a review from jon-tow as a code owner August 27, 2023 01:24

kojima-takeshi188 changed the title ~~Weblab 10b~~ add evaluation results for weblab-10b models Aug 27, 2023

mkshing requested review from mkshing and mrorii and removed request for jon-tow October 11, 2023 23:32

mkshing reviewed Oct 11, 2023

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Comments

add evaluation results for weblab-10b models#85

add evaluation results for weblab-10b models#85
kojima-takeshi188 wants to merge 2 commits intoStability-AI:jp-stablefrom
kojima-takeshi188:weblab-10b

kojima-takeshi188 commented Aug 27, 2023

Uh oh!

mkshing left a comment

Uh oh!

mkshing Oct 11, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Comments

Conversation

kojima-takeshi188 commented Aug 27, 2023

Uh oh!

mkshing left a comment

Choose a reason for hiding this comment

Uh oh!

mkshing Oct 11, 2023

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants