support ShareGPT dataset as data file#305
support ShareGPT dataset as data file#305tukwila wants to merge 15 commits intovllm-project:mainfrom
Conversation
f8c2231 to
d246dee
Compare
1cf7e56 to
e98bd0e
Compare
|
This seems external to the GuideLLM. Can you please move all code and documentation to |
1d840bc to
a347948
Compare
Done |
|
Sorry I forgot about this PR due to the sudden flurry of new PRs. Can you also move the changes in |
394f505 to
d904a7e
Compare
Done |
jaredoconnell
left a comment
There was a problem hiding this comment.
Is the requirements.txt supposed to include all dependencies? I had to install datasets and transformers for it to work.
It may be beneficial to also note that you need to run it with the HF_TOKEN value set.
Once I addressed these it appears to have worked.
yes, i updated and retest it. |
Signed-off-by: guangli.bao <guangli.bao@daocloud.io>
f9b581c to
ae8945b
Compare
Signed-off-by: guangli.bao <guangli.bao@daocloud.io>
ae8945b to
15a9fd8
Compare
…into support_sharegpt
…into support_sharegpt
Signed-off-by: guangli.bao <guangli.bao@daocloud.io>
…into support_sharegpt
Signed-off-by: guangli.bao <guangli.bao@daocloud.io>
eb71524 to
5d0f804
Compare
…into support_sharegpt
Signed-off-by: guangli.bao <guangli.bao@daocloud.io>
Summary
Details
I hope data file can support ShareGPT as benchmark test data such as: ShareGPT_V3_unfiltered_cleaned_split.json; In this PR, user can abstract testing prompts from origin file and filter human prompts (10 < words < 1000) to save into local file, refer to:
Test Plan
Related Issues
Use of AI
## WRITTEN BY AI ##)