New Data Questions

Hello, all! I find your dataset fascinating, and I am glad you posted new chats from this summer. But I am having some trouble understanding the formatting. It has changed significantly since the first data dump, and the documentation does not address these changes. I have listed the major issues below, can you clarify?

1. There is no longer any context text in the new files, was this dropped?

2. In some (but not all) of the files there are no longer any user profiles. Was this dropped in the middle of the data collection?

3. There is also only one evaluation metric ("eval_score"), rather than three ("breadth","engagement", and "quality"). Was the paradigm changed from the first rounds? And what is "profile_match" all about?

4. How are we supposed to know which participant is the bot and which is the human? Are they consistently labeled (e.g. participant1 is always human) or is there a separate key we need?

In summary, this is a fantastic resource but I am not sure how useful it is without understanding how the data was assembled. Or, is there an updated data dictionary available anywhere?


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

New Data Questions #23

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

New Data Questions #23

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions