Conversation
sharegpt
conversations where from is human/gpt. (optional: first row with role system to override default system prompt)
Note: type: sharegpt opens special configs: - conversation: enables conversions to many Conversation types. Refer to the ‘name’ here for options. - roles: allows you to specify the roles for input and output. This is useful for datasets with custom roles such as tool etc to support masking. - field_human: specify the key to use instead of human in the conversation. - field_model: specify the key to use instead of gpt in the conversation.
datasets:
path: ...
type: sharegpt
conversation: # Options (see Conversation 'name'): https://github.com/lm-sys/FastChat/blob/main/fastchat/conversation.py
field_human: # Optional[str]. Human key to use for conversation.
field_model: # Optional[str]. Assistant key to use for conversation.
# Add additional keys from your dataset as input or output roles
roles:
input: # Optional[List[str]]. These will be masked based on train_on_input
output: # Optional[List[str]].pygmalion
sharegpt.load_role
conversations where role is used instead of from
sharegpt.load_guanaco
conversations where from is prompter assistant instead of default sharegpt
sharegpt_jokes
creates a chat where bot is asked to tell a joke, then explain why the joke is funny