From 857a80b70e0d0ede4ec70aade7f1c504967ea0aa Mon Sep 17 00:00:00 2001 From: NanoCode012 Date: Sun, 21 May 2023 23:28:06 +0900 Subject: [PATCH] Format dataset types --- README.md | 14 +++++++------- 1 file changed, 7 insertions(+), 7 deletions(-) diff --git a/README.md b/README.md index ae926e02f..1b93f503a 100644 --- a/README.md +++ b/README.md @@ -35,31 +35,31 @@ Go ahead and axolotl questions!! Have a dataset in one of the following format (JSONL recommended): -- alpaca: instruction; input(optional) +- `alpaca`: instruction; input(optional) ```json {"instruction": "...", "input": "...", "output": "..."} ``` -- jeopardy: question and answer +- `jeopardy`: question and answer ```json {"question": "...", "category": "...", "answer": "..."} ``` -- oasst: instruction +- `oasst`: instruction ```json {"INSTRUCTION": "...", "RESPONSE": "..."} ``` -- gpteacher: instruction; input(optional) +- `gpteacher`: instruction; input(optional) ```json {"instruction": "...", "input": "...", "response": "..."} ``` -- reflection: instruction with reflect; input(optional) +- `reflection`: instruction with reflect; input(optional) ```json {"instruction": "...", "input": "...", "output": "...", "reflection": "...", "corrected": "..."} ``` -- sharegpt: conversations +- `sharegpt`: conversations ```json {"conversations": [{"from": "...", "value": "..."}]} ``` -- completion: raw corpus +- `completion`: raw corpus ```json {"text": "..."} ```