diff --git a/README.md b/README.md index 84ff70a44..91678f849 100644 --- a/README.md +++ b/README.md @@ -81,12 +81,8 @@ Have dataset(s) in one of the following format (JSONL recommended):
-See all formats +See other formats -- `alpaca`: instruction; input(optional) - ```json - {"instruction": "...", "input": "...", "output": "..."} - ``` - `jeopardy`: question and answer ```json {"question": "...", "category": "...", "answer": "..."} @@ -103,14 +99,6 @@ Have dataset(s) in one of the following format (JSONL recommended): ```json {"instruction": "...", "input": "...", "output": "...", "reflection": "...", "corrected": "..."} ``` -- `sharegpt`: conversations - ```json - {"conversations": [{"from": "...", "value": "..."}]} - ``` -- `completion`: raw corpus - ```json - {"text": "..."} - ``` > Have some new format to propose? Check if it's already defined in [data.py](src/axolotl/utils/data.py) in `dev` branch!