diff --git a/README.md b/README.md index ca36acbcb..d032d1d31 100644 --- a/README.md +++ b/README.md @@ -261,6 +261,12 @@ See sample configs in [configs](configs) folder or [examples](examples) for quic datasets: - path: vicgalle/alpaca-gpt4 type: alpaca # format from earlier + + # huggingface repo with specific configuration/subset + datasets: + - path: EleutherAI/pile + name: enron_emails + type: completion # format from earlier # local datasets: